The string to insert for each position at which there is no token. Default is an underscore (""). Default value: ''.
The maximum shingle size. Default and minimum value is 2. Default value: 2.
The minimum shingle size. Default and minimum value is 2. Must be less than the value of maxShingleSize. Default value: 2.
The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.
Polymorphic Discriminator
A value indicating whether the output stream will contain the input tokens (unigrams) as well as shingles. Default is true. Default value: true.
A value indicating whether to output unigrams for those times when no shingles are available. This property takes precedence when outputUnigrams is set to false. Default is false. Default value: false.
The string to use when joining adjacent tokens to form a shingle. Default is a single space (" "). Default value: ''.
Generated using TypeDoc
Creates combinations of tokens as a single token. This token filter is implemented using Apache Lucene.