Creates combinations of tokens as a single token. This token filter is implemented using Apache Lucene.
The string to insert for each position at which there is no token. Default is an underscore (""). Default value: ''.
The maximum shingle size. Default and minimum value is 2. Default value: 2.
The minimum shingle size. Default and minimum value is 2. Must be less than the value of maxShingleSize. Default value: 2.
The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.
A value indicating whether the output stream will contain the input tokens (unigrams) as well as shingles. Default is true. Default value: true.
A value indicating whether to output unigrams for those times when no shingles are available. This property takes precedence when outputUnigrams is set to false. Default is false. Default value: false.
The string to use when joining adjacent tokens to form a shingle. Default is a single space (" "). Default value: ''.
Generated using TypeDoc