Flexibly separates text into terms via a regular expression pattern. This analyzer is implemented using Apache Lucene.
Regular expression flags. Possible values include: 'CANON_EQ', 'CASE_INSENSITIVE', 'COMMENTS', 'DOTALL', 'LITERAL', 'MULTILINE', 'UNICODE_CASE', 'UNIX_LINES'
A value indicating whether terms should be lower-cased. Default is true. Default value: true.
The name of the analyzer. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.
A regular expression pattern to match token separators. Default is an expression that matches one or more whitespace characters. Default value: '\W+'.
A list of stopwords.
Generated using TypeDoc