public final class WordDelimiterTokenFilter extends TokenFilter
Constructor and Description |
---|
WordDelimiterTokenFilter(String name)
Constructor of
WordDelimiterTokenFilter . |
Modifier and Type | Method and Description |
---|---|
Boolean |
areNumbersCatenated()
Get the catenateNumbers property: A value indicating whether maximum
runs of number parts will be catenated.
|
Boolean |
areWordsCatenated()
Get the catenateWords property: A value indicating whether maximum runs
of word parts will be catenated.
|
Boolean |
catenateAll()
Get the catenateAll property: A value indicating whether all subword
parts will be catenated.
|
Boolean |
generateNumberParts()
Get the generateNumberParts property: A value indicating whether to
generate number subwords.
|
Boolean |
generateWordParts()
Get the generateWordParts property: A value indicating whether to
generate part words.
|
List<String> |
getProtectedWords()
Get the protectedWords property: A list of tokens to protect from being
delimited.
|
Boolean |
isPreserveOriginal()
Get the preserveOriginal property: A value indicating whether original
words will be preserved and added to the subword list.
|
Boolean |
isStemEnglishPossessive()
Get the stemEnglishPossessive property: A value indicating whether to
remove trailing "'s" for each subword.
|
WordDelimiterTokenFilter |
setCatenateAll(Boolean catenateAll)
Set the catenateAll property: A value indicating whether all subword
parts will be catenated.
|
WordDelimiterTokenFilter |
setGenerateNumberParts(Boolean generateNumberParts)
Set the generateNumberParts property: A value indicating whether to
generate number subwords.
|
WordDelimiterTokenFilter |
setGenerateWordParts(Boolean generateWordParts)
Set the generateWordParts property: A value indicating whether to
generate part words.
|
WordDelimiterTokenFilter |
setNumbersCatenated(Boolean numbersCatenated)
Set the catenateNumbers property: A value indicating whether maximum
runs of number parts will be catenated.
|
WordDelimiterTokenFilter |
setPreserveOriginal(Boolean preserveOriginal)
Set the preserveOriginal property: A value indicating whether original
words will be preserved and added to the subword list.
|
WordDelimiterTokenFilter |
setProtectedWords(List<String> protectedWords)
Set the protectedWords property: A list of tokens to protect from being
delimited.
|
WordDelimiterTokenFilter |
setProtectedWords(String... protectedWords)
Set the protectedWords property: A list of tokens to protect from being
delimited.
|
WordDelimiterTokenFilter |
setSplitOnCaseChange(Boolean splitOnCaseChange)
Set the splitOnCaseChange property: A value indicating whether to split
words on caseChange.
|
WordDelimiterTokenFilter |
setSplitOnNumerics(Boolean splitOnNumerics)
Set the splitOnNumerics property: A value indicating whether to split on
numbers.
|
WordDelimiterTokenFilter |
setStemEnglishPossessive(Boolean stemEnglishPossessive)
Set the stemEnglishPossessive property: A value indicating whether to
remove trailing "'s" for each subword.
|
WordDelimiterTokenFilter |
setWordsCatenated(Boolean wordsCatenated)
Set the catenateWords property: A value indicating whether maximum runs
of word parts will be catenated.
|
Boolean |
splitOnCaseChange()
Get the splitOnCaseChange property: A value indicating whether to split
words on caseChange.
|
Boolean |
splitOnNumerics()
Get the splitOnNumerics property: A value indicating whether to split on
numbers.
|
getName
public WordDelimiterTokenFilter(String name)
WordDelimiterTokenFilter
.name
- The name of the token filter. It must only contain letters, digits,
spaces, dashes or underscores, can only start and end with alphanumeric
characters, and is limited to 128 characters.public Boolean generateWordParts()
public WordDelimiterTokenFilter setGenerateWordParts(Boolean generateWordParts)
generateWordParts
- the generateWordParts value to set.public Boolean generateNumberParts()
public WordDelimiterTokenFilter setGenerateNumberParts(Boolean generateNumberParts)
generateNumberParts
- the generateNumberParts value to set.public Boolean areWordsCatenated()
public WordDelimiterTokenFilter setWordsCatenated(Boolean wordsCatenated)
wordsCatenated
- the catenateWords value to set.public Boolean areNumbersCatenated()
public WordDelimiterTokenFilter setNumbersCatenated(Boolean numbersCatenated)
numbersCatenated
- the catenateNumbers value to set.public Boolean catenateAll()
public WordDelimiterTokenFilter setCatenateAll(Boolean catenateAll)
catenateAll
- the catenateAll value to set.public Boolean splitOnCaseChange()
public WordDelimiterTokenFilter setSplitOnCaseChange(Boolean splitOnCaseChange)
splitOnCaseChange
- the splitOnCaseChange value to set.public Boolean isPreserveOriginal()
public WordDelimiterTokenFilter setPreserveOriginal(Boolean preserveOriginal)
preserveOriginal
- the preserveOriginal value to set.public Boolean splitOnNumerics()
public WordDelimiterTokenFilter setSplitOnNumerics(Boolean splitOnNumerics)
splitOnNumerics
- the splitOnNumerics value to set.public Boolean isStemEnglishPossessive()
public WordDelimiterTokenFilter setStemEnglishPossessive(Boolean stemEnglishPossessive)
stemEnglishPossessive
- the stemEnglishPossessive value to set.public List<String> getProtectedWords()
public WordDelimiterTokenFilter setProtectedWords(String... protectedWords)
protectedWords
- the protectedWords value to set.public WordDelimiterTokenFilter setProtectedWords(List<String> protectedWords)
protectedWords
- the protectedWords value to set.Copyright © 2021 Microsoft Corporation. All rights reserved.