Class DictionaryDecompounderTokenFilter

java.lang.Object
com.azure.search.documents.indexes.models.TokenFilter
com.azure.search.documents.indexes.models.DictionaryDecompounderTokenFilter

public final class DictionaryDecompounderTokenFilter extends TokenFilter
Decomposes compound words found in many Germanic languages. This token filter is implemented using Apache Lucene.
  • Constructor Details

    • DictionaryDecompounderTokenFilter

      public DictionaryDecompounderTokenFilter(String name, List<String> wordList)
      Constructor of TokenFilter.
      Parameters:
      name - The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.
      wordList - The list of words to match against.
  • Method Details

    • getWordList

      public List<String> getWordList()
      Get the wordList property: The list of words to match against.
      Returns:
      the wordList value.
    • getMinWordSize

      public Integer getMinWordSize()
      Get the minWordSize property: The minimum word size. Only words longer than this get processed. Default is 5. Maximum is 300.
      Returns:
      the minWordSize value.
    • setMinWordSize

      public DictionaryDecompounderTokenFilter setMinWordSize(Integer minWordSize)
      Set the minWordSize property: The minimum word size. Only words longer than this get processed. Default is 5. Maximum is 300.
      Parameters:
      minWordSize - the minWordSize value to set.
      Returns:
      the DictionaryDecompounderTokenFilter object itself.
    • getMinSubwordSize

      public Integer getMinSubwordSize()
      Get the minSubwordSize property: The minimum subword size. Only subwords longer than this are outputted. Default is 2. Maximum is 300.
      Returns:
      the minSubwordSize value.
    • setMinSubwordSize

      public DictionaryDecompounderTokenFilter setMinSubwordSize(Integer minSubwordSize)
      Set the minSubwordSize property: The minimum subword size. Only subwords longer than this are outputted. Default is 2. Maximum is 300.
      Parameters:
      minSubwordSize - the minSubwordSize value to set.
      Returns:
      the DictionaryDecompounderTokenFilter object itself.
    • getMaxSubwordSize

      public Integer getMaxSubwordSize()
      Get the maxSubwordSize property: The maximum subword size. Only subwords shorter than this are outputted. Default is 15. Maximum is 300.
      Returns:
      the maxSubwordSize value.
    • setMaxSubwordSize

      public DictionaryDecompounderTokenFilter setMaxSubwordSize(Integer maxSubwordSize)
      Set the maxSubwordSize property: The maximum subword size. Only subwords shorter than this are outputted. Default is 15. Maximum is 300.
      Parameters:
      maxSubwordSize - the maxSubwordSize value to set.
      Returns:
      the DictionaryDecompounderTokenFilter object itself.
    • isOnlyLongestMatched

      public Boolean isOnlyLongestMatched()
      Get the onlyLongestMatch property: A value indicating whether to add only the longest matching subword to the output. Default is false.
      Returns:
      the onlyLongestMatch value.
    • setOnlyLongestMatched

      public DictionaryDecompounderTokenFilter setOnlyLongestMatched(Boolean onlyLongestMatched)
      Set the onlyLongestMatch property: A value indicating whether to add only the longest matching subword to the output. Default is false.
      Parameters:
      onlyLongestMatched - the onlyLongestMatch value to set.
      Returns:
      the DictionaryDecompounderTokenFilter object itself.