java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.external.services.labelToConcept.nGramTokenizers.NgramTokenizer
All Implemented Interfaces:
OneToManyLinkingStrategy

public class NgramTokenizer extends Object implements OneToManyLinkingStrategy
Creates regular n-grams
  • Field Details

    • delimiter

      private String delimiter
    • nGramsize

      private int nGramsize
  • Constructor Details

    • NgramTokenizer

      public NgramTokenizer(int nGramsize, String delimiter)
  • Method Details

    • getNgrams

      public HashSet<String> getNgrams(String[] tokens)
      Creates ngrams out of the given array tokens.
      Parameters:
      tokens - Array for which ngrams shall be created.
      Returns:
      ngrams in a set.
    • getDelimiter

      public String getDelimiter()
    • setDelimiter

      public void setDelimiter(String delimiter)
    • getnGramsize

      public int getnGramsize()
    • setnGramsize

      public void setnGramsize(int nGramsize)