Class WiktionaryLinker
java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.external.wiktionary.WiktionaryLinker
- All Implemented Interfaces:
LabelToConceptLinker
This linker can link strings to dictionary entries.
-
Field Summary
Modifier and TypeFieldDescriptionprivate WiktionaryKnowledgeSource
The dictionary that is used to perform lookups.private String
Default name of the linker.(package private) LinkedList<StringModifier>
The list of operations that is performed to find a concept in the dictionary. -
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionGet instance specific name of the linker.linkLabelToTokensLeftToRight
(String labelToBeLinked) Splits the labelToBeLinked in ngrams up to infinite size and tries to link components.linkToPotentiallyMultipleConcepts
(String labelToBeLinked) This method tries to linklabelToBeLinked
to one concept if possible.linkToSingleConcept
(String labelToBeLinked) Queries for a concept and returns a link that represents an entity in the background knowledge source such as theSemanticWordRelationDictionary
.linkToSingleConcept
(String labelToBeLinked, Language language) The language can be specified.static String
normalizeForWiktionaryLookup
(String lookupString) Normalize for WordNet lookup, i.e., lowercasing and camel-case resolution.void
setNameOfLinker
(String nameOfLinker) Set instance specific name of the linker.
-
Field Details
-
nameOfLinker
Default name of the linker. -
dictionary
The dictionary that is used to perform lookups. -
stringModificationSequence
LinkedList<StringModifier> stringModificationSequenceThe list of operations that is performed to find a concept in the dictionary.
-
-
Constructor Details
-
WiktionaryLinker
Constructor- Parameters:
dictionary
- The dictionary that shall be used.
-
-
Method Details
-
linkToSingleConcept
Description copied from interface:LabelToConceptLinker
Queries for a concept and returns a link that represents an entity in the background knowledge source such as theSemanticWordRelationDictionary
. Note that the link may not always be something intuitive such as a URI but may also be an artificial identifier that is understood by the corresponding background knowledge source.- Specified by:
linkToSingleConcept
in interfaceLabelToConceptLinker
- Parameters:
labelToBeLinked
- The label which shall be linked to a single concept.- Returns:
- Concept or null if no link could be found.
-
linkToSingleConcept
The language can be specified.- Parameters:
labelToBeLinked
- Label that shall be linked.language
- The language of the label.- Returns:
- Linked concept or null.
-
linkToPotentiallyMultipleConcepts
Description copied from interface:LabelToConceptLinker
This method tries to linklabelToBeLinked
to one concept if possible. If it fails, it will try to link it to multiple concepts.- Specified by:
linkToPotentiallyMultipleConcepts
in interfaceLabelToConceptLinker
- Parameters:
labelToBeLinked
- The label which shall be linked.- Returns:
- One or multiple linked concepts in a set. Null if it could not fully link the label.
-
linkLabelToTokensLeftToRight
Splits the labelToBeLinked in ngrams up to infinite size and tries to link components. This corresponds to a MAXGRAM_LEFT_TO_RIGHT_TOKENIZER or NGRAM_LEFT_TO_RIGHT_TOKENIZER OneToManyLinkingStrategy.- Parameters:
labelToBeLinked
- The label that shall be linked.- Returns:
- A set of concept URIs that were found.
-
getNameOfLinker
Description copied from interface:LabelToConceptLinker
Get instance specific name of the linker.- Specified by:
getNameOfLinker
in interfaceLabelToConceptLinker
- Returns:
- Name as String.
-
setNameOfLinker
Description copied from interface:LabelToConceptLinker
Set instance specific name of the linker.- Specified by:
setNameOfLinker
in interfaceLabelToConceptLinker
- Parameters:
nameOfLinker
- Name to be set.
-
normalizeForWiktionaryLookup
Normalize for WordNet lookup, i.e., lowercasing and camel-case resolution. Dev-Remark: The WordNet library utilizes space-separation.- Parameters:
lookupString
- The string that shall be normalized for lookup.- Returns:
- Space-separated lookup word.
-