Uses of Interface
de.uni_mannheim.informatik.dws.melt.matching_jena.TextExtractor
Package
Description
This package contains preconfigured matcher content that can be used in matching pipelines or as an
example/reference for a new background-based matcher.
-
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena
Modifier and TypeMethodDescriptionstatic TextExtractor
TextExtractor.appendStringPostProcessing
(TextExtractor e, Function<String, String> postprocessing) static TextExtractor
TextExtractor.wrapLiteralExtractor
(LiteralExtractor e) static TextExtractor
TextExtractor.wrapLiteralExtractorMap
(LiteralExtractorMap e) static TextExtractor
TextExtractor.wrapTextExtractorMap
(TextExtractorMap e) Modifier and TypeMethodDescriptionstatic TextExtractor
TextExtractor.appendStringPostProcessing
(TextExtractor e, Function<String, String> postprocessing) static TextExtractorMap
TextExtractorMap.wrapTextExtractor
(TextExtractor e) -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.elementlevel
Modifier and TypeFieldDescriptionprivate List<TextExtractor>
StopwordExtraction.valueExtractors
Literal extractors to choose which literal/properties should be used.ModifierConstructorDescriptionStopwordExtraction
(Function<String, Collection<String>> tokenizer, boolean countDistinctTermsPerResource, int topNStopwords, double stopwordsPercentage, TextExtractor... valueExtractors) Extracts the stopwords based on two criteria.ModifierConstructorDescriptionStopwordExtraction
(Function<String, Collection<String>> tokenizer, boolean countDistinctTermsPerResource, int topNStopwords, double stopwordsPercentage, List<TextExtractor> valueExtractors) Extracts the stopwords based on two criteria. -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.elementlevel.scale
Modifier and TypeFieldDescriptionprotected Set<TextExtractor>
ScalableStringProcessingMatcher.usedValueExtractors
private List<TextExtractor>
PropertySpecificStringProcessingMultipleReturn.valueExtractors
Modifier and TypeMethodDescriptionprotected Map<TextExtractor,
Set<String>> ScalableStringProcessingMatcher.extractAllValues
(org.apache.jena.rdf.model.Resource r) PropertySpecificStringProcessingMultipleReturn.getValueExtractors()
Modifier and TypeMethodDescriptionScalableStringProcessingMatcher.getLiterals
(PropertySpecificStringProcessingMultipleReturn processing, Map<TextExtractor, Set<String>> valueMap) ModifierConstructorDescriptionPropertySpecificStringProcessing
(Function<String, Object> processing, double confidence, TextExtractor... valueExtractors) PropertySpecificStringProcessingMultipleReturn
(Function<String, Iterable<Object>> processing, double confidence, TextExtractor... valueExtractors) ModifierConstructorDescriptionPropertySpecificStringProcessing
(Function<String, Object> processing, double confidence, List<TextExtractor> valueExtractors) PropertySpecificStringProcessing
(Function<String, Object> processing, double confidence, List<TextExtractor> valueExtractors, int maxLevenshteinDistance, int minLengthForLevenshtein) PropertySpecificStringProcessingMultipleReturn
(Function<String, Iterable<Object>> processing, double confidence, List<TextExtractor> valueExtractors) PropertySpecificStringProcessingMultipleReturn
(Function<String, Iterable<Object>> processing, double confidence, List<TextExtractor> valueExtractors, int maxLevenshteinDistance, int minLengthForLevenshtein) -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.external.matcher
Modifier and TypeFieldDescriptionprivate final TextExtractor
BackgroundMatcher.valueExtractor
The value extractor used to obtain labels for resources.private TextExtractor
SimpleStringMatcher.valueExtractor
The value extractor used to obtain labels for resources.Modifier and TypeMethodDescriptionBackgroundMatcherTools.getURIlabelMap
(org.apache.jena.util.iterator.ExtendedIterator<? extends org.apache.jena.ontology.OntResource> iterator, TextExtractor valueExtractor) Creates a map of the formURI -> set<labels>
. -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.external.services.labelToConcept
Modifier and TypeMethodDescriptionLinksToFile.getLinks
(TestCase testCase, TextExtractor extractor, LabelToConceptLinker linker, int maxLabelTokenLength) LinksToFile.getLinksForIterator
(org.apache.jena.util.iterator.ExtendedIterator<? extends org.apache.jena.ontology.OntResource> iterator, TextExtractor valueExtractor, LabelToConceptLinker linker, int maxLabelTokenLength) LinksToFile.getLinksForOntModel
(org.apache.jena.ontology.OntModel ontModel, TextExtractor extractor, LabelToConceptLinker linker, int maxLabelTokenLength) LinksToFile.getLinksTestcases
(@NotNull List<TestCase> testCaseList, @NotNull TextExtractor extractor, @NotNull LabelToConceptLinker linker, int maxLabelTokenLength) LinksToFile.getLinksTracks
(@NotNull List<Track> trackList, @NotNull TextExtractor extractor, @NotNull LabelToConceptLinker linker, int maxLabelTokenLength) static void
LinksToFile.writeLinksToFile
(@NotNull File fileToWrite, @NotNull TestCase testCase, @NotNull TextExtractor extractor, @NotNull LabelToConceptLinker linker, int maxLabelTokenLength) Write all links of a track to file (UTF-8 encoded).static void
LinksToFile.writeLinksToFile
(@NotNull File fileToWrite, @NotNull Track track, @NotNull TextExtractor extractor, @NotNull LabelToConceptLinker linker, int maxLabelTokenLength) Write all links of a track to file (UTF-8 encoded).static <T> void
LinksToFile.writeLinksToFile
(@NotNull File fileToWrite, @NotNull List<T> list, @NotNull TextExtractor extractor, @NotNull LabelToConceptLinker linker, int maxLabelTokenLength) Write all links of the tracks/testcases to one file (UTF-8 encoded). -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.util.textExtractors
Modifier and TypeClassDescriptionclass
All annotation properties are followed (recursively).class
This extractor uses all literals of the resource.class
This extractor uses all literals which are also strings e.g.class
This extractor is a composer and uses the given extractor in the given order as long as an extractor will yield an result.class
ATextExtractor
which extracts texts from a resource which can be used by transformer based matchers likeTransformersFilter
orTransformersFineTuner
.class
Extracts a label for the given resource and also creates a text for the superclass such that more context is provided.class
Extracts all values from specific properties as long as it is a literal.class
Extracts only one speaking label (language can be set in constructor) which can be (in decreasing importance): skos:prefLabel, rdfs:label, fragment (only if more than 50 percent are not numbers), skos:altLabel, skos:hiddenLabel.class
Extracts all values from a specific property as long as it is a literal.class
The textExtractor is a base class for all extractors which lists all statements about a resource.class
class
ATextExtractor
which extracts texts from a resource which can be used by transformer based matchers likeTransformersFilter
orTransformersFineTuner
.class
ATextExtractor
which extracts texts from a resource which can be used by transformer based matchers likeTransformersFilter
orTransformersFineTuner
.class
Extracts the fragment of the URL, e.g.class
Extracts the local name from the URI.class
A text textractor which extracts texts from a resource which can be used by transformer based matchers like TransformersFilter or TransformersFilterFineTuner.class
This extractor creates only one text per resource which describes it by verbalizing each statement where the resource is in the subject position.Modifier and TypeFieldDescriptionprivate final List<TextExtractor>
TextExtractorFallback.extractors
Modifier and TypeMethodDescriptionTextExtractorFallback.getExtractors()
static List<TextExtractor>
TextExtractorProperty.wrapExtractor
(Collection<org.apache.jena.rdf.model.Property> properties) static List<TextExtractor>
TextExtractorProperty.wrapExtractor
(org.apache.jena.rdf.model.Property... properties) ModifierConstructorDescriptionManualInspection
(TextExtractor extractor) TextExtractorFallback
(TextExtractor... extractors) -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_ml.python.nlptransformers
Modifier and TypeMethodDescriptionTransformersBase.getExtractor()
Returns the text extractor which extracts text from a given resource.Modifier and TypeMethodDescriptionvoid
TransformersBase.setExtractor
(TextExtractor extractor) Sets the extractor which computes the text from a given resource.ModifierConstructorDescriptionLLMBase
(TextExtractor extractor, String modelName, String promt) Constructor with all required parameters and default values for optional parameters (can be changed by setters).LLMBinaryFilter
(TextExtractor extractor, String modelName, String promt) Constructor with all required parameters and default values for optional parameters (can be changed by setters).LLMChooseGivenEntityFilter
(TextExtractor extractor, String modelName, String promt) Constructor with all required parameters and default values for optional parameters (can be changed by setters).SentenceTransformersFineTuner
(TextExtractor extractor, String initialModelName, File resultingModelLocation) Run the training of a NLP sentence transformers.SentenceTransformersMatcher
(TextExtractor extractor, String modelName) TransformersBase
(TextExtractor extractor, String modelName) Constructor with all required parameters.TransformersBaseFineTuner
(TextExtractor extractor, String initialModelName, File resultingModelLocation) Run the training of a NLP transformer.TransformersFilter
(TextExtractor extractor, String modelName) Constructor with all required parameters and default values for optional parameters (can be changed by setters).TransformersFineTuner
(TextExtractor extractor, String initialModelName, File resultingModelLocation) Run the training of a NLP transformer.TransformersFineTunerHpSearch
(TextExtractor extractor, String initialModelName, File resultingModelLocation) -
Uses of TextExtractor in de.uni_mannheim.informatik.dws.melt.matching_ml.python.nlptransformers.relationprediction
ModifierConstructorDescriptionRelationTypeFineTuner
(TextExtractor extractor, String modelName, File resultingModelLocation) RelationTypePredictor
(TextExtractor extractor, String modelName)