Class TextExtractorResourceDescriptionInRDF
java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.util.textExtractors.TextExtractorRDFBase
de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.util.textExtractors.TextExtractorResourceDescriptionInRDF
- All Implemented Interfaces:
TextExtractor
-
Field Summary
Modifier and TypeFieldDescriptionprotected boolean
private static final Set<org.apache.jena.rdf.model.Property>
private static final TextExtractorOnlyLabel
private static final org.slf4j.Logger
private static final Pattern
private static Set<org.apache.jena.riot.Lang>
protected boolean
protected boolean
protected boolean
protected org.apache.jena.riot.RDFFormat
The serialization format to use - more info at the corresponding jena page.protected boolean
Fields inherited from class de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.util.textExtractors.TextExtractorRDFBase
SKIP_DEFINITIONS, SKIP_DEFINITIONS_AND_LONG_LITERALS, SKIP_DEFINITIONS_AND_SHORTEN_LONG_LITERALS, statementProcessor
-
Constructor Summary
ConstructorDescriptionTextExtractorResourceDescriptionInRDF
(boolean useLabelInsteadOfResource) TextExtractorResourceDescriptionInRDF
(boolean useLabelInsteadOfResource, org.apache.jena.riot.RDFFormat serializationFormat) -
Method Summary
Modifier and TypeMethodDescriptionprivate org.apache.jena.rdf.model.Model
createEmptyModel
(org.apache.jena.rdf.model.Resource r) extract
(org.apache.jena.rdf.model.Resource r) Given a Jena resource this method extracts textual/string representations from it.protected org.apache.jena.rdf.model.Model
getModelWithLabel
(org.apache.jena.rdf.model.Resource r) protected org.apache.jena.rdf.model.Model
getModelWithResource
(org.apache.jena.rdf.model.Resource r) private String
org.apache.jena.riot.RDFFormat
boolean
boolean
boolean
boolean
boolean
private org.apache.jena.rdf.model.Model
removeUnusedPrefixes
(org.apache.jena.rdf.model.Model m) setIncludePrefixesInModel
(boolean includePrefixesInModel) setRemoveMultipleSpaces
(boolean removeMultipleSpaces) setRemoveNewlines
(boolean removeNewlines) setRemovePrefixDefition
(boolean removePrefixDefition) setSerializationFormat
(org.apache.jena.riot.RDFFormat serializationFormat) setUseLabelInsteadOfResource
(boolean useLabelInsteadOfResource) Methods inherited from class de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.util.textExtractors.TextExtractorRDFBase
getStatementProcessor, setStatementProcessor
-
Field Details
-
LOGGER
private static final org.slf4j.Logger LOGGER -
MULTIPLE_WHITESPACE
-
removeNewlines
protected boolean removeNewlines -
removeMultipleSpaces
protected boolean removeMultipleSpaces -
useLabelInsteadOfResource
protected boolean useLabelInsteadOfResource -
includePrefixesInModel
protected boolean includePrefixesInModel -
removePrefixDefition
protected boolean removePrefixDefition -
serializationFormat
protected org.apache.jena.riot.RDFFormat serializationFormatThe serialization format to use - more info at the corresponding jena page. -
LABEL_PROP
-
labelExtractor
-
noPrefixLang
-
-
Constructor Details
-
TextExtractorResourceDescriptionInRDF
public TextExtractorResourceDescriptionInRDF(boolean useLabelInsteadOfResource, org.apache.jena.riot.RDFFormat serializationFormat) -
TextExtractorResourceDescriptionInRDF
public TextExtractorResourceDescriptionInRDF(boolean useLabelInsteadOfResource) -
TextExtractorResourceDescriptionInRDF
public TextExtractorResourceDescriptionInRDF()
-
-
Method Details
-
extract
Description copied from interface:TextExtractor
Given a Jena resource this method extracts textual/string representations from it.- Parameters:
r
- the jena resource which also allows to traverse the whole rdf graph- Returns:
- a set of textual representations of the given resource.
-
getModelWithResource
protected org.apache.jena.rdf.model.Model getModelWithResource(org.apache.jena.rdf.model.Resource r) -
getModelWithLabel
protected org.apache.jena.rdf.model.Model getModelWithLabel(org.apache.jena.rdf.model.Resource r) -
removeUnusedPrefixes
private org.apache.jena.rdf.model.Model removeUnusedPrefixes(org.apache.jena.rdf.model.Model m) -
getPrefix
-
createEmptyModel
private org.apache.jena.rdf.model.Model createEmptyModel(org.apache.jena.rdf.model.Resource r) -
isRemoveNewlines
public boolean isRemoveNewlines() -
setRemoveNewlines
-
isUseLabelInsteadOfResource
public boolean isUseLabelInsteadOfResource() -
setUseLabelInsteadOfResource
public TextExtractorResourceDescriptionInRDF setUseLabelInsteadOfResource(boolean useLabelInsteadOfResource) -
isIncludePrefixesInModel
public boolean isIncludePrefixesInModel() -
setIncludePrefixesInModel
public TextExtractorResourceDescriptionInRDF setIncludePrefixesInModel(boolean includePrefixesInModel) -
isRemovePrefixDefition
public boolean isRemovePrefixDefition() -
setRemovePrefixDefition
-
getSerializationFormat
public org.apache.jena.riot.RDFFormat getSerializationFormat() -
setSerializationFormat
public TextExtractorResourceDescriptionInRDF setSerializationFormat(org.apache.jena.riot.RDFFormat serializationFormat) -
isRemoveMultipleSpaces
public boolean isRemoveMultipleSpaces() -
setRemoveMultipleSpaces
-