Class DatasetIDExtractorUrlPattern
java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_base.multisource.DatasetIDExtractorUrlPattern
- All Implemented Interfaces:
DatasetIDExtractor
Extracts the dataset id given a URL pattern which is currently a prefix and infix.
All information between prefix (which has to be at the beginning of the string/url) and postfix are extracted as a dataset ID.
-
Field Summary
Fields inherited from interface de.uni_mannheim.informatik.dws.melt.matching_base.multisource.DatasetIDExtractor
CONFERENCE_TRACK_EXTRACTOR, KG_TRACK_EXTRACTOR, LARGE_BIO_TRACK_EXTRACTOR
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptiongetDatasetID
(String uri) Extracts from an URI the corresponding source / dataset identifier (which needs to be included in the URI like a specific domain etc).
-
Field Details
-
prefix
-
prefixLength
protected int prefixLength -
infix
-
postProcessing
-
-
Constructor Details
-
DatasetIDExtractorUrlPattern
-
-
Method Details
-
getDatasetID
Description copied from interface:DatasetIDExtractor
Extracts from an URI the corresponding source / dataset identifier (which needs to be included in the URI like a specific domain etc).- Specified by:
getDatasetID
in interfaceDatasetIDExtractor
- Parameters:
uri
- the uri which should contain any dataset specific component which is extracted.- Returns:
- the dataset identifier as string
-