Class DatasetIDExtractorUrlPattern

java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_base.multisource.DatasetIDExtractorUrlPattern
All Implemented Interfaces:
DatasetIDExtractor

public class DatasetIDExtractorUrlPattern extends Object implements DatasetIDExtractor
Extracts the dataset id given a URL pattern which is currently a prefix and infix. All information between prefix (which has to be at the beginning of the string/url) and postfix are extracted as a dataset ID.
  • Field Details

    • prefix

      protected String prefix
    • prefixLength

      protected int prefixLength
    • infix

      protected String infix
    • postProcessing

      protected Function<String,String> postProcessing
  • Constructor Details

  • Method Details

    • getDatasetID

      public String getDatasetID(String uri)
      Description copied from interface: DatasetIDExtractor
      Extracts from an URI the corresponding source / dataset identifier (which needs to be included in the URI like a specific domain etc).
      Specified by:
      getDatasetID in interface DatasetIDExtractor
      Parameters:
      uri - the uri which should contain any dataset specific component which is extracted.
      Returns:
      the dataset identifier as string