java.lang.Object
de.uni_mannheim.informatik.dws.melt.matching_jena_matchers.multisource.dispatchers.DatasetIDExtractorHelperJena

public class DatasetIDExtractorHelperJena extends Object
Extracts the dataset id from a whole model based on sampling some resources.
  • Constructor Details

    • DatasetIDExtractorHelperJena

      public DatasetIDExtractorHelperJena()
  • Method Details

    • getDatasetIDFromModel

      public static String getDatasetIDFromModel(org.apache.jena.rdf.model.Model model, DatasetIDExtractor extractor)
      Extracts the dataset id by iterating over all resources and return the most appearing dataset id.
      Parameters:
      model - the model
      extractor - the id extractor to use
      Returns:
      the most appearing dataset id.
    • getDatasetIDFromModelbySampling

      public static String getDatasetIDFromModelbySampling(org.apache.jena.rdf.model.Model model, DatasetIDExtractor extractor)
      Extracts the dataset id by sample 20 resources and return the most appearing dataset id.
      Parameters:
      model - the model
      extractor - the id extractor to use
      Returns:
      the most appearing dataset id.
    • getDatasetIDFromModelbySampling

      public static String getDatasetIDFromModelbySampling(org.apache.jena.rdf.model.Model model, DatasetIDExtractor extractor, int numSamples)
      Extracts the dataset id by sample resources and return the most appearing dataset id.
      Parameters:
      model - the model
      extractor - the id extractor to use
      numSamples - the number of samples to use
      Returns:
      the most appearing dataset id.
    • getDatasetIDFromModelbySampling

      public static String getDatasetIDFromModelbySampling(org.apache.jena.rdf.model.Model model, DatasetIDExtractor extractor, int numSamples, Set<String> excludedDatasetIDs)
      Extracts the dataset id by sample resources and return the most appearing dataset id.
      Parameters:
      model - the model
      extractor - the id extractor to use
      numSamples - the number of samples to use
      excludedDatasetIDs - dataset ids to exclude during counting
      Returns:
      the most appearing dataset id.