Introduction

  • Data integration involves combining data residing in different sources and providing user with a unified view of these data
  • Data integration over the Web can be implemented as follows:
    • Export the data sets to be integrated as RDF graphs
    • Merge identical resources (i.e. resources having the same URI) from different data sets
    • Start making queries on the integrated data, queries that were not possible on the individual data sets.

Export first data set as RDF graph

  • For example the following RDF graph contains information about book “The Glass Palace” by Amitav Ghosh

Export second data set as RDF graph

  • Information about the same book but in French this time is modeled in RDF graph below

Merge identical resources from different data sets

    • Merge identical resources (i.e. resources having the same URI) from different data sets

    Merge identical resources from different data sets

    • Merge identical resources (i.e. resources having the same URI) from different data sets

    Start making queries on the integrated data

    • A user of the second dataset may ask queries like: “give me the title of the original book”
    • This information is not in the second dataset 
    • This information can be however retrieved from the integrated dataset, in which the second dataset was connected with the the first dataset