|Title||:||Linked Open Data (LOD) Enrichment: State-of-the-Art and Future Scope|
|Speaker||:||Subhashree S (IITM)|
|Details||:||Tue, 26 Jul, 2016 3:30 PM @ BSB 361|
|Abstract:||:||Research Proposal Seminar: The Linked Data initiative aims to provide a set of guidelines and best practices for publishing structured data and associating it with other resources. The Linking Open Data community project works with the main objective of publishing open datasets as RDF triples and establish RDF links between objects from different datasets and has the potential of complementing the world wide web with a data space of entities connected to one another with labelled edges. Many organizations have built systems to exploit the power of Linked Data for specific purposes. For example, the British Broadcasting Corporation (BBC) uses linked datasets such as DBpedia and Musicbrainz to enable cross-domain navigation and enhanced search in their websites. IBM has been using Linked Data as an integration technology for several years and their new cognitive system, Watson has DBpedia and YAGO as part of its major data sources.
Though Linked Data can prove to be an influential technology in such scenarios, providing enormous benefits, its scope will be limited to a large extent unless it is fully grown and updated. For example, in the case mentioned above, Watson may not be able to answer questions about Greek mythology if it does not have information about personalities and facts from the Greek mythology represented as Linked Data. Hence, there arises a need to keep enriching the Linked Open Data cloud in all aspects, i.e, more domains need to be covered, and more entities, concepts and links between them are required. Due to the rapid growth of the Open Data and Linked Data movements, a large amount of data is being made available as Linked Data and this has helped fulfill the domain coverage part of the Linked Data growth, stated above. However, often these data sources remain isolated, lacking schemata and links to other datasets.
Hence, enriching the existing datasets with respect to the various dimensions becomes a mandatory prerequisite for the wider application and usage of the linked datasets. In this seminar, we will cover the relevant works in the literature, identify the gaps and propose interesting issues for future research in this area.