×

A distributional structured semantic space for querying RDF graph data. (English) Zbl 1250.68251

Summary: The vision of creating a linked data web brings together the challenge of allowing queries across highly heterogeneous and distributed datasets. In order to query linked data on the Web today, end users need to be aware of which datasets potentially contain the data and also which data model describes these datasets. The process of allowing users to expressively query relationships in RDF while abstracting them from the underlying data model represents a fundamental problem for Web-scale linked data consumption. This article introduces a distributional structured semantic space which enables data model independent natural language queries over RDF data. The center of the approach relies on the use of a distributional semantic model to address the level of semantic interpretation demanded to build the data model independent approach. The article analyzes the geometric aspects of the proposed space, providing its description as a distributional structured vector space, which is built upon the generalized vector space model (GVSM). The final semantic space proved to be flexible and precise under real-world query conditions achieving mean reciprocal rank = 0.516, avg. precision = 0.482 and avg. recall = 0.491.

MSC:

68T30 Knowledge representation
68T50 Natural language processing
68P10 Searching and sorting
Full Text: DOI

References:

[1] DOI: 10.1145/32206.32212 · doi:10.1145/32206.32212
[2] Sahlgren M., Rivista di Linguistica 20
[3] Turney P. D., Journal of Artificial Intelligence Research 37 pp 141– · Zbl 0823.68092
[4] DOI: 10.1162/coli.2007.33.2.161 · Zbl 1234.68431 · doi:10.1162/coli.2007.33.2.161
[5] DOI: 10.2307/2310058 · Zbl 0080.00702 · doi:10.2307/2310058
[6] DOI: 10.1016/j.websem.2009.08.001 · doi:10.1016/j.websem.2009.08.001
[7] Baeza-Yates R., Modern Information Retrieval (1999)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.