Coupling semantic and statistical techniques for dynamically enriching web ontologies

409 Accesses
3 Citations
7 Altmetric
Explore all metrics

Abstract

With the development of the Semantic Web technology, the use of ontologies to store and retrieve information covering several domains has increased. However, very few ontologies are able to cope with the ever-growing need of frequently updated semantic information or specific user requirements in specialized domains. As a result, a critical issue is related to the unavailability of relational information between concepts, also coined missing background knowledge. One solution to address this issue relies on the manual enrichment of ontologies by domain experts which is however a time consuming and costly process, hence the need for dynamic ontology enrichment. In this paper we present an automatic coupled statistical/semantic framework for dynamically enriching large-scale generic ontologies from the World Wide Web. Using the massive amount of information encoded in texts on the Web as a corpus, missing background knowledge can therefore be discovered through a combination of semantic relatedness measures and pattern acquisition techniques and subsequently exploited. The benefits of our approach are: (i) proposing the dynamic enrichment of large-scale generic ontologies with missing background knowledge, and thus, enabling the reuse of such knowledge, (ii) dealing with the issue of costly ontological manual enrichment by domain experts. Experimental results in a precision-based evaluation setting demonstrate the effectiveness of the proposed techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

Notes

References

Buitelaar, P., et al. (2003). A multi-layered, XML-based approach to the integration of linguistic and semantic annotations. In Proceedings of EACL 2003 workshop on language technology and the Semantic Web. Budapest, Hungary
Cilibrasi, R.L., & Vitanyi, P.M.B. (2007). The Google similarity distance. IEEE Transactions on Knowledge and Data Engineering, 19(3), 370–383.
Article Google Scholar
Cimiano, P., Hotho, A., Staab, S. (2005). Learning concept hierarchies from text corpora using formal concept analysis. Journal of Artificial Intelligence Research, 24(1), 305–339.
MATH Google Scholar
Croft, W.B., Metzler, D., Strohman, T. (2010). Search engines information retrieval in practice. Addison Wesley.
Cunningham, H., et al. (2002). GATE: A framework and graphical development environment for robust NLP tools and applications. In 40th anniversary meeting of the association for computational linguistics. Phil, USA.
Faure, D., & Poibeau, T. (2000). First experiments of using semantic knowledge learned by ASIUM for information extraction task using INTEX. In Proceedings of ECAI workshop on ontology learning.
Giunchiglia, F., Shvaiko, P., Yatskevich, M. (2004). S-Match: An algorithm and an implementation of semantic matching. In C. Bussler, et al. (Eds.), The Semantic Web: Research and applications (pp. 61–75). Berlin/Heidelberg: Springer.
Chapter Google Scholar
Hahn, U., & Marko, K. (2002). Ontology and lexicon evolution by text understanding. In Proceedings of the ECAI 2002 workshop on machine learning and natural language processing for ontology engineering (OLT’2002).
Hearst, M.A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proc. of COLING.
Maedche, A., & Staab, S. (2000). Discovering conceptual relations from text. In 14th European conference on artificial intelligence (ECAI’2000). Berlin, Germany.
Maedche, A., & Staab, S. (2001). Ontology learning for the Semantic Web. IEEE Intelligent Systems, 16(2), 72–79.
Article Google Scholar
Monachesi, P., & Markus, T. (2010). Using social media for ontology enrichment. In L. Aroyo, et al. (Eds.), The Semantic Web: Research and applications (pp. 166–180). Berlin/Heidelberg: Springer.
Chapter Google Scholar
Ruiz-Casado, M., Alfonseca, E., Castells, P. (2007). Automatising the learning of lexical patterns: an application to the enrichment of WordNet by extracting semantic relationships from Wikipedia. Data & Knowledge Engineering, 61(3), 484–499.
Article Google Scholar
Schutz, A., & Buitelaar, P. (2005). RelExt: A tool for relation extraction from text in ontology extension the Semantic Web. In Y. Gil, et al. (Eds.), ISWC 2005 (pp. 593–606).
Shamsfard, M., & Barforoush, A.A. (2003). The state of the art in ontology learning: a framework for comparison. Knowledge Engineering Review, 18(4), 293–316.
Article Google Scholar
Trojahn, C., et al. (2008). A cooperative approach for composite ontology mapping. In S. Spaccapietra (Ed.), Journal on data semantics X (pp. 237–263). Berlin/Heidelberg: Springer.
Chapter Google Scholar
Velardi, P., Fabriani, P., Missikoff, M. (2001). Using text processing techniques to automatically enrich a domain ontology. In Proceedings of the international conference on formal ontology in information systems (Vol. 2001). Ogunquit, Maine, U.S.A.: ACM.
Google Scholar
Wang, Q., Gauch, S., Luong, H. (2010). Ontology concept enrichment via text mining. In IADIS international conference on internet technologies & society (pp. 147–154).
Yamaguchi, T. (2001). Acquiring conceptual relations from domain-specific texts. In Proceedings of the IJCAI 2001, second workshop on ontology learning (OL’2001).

Download references

Author information

Authors and Affiliations

Dimensions Consulting Company, Ramallah, Palestine
Mohammed Maree
Faculty of Computer Science, University of Lyon, Lyon, France
Mohammed Belkhatir

Authors

Mohammed Maree
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Belkhatir
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammed Belkhatir.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Maree, M., Belkhatir, M. Coupling semantic and statistical techniques for dynamically enriching web ontologies. J Intell Inf Syst 40, 455–478 (2013). https://doi.org/10.1007/s10844-012-0233-4

Download citation

Received: 16 September 2011
Revised: 27 November 2012
Accepted: 06 December 2012
Published: 18 January 2013
Issue Date: June 2013
DOI: https://doi.org/10.1007/s10844-012-0233-4

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Statistical Knowledge Patterns: Identifying Synonymous Relations in Large Linked Datasets

Utilizing Big Data Analytics for Automatic Building of Language-agnostic Semantic Knowledge Bases

A statistically-based ontology matching tool

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Coupling semantic and statistical techniques for dynamically enriching web ontologies

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Statistical Knowledge Patterns: Identifying Synonymous Relations in Large Linked Datasets

Utilizing Big Data Analytics for Automatic Building of Language-agnostic Semantic Knowledge Bases

A statistically-based ontology matching tool

Explore related subjects

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation