skip to main content
research-article

LODFlow: a workflow management system for linked data processing

Published: 16 September 2015 Publication History

Abstract

The extraction and maintenance of Linked Data datasets is a cumbersome, time-consuming and resource-intensive activity. The cost for producing Linked Data can be reduced by a workflow management system, which describes plans to systematically support the lifecycle of RDF datasets. We present the LODFlow Linked Data Workflow Management System, which provides an environment for planning, executing, reusing, and documenting Linked Data workflows. The LODFlow approach is based on a comprehensive knowledge model for describing the workflows and a workflow execution engine supporting systematic workflow execution, reporting, and exception handling. The environment was evaluated in a large-scale real-world use case. As result, LODFlow supports Linked Data engineers to systematically plan, execute and assess Linked Data production and maintenance workflows, thus improving efficiency, ease-of-use, reproducibility, reuseability and provenance.
The environment was evaluated in a large-scale real-world use case. As result, LODFlow supports Linked Data engineers to systematically plan, execute and assess Linked Data production and maintenance workflows, thus improving efficiency, ease-of-use, reproducibility, reuseability and provenance.

References

[1]
Ilkay Altintas, Oscar Barney, and Efrat Jaeger-Frank. Provenance collection support in the kepler scientific workflow system. In Luc Moreau and Ian T. Foster, editors, IPAW, volume 4145 of Lecture Notes in Computer Science, pages 118--132. Springer, 2006.
[2]
Sören Auer. Introduction to lod2. In Sören Auer, Volha Bryl, and Sebastian Tramp, editors, Linked Open Data -- Creating Knowledge Out of Interlinked Data. Springer-Verlag, 2014.
[3]
V. Ćurčin, M. Ghanem, Y. Guo, M. Köhler, A. Rowe, J. Syed, and P. Wendel. Discovery net: Towards a grid of knowledge discovery. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '02, pages 658--663, New York, NY, USA, 2002. ACM.
[4]
Michael Erdmann and Walter Waterfeld. Overview of the neon toolkit. In María del Carmen Suárez-Figueroa, Asunción Gómez-Pérez, Enrico Motta, and Aldo Gangemi, editors, Ontology Engineering in a Networked World, pages 281--301. Springer, 2012.
[5]
Dieter Fensel, Federico Michele Facca, Elena Simperl, and Ioan Toma. Semantic web services. Springer Science & Business Media, 2011.
[6]
Aldo Gangemi, Silvio Peroni, David Shotton, and Fabio Vitali. A pattern-based ontology for describing publishing workflows. In Proceedings of the 5th Workshop on Ontology and Semantic Web Patterns (WOP2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 19, 2014., pages 2--13, 2014.
[7]
Dimitrios Georgakopoulos, Mark F. Hornick, and Amit P. Sheth. An overview of workflow management: From process modeling to workflow automation infrastructure. Distributed and Parallel Databases, 3(2):119--153, 1995.
[8]
Asunción Gomez-Perez, Mariano Fernandez-Lopez, and Oscar Corcho. Ontological Engineering: With Examples from the Areas of Knowledge Management, E-Commerce and the Semantic Web, 1st Edition. Springer-Verlag, Heidelberg, 2004.
[9]
D Hull, K Wolstencroft, R Stevens, C Goble, M R Pocock, P Li, and T Oinn. Taverna: a tool for building and running workflows of services. Nucleic Acids Res, 34(Web Server issue):729--732, July 2006.
[10]
D. Johnson, K. Meacham, and H. Kornmayer. A middleware independent grid workflow builder for scientific applications. In E-Science Workshops, 2009 5th IEEE International Conference on, pages 86--91, Dec 2009.
[11]
Timothy Lebo, Satya Sahoo, Deborah McGuinness, Khalid Belhajjame, James Cheney, David Corsar, Daniel Garijo, Stian Soiland-Reyes, Stephan Zednik, and Jun Zhao. PROV-O: The prov ontology. Retrieved from http://www.w3.org/TR/prov-o/ on 13.01.2015.
[12]
Bertram Ludäscher, Ilkay Altintas, Chad Berkley, Dan Higgins, Efrat Jaeger, Matthew Jones, Edward A. Lee, Jing Tao, and Yang Zhao. Scientific workflow management and the kepler system. Concurrency and Computation: Practice and Experience, 18(10):1039--1065, 2006.
[13]
Luc Moreau, Ben Clifford, Juliana Freire, Joe Futrelle, Yolanda Gil, Paul Groth, Natalia Kwasnikowska, Simon Miles, Paolo Missier, Jim Myers, Beth Plale, Yogesh Simmhan, Eric Stephan, and Jan Van den Bussche. The open provenance model core specification (v1.1). Future Generation Computer Systems (FGCS), 27(6):743--756, 2011. {IF 1.978, CORE A}.
[14]
Natalya F Noy and Deborah L McGuinness. Ontology development 101: A guide to creating your first ontology. Development, 32(1):1--25, 2001.
[15]
York Sure and Rudi Studer. On-To-Knowledge methodology. In John Davies, Dieter Fensel, and Frank van Harmelen, editors, On-To-Knowledge: Semantic Web enabled Knowledge Management, chapter 3, pages 33--46. J. Wiley and Sons, 2002.
[16]
Bert Van Nuffelen, Valentina Janev, Michael Martin, Vuk Mijovic, and Sebastian Tramp. Supporting the linked data life cycle using an integrated tool stack. In Sören Auer, Volha Bryl, and Sebastian Tramp, editors, Linked Open Data -- Creating Knowledge Out of Interlinked Data. Springer-Verlag, 2014.
[17]
Sanjiva Weerawarana, Francisco Curbera, Frank Leymann, Tony Storey, and Donald F Ferguson. Web services platform architecture: SOAP, WSDL, WS-policy, WS-addressing, WS-BPEL, WS-reliable messaging and more. Prentice Hall PTR, 2005.
[18]
WfMC. Wfmc: Terminology and glosssary. Online PDF, February 1999.

Cited By

View all
  • (2022)Components.js: Semantic dependency injectionSemantic Web10.3233/SW-22294514:1(135-153)Online publication date: 30-Nov-2022
  • (2021)Sec4ML: An approach to support Cybersecurity Data Publishing for Machine Learning tasks2021 IEEE 25th International Enterprise Distributed Object Computing Workshop (EDOCW)10.1109/EDOCW52865.2021.00053(226-235)Online publication date: Oct-2021
  • (2019)The Linked Data Wiki: Leveraging Organizational Knowledge Bases with Linked Open DataPrimate Life Histories, Sex Roles, and Adaptability10.1007/978-3-030-15640-4_15(294-319)Online publication date: 15-Mar-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SEMANTICS '15: Proceedings of the 11th International Conference on Semantic Systems
September 2015
220 pages
ISBN:9781450334624
DOI:10.1145/2814864
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 September 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. linked data
  2. linked data workflow management system
  3. workflow management

Qualifiers

  • Research-article

Funding Sources

  • FP7 -- GeoKnow project
  • Brazilian Federal Agency for the Support and Evaluation of Graduate Education (CAPES/Brazil)
  • BMWi -- project SAKE

Conference

SEMANTiCS '15

Acceptance Rates

SEMANTICS '15 Paper Acceptance Rate 22 of 97 submissions, 23%;
Overall Acceptance Rate 40 of 182 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 24 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Components.js: Semantic dependency injectionSemantic Web10.3233/SW-22294514:1(135-153)Online publication date: 30-Nov-2022
  • (2021)Sec4ML: An approach to support Cybersecurity Data Publishing for Machine Learning tasks2021 IEEE 25th International Enterprise Distributed Object Computing Workshop (EDOCW)10.1109/EDOCW52865.2021.00053(226-235)Online publication date: Oct-2021
  • (2019)The Linked Data Wiki: Leveraging Organizational Knowledge Bases with Linked Open DataPrimate Life Histories, Sex Roles, and Adaptability10.1007/978-3-030-15640-4_15(294-319)Online publication date: 15-Mar-2019
  • (2018)UnifiedViewsSemantic Web10.3233/SW-1802919:5(661-676)Online publication date: 1-Jan-2018
  • (2017)Linked data processing provenanceProceedings of the International Conference on Web Intelligence10.1145/3106426.3106495(88-96)Online publication date: 23-Aug-2017

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media