Abstract
In the context of contemporary data, the processing of information is crucial. This paper proposes an extension to the traditional database relational algebra, which enriches the data model and provides additional complex-data operations. Specifically, we focus on analytical operators from the areas of data mining and similarity search, such as frequent pattern mining or similarity search queries. The proposed approach can be easily extended by additional algebraic operators. To demonstrate the capabilities of our analytical algebra, we show three practical use cases with different levels of the expression complexity.
This research was supported by ERDF “CyberSecurity, CyberCrime and Critical Information Infrastructures Center of Excellence” (No. CZ.02.1.01/0.0/0.0/16_019/0000822).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Adali, S., Bonatti, P., Sapino, M.L., Subrahmanian, V.: A multi-similarity algebra. In: Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, pp. 402–413 (1998)
Agrawal, R., Imieliński, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE (1995)
Ayres, J., Flannick, J., Gehrke, J., Yiu, T.: Sequential pattern mining using a bitmap representation. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 429–435 (2002)
Budikova, P., Batko, M., Zezula, P.: Query language for complex similarity queries. In: Morzy, T., Härder, T., Wrembel, R. (eds.) ADBIS 2012. LNCS, vol. 7503, pp. 85–98. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33074-2_7
Calders, T., Lakshmanan, L.V., Ng, R.T., Paredaens, J.: Expressive power of an algebra for data mining. ACM Trans. Database Syst. (TODS) 31(4), 1169–1214 (2006)
Dohnal, V., Gennaro, C., Zezula, P.: Similarity join in metric spaces using eD-index. In: Mařík, V., Retschitzegger, W., Štěpánková, O. (eds.) DEXA 2003. LNCS, vol. 2736, pp. 484–493. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45227-0_48
Han, J., et al.: PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings of the 17th International Conference on Data Engineering, pp. 215–224. Citeseer (2001)
Heaton, J.: Comparing dataset characteristics that favor the Apriori, Eclat or FP-growth frequent itemset mining algorithms. In: SoutheastCon 2016, pp. 1–7. IEEE (2016)
Johnson, T., Lakshmanan, L.V., Ng, R.T.: The 3W model and algebra for unified data mining. In: VLDB, pp. 21–32 (2000)
Peschel, J., Batko, M., Valcik, J., Sedmidubsky, J., Zezula, P.: FIMSIM: discovering communities by frequent item-set mining and similarity search. In: Reyes, N., et al. (eds.) SISAP 2021. LNCS, vol. 13058, pp. 372–383. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-89657-7_28
Peschel, J., Batko, M., Zezula, P.: Algebra for complex analysis of data. In: Hartmann, S., Küng, J., Kotsis, G., Tjoa, A.M., Khalil, I. (eds.) DEXA 2020, Part I. LNCS, vol. 12391, pp. 177–187. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59003-1_12
Strobl, C.: PostGIS. In: Shekhar, S., Xiong, H. (eds.) Encyclopedia of GIS, pp. 891–898. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-35973-1_1012
Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach, vol. 32. Springer Science, New York (2006). https://doi.org/10.1007/0-387-29151-2
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Peschel, J., Batko, M., Zezula, P. (2022). Analytical Algebra: Extension of Relational Algebra. In: Strauss, C., Cuzzocrea, A., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2022. Lecture Notes in Computer Science, vol 13427. Springer, Cham. https://doi.org/10.1007/978-3-031-12426-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-031-12426-6_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-12425-9
Online ISBN: 978-3-031-12426-6
eBook Packages: Computer ScienceComputer Science (R0)