Journal of Digital Information Management


Vol No. 19 ,Issue No. 1 2021

TSV2RDF: Generating RDF Data Model from TSV File Format Using Semantic Web Technologies
Mammadov Hasan, Yan Li, Muhammad Waqas Ahmad
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China & College of Electrical and Mechanical Engineering National University of Sciences and Technology, Islamabad, Pakistan
Abstract: The Semantic Web empowers machines to better understand the information by associating precise meanings with the data. Resource Description Framework (RDF) lies among basic building blocks for the Semantic Web, to give formal definitions to the data. However humongous amount of majority governments and official data exist in various legacy file formats, generally in the tabular form such as tab-separated values (TSV) or relational databases. Nevertheless, there existed no built-in or standardized implementation to convert TSV to RDF using Semantic Web technologies. This paper focused on deliberation to define and implement a novel mechanism for data conversion. The proposed technique involved in conversion of TSV data to RDF format based on RDF Mapping Language (RML)with direct mapping technique. RML is the RDF mapping language, which is meant to define mapping functions from heterogeneous file formats to RDF data structure. In this paper, an RML and direct mapping based RDF generation software TSV2RDF has been created. TSV2RDF has been evaluated at various benchmark TSV datasets. The proposed conversion method has produced comparative results with increased accuracy and reduced processing time. The performance of developed software exhibits significant performance compared with other datasets. Significant usability at TSV to RDF transformation is suggested by smart adoption of the TSV2RDF.
Keywords: RDF, TSV, RML, Relational to RDF Mapping Language (R2RML), Semantic Web TSV2RDF: Generating RDF Data Model from TSV File Format Using Semantic Web Technologies
DOI:https://doi.org/10.6025/jdim/2021/19/1/10-26
Full_Text   PDF 4.13 MB   Download:   13  times
References:

[1] Wagner, A., Bonduel, M., Pauwels, P., Rüppel, U. (2020). Representing construction-related geometry in a semantic web context: A review of approaches. Automation in Construction, 115, 103130.
[2] Dadkhah, M., Araban, S., Paydar, S. (2020). A systematic literature review on semantic web enabled software testing. Journal of Systems and Software, 162, 110485.
[3] Banane, M., Belangour, A., El Houssine, L. (2017, October). Storing RDF data into big data NoSQL databases. In: First International Conference on Real Time Intelligent Systems (p. 69-78). Springer, Cham.
[4] Faqir, A., Mahmood, A., Qazi, K., Malik, S. (2019, November). An Approach to Map Geography Mark-up Language Data to Resource Description Framework Schema. In: International Conference on Intelligent Technologies and Applications (p. 343-354). Springer, Singapore.
[5] Elbashir, M. K., Aboelhassan, M. A. (2018). An Algorithm for Mapping Relational Database to Resource Description Framework. Gezira Journal of Engineering and Applied Sciences, 11 (1).
[6] de Paula, G. C., de Farias, C. R. (2020). A competency question-oriented approach for the transformation of semi-structured bioinformatics data into linked open data. Engineering Applications of Artificial Intelligence, 90, 103495.
[7] Lefrançois, M., Zimmermann, A., Bakerally, N. (2017, May). A SPARQL extension for generating RDF from heterogeneous formats. In: European Semantic Web Conference (pp. 35-50). Springer, Cham.
[8] Karr, J. R., Liebermeister, W., Goldberg, A. P., Sekar, J. A., & Shaikh, B. (2020). Structured spreadsheets with ObjTables enable data reuse and integration. arXiv preprint arXiv:2005.05227.
[9] Chiarcos, C., Ionov, M. (2019). Ligt: An LLOD-native vocabulary for representing interlinear glossed text as RDF. In: 2nd Conference on Language, Data and Knowledge (LDK 2019). Schloss Dagstuhl-Leibniz-ZentrumfuerInformatik.
[10] Dotsika, F. (2010). Semantic APIs: Scaling up towards the semantic web. International Journal of Information Management, 30 (4) 335-342.
[11] Miller, E. (1998). An introduction to the resource description framework. Bulletin of the American Society for Information Science and Technology, 25 (1) 15-19.
[12] Lefrançois, M., Zimmermann, A., Bakerally, N. (2016, November). Flexible RDF generation from RDF and heterogeneous data sources with SPARQL-Generate. In European Knowledge Acquisition Workshop (p. 131- 135). Springer, Cham.
[13] Chiarcos, C., Ionov, M., Glaser, L., Fäth, C. (2020). An ontology for CoNLL-RDF: Formal data structures for TSV formats in language technology.
[14] Barisevièius, G., Coste, M., Geleta, D., Juric, D., Khodadadi, M., Stoilos, G., Zaihrayeu, I. (2018, October). Supporting digital healthcare services using semantic web technologies. In: International Semantic Web Conference (p. 291-306). Springer, Cham.
[15] Wang, X., Zhang, X., Li, M. (2015). A survey on semantic sensor web: sensor ontology, mapping and query. International Journal of u-and e-Service, Science and Technology, 8 (10) 325-342.
[16] Schwarz, J., Terrenghi, N., Legner, C. (2017). Towards comparable business model concepts: resource description framework (RDF) schemas for semantic business model representations. In: Designing the Digital Transformation: DESRIST 2017 Research in Progress Proceedings of the 12th International Conference on Design Science Research in Information Systems and Technology. Karlsruhe, Germany. 30 May-1 Jun. (p. 101- 109). KarlsruherInstitutfürTechnologie (KIT).
[17] Lin, Z., Tripunitara, M. (2017, March). Graph Automorphism- Based, Semantics-Preserving Security for the Resource Description Framework (RDF). In: Proceedings of the Seventh ACM on Conference on Data and Application Security and Privacy (p. 337-348).
[18] Liu, J., Yang, M., Zhang, L., Zhou, W. (2019). An effective biomedical data migration tool from resource description framework to JSON. Database.
[19] Fang, H., Zhao, B., Zhang, X. W., Yang, X. X. (2019). A united framework for large-scale resource description framework stream processing. Journal of Computer Science and Technology, 34 (4) 762-774.
[20] Hadi, A. S., Ali, S. H. (2019). Resource Description Framework Representation for Transaction Log File. Journal of Computational and Theoretical Nanoscience, 16 (3) 1093-1099.
[21] Faheem, M., Sattar, H., Bajwa, I. S., Akbar, W. (2018, October). Relational database to resource description framework and its schema. In: International Conference on Intelligent Technologies and Applications (p. 604-617). Springer, Singapore.
[22] Mandal, K., Sen, T. (2018). U.S. Patent No. 10,042,619. Washington, DC: U.S. Patent and Trademark Office.
[23] Matsumoto, S., Yamanaka, R., Chiba, H. (2018). Mapping RDF graphs to property graphs. arXiv preprint arXiv:1812.01801.
[24] Lin, Z., Tripunitara, M. (2017, March). Graph Automorphism-Based, Semantics-Preserving Security for the Resource Description Framework (RDF). In: Proceedings of the Seventh ACM on Conference on Data and Application Security and Privacy (p. 337-348).
[25] Riaz, A., Bajwa, I. S., Ali, M. (2019, November). Automatic RDF, Metadata Generation from Legacy Software Models. In: International Conference on Intelligent Technologies and Applications (p. 385-397). Springer, Singapore.
[26] De Una, D., Rümmele, N., Gange, G., Schachte, P., & Stuckey, P. J. (2018, January). Machine Learning and Constraint Programming for Relational-To-Ontology Schema Mapping. In: IJCAI (Vol. 2018, p. 27th).
[27] Dingman, P. C., Bunton, W. G., Van Dyken, K. E., Yogman, L. T., Zhang, Y. (2018). U.S. Patent No. 10,127,250. Washington, DC: U.S. Patent and Trademark Office.
[28] Cate, B. T., Kolaitis, P. G., Qian, K., Tan, W. C. (2017). Approximation algorithms for schema-mapping discovery from data examples. ACM Transactions on Database Systems (TODS), 42 (2) 1-41.