Automatic Verification of Correspondence Between Teaching Resources and Executive Regulations in the Field of Design and Implementation of Concrete Buildings: a Text Mining Approach

Document Type : Research Article

Authors

1 Faculty of Engineering, Department of Civil Engineering and Construction Management, Ferdowsi University, Mashhad, Iran

2 Department of Civil Engineering, Ferdowsi University of Mashhad

Abstract

One of the challenges of higher education in editing university educational texts is to achieve the maximum compatibility of the content of educational resources with the instructions. Therefore, to reach an efficient educational system in line with industry needs, the appropriateness of the content of the educational resources with the regulations should be evaluated and revised if necessary. The need to review educational resources in the field of engineering and technology is important because these disciplines are needed in the application of industry and services in the country, and in fact, the training of expert and technical forces that can meet the technical needs of the country at different levels is the most important task of curricula in universities. This issue doubles the importance of paying attention to the teaching resources of these disciplines. This research seeks to extract keywords of "Iranian Concrete Regulations" and "reinforced concrete structures" which using three statistical approaches, linguistic knowledge, and graph-based approaches proposes a hybrid method. Then, the keywords of each document are visualized in a clustered network and analyzed. Comparing the results shows that the contents of these two documents are not completely similar. In fact, it can be said that the Regulations is an instructional document in which all the details have been addressed and all the issues surrounding concrete structures have been discussed. However the book is not satisfied with the design instructions and has examined the concepts related to the design of concrete structures in detail.

Keywords

Main Subjects


[1] J. Gantz, D. Reinsel, The digital universe in 2020: Big data, bigger digital shadows, and biggest   growth in the far east, IDC iView: IDC Analyze the future, 2007(2012) (2012) 1-16.
[2] R. Feldman, I. Dagan, Knowledge Discovery in Textual Databases (KDT), in:  KDD, 1995, pp. 112-117.
[3] M. Allahyari, S. Pouriyeh, M. Assefi, S. Safaei, E.D. Trippe, J.B. Gutierrez, K. Kochut, A brief survey of text mining: Classification, clustering and extraction techniques, arXiv preprint arXiv:1707.02919,  (2017).
[4] H. Alrasheed, Word synonym relationships for text analysis: A graph-based approach, Plos one, 16(7) (2021) e0255127.
[5] G. Miner, Practical text mining and statistical analysis for non-structured text data applications, Academic Press, 2012.
[6] S. Beliga, A. Meštrović, S. Martinčić-Ipšić, An overview of graph-based keyword extraction methods and approaches, Journal of information and organizational sciences, 39(1) (2015) 1-20.
[7] S. Siddiqi, A. Sharan, Keyword and keyphrase extraction techniques: a literature review, International Journal of Computer Applications, 109(2) (2015).
[8] L. Page, S. Brin, R. Motwani, T. Winograd, The pagerank citation ranking: Bring order to the web, Technical report, stanford University, 1998.
[9] A. Gazni, Automatic extraction of key phrases from Persian article texts, Librarianship and Information Science, 9(3) (2006) 95-106. (in persian)
[10] O. Hajipoor, s. sadidpour, Automatic keyword extraction of short Persian texts using word2vec, Electronic and Cyber Defense, 8(2) (2020) 105-114. (in persian)
[11] R. Campos, V. Mangaravite, A. Pasquali, A. Jorge, C. Nunes, A. Jatowt, YAKE! Keyword extraction from single documents using multiple local features, Information Sciences, 509 (2020) 257-289.
[12] B. Hejazi, J.A. Nasiri, Keywords Extraction from Persian Thesis Using Statistical Features and Bayesian Classification, Language Related Research, 12(6) (2022) 339-367.
[13] M. Azarafza, M.-R. Feizi-Derakhshi, M.B. Shendi, Textrank-based microblogs keyword extraction method for Persian language, in:  Conference: 3rd International Congress on Science and Engineering, Hamburg, Germany, 2020.
[14] Z. Zhou, X. Zou, X. Lv, J. Hu, Research on weighted complex network based keywords extraction, in:  Chinese Lexical Semantics: 14th Workshop, CLSW 2013, Zhengzhou, China, May 10-12, 2013. Revised Selected Papers 14, Springer, 2013, pp. 442-452.
[15] D.A. Vega-Oliveros, P.S. Gomes, E.E. Milios, L. Berton, A multi-centrality index for graph-based keyword extraction, Information Processing & Management, 56(6) (2019) 102063.
[16] S.K. Biswas, Keyword extraction from tweets using weighted graph, in:  Cognitive Informatics and Soft Computing: Proceeding of CISC 2017, Springer, 2019, pp. 475-483.
[17] A. Bellaachia, M. Al-Dhelaan, Ne-rank: A novel graph-based keyphrase extraction in twitter, in:  2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, IEEE, 2012, pp. 372-379.
[18] V. Kalra, R. Aggarwal, Importance of Text Data Preprocessing & Implementation in RapidMiner, ICITKM, 14 (2017) 71-75.
[19] G. Salton, C. Buckley, Term-weighting approaches in automatic text retrieval, Information processing & management, 24(5) (1988) 513-523.
[20] f. hafezi moghaddas, m. maghrebi, Using text mining techniques to analyze technical texts: A case study, content analysis of the American Concrete Code (ACI 318-08), in:  13th National Congress on Civil Engineering, 2022. (in persian)
[21] J. Sun, K. Lei, L. Cao, B. Zhong, Y. Wei, J. Li, Z. Yang, Text visualization for construction document information management, Automation in construction, 111 (2020) 103048.
[22] Z. Xie, Centrality measures in text mining: prediction of noun phrases that appear in abstracts, in:  Proceedings of the ACL student research workshop, 2005, pp. 103-108.
[23] M. Bastian, S. Heymann, M. Jacomy, Gephi: an open source software for exploring and manipulating networks, in:  Proceedings of the international AAAI conference on web and social media, 2009, pp. 361-362.