Evolution of IT, management and industrial engineering research: A topic model approach

Document Type : Article


1 School of Industrial Engineering, Iran University of Science and Technology (IUST), University Ave. Narmak, 16846-13114, Tehran, Iran

2 School of Computer Engineering, Iran University of Science and Technology (IUST), University Ave. Narmak, 16846-13114, Tehran, Iran


Information Technology (IT), Management and Industrial Engineering are correlated academic disciplines which their publications rose significantly over the last decades. The aim of this study is analyzing the research evolution, determining the important topics and areas and depiction the trend of interdisciplinary topics in these domains. To accomplish this, the text mining techniques are used and the combination of bibliographic analysis and topic modeling approach are applied on their publications in the WOS repository over the last 20 years. In the topic extraction process, a heuristic function was suggested to key extraction, and some new applicable criteria were defined to compare the topics. Moreover, a novel approach was proposed to determine the high-level category for each topic. The results determined the hot-important topics and incremented, decremented and fixed topics are identified. Subsequently, comparing the high-level research areas confirmed the strong scientific relationships between them. This study presents a deep knowledge about internal research evolution of domains and illustrates the effect of topics on each other over the past 20 years. Furthermore, the methodology of this study could be applied to determine the interdisciplinary topics and observe the research evolution of other academic domains.


[1]    Pfeiffer, Alice, "Close Link Between Engineering and Business Management," in The New York Times, ed: The New York Times Company, 2009.
[2]    Porter, Alan and Rafols, Ismael, "Is science becoming more interdisciplinary? Measuring and mapping six research fields over time," Scientometrics, vol. 81, pp. 719-745, 2009.
[3]    He, Wu and Xu, Lida, "A state-of-the-art survey of cloud manufacturing," International Journal of Computer Integrated Manufacturing, vol. 28, pp. 239-250, 2015.
[4]    Heilig, Leonard and Voß, Stefan, "A scientometric analysis of cloud computing literature," IEEE Transactions on Cloud Computing, vol. 2, pp. 266-278, 2014.
[5]    Jin, Jong Beom, Leem, Choon Seong, and Lee, Choong Hyun, "Research issues and trends in industrial productivity over 44 years," International Journal of Production Research, vol. 54, pp. 1273-1284, 2016.
[6]    Ronda-Pupo, Guillermo Armando, "Knowledge map of Latin American research on management: Trends and future advancement," Social Science Information, vol. 55, pp. 3-27, 2016.
[7]    Sedighi, Mehri and Jalalimanesh, Ammar, "Mapping research trends in the field of knowledge management," Malaysian Journal of Library & Information Science, vol. 19, 2017.
[8]    Seth, Dinesh, Seth, Dinesh, Shrivastava, RL, Shrivastava, RL, Shrivastava, Sanjeev, and Shrivastava, Sanjeev, "An empirical investigation of critical success factors and performance measures for green manufacturing in cement industry," Journal of Manufacturing Technology Management, vol. 27, pp. 1076-1101, 2016.
[9]    Wagner, Caroline S, Roessner, J David, Bobb, Kamau, Klein, Julie Thompson, Boyack, Kevin W, Keyton, Joann, et al., "Approaches to understanding and measuring interdisciplinary scientific research (IDR): A review of the literature," Journal of informetrics, vol. 5, pp. 14-26, 2011.
[10]    Wagner, aroline S., Roessner, J. David, Bobb, Kamau, Thompson Klein, Julie, Boyack, Kevin W., Keyton, Joann, et al., "Approaches to understanding and measuring interdisciplinary scientific research (IDR): A review of the literature," Journal of Informetrics, vol. 5, pp. 14-26, 2011.
[11]    Garfield, Eugene, Sher, Irving H, and Torpie, Richard J, "The use of citation data in writing the history of science," INSTITUTE FOR SCIENTIFIC INFORMATION INC PHILADELPHIA PA1964.
[12]    Small, Henry G, "A co-citation model of a scientific specialty: A longitudinal study of collagen research," Social studies of science, vol. 7, pp. 139-166, 1977.
[13]    Fu, Hui-Zhen and Ho, Yuh-Shan, "Highly cited Antarctic articles using Science Citation Index Expanded: a bibliometric analysis," Scientometrics, vol. 109, pp. 337-357, 2016.
[14]    Osei-Kyei, Robert and Chan, Albert PC, "Review of studies on the Critical Success Factors for Public–Private Partnership (PPP) projects from 1990 to 2013," International Journal of Project Management, vol. 33, pp. 1335-1346, 2015.
[15]    Amado, Alexandra, Cortez, Paulo, Rita, Paulo, and Moro, Sérgio, "Research trends on Big Data in Marketing: A text mining and topic modeling based literature analysis," European Research on Management and Business Economics, vol. 24, pp. 1-7, 2018.
[16]    Choi, Hyo Shin, Lee, Won Sang, and Sohn, So Young, "Analyzing research trends in personal information privacy using topic modeling," Computers & Security, vol. 67, pp. 244-253, 2017/06/01/ 2017.
[17]    De Battisti, Francesca, Ferrara, Alfio, and Salini, Silvia, "A decade of research in statistics: a topic model approach," Scientometrics, vol. 103, pp. 413-433, 2015.
[18]    Gerdsri, Nathasit, Kongthon, Alisa, and Puengrusme, Sudatip, "Profiling the Research Landscape in Emerging Areas Using Bibliometrics and Text Mining: A Case Study of Biomedical Engineering (BME) in Thailand," International Journal of Innovation and Technology Management, vol. 14, p. 1740011, 2017.
[19]    Yau, Chyi-Kwei, Porter, Alan, Newman, Nils, and Suominen, Arho, "Clustering scientific documents with topic modeling," Scientometrics, vol. 100, pp. 767-786, 2014.
[20]    Furrer, Olivier, Thomas, Howard, and Goussevskaia, Anna, "The structure and evolution of the strategic management field: A content analysis of 26 years of strategic management research," International Journal of Management Reviews, vol. 10, pp. 1-23, 2008.
[21]    Cancino, Christian, Merigó, José M, Coronado, Freddy, Dessouky, Yasser, and Dessouky, Mohamed, "Forty years of Computers & Industrial Engineering: A bibliometric analysis," Computers & Industrial Engineering, vol. 113, pp. 614-629, 2017.
[22]    Lee, Won Sang and Sohn, So Young, "Effects of standardization on the evolution of information and communications technology," Technological Forecasting and Social Change, vol. 132, pp. 308-317, 2018.
[23]    Shi, Yingling and Liu, Xinping, "Research on the Literature of Green Building Based on the Web of Science: A Scientometric Analysis in CiteSpace (2002–2018)," Sustainability, vol. 11, p. 3716, 2019.
[24]    Liao, Huchang, Tang, Ming, Luo, Li, Li, Chunyang, Chiclana, Francisco, and Zeng, Xiao-Jun, "A bibliometric analysis and visualization of   research," Sustainability, vol. 10, p. 166, 2018.
[25]    Morkūnaitė, Žydrūnė, Kalibatas, Darius, and Kalibatienė, Diana, "A bibliometric data analysis of multi-criteria decision making methods in heritage buildings," Journal of Civil Engineering and Management, vol. 25, pp. 76-99, 2019.
[26]    Hosseini, Seyedmohsen , Ivanov, Dmitry, and Dolgui, Alexandre, "Review of quantitative methods for supply chain resilience analysis," TransportationResearchPart E: Logistics and Transportation Review, vol. 125, pp. 285-307, 2019.
[27]    Gaur, Ajai and Kumar, Mukesh, "A systematic approach to conducting review studies: An assessment of content analysis in 25 years of IB research," Journal of World Business, vol. 53, pp. 280-289, 2018.
[28]    Jones, Spencer S, Rudin, Robert S, Perry, Tanja, and Shekelle, Paul G, "Health information technology: an updated systematic review with a focus on meaningful use," Annals of internal medicine, vol. 160, pp. 48-54, 2014.
[29]    Rabiei, Mohammad, Hosseini-Motlagh, Seyyed-Mahdi, and Haeri, Abdorrahman, "Using text mining techniques for identifying research gaps and priorities: a case study of the environmental science in Iran," Scientometrics, vol. 110, pp. 815-842, 2017.
[30]    Sedighi, Mehri and Jalalimanesh, Ammar, "Mapping research trends in the field of knowledge management," Malaysian Journal of Library & Information Science, vol. 19, pp. 71-85, 2017-03-22 2017.
[31]    THOMSON-REUTERS. (2017, 8/12/2017). Web of ScienceTM Core Collection Help. Available: http://images.webofknowledge.com/WOKRS524B8/help/WOS/hp_subject_category_terms_tasca.html
[32]    Elango, Bakthavachalam and Ho, Yuh – Shan, "Top-cited articles in the field of tribology : A bibliometric analysis," Journal of Scientometrics and Information Management, vol. 12, pp. 289-307, 2018.
[33]    Kim, Meen Chul and Zhu, Yongjun, "Scientometrics of Scientometrics: Mapping Historical Footprint and Emerging Technologies in Scientometrics," in Scientometrics, ed: IntechOpen, 2018, p. 9.
[34]    Klarenbeek, Tracy and Boshoff, Nelius, "Measuring multidisciplinary health research at South African universities: a comparative analysis based on co-authorships and journal subject categories," Scientometrics, vol. 116, pp. 1461-1485, September 01 2018.
[35]    Lin, Hongli, Zhu, Yuming, Ahmad, Naveed, and Han, Qingye, "A scientometric analysis and visualization of global research on brownfields," Environmental Science and Pollution Research, vol. 26, pp. 17666-17684, June 01 2019.
[36]    Fu, Hui-Zhen, Wang, Ming-Huang, and Ho, Yuh-Shan, "The most frequently cited adsorption research articles in the Science Citation Index (Expanded)," Journal of Colloid and Interface Science, vol. 379, pp. 148-156, 2012.
[37]    Blei, David M, "Probabilistic topic models," Communications of the ACM, vol. 55, pp. 77-84, 2012.
[38]    Hu, Zhengyin, Fang, Shu, and Liang, Tian, "Empirical study of constructing a knowledge organization system of patent documents using topic modeling," Scientometrics, vol. 100, pp. 787-799, 2014.
[39]    Landauer, Thomas K, Foltz, Peter W, and Laham, Darrell, "An introduction to latent semantic analysis," Discourse processes, vol. 25, pp. 259-284, 1998.
[40]    Hofmann, Thomas, "Probabilistic latent semantic indexing," in Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, 1999, pp. 50-57.
[41]    Blei, David M, Ng, Andrew Y, and Jordan, Michael I, "Latent Dirichlet allocation," Journal of machine Learning research, vol. 3, pp. 993-1022, 2003.
[42]    Griffiths, Thomas L, Jordan, Michael I, Tenenbaum, Joshua B, and Blei, David M, "Hierarchical topic models and the nested Chinese restaurant process," in Advances in neural information processing systems, 2004, pp. 17-24.
[43]    Lafferty, John D and Blei, David M, "Correlated topic models," in Advances in neural information processing systems, 2006, pp. 147-154.
[44]    Chang, Jonathan and Blei, David M, "Relational topic models for document networks," in International conference on artificial intelligence and statistics, 2009, pp. 81-88.
[45]    Bosman, Jeroen, Mourik, Ineke van, Rasch, Menno, Sieverts, Eric, and Verhoeff, Huib, "Scopus reviewed and compared: The coverage and functionality of the citation database Scopus, including comparisons with Web of Science and Google Scholar," 2006.
[46]    Qin, Jian, "Semantic similarities between a keyword database and a controlled vocabulary database: An investigation in the antibiotic resistance literature," Journal of the Association for Information Science and Technology, vol. 51, pp. 166-180, 2000.
[47]    Chaudhry, SS and Luo, W, "Application of genetic algorithms in production and operations management: a review," International Journal of Production Research, vol. 43, pp. 4083-4101, 2005.
[48]    Gen, Mitsuo and Cheng, Runwei, Genetic algorithms and engineering optimization vol. 7: John Wiley & Sons, 2000.
[49]    Dasgupta, Dipankar and Michalewicz, Zbigniew, Evolutionary algorithms in engineering applications: Springer Science & Business Media, 2013.
[50]    Hornik, Kurt and Grün, Bettina, "Topicmodels: An R package for fitting topic models," Journal of Statistical Software, vol. 40, pp. 1-30, 2011.
[51]    Zhang, Yi, Porter, Alan L, Hu, Zhengyin, Guo, Ying, and Newman, Nils C, "“Term clumping” for technical intelligence: A case study on dye-sensitized solar cells," Technological Forecasting and Social Change, vol. 85, pp. 26-39, 2014.
[52]    Thukral, Anjali, Jain, Ayush, Aggarwal, Mudit, and Sharma, Mehul, "Semi-automatic Ontology Builder Based on Relation Extraction from Textual Data," Singapore, 2018, pp. 343-350.
[53]    Mohapatra, Prateeti, Deng, Yu, Gupta, Abhirut, Dasgupta, Gargi, Paradkar, Amit, Mahindru, Ruchi, et al., "Domain Knowledge Driven Key Term Extraction for IT Services," Cham, 2018, pp. 489-504.
[54]    Duari, Swagata and Bhatnagar, Vasudha, "sCAKE: Semantic Connectivity Aware Keyword Extraction," Information Sciences, vol. 477, pp. 100-117, 2019.
[55]    Huh, Jun-Ho, "Big Data Analysis for Personalized Health Activities: Machine Learning Processing for Automatic Keyword Extraction Approach," Symmetry, vol. 10, p. 93, 2018.
[56]    Momtazi, Saeedeh and Moradiannasab, Omid, "A statistical approach to knowledge discovery: Bootstrap analysis of language models for knowledge base population from unstructured text," Scientia Iranica, vol. 26, pp. 26-39, 2019.
[57]    Hall, David, Jurafsky, Daniel, and Manning, Christopher D, "Studying the history of ideas using topic models," in Proceedings of the conference on empirical methods in natural language processing, 2008, pp. 363-371.
[58]    Weston, Steve and Calaway, Rich, "Getting Started with doParallel and foreach," Date of access, vol. 30, 2017.
[59]    Klavans, Richard and Boyack, Kevin W., "Which Type of Citation Analysis Generates the Most Accurate Taxonomy of Scientific and Technical Knowledge?," Journal of the Association for Information Science and Technology, vol. 68, pp. 984-998, 2017.
[60]    Singhal, Amit, "Modern information retrieval: A brief overview," IEEE Data Eng. Bull., vol. 24, pp. 35-43, 2001.
[61]    Gronsbell, Jessica, Minnier, Jessica, Yu, Sheng, Liao, Katherine, and Cai, Tianxi, "Automated feature selection of predictors in electronic medical records data," Biometrics, vol. 75, pp. 268-277, 2019.
[62]    Hong, Chuan, Liao, Katherine P, and Cai, Tianxi, "Semi‐supervised validation of multiple surrogate outcomes with application to electronic medical records phenotyping," Biometrics, vol. 75, pp. 78-89, 2019.
[63]    Izadi, Nazanin, Aminian, Omid, and Esmaeili, Bahador, "Occupational Accidents in Iran: Risk Factors and Long Term Trend (2007–2016)," Journal of Research in Health Sciences, vol. 19, pp. 1-6, 2019.
[64]    Kogi, Kazutaka, "Work improvement and occupational safety and health management systems: common features and research needs," Industrial Health, vol. 40, pp. 121-133, 2002.
[65]    Ohniwa, Ryosuke L, Hibino, Aiko, and Takeyasu, Kunio, "Trends in research foci in life science fields over the last 30 years monitored by emerging topics," Scientometrics, vol. 85, pp. 111-127, 2010.
[66]    Bornmann, Lutz, Leydesdorff, Loet, and Wang, Jian, "How to improve the prediction based on citation impact percentiles for years shortly after the publication date?," Journal of Informetrics, vol. 8, pp. 175-180, 2014.
[67]    Wang, Jian, "Citation time window choice for research impact evaluation," Scientometrics, vol. 94, pp. 851-872, 2013.
[68]    Wei, Xiaobing, Xue, Hongyan, and Zhang, Jianhua, "Partnership in Supply Chain Risk Management Research," Journal of Applied Science and Engineering Innovation, vol. 2, 2015.
[69]    Lee, Jay, Lapira, Edzel, Bagheri, Behrad, and Kao, Hung-an, "Recent advances and trends in predictive manufacturing systems in big data environment," Manufacturing Letters, vol. 1, pp. 38-41, 2013.
[70]    Zhang, Lin, Luo, Yongliang, Tao, Fei, Li, Bo Hu, Ren, Lei, Zhang, Xuesong, et al., "Cloud manufacturing: a new manufacturing paradigm," Enterprise Information Systems, vol. 8, pp. 167-187, 2014.
[71]    Herrmann, Christoph, Schmidt, Christopher, Kurle, Denis, Blume, Stefan, and Thiede, Sebastian, "Sustainability in Manufacturing and Factories of the Future," International Journal of precision engineering and manufacturing-green technology, vol. 1, pp. 283-292, 2014.
[72]    Chai, Junyi, Liu, James N. K., and Ngai, Eric W. T., "Application of decision-making techniques in supplier selection: A systematic review of literature," Expert Systems with Applications, vol. 40, pp. 3872-3885, 2013/08/01/ 2013.
[73]    Wu, Fang, Yeniyurt, Sengun, Kim, Daekwan, and Cavusgil, S Tamer, "The impact of information technology on supply chain capabilities and firm performance: A resource-based view," Industrial Marketing Management, vol. 35, pp. 493-504, 2006.
[74]    Akbari, O Zohreh, "A survey of agent-oriented software engineering paradigm: Towards its industrial acceptance," International Journal of Computer Engineering Research, vol. 1, pp. 14-28, 2010.
[75]    Amezquita-Sanchez, J.P., Valtierra-Rodriguez, M., and Adeli, H., "Wireless smart sensors for monitoring the health condition of civil infrastructure," Scientia Iranica, vol. 25, pp. 2913-2925, 2018.
[76]    Entezami, Alireza, Shariatmadar, Hashem, and Karamodin, Abbas, "An improvement on feature extraction via time series modeling for structural health monitoring based on unsupervised learning methods," Scientia Iranica, pp. -, 2018.
[77]    Vazirizade, Sayyed Mohsen, Bakhshi, Ali, Bahar, Omid, and Nozhati, Saeed, "Online nonlinear structural damage detection using Hilbert Huang transform and artificial neural networks," Scientia Iranica, vol. 26, pp. 1266-1279, 2019.
[78]    Kontos, Emily, Blake, Kelly D, Chou, Wen-Ying Sylvia, and Prestin, Abby, "Predictors of eHealth usage: insights on the digital divide from the Health Information National Trends Survey 2012," Journal of medical Internet research, vol. 16, 2014.
[79]    Holzinger, Andreas, Dehmer, Matthias, and Jurisica, Igor, "Knowledge discovery and interactive data mining in Bioinformatics-state-of-the-art, future challenges and research directions," BMC Bioinformatics, vol. 15, p. I1, 2014.
[80]    Fan, Wei and Bifet, Albert, "Mining big data: current status, and forecast to the future," ACM sIGKDD Explorations Newsletter, vol. 14, pp. 1-5, 2013.
[81]    Hashem, Ibrahim Abaker Targio, Yaqoob, Ibrar, Anuar, Nor Badrul, Mokhtar, Salimah, Gani, Abdullah, and Khan, Samee Ullah, "The rise of “big data” on cloud computing: Review and open research issues," Information Systems, vol. 47, pp. 98-115, 2015.
[82]    Cheung, Lewis TO, Fok, Lincoln, Tsang, Eric PK, Fang, Wei, and Tsang, HY, "Understanding residents’ environmental knowledge in a metropolitan city of Hong Kong, China," Environmental Education Research, vol. 21, pp. 507-524, 2015.
[83]    Cornell, Sarah, Berkhout, Frans, Tuinstra, Willemijn, Tàbara, J David, Jäger, Jill, Chabay, Ilan, et al., "Opening up knowledge systems for better responses to global environmental change," Environmental Science & Policy, vol. 28, pp. 60-70, 2013.
[84]    Bala, Suman and Kumar, Krishan, "A literature review on kidney disease prediction using data mining classification technique," International Journal of Computer Science and Mobile Computing, vol. 3, pp. 960-967, 2014.
[85]    Chaurasia, Vikas and Pal, Saurabh, "Early prediction of heart diseases using data mining techniques," Caribbean Journal of Science and Technology, vol. 1, pp. 208-217, 2013.
[86]    Masethe, Hlaudi Daniel and Masethe, Mosima Anna, "Prediction of heart disease using classification algorithms," in Proceedings of the World Congress on Engineering and Computer Science, 2014, p. 2224.
[87]    Wager, Karen A, Lee, Frances W, and Glaser, John P, Health care information systems: a practical approach for health care management: John Wiley & Sons, 2017.
[88]    Wickens, Christopher D, Hollands, Justin G, Banbury, Simon, and Parasuraman, Raja, Engineering psychology & human performance: Psychology Press, 2015.
[89]    Landy, Frank J and Conte, Jeffrey M, Work in the 21st Century, Binder Ready Version: An Introduction to Industrial and Organizational Psychology: John Wiley & Sons, 2016.