Document Type : Original Research Paper

Authors

1 Department of Department of Industrial Engineering, Faculty of Industrial Engineering and Management Systems, Amirkabir University of Technology, Tehran, Iran

2 Department of Financial Engineering, Faculty of Industrial Engineering and Management Systems, Amirkabir University of Technology, Tehran, Iran

Abstract

Objective: Today, customers have become a critical factor in directing investors, producers, and even researchers and innovators. For this reason, organizations need to know about their customers and plan for them. Insurance companies and in general the insurance industry in each country, is one of the most important financial institutions active in financial markets, especially the capital market, which in addition to providing security for economic activities, can play a very fundamental role in providing insurance services. In other words, insurance companies play a vital role in the mobility, dynamic of financial markets and the provision of investable funds in economic activities. In this research, it has been attempted to answer one of the most important questions of insurance organizations, namely, predicting the level of customers’ losses and investing on profitable customers.
Methodology: Data mining methods were used to discover knowledge to meet business needs and customer relationship management strategies. In addition, an overview of the various applications of data mining in customer relationship management in various insurance companies has been done. In the model implementation stage, a real data-set is used to evaluate the proposed model. To perform the data mining techniques in the insurance industry as data of customers, the vehicle body insurance from 2015 to 2017 has been under investigation. The total number of data used in this study from the beginning was more than 19,356, which during data preparation using Rapidminer 7.1 software became 19,356. After the initial processing, an attempt is made to extract good features from the 15 variables in the data-set that is tangible and help this research in its goal. As a result, by using clustering, drivers are divided into separate clusters based on the amount of loss, and the characteristics of each cluster are expressed. In the clustering section, three algorithms of data mining are examined. First, k-means, k-medoids, and DBSCAN implemented on data-set. Then, the conclusion of three algorithms compared with each other based on the time of calculation and accuracy.
Finding: Data mining was a good tool in this research, owing to the large volume of data, to discover the needs and identify customers. The data mining technique which was the main approach of this study fully covered the information needs by methods such as classification, prediction and clustering. The k-means algorithm was selected as the most optimal one in time and accuracy. In the following, the implementation of the algorithms in the modeling step, the decision tree algorithm was selected and by the decision tree related to the forecasting model, it can be predicted future customers by what characteristics would be in what category. It will be valuable for the insurance companies. Using a decision tree, a forecasting model is proposed to help insurance companies to identify profitable customers which can be used for future plans of organizations.
Conclusion: The customer plays an important role in today's industry. Through studying the data obtained from customer behavior, appropriate action can be taken for marketing-related planning and customer acquisition. The use of predictive models and preventive roadmaps has always been one of the goals of the tools that various organizations have been looking for. In this research, the insurance industry as one of the most important pillars of economic in developing countries has been chosen. By reviewing the share of the insurance industry in the economy of a developing country, it can be seen that insurance has a significant role compared to other services.  In this study, the role of insurance companies in optimizing the investment process and ways to expand the interaction between insurance examined. Customers can lead to the growth and development of the insurance industry and the capital market and thus the growth and development of the national economy. Therefore, in the implementation of this research, the data of insurance customers have been used and a forecasting model has been presented. As a good prediction model, the decision tree with 86.21% accuracy was the best model that reached in this study. The insurers’ income criterion is considered as the node root, which shows the used method can help insurance companies make more profit by focusing on profitable customers.
JEL Classification: B31, C38, C22, D12

Keywords

مترجم، کیومرث و نیاکان، لیلی. (۱۴۰۰). سنجش و ارزیابی رضایت‎مندی مشتریان بیمه‌های زندگی. پژوهشنامه بیمه، 36(1): 119-87.
حاجی­حیدری، نسترن.، خاله­ء، سامرند‌ و فراهی، احمد. (1390). طبقه­بندی میزان ریسک بیمه­گذاران بیمه بدنه خودرو با استفاده از الگوریتم­های داده­کاوی (مورد مطالعه: یک شرکت بیمه). پژوهشنامه بیمه، ‌26(4): 129-107.
شهرابی، جمال. (1395). داده­کاوی. تهران: انتشارات دانشگاه صنعتی امیرکبیر(پلی تکنیک تهران)، چاپ اول.
غضنفری، مهدی، علیزاده، سمیه و تیمورپور، بابک. (1387). داده­کاوی و کشف دانش. تهران: انتشارات دانشگاه علم و صنعت، چاپ اول.
قره­نژاد، سحر. (1389). لزوم حفظ مشتریان بیمه با استفاده از ابزارهای داده‌کاوی. تازه‌های جهان بیمه، 13(150-151): 23-15.
قره‌خانی، محسن و ابوالقاسمی، مریم. (1390). کاربردهای داده‌کاوی در صنعت بیمه. تازه­های جهان بیمه، 14(158)، 21-5.
Baecke, P. & Bocca, L. (2017). The value of vehicle telematics data in insurance risk selection processes. Decision Support Systems, 98: 69–79.
Balaji, S. & Srivatsa, S. K. (2012). Decision tree induction based classification for mining life insurance databases. International Journal of Computer Science and Information Technology & Security, 2(3): 699-703.
Bhowmik, R. (2011). Detecting auto insurance fraud by data mining techniques. Journal of Emerging Trends in Computing and Information Sciences, 2(4): 156-162.
Morik, K. & Kӧpcke, H. (2004). Analyzing customer churn in insurance data –A case study. European Conference on Principles of Data Mining and Knowledge Discovery, 325-336.
Oshini, G. T. L. & Caldera H. A. (2013). Mining life insurance data for customer attrition analysis. Journal of Industrial and Intelligent Information, 1(1): 52-58. ‌
Raedel, M., Hartmann, A., Priess, H. W., Bohm, S., Samietz, S., Konstantinidis, I. & Walter, M. H. (2017). Re-interventions  after  restoring  teeth—mining  an  insurance  database. Journal  of  Dentistry, 57: 14–19.
Rahman, M. S., Arefin, K. Z., Masud, S., Sultana, S. & Rahman, R. M. (2017). Analyzing life insurance data with different classification techniques for customers’ behavior analysis. Asian Conference on Intelligent Information and Database Systems, 15-25.
Ranjan, R. (2011). Self insurance and insurance demand under self-deception. Asia-Pacific Journal of Risk and Insurance, 5(2): 1-27.
Sundarkumar, G. G. & Ravi, V. (2015). A novel hybrid under sampling method for mining unbalanced datasets in banking and insurance. Engineering Applications of Artificial Intelligence, 37: 368–377.
Thakur, S. S. & Sing, J. K. (2013). Mining customer’s data for vehicle insurance prediction system using k-means clustering - An Application. International Journal of Computer Applications in Engineering Sciences, 3(4): 148-153.
Umamaheswari, K. & Janakiraman, S. (2014). Role of data mining in insurance industry. An International Journal of Advanced Computer Technology, 3(6): 961-966.
Wanke, P. & Barros, C. P. (2015). Efficiency drivers in Brazilian insurance: A two-stage DEA meta frontier-data mining approach. Economic Modelling, 53(C): 8–22.

Letters to Editor


IJIR Journal welcomes letters to the editor for the post-publication discussions and corrections which allows debate post publication on its site, through the Letters to Editor. Letters pertaining to manuscript published in IJIR should be sent to the editorial office of IJIR within three months of either online publication or before printed publication, except for critiques of original research. Following points are to be considering before sending the letters (comments) to the editor.

[1] Letters that include statements of statistics, facts, research, or theories should include appropriate references, although more than three are discouraged.

[2] Letters that are personal attacks on an author rather than thoughtful criticism of the author’s ideas will not be considered for publication.

[3] Letters can be no more than 300 words in length.

[4] Letter writers should include a statement at the beginning of the letter stating that it is being submitted either for publication or not.

[5] Anonymous letters will not be considered.

[6] Letter writers must include their city and state of residence or work.

[7] Letters will be edited for clarity and length.
CAPTCHA Image