پیش بینی خسارت با استفاده از تکنیک های رگرسیون عمیق متوالی در روش های یادگیری عمیق

پرستش, مونا; بهشتی‌فرد, ضیاء الدین; راد, عباس

doi:10.22056/ijir.2025.04.02

نوع مقاله : مقاله پژوهشی

نویسندگان

¹ دانشجوی دکتری، گروه مهندسی کامپیوتر، دانشکده فنی و مهندسی، واحد تهران جنوب، دانشگاه آزاد اسلامی، تهران، ایران

² مربی، گروه مهندسی کامپیوتر، دانشکده فنی و مهندسی، واحد تهران جنوب، دانشگاه آزاد اسلامی، تهران، ایران

³ استادیار، گروه مدیریت، دانشکده مدیریت و حسابداری، دانشگاه شهید بهشتی، تهران، ایران

https://doi.org/10.22056/ijir.2025.04.02

چکیده

پیشینه و اهداف: یکی از چالش‌های مهم شرکت‌های بیمه‌ تعیین نرخ بهینة ‌بیمه‌نامه‌های اموال مانند خودرو و آتش‌سوزی است. اگر شرکت بیمه بتواند امکان وقوع خسارت را در یک ‌بیمه‌نامه تشخیص دهد، تصمیمات مؤثر و بهتری در تعیین نرخ بیمه‌، میزان تخفیف اختصاص‌یافته به ‌بیمه‌نامه یا تصمیم‌گیری دربارة تمدید آن ‌بیمه‌نامه خواهد داشت. شرکت‌های بیمه و خبرگان رشتة صدور و خسارت‌، به‌دنبال روش‌های جدیدی برای ارزیابی ریسک مشتریان و ‌بیمه‌نامه‌ها از طریق پیش‌بینی وقوع خسارات احتمالی در این رشته هستند. روش‌های سنتی مانند مدل‌های خطی عمومی (GLMs) غالباً در شناسایی الگوهای پیچیده در داده‌های بیمه ناکام‌اند. یادگیری عمیق با توانایی شناسایی روابط غیرخطی و الگوهای پیچیده می‌تواند این محدودیت‌ها را رفع کند. در این مقاله روشی برای بهبود عملکرد یادگیری عمیق با استفاده از تکنیک‌های رگرسیون عمیق ترتیبی ارائه شده است. رویکرد پیشنهادی ترکیبی از یادگیری عمیق و مدل‌های ترتیبی است. از شبکه‌های حافظة طولانی کوتاه‌مدت (LSTM) برای مدل‌سازی داده‌های سری زمانی استفاده می‌شود.
روش‌شناسی: در این مقاله از داده‌های 7 سال اخیر صدور و خسارت بیمة آتش‌سوزی شرکت بیمة البرز برای بررسی و پیش‌بینی خسارت در این رشته استفاده شده است. در این مقاله با تمرکز بر پیش‌پردازش داده‌ها و استخراج ویژگی‌های برتر برای ارائة بهترین نتیجه و پس از اعمال روش‌های مختلف استخراج ویژگی، در نهایت از مجموع 40 ویژگی، 20 ویژگی انتخاب و سپس با استفاده یادگیری عمیق آموزش داده شد. رویکرد پیشنهادی، ترکیبی از یادگیری عمیق و مدل‌های ترتیبی با استفاده از شبکه‌های حافظة طولانی کوتاه‌مدت (LSTM) برای مدل‌سازی داده‌های سری زمانی استفاده‌ می‌کند.
یافته‌ها: در این مقاله با بررسی روش‌های مختلف یادگیری ماشینی روی داده‌های صدور و خسارت شرکت بیمه‌، این نتیجه به دست آمد که مدل رگرسیون عمیق ترتیبی نسبت به روش‌های سنتی عملکرد بهتری دارد. بهبود دقت پیش‌بینی، قابلیت اطمینان بالاتر و تأکید بر اهمیت ویژگی‌های زمانی از نتایج اصلی‌اند.
نتیجه‌گیری: بهینه‌سازی فرایندهای ارزیابی ریسک و خسارت از طریق مدل‌های یادگیری عمیق می‌تواند به ارائة نرخ‌های ‌حق‌بیمه دقیق‌تر و کاهش ضررهای ناشی از پرداخت‌های نامناسب منجر شود. همچنین، استفاده از این مدل‌ها می‌تواند به شرکت‌های بیمه کمک کند تا استراتژی‌های مدیریت ریسک بهتری اتخاذ کنند و فرایندهای کشف تقلب را بهبود بخشند. این موضوع به‌ویژه در بیمه‌های آتش‌سوزی، که خسارت‌های مالی سنگینی به همراه دارند، از اهمیت بالایی برخوردار است.

کلیدواژه‌ها

موضوعات

ارزیابی ریسک در رشته های بیمه ‎

عنوان مقاله [English]

Predicting damage using sequential deep regression techniques in deep learning methods

نویسندگان [English]

Mona Parastesh ¹
Ziaeddin Beheshtifard ²
Abbas Raad ³

¹ PhD Student, Department of Computer Engineering, Faculty of Technical and Engineering, South Tehran Branch, Islamic Azad University, Tehran, Iran

² Instructor, Department of Computer Engineering, Faculty of Technical and Engineering, South Tehran Branch, Islamic Azad University, Tehran, Iran

³ Assistant Professor, Department of Management, Faculty of Management and Accounting, Shahid Beheshti University, Tehran, Iran

چکیده [English]

BACKGROUND AND OBJECTIVES: The role of the insurance industry is changing nowadays. The reason is that companies are using new analytical methods to predict losses and risks, and these methods help them assess potential risks. In this era, traditional business models and old methods have always been under threat from technology. New insurance companies are using the power of innovative technologies to eliminate the traditional leaders of the insurance market. The protection provided to the insured against risks and the proposed solutions provided to deal with risks are obtained through services designed to identify potential risks, and these services can be used to warn of danger (in high-risk cases). As a result, these services will be the most important distinction of these companies and the key to their success in the future. Powerful artificial intelligence and analysis of large volumes of big data give insurers the power to move towards predicting losses and incidents. The more information insurance companies have about their policyholders, the better they can use these valuable data to predict policyholders’ behavior and create a historical profile for each individual, thereby reducing the volume of claims and associated risks. Insurance companies enjoying leverage innovative technologies have a significant opportunity for growth. However, those that continue to rely on basic questions such as age, gender, and occupation to determine premiums are unlikely to survive in the digital era and amidst the rise of insurtech. Insurers that fail to adopt predictive analytics and continue to use outdated traditional systems may experience longer delays in processing and paying claims compared to innovative companies. This gap will allow tech-driven insurers to attract more customers and cover a wider range of policyholders in the long term. Insurance data often contains nonlinear and complex relationships that simple models—such as linear regression or decision trees—cannot fully capture or model effectively. These companies are also faced with vast volumes of data. Traditional methods such as general linear models often fail to identify complex patterns in insurance data. Therefore, we seek to improve existing methods by applying modern techniques such as deep learning, since deep neural networks can more accurately identify complex patterns in insurance data, process large datasets efficiently, and uncover hidden insights. Deep learning, with the ability to identify nonlinear relationships and complex patterns, can overcome these limitations. In this paper, a method to improve the performance of deep learning using sequential deep regression techniques is presented. The proposed approach is a combination of deep learning and sequential models. Long Short-Term Memory (LSTM) neural networks are used to model time series data.
METHODS: In this study, data spanning the past seven years from Alborz Insurance Company—specifically related to the issuance and loss records of fire insurance policies—has been systematically utilized to analyze and forecast potential losses in this domain. The methodology places a strong emphasis on comprehensive data pre-processing, including cleaning, normalization, and transformation to ensure the reliability and quality of the input data. In the feature engineering stage, various techniques were applied to extract the most informative and relevant attributes from the raw dataset. Out of a total of 40 initially selected features, the top 20 features were identified through statistical analysis and machine learning-based selection methods. These refined features were then used to train the deep learning models. The proposed method is a hybrid approach that combines deep learning with sequential modeling techniques. Specifically, Long Short-Term Memory (LSTM) neural networks were employed due to their ability to capture time-dependent patterns in sequential data, making them particularly suitable for modeling the temporal dynamics inherent in insurance data over multiple years.

FINDINGS :The study involved the evaluation and comparison of multiple machine learning algorithms, including traditional models and advanced deep learning techniques. The results demonstrated that the proposed sequential deep regression approach significantly outperforms conventional models such as general linear models and decision trees. Notably, the LSTM-based model provided higher prediction accuracy and demonstrated superior performance in identifying complex, nonlinear patterns within the data. Key findings highlight the critical role of temporal features in enhancing prediction reliability and show that incorporating time series analysis is essential for improving the accuracy of damage occurrence forecasts in fire insurance.

CONCLUSION: The results of this research underscore the effectiveness of combining deep learning techniques with sequential models for predicting fire insurance losses. Using the confidential and comprehensive issuance and claims dataset from Alborz Insurance Company over seven years, the proposed hybrid model was capable of delivering better performance in comparison to previous methods. The approach not only improved the precision of predictions but also offered a more robust and scalable solution for risk assessment. Overall, the use of LSTM-based deep learning models represents a significant advancement in the insurance industry’s ability to make data-driven decisions regarding premium setting, policy issuance, and risk mitigation strategies.

کلیدواژه‌ها [English]

Deep learning
Fire damage
Insurance
Machine learning

مراجع

تجددی نودهی، م.، حسینی خطیبانی، س.، یزدی‌نژاد، م.، و زلفی، س. (۱۴۰۲). پیش‌بینی هزینه‌های بیمه درمانی افراد با استفاده از یادگیری ماشین و روش یادگیری جمعی. پژوهشنامه بیمه، 13(1)، 1-14. https://doi.org/10.22056/ijir.2024.01.01

رستمی، م.، بنی حسینی، س. م.، و صفائی، س. (1401). الگوریتم هوشمند ارزیابی و پیش‌بینی ریسک و خسارت بیمه درمان تکمیلی با استفاده از روش شبکه عصبی مصنوعی چندلایه]مقاله کنفرانسی.[ بیست و نهمین همایش ملی و دهمین همایش بین‌المللی بیمه و توسعه، تهران، ایران.https://civilica.com/doc/1770257 /

رئیسی‌ وانانی، ا.، تقوی‌فرد، م. ت، سهرابی، ب.، و امیرحسینی، م. (۱۴۰۲). طراحی سیستم ارزیابی هوشمند جهت پیش‌بینی خسارت بیمه‌های آتش‌سوزی با استفاده از یادگیری عمیق. پژوهشنامه بیمه، 12(4)، 251-264. https://doi.org/10.22056/ijir.2023.04.01

معنوی، ف.، دادگر، م.، و رحمتیان، م. (۱۴۰۰). ارائه روش پیش‌بینی خسارت در بیمه‌نامه شخص ثالث ]مقاله کنفرانسی [. بیست و نهمین همایش ملی و دهمین همایش بین‌المللی بیمه و توسعه، تهران، ایران. https://civilica.com/doc/1390851/

میرباقری جم، م.، شهیکی‌تاش، م. ن، زمانیان، غ.، و صفری، ا. (۱۴۰۱). تجمیع ریسک‌های بیمه‌گری صنعت بیمه ایران با استفاده از توابع مفصل (رویکرد توابع مفصل ارشمیدسی سلسله‌مراتبی). پژوهشنامه بیمه، 4(4)، 412-425. https://doi.org/10.22056/ijir.2015.04.02

Abdelhadi, S., ElBahnasy, K. A., & Abdelsalam, M. M. (2020). A proposed model to predict auto insurance claims using machine learning techniques. Journal of Theoretical and Applied Information Technology, 98(22), 3428–3437. https://www.jatit.org/volumes/Vol98No22/8Vol98No22.pdf

Abdulkadir, U.I., & Fernando, A. (2024). A deep learning model for insurance claims predictions. Journal on Artificial Intelligence, 6(1), 71–83. https://doi.org/10.32604/jai.2024.045332

Balaji, S., & Srivatsa, S. K. (2012). Naïve Bayes classification approach for mining life insurance databases for effective prediction of customer preferences. International Journal of Computer Applications, 51(3), 22–28. https://doi.org/10.5120/8023-0805

Banks, D. (2020). Discussion of “Machine learning applications in non-life insurance”. Applied Stochastic Models in Business and Industry, 36(4), 538–540. https://doi.org/10.1002/asmb.2537

Baran, S., & Rola, P. (2022). Prediction of motor insurance claims occurrence as an imbalanced machine learning problem [Preprint]. arXiv, 2204.06109. https://doi.org/10.48550/arXiv.2204.06109

Bärtl, M., & Krummaker, S. (2020). Prediction of claims in export credit finance: A comparison of four machine learning techniques. Risks, 8(1), 22. https://doi.org/10.3390/risks8010022

Bücher, A., & Rosenstock, A. (2023). Micro-level prediction of outstanding claim counts based on novel mixture models and neural networks. European Actuarial Journal, 13, (May), 55–90. https://doi.org/10.1007/s13385-022-00314-4

Cummings, J., & Hartman, B. (2022). Using machine learning to better model long‑term care insurance claims. North American Actuarial Journal, 26(3), 470–483. https://doi.org/10.1080/10920277.2021.2022497

Grize, Y. L., Fischer, W., & Lützelschwab, C. (2020). Machine learning applications in non-life insurance. Applied Stochastic Models in Business and Industry, 36(4), 545–547. https://doi.org/10.1002/asmb.2564

Gulseven, O. (2020). Estimating the demand factors and willingness to pay for agricultural insurance [Preprint]. arXiv, 2004.11279. https://doi.org/10.48550/arXiv.2004.11279

Hanafy, M., & Ming, R. (2021). Machine learning approaches for auto insurance big data. Risks, 9(2), 42. https://doi.org/10.3390/risks9020042

Hu, S., & O’Hagan, A. (2020). Insurance claims forecasting with cluster analysis. The Actuary, (8^th July). https://www.theactuary.com/features/2020/07/08/insurance-claims-forecasting-cluster-analysis

Jing, L., Zhao, W., Sharma, K., & Feng, R. (2018). Research on probability-based learning application on car insurance data. Proceedings of the 2017 4th International Conference on Machinery, Materials and Computer (MACMC 2017) (pp. 59–63) [Conference presentation]. Atlantis Press. https://doi.org/10.2991/macmc-17.2018.14

Kose, I., Gokturk, M., & Kilic, K. (2015). An interactive machine-learning-based electronic fraud and abuse detection system in healthcare insurance. Applied Soft Computing, 36, 283-299.https://doi.org/10.1016/j.asoc.2015.07.018

Lentz, T. J., Dotson, G. S., Williams, P. R. D., Maier, A., Gadagbui, B., Pandalai, S. P., Lamba, A., Hearl, F., & Mumtaz, M. (2015). Aggregate exposure and cumulative risk assessment—integrating occupational and non-occupational risk factors. Journal of Occupational and Environmental Hygiene, 12(sup1), S112–S126. https://doi.org/10.1080/15459624.2015.1060326

Manavi, F., Dadgar, M., & Rahmatian, M. (2021). Presenting a method for predicting damage in third-party car insurance policies [Conference presentation]. The 28th National and 9th International Conference on Insurance and Development, Tehran, Iran. https://civilica.com/doc/1390851/ [In Persian].

Mirbagheri Jam, M., Shahiki Tash, M. N., Zamanian, G., & Safari, A. (2022). Aggregation of underwriting risks in insurance industry of Iran using copulas (Hierarchical archimedean copulas approach.( Insurance Research Journal, 4(4), 412–425. https://doi.org/10.22056/ijir.2015.04.02 [In Persian].

Pabuçcu, H., Ongan, S., & Ongan, A. (2020). Forecasting the movements of Bitcoin prices: An application of machine learning algorithms. Quantitative Finance and Economics, 4(4), 679–692. https://doi.org/10.3934/QFE.2020031

Pesantez-Narvaez, J., Guillen, M., & Alcañiz, M. (2019). Predicting motor insurance claims using telematics data—XGBoost versus logistic regression. Risks, 7(2), 70. https://doi.org/10.3390/risks7020070

Raeesi Vanani, I., Taghavifard, M., Sohrabi, B., & Amirhosseini, M. (2023). Designing an intelligent evaluation system for predicting fire insurance claims using deep learning. Iranian Journal of Insurance Research, 12(4), 251-264. https://doi.org/10.22056/ijir.2023.04.01 [In Persian].

Reddy, D. B., Mounika, T., Peri, P., Rao, N. T., & Bhattacharyya, D. (2024). Prediction of automobile insurance claims using deep neural networks [Conference presentation]. In D. Bhattacharyya & R. Ghosh (Eds.), EAI International Conference on Computational Intelligence and Generative AI (EAI, Springer Innovations in Communication and Computing (pp. 31-41). Springer. https://doi.org/10.1007/978-3-031-76610-7_3

Rostami, M., Bani Hosseini, S. M., & Safaei, S. (2022). An intelligent algorithm for evaluating and predicting risk and damage in supplementary health insurance using multilayer artificial neural networks [Conference presentation]. The 29th National and 10th International Conference on Insurance and Development, Tehran, Iran. https://civilica.com/doc/1770257/ [In Persian].

Sundarkumar, G. G., & Ravi, V. (2015). A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance. Engineering Applications of Artificial Intelligence, 37, 368–377. https://doi.org/10.1016/j.engappai.2014.09.019

Tajaddodi Nodehi, M., Hosseini Khatibani, S., Yazdinejad, M., & Zolfi, S. (2023). Predicting people's health insurance costs using machine learning and ensemble learning methods. Iranian Journal of Insurance Research, 13(1), 1-14. https://doi.org/10.22056/ijir.2024.01.01 [In Persian].

Thakur, S. S., & Sing, J. K. (2013). Mining customer data for vehicle insurance prediction system using k-means clustering—An application. International Journal of Computer Applications in Engineering Sciences, 3(4), 148. https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=9ad9d68128e22acfd5f4897b0e1e562099792b2f

Weerasinghe, K. P. M. L., & Wijegunasekara, M. C. (2016). A comparative study of data mining algorithms in the prediction of auto insurance claims. European International Journal of Science and Technology, 5(1), 47–54. https://rda.sliit.lk/handle/123456789/2525Wüthrich, M. V. (2020). Bias regularization in neural network models for general insurance pricing. European Actuarial Journal, 10(1), 179–202. https://doi.org/10.1007/s13385-019-00215-z

نامه به سردبیر

سردبیر نشریه پژوهشنامه بیمه، هرگونه پیشنهاد و انتقاد دیگر نویسندگان و خوانندگان را در خصوص نقد و بررسی این مقاله مندرج در سامانه نشریه را ظرف مدت 3 ماه از تاریخ انتشار آنلاین مقاله در سامانه و قبل از انتشار چاپی نشریه، به منظور اصلاح و نظردهی امکان پذیر نموده است.، البته این نقد در مورد تحقیقات اصلی مقاله نمی باشد.
توجه به موارد ذیل پیش از ارسال نامه به سردبیر لازم است در نظر گرفته شود:
[1] نامه هایی که شامل گزارش آماری، واقعیت ها، تحقیقات یا نظریه پردازی ها هستند، لازم است همراه با منابع معتبر و مناسب همراه باشد، اگرچه ارسال بیش از زمان 3 نامه توصیه نمی گردد.
[2] نامه هایی که بجای انتقاد سازنده به ایده های تحقیق، مشتمل بر حملات شخصی به نویسنده باشند، توجه و چاپ نمی شود.
[3] نامه ها نباید بیش از 300 کلمه باشد.
[4] نویسندگان نامه لازم است در ابتدای نامه تمایل یا عدم تمایل خود را نسبت به چاپ نظریه ارسالی نسبت به یک مقاله خاص اعلام نمایند.
[5] به نامه های ناشناس ترتیب اثر داده نمی شود.
[6] شهر، کشور و محل سکونت نویسندگان نامه باید در نامه مشخص باشد.
[7] به منظور شفافیت بیشتر و محدودیت حجم نامه، ویرایش بر روی آن انجام می پذیرد.

نام و نام خانوادگی *

پست الکترونیکی *

وابستگی سازمانی *

توضیحات *

شناسه امنیتی *

پژوهشنامه بیمه

پیش بینی خسارت با استفاده از تکنیک های رگرسیون عمیق متوالی در روش های یادگیری عمیق

مراجع

مراجع

ارسال نظر در مورد این مقاله

دوره 14، شماره 4 - شماره پیاپی 54
مهر 1404
صفحه 281-294

پیش بینی خسارت با استفاده از تکنیک های رگرسیون عمیق متوالی در روش های یادگیری عمیق

مراجع

مراجع

ارسال نظر در مورد این مقاله

دوره 14، شماره 4 - شماره پیاپی 54مهر 1404صفحه 281-294

دوره 14، شماره 4 - شماره پیاپی 54
مهر 1404
صفحه 281-294