یک مدل ریاضی برای شناسایی و اعتبارسنجی خوشه‌های مشکوک به تقلب سازمان‌یافته در بیمة خودرو

حمزه, اسماء; نجفی  آرانی, محمدجواد

doi:10.22056/ijir.2025.03.03

نوع مقاله : مقاله پژوهشی

نویسندگان

¹ استادیار، گروه فناوری های نوین بیمه ای، پژوهشکده بیمه، تهران، ایران

² دانشیار، گروه علوم کامپیوتر، مرکز آموزش عالی محلات، محلات، ایران

https://doi.org/10.22056/ijir.2025.03.03

چکیده

پیشینه و اهداف: تقلب در صنعت بیمه یکی از مشکلات رایج در این حوزه است که موجب خسارات سنگینی، چه به‌لحاظ منافع مادی و چه به‌لحاظ اعتماد عمومی در این صنعت می‌شود. مؤسسات مالی و پولی به‌شدت در پی شناخت دقیق فعالیت‌های کلاهبرداران و متقلبان هستند. این امر به‌دلیل اثر مستقیم آن بر خدمت‌رسانی به مشتریان مؤسسات، به کاهش هزینه‌های عملیاتی، جلب اعتماد سایر بیمه‌گذاران و حفظ و ارتقای سهم بازار بیمه‌گران به‌عنوان ارائه‌دهندگان خدمات مالی قابل اطمینان منجر خواهد شد. یکی از رایج‌ترین تخلفات، تقلب‌های سازمان‌یافته و فرصت‌طلبانه در بیمة خودرو است. تصادفات عمدی به‌ویژه در قالب گروهی، صدمه دیدن افراد توسط وسیلة نقلیه یا صحنه‌سازی از جمله تقلب‌های رایج در این حوزه هستند. هدف این مقاله معرفی مدل‌ ریاضی مبتنی بر نظریة گراف (شبکه) برای شناسایی خوشه‌‌های مشکوک برای تقلب‌‌های سازمان‌یافته است.
روش‌شناسی: یکی از روش‌هایی که برای شناسایی تقلب کاربرد دارد، تحلیل شبکه است. در تحلیل شبکه ارتباطات بین افراد و شخصیت‌های حقیقی و حقوقی مختلف ارزیابی و ابعاد جدیدی از این ارتباطات شناسایی می‌شود. در این پژوهش، ابتدا با استفاده از نظریة گراف، شبکه‌‌ای به نام شبکة تصادفات معرفی می‌‌شود. سپس نشان داده می‌شود که شبکة حاصل از تصادفات خودروها یک فرایند تصادفی است. سپس در شبکة ساخته‌شده از تصادفات، مجموعه خودروهای مشکوک که در این ساختار تصادفی ایجاد نظم می‌کنند، با معرفی یک الگوریتم شناسایی می‌شوند.
یافته‌ها: این فرایند باعث تخصیص یک برچسب از نظر متقلب بودن یا نبودن به هر تصادف و به هر فرد می‌شود. با توجه به ساختار الگوریتم و پیچیدگی آن می‌توان نتیجه گرفت الگوریتم پیشنهادی به‌سادگی قادر به تحلیل داده‌های بسیار زیاد است.
نتیجه‌گیری: بررسی این موضوع موجب می‌‌شود که بیمه‌گر بتواند وابسته به برچسب هر فرد یا تصادف، سیاست‌‌گذاری‌‌های متفاوتی را برای برخورد با متخلفان اتخاذ کند تا بتواند در جهت کاهش زیان مالی و افزایش اعتماد عمومی گام بردارد.

کلیدواژه‌ها

موضوعات

ریاضیات مالی/ کاربردی

عنوان مقاله [English]

A Mathematical Model for Identifying and Validating Suspicious Clusters Associated with Organized Fraud in Auto Insurance

نویسندگان [English]

Asma Hamzeh ¹
Mohammad Javad Nadjafi-Arani ²

¹ Assistant professor, Department of New Insurance Technologies, Insurance Research Institute, Tehran, Iran

² Associate Professor, Department of Computer Science, Mahallat institute of higher education, Mahallat, Iran

چکیده [English]

BACKGROUND AND OBJECTIVES: Insurance fraud presents a persistent challenge within the insurance industry, leading to substantial financial losses and eroding public trust. Financial institutions are actively seeking accurate methods to identify the activities of fraudsters and scammers. Due to its direct effect on serving the clients of institutions, this will lead to the reduction of operating costs, gaining the trust of other insurers, and maintaining and improving the market share of insurers as reliable financial service providers. One of the most prevalent forms of fraud occurs in auto insurance, where organized and opportunistic fraudulent activities are rampant. Fabricated accidents, especially those involving groups, staged injuries, and orchestrated scenes, are among the common fraudulent practices in this realm. Opportunistic fraud is typically committed by an individual who simply seizes an opportunity to inflate a claim or receive an exaggerated estimate for damages or repairs from their insurance companies. In contrast, professional fraud is often carried out by organized groups. These rings typically target multiple fake identities, organizations, or even brands. These criminal networks frequently rely on insiders to help them defraud companies, simultaneously using various schemes. Although the amounts involved in professional fraud cases are much larger, they occur less frequently than opportunistic insurance fraud. Combating insurance fraud is a challenging issue. Most traditional systems can detect opportunistic fraud; however, due to the significant financial losses involved, insurance companies are particularly focused on identifying organized fraud rings. Consequently, insurers need to adopt advanced technologies and sophisticated systems to effectively address this problem.
METHODS: Network analysis is a valuable technique for fraud detection, enabling the evaluation of communications among individuals and entities (both real and legal) to uncover new dimensions of these interactions. This paper introduces a mathematical model, based on graph theory, to identify suspicious clusters associated with organized fraud. A network, termed the “accident network,” was first introduced using graph theory in this research. This network demonstrates characteristics of a random graph. Subsequently, suspicious clusters within this network are identified using a graph theory-based algorithm. The occurrence probability of such clusters in a random accident network is then examined by defining a binomial distribution over its edges.
FINDINGS: This process assigns a label (indicating fraudulent or non-fraudulent) to each accident and individual involved. Given the algorithm's structure and complexity, the proposed method is capable of efficiently analyzing large datasets.
CONCLUSION: Insurance fraud is an act committed to defraud insurers for financial gain. Insurance fraud has existed since the formation of commercial enterprises and has so far imposed billions of dollars in costs on insurance companies annually. Insurance fraud comes in various forms and occurs in all insurance domains, covering a wide range of claims from exaggerated ones to fabricated accidents and damages.Auto insurance fraud, particularly the organized fraud studied in this research, is often carried out through group structures. This structure leads to significant cost increases for insurers and consequently higher insurance premiums. Today, given the necessity of fraud detection in various fields, data mining and machine learning techniques such as artificial neural networks, fuzzy logic, and genetic algorithms have become common tools for fraud detection due to their high capabilities in modeling and navigating complex problems.Another tool used for detecting organized fraud is graph theory. In this approach, the problem is first mathematically modeled. This means the accident network is first modeled as a graph, and then organized fraud is detected using available tools. Then, computer science concepts are utilized to more precisely identify networks suspected of fraud. More accurately, in structures like a country's accident data, the amount of available data is very large finding relationships among them is quite difficult.While using tools like data mining, machine learning, neural networks, fuzzy logic, genetic algorithms, etc., whose main purpose is to find relationships among data, is very useful, they have some shortcomings. These tools, if meta-heuristic algorithms are used, will have inaccuracies or overfitting in imbalanced data. In heuristic algorithms, finding relationships among large amounts of data has very high computational complexity, which in some cases may take weeks or more to execute.In this research, the researchers have tried to address this shortcoming using mathematical models while accurately examining the probability of suspicious events. Therefore, in this research, first, the accident network was modeled using graph theory, and then it was shown that this model is a random process, and the presence of regular elements in the model indicates sets of vehicles suspected of fraud. Subsequently, based on an algorithm for finding suspicious subgraphs written as an m-file script in MATLAB, suspicious vehicles were extracted from all vehicles.Finally, it was proven that the accident network is a Poisson process, and its occurrence probability can be determined. This reasoning, based on graph modeling structure, helps assign a credibility degree to each accident and each vehicle regarding suspicion of fraud. For future research, it is suggested that a more extensive network, including all stakeholders in organized fraud, should be created and examined. More precisely, the network should examine the label assignment of main beneficiaries who profit from an accident based on their profit shares. Specifically, future work should investigate labeling main beneficiaries based on their profit shares from an accident, enabling insurers to adopt tailored policies for various stakeholders (e.g., policyholders, vehicle occupants, repair shops) to reduce financial losses and restore public trust.

کلیدواژه‌ها [English]

Car insurance
Graph theory
Poisson distribution
labeling

مراجع

Brockett, P. L., & Levine, A. (1977). On a characterization of RIDITs. The Annals of Statistics, 1245-1248. https://doi.org/10.1214/aos/1176344010

Bernardo, A., & Della Valle, E. (2022). An extensive study of C-SMOTE, a continuous synthetic minority oversampling technique for evolving data streams. Expert Systems with Applications, 196, 116630. https://doi.org/10.1016/j.eswa.2022.116630

Bodaghi, A., & Teimourpour, B. (2018). The detection of professional fraud in automobile insurance using social network analysis. arXiv preprint arXiv:1805.09741. https://doi.org/10.48550/arXiv.1805.09741

Ghahramani, S. (2005). Fundamentals of probability with stochastic processes (3rd ed.). Pearson/Prentice Hall. https://www.amazon.com/Fundamentals-Probability-Stochastic-Processes-Third/dp/1498755011

Nian, K., Zhang, H., Tayal, A., Coleman, T., & Li, Y. (2016). Auto insurance fraud detection using unsupervised spectral ranking for anomaly. The Journal of Finance and Data Science, 2(1), 58-75. https://doi.org/10.1016/j.jfds.2016.03.001

Noble, C. C., & Cook, D. J. (2003). Graph-based anomaly detection [Conference presentation]. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, 631-636. https://doi.org/10.1145/956750.956831

Óskarsdóttir, M., Ahmed, W., Antonio, K., Baesens, B., Dendievel, R., Donas, T., & Reynkens, T. (2022). Social network analytics for supervised fraud detection in insurance. Risk Analysis, 42(8), 1872-1890. https://doi.org/10.48550/arXiv.2009.08313

Pourhabibi, T., Ong, K. L., Kam, B. H., & Boo, Y. L. (2020). Fraud detection: A systematic literature review of graph-based anomaly detection approaches. Decision Support Systems, 133, 113303. https://doi.org/10.1016/j.dss.2020.113303

Rajan, R. S., Shantrinal, A. A., Kumar, K. J., Rajalaxmi, T., Fan, J., & Fan, W. (2021). Embedding complete multi-partite graphs into Cartesian product of paths and cycles. Electronic Journal of Graph Theory and Applications, 9(2), 507-521. https://doi.org/10.5614/ejgta.2021.9.2.21

Šubelj, L., Furlan, Š., & Bajec, M. (2011). An expert system for detecting automobile insurance fraud using social network analysis. Expert Systems with Applications, 38(1), 1039-1052. https://doi.org/10.1016/j.eswa.2010.07.143

Tarawneh, A., Hassanat, A., Altarawneh, G., & Almuhaimeed, A. (2022). Stop oversampling for class imbalance learning: A review. IEEE Access, 10, 47643-47660. https://doi.org/10.1109/ACCESS.2022.3169512

West, D. B. (2001). Introduction to graph theory (Vol. 2). Upper Saddle River: Prentice hall. https://www.amazon.com/Introduction-Graph-Theory-Douglas-West/dp/0130144002

SAS Institute Inc. (2011). The insurance fraud race: Using information and analytics to stay ahead of criminals [White paper]. https://www.nextdeal.gr/sites/default/files/attachments/SAS.pdf

Zhou, J., Xu, W., Guo, X., & Ding, J. (2015). A method for modeling and analysis of directed weighted accident causation network (DWACN). Physica A: statistical mechanics and its applications, 437, 263-277. https://doi.org/10.1016/j.physa.2015.05.112

Zhai, J., Qi, J., & Shen, C. (2022). Binary imbalanced data classification based on diversity oversampling by generative models. Information Sciences, 586, 313-343. https://doi.org/10.1016/j.ins.2021.11.058

نامه به سردبیر

سردبیر نشریه پژوهشنامه بیمه، هرگونه پیشنهاد و انتقاد دیگر نویسندگان و خوانندگان را در خصوص نقد و بررسی این مقاله مندرج در سامانه نشریه را ظرف مدت 3 ماه از تاریخ انتشار آنلاین مقاله در سامانه و قبل از انتشار چاپی نشریه، به منظور اصلاح و نظردهی امکان پذیر نموده است.، البته این نقد در مورد تحقیقات اصلی مقاله نمی باشد.
توجه به موارد ذیل پیش از ارسال نامه به سردبیر لازم است در نظر گرفته شود:
[1] نامه هایی که شامل گزارش آماری، واقعیت ها، تحقیقات یا نظریه پردازی ها هستند، لازم است همراه با منابع معتبر و مناسب همراه باشد، اگرچه ارسال بیش از زمان 3 نامه توصیه نمی گردد.
[2] نامه هایی که بجای انتقاد سازنده به ایده های تحقیق، مشتمل بر حملات شخصی به نویسنده باشند، توجه و چاپ نمی شود.
[3] نامه ها نباید بیش از 300 کلمه باشد.
[4] نویسندگان نامه لازم است در ابتدای نامه تمایل یا عدم تمایل خود را نسبت به چاپ نظریه ارسالی نسبت به یک مقاله خاص اعلام نمایند.
[5] به نامه های ناشناس ترتیب اثر داده نمی شود.
[6] شهر، کشور و محل سکونت نویسندگان نامه باید در نامه مشخص باشد.
[7] به منظور شفافیت بیشتر و محدودیت حجم نامه، ویرایش بر روی آن انجام می پذیرد.

نام و نام خانوادگی *

پست الکترونیکی *

وابستگی سازمانی *

توضیحات *

شناسه امنیتی *

پژوهشنامه بیمه

یک مدل ریاضی برای شناسایی و اعتبارسنجی خوشه‌های مشکوک به تقلب سازمان‌یافته در بیمة خودرو

مراجع

مراجع

ارسال نظر در مورد این مقاله

دوره 14، شماره 3 - شماره پیاپی 53
تیر 1404
صفحه 209-222

یک مدل ریاضی برای شناسایی و اعتبارسنجی خوشه‌های مشکوک به تقلب سازمان‌یافته در بیمة خودرو

مراجع

مراجع

ارسال نظر در مورد این مقاله

دوره 14، شماره 3 - شماره پیاپی 53تیر 1404صفحه 209-222

دوره 14، شماره 3 - شماره پیاپی 53
تیر 1404
صفحه 209-222