Insurance cases: analysis by machine learning

Keywords: insurance, fraud detection, machine learning, classifiers, visualization, Python

Abstract

One of the main problems of insurance is fraud, when the client wants to get overpayments by distorting information about the insured event. However, traditional methods of insurance fraud combating require a lot of routine manual work and are not very effective. The paper proposes the development of a prototype of the insurance case monitoring system in order to detect fraud using machine-learning methods. The development was carried out on the example of a database of insurance cases, which has 38 variables and contains 1000 records of insurance claims. The dataset provides information on 1) client – 10 features; 2) insurance contract – 7 features; 3) incident – 21 features. Preliminary data processing, modeling and development of the monitoring system was carried out using the Python. Classifiers (logistic regression, gradient boosting and random forest) with different combinations of variables were built. For each model, the conjugation matrix, accuracy, specificity, sensitivity, and ROC curves were analyzed. Simulation results allowed to select 5 main variables for monitoring, 3 of which characterize the client, 2 – incident. The proposed monitoring system allows to identify the following patterns: 1) in most cases, fraudsters were managers and technical support staff; 2) customers, who were practicing chess or CrossFit, were more prone to fraud; 3) most of the fraud was recorded in severe damage; 4) in case of absence of contact with emergency services, a large amount of the claim indicated fraud.

Downloads

Download data is not yet available.

Author Biographies

K. Kononova, V.N. Karazin Kharkiv National University

D.Sc. (Economics), Professor, Professor of the Department of Economic Cybernetics and Applied Economics

M. Tarabanov, V.N. Karazin Kharkiv National University

Master of the Department of Economic Cybernetics and Applied Economics

References

Plastun, V. (2014). Problems of insurance fraud and the practice of avoiding it. Economics: problems of theory and practice, 477-488. (in Ukrainian)

Bondarenko, E. (2020). Crimes in the field of insurance: features of their commission in Ukraine. Electronic repository of NAVS, 3. Retrieved from http://elar.naiau.kiev.ua/jspui/handle/123456789/8505. (in Ukrainian)

Nedzherya, V. (2020). Risks of insurance fraud and methods of combating them. Efficient economy, 3. doi: https://doi.org/10.32702/2307-2105-2020.3.150. (in Ukrainian)

Punith, A. (2021). Insurance claims – Fraud detection using machine learning. Retrieved from https://medium.com/geekculture/insurance-claims-fraud-detection-using-machine-learning-78f04913097.

Roshan, S. (2021). Fraud Detection in Insurance Claims. Retrieved from https://www.kaggle.com/roshansharma/fraud-detection-in-insurance-claims/notebook#Modelling-with-Ensemble-of-Samplers.

Ermoshenko, A. (2009). Insurance fraud as a source of threats in the interaction of insurers and banks. Ukrainian Academy of Banking of the NBU, 27. Retrieved from http://essuir.sumdu.edu.ua/handle/123456789/54055e. (in Ukrainian)

Zhabynetsʹ, O. (2009). Prevention of insurance abuse as one of the factors ensuring economic security of the insurer. Bulletin of Lviv State University of Internal Affairs, 1, 1-6 Retrieved from: https://www.lvduvs.edu.ua/documents_pdf/visnyky/nvse/01_2009/09zojebs.pdf. (in Ukrainian)

Shirinyan, L. (2010). Insurance fraud – economic and legal aspects, indicators and ways to fight. Economics and law, 3. Retrieved from http://dspace.nuft.edu.ua/jspui/handle/123456789/16581. (in Ukrainian)

FEDERAL BUREAU OF INVESTIGATION (FBI). 2020. Insurance Fraud. Retrieved from https://www.fbi.gov/stats-services/publications/insurance-fraud.

Published
2021-12-30
How to Cite
Kononova, K., & Tarabanov, M. (2021). Insurance cases: analysis by machine learning. Bulletin of V. N. Karazin Kharkiv National University Economic Series, (101), 35-44. https://doi.org/10.26565/2311-2379-2021-101-04
Section
Modelling, simulation and information technology in economics and management