Improving Income Tax Fraud Detection Accuracy with Logistic Regression

Authors

  • P Sudha
  • Rafeeda Fatima
  • Meena Kumari KS
  • Vinutha CB

DOI:

https://doi.org/10.52783/jns.v14.3764

Keywords:

Income Tax Fraud Detection, Logistic Regression, Decision Tree, Random Forest, Naive Bayes, K-Nearest Neighbors, Feed Forward Neural Network, Android Studio

Abstract

Another critical source of revenue for governments involves taxation on income for people and companies, which must be paid under law. Probably the biggest challenge is tax fraud, which is the act of intentionally machining the declaration to avoid taxation. In conjunction with this, our project aims to analyze the financial data of taxpayers and establish a strong machine learning model for flagging such errant behavior. The authors compared six machine learning algorithms, the six types being a Feedforward Neural Network, k nearest Neighbors, Random Forest, Naive Bayes, Decision Tree, and Logistic Regression, for the detection and classification of tax fraud. Logistic regression performed best in the detection of tax fraud. The proposed framework is found to be better, whereby it is able to identify complex patterns better than the previously existing methods since both linear and non-linear correlations are considered in between variables. We trained and ran our model on the OpenML dataset. The results seem promising. Our model also assures that, besides being financially viable, it promotes fruitfulness in policy design with much less tax revenue loss. Using Tensor Flow for deployment of the model on Android Studio also enhanced accessibility. It has also led to the build-up of a simple prediction tool that helped people understand the risks involved in tax fraud.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Murorunkwere, B.F., Tuyishimire, O., Haughton, D., Nzabanita, J., “Fraud Detection Using Neural Networks: A Case Study of Income Tax”, Future Internet 2022, 14, 168.

Pérez López, C., Delgado Rodríguez, M.J., de Lucas Santos, S. “Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers”, Future Internet 2019, 11, 86.

M. S. Rad and A. Shahbahrami, “Detecting high risk taxpayers using data mining techniques”, 2016 2nd International Conference of Signal Processing and Intelligent Systems (ICSPIS), Tehran, Iran, 2016, pp. 1-5.

Mojahedi, Houri & Babazadeh sangar, Amin & Masdari, Mohammad. (2022). “Towards Tax Evasion Detection Using Improved Particle Swarm Optimization Algorithm”, Mathematical Problems in Engineering. 2022. 1-17.

de Roux, D., Perez, B., Moreno, A., Villamil, M.D.P., Figueroa, C. “Tax fraud detection for under-reporting declarations using an unsupervised machine learning approach”, In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 215–222

Miloš Savić, Jasna Atanasijević, Dušan Jakovetić, Nataša Krejić, “Tax evasion risk management using a Hybrid Unsupervised Outlier Detection method”, Expert Systems with Applications, Volume 193, 2022, 116409, ISSN 0957-4174.

González, P.C., Velásquez, J.D, “Characterization and detection of taxpayers with false invoices using data mining techniques”, Expert Syst. Appl. 2013, 40, 1427–1436.

Ghosh, S., Douglas, L.R, “Credit card fraud detection with a neural-network”, In Proceedings of the Twenty-Seventh Hawaii International Conference, Wailea, HI, USA, 4–7 January 1994.

Chi-Hung Lin, I-Chun Lin, Ching-Huei Wu, Ya-Ching Yang & Jinsheng Roan (2012) “The application of decision tree and artificial neural network to income tax audit: the examples of profit-seeking enterprise income tax and individual income tax in Taiwan”, Journal of the Chinese Institute of Engineers, 35:4, 401-41.

B. Baesens, V. Vlasselaer, W. Verbeke. “Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques”, Wiley, 2015.

Lee, Kidong, David E. Booth and Pervaiz Alam. “A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms”, Expert Syst. Appl. 29 (2005): 1-16.

Clavería Navarrete, A. y Carrasco Gallego, A. (2021). “Neural network algorithms for fraud detection: a comparison of the complementary techniques in the last five years”, Journal of Management Information and Decision Sciences, 24 (special 1), 1-16.

Pérez López, César, María Jesús Delgado Rodríguez, and Sonia de Lucas Santos. 2019. “Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers”, Future Internet 11, no. 4: 86.

D. Buddhi, R. Singh and A. Gehlot, “Online Virtual Classroom Application In Android Studio,” 2022 International Interdisciplinary Humanitarian Conference for Sustainability (IIHC), Bengaluru, India, 2022, pp. 960-963.

Z. Zulfikar, Z. Zulhelmi, T. Y. Arif, A. Afdhal and P. N. Syawaldi, “Android Application: Skin Abnormality Analysis based on Edge Detection Technique,” 2018 International Conference on Electrical Engineering and Informatics (ICELTICs), Banda Aceh, Indonesia, 2018, pp. 89-94.

Downloads

Published

2025-04-15

How to Cite

1.
Sudha P, Fatima R, Kumari KS M, CB V. Improving Income Tax Fraud Detection Accuracy with Logistic Regression. J Neonatal Surg [Internet]. 2025Apr.15 [cited 2025Sep.29];14(14S):461-74. Available from: https://www.jneonatalsurg.com/index.php/jns/article/view/3764