Improving Income Tax Fraud Detection Accuracy with Logistic Regression
DOI:
https://doi.org/10.52783/jns.v14.3764Keywords:
Income Tax Fraud Detection, Logistic Regression, Decision Tree, Random Forest, Naive Bayes, K-Nearest Neighbors, Feed Forward Neural Network, Android StudioAbstract
Another critical source of revenue for governments involves taxation on income for people and companies, which must be paid under law. Probably the biggest challenge is tax fraud, which is the act of intentionally machining the declaration to avoid taxation. In conjunction with this, our project aims to analyze the financial data of taxpayers and establish a strong machine learning model for flagging such errant behavior. The authors compared six machine learning algorithms, the six types being a Feedforward Neural Network, k nearest Neighbors, Random Forest, Naive Bayes, Decision Tree, and Logistic Regression, for the detection and classification of tax fraud. Logistic regression performed best in the detection of tax fraud. The proposed framework is found to be better, whereby it is able to identify complex patterns better than the previously existing methods since both linear and non-linear correlations are considered in between variables. We trained and ran our model on the OpenML dataset. The results seem promising. Our model also assures that, besides being financially viable, it promotes fruitfulness in policy design with much less tax revenue loss. Using Tensor Flow for deployment of the model on Android Studio also enhanced accessibility. It has also led to the build-up of a simple prediction tool that helped people understand the risks involved in tax fraud.
Downloads
Metrics
References
Murorunkwere, B.F., Tuyishimire, O., Haughton, D., Nzabanita, J., “Fraud Detection Using Neural Networks: A Case Study of Income Tax”, Future Internet 2022, 14, 168.
Pérez López, C., Delgado Rodríguez, M.J., de Lucas Santos, S. “Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers”, Future Internet 2019, 11, 86.
M. S. Rad and A. Shahbahrami, “Detecting high risk taxpayers using data mining techniques”, 2016 2nd International Conference of Signal Processing and Intelligent Systems (ICSPIS), Tehran, Iran, 2016, pp. 1-5.
Mojahedi, Houri & Babazadeh sangar, Amin & Masdari, Mohammad. (2022). “Towards Tax Evasion Detection Using Improved Particle Swarm Optimization Algorithm”, Mathematical Problems in Engineering. 2022. 1-17.
de Roux, D., Perez, B., Moreno, A., Villamil, M.D.P., Figueroa, C. “Tax fraud detection for under-reporting declarations using an unsupervised machine learning approach”, In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 215–222
Miloš Savić, Jasna Atanasijević, Dušan Jakovetić, Nataša Krejić, “Tax evasion risk management using a Hybrid Unsupervised Outlier Detection method”, Expert Systems with Applications, Volume 193, 2022, 116409, ISSN 0957-4174.
González, P.C., Velásquez, J.D, “Characterization and detection of taxpayers with false invoices using data mining techniques”, Expert Syst. Appl. 2013, 40, 1427–1436.
Ghosh, S., Douglas, L.R, “Credit card fraud detection with a neural-network”, In Proceedings of the Twenty-Seventh Hawaii International Conference, Wailea, HI, USA, 4–7 January 1994.
Chi-Hung Lin, I-Chun Lin, Ching-Huei Wu, Ya-Ching Yang & Jinsheng Roan (2012) “The application of decision tree and artificial neural network to income tax audit: the examples of profit-seeking enterprise income tax and individual income tax in Taiwan”, Journal of the Chinese Institute of Engineers, 35:4, 401-41.
B. Baesens, V. Vlasselaer, W. Verbeke. “Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques”, Wiley, 2015.
Lee, Kidong, David E. Booth and Pervaiz Alam. “A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms”, Expert Syst. Appl. 29 (2005): 1-16.
Clavería Navarrete, A. y Carrasco Gallego, A. (2021). “Neural network algorithms for fraud detection: a comparison of the complementary techniques in the last five years”, Journal of Management Information and Decision Sciences, 24 (special 1), 1-16.
Pérez López, César, María Jesús Delgado Rodríguez, and Sonia de Lucas Santos. 2019. “Tax Fraud Detection through Neural Networks: An Application Using a Sample of Personal Income Taxpayers”, Future Internet 11, no. 4: 86.
D. Buddhi, R. Singh and A. Gehlot, “Online Virtual Classroom Application In Android Studio,” 2022 International Interdisciplinary Humanitarian Conference for Sustainability (IIHC), Bengaluru, India, 2022, pp. 960-963.
Z. Zulfikar, Z. Zulhelmi, T. Y. Arif, A. Afdhal and P. N. Syawaldi, “Android Application: Skin Abnormality Analysis based on Edge Detection Technique,” 2018 International Conference on Electrical Engineering and Informatics (ICELTICs), Banda Aceh, Indonesia, 2018, pp. 89-94.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.