Hindi Fake News Detection Using Machine Learning Models
Keywords:
Fake, Machine learning algorithms, Random Forest, CNN, XG BoostAbstract
Social media and online platforms amplify the spread of misinformation, directly impacting politics, economics, and society. The results for this study will discuss the application of machine learning and deep learning in detecting fake news in Hindi. The final results for this study were derived from a dataset consisting of over 2,100 Hindi news articles labeled as genuine or fabricated. This paper will systematize the results of three robust algorithms: XGBoost, Convolutional Neural Networks (CNN), and Random Forest (RF). These performances were evaluated against performance by doing preprocessing steps such as tokenization, stop word removal, and stemming, with feature extraction using TF-IDF. After all the comparisons, the best performance was given by XGBoost with 94% accuracy, beating Random Forest with an accuracy of 85% and CNN with an accuracy of 84%. Moreover, XGBoost beat on other metrics too, such as RMSE and MAE. The above findings have underlined the strong potential of both ensemble and deep learning models in the detection of fake news in Hindi.
Downloads
Metrics
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.