Comparative Analysis of Machine Learning Models for Crop Yield Prediction Using Categorical and Numerical Agro-Meteorological Data
Keywords:
Crop Yield Prediction, Deep Learning, Artificial Neural Networks, Regression Models, Machine Learning, Random Forest, Gradient Boosting, Support Vector Regression, Linear Regression, Agricultural Data Analysis, Model Evaluation, Precision Agriculture, MAEAbstract
Accurate crop yield prediction plays a vital role in ensuring food security, optimizing agricultural planning, and enabling efficient resource allocation. With the increasing availability of agricultural datasets, machine learning and deep learning techniques have emerged as powerful tools for forecasting crop yields based on historical and agro-climatic data. This study presents a comprehensive comparative analysis of five prominent regression models—Deep Learning (Artificial Neural Networks), Linear Regression, Random Forest Regressor, Gradient Boosting Regressor, and Support Vector Regressor—for crop yield prediction. The dataset used in this study comprises a combination of categorical features (crop type, state, season, year) and numerical attributes (area and production), which were appropriately encoded and scaled for model training.
Model performance was rigorously evaluated using standard regression metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Coefficient of Determination (R²). The results reveal that the deep learning model significantly outperformed all traditional regression approaches, achieving an R² score of 0.94 and a notably low RMSE of 227.99, indicating its superior capability in capturing complex, non-linear relationships in agricultural data. Random Forest and Gradient Boosting regressors also demonstrated robust performance with R² values of 0.88 and 0.84, respectively. In contrast, Linear Regression and Support Vector Regressor exhibited subpar predictive accuracy, particularly the SVR, which failed to generalize to the data (R² = -0.00).
This research highlights the efficacy of deep learning in enhancing crop yield prediction accuracy and underscores the limitations of simpler linear models in handling heterogeneous, high-dimensional agricultural data. The findings have practical implications for precision agriculture, enabling data-driven decision-making for farmers, agronomists, and policymakers. Future directions include incorporating meteorological and soil data, exploring temporal deep learning models such as LSTMs, and integrating explainable AI methods to interpret model predictions
Downloads
Metrics
References
S. Thirumal and R. Latha, "Automated Rice Crop Yield Prediction using Sine Cosine Algorithm with Weighted Regularized Extreme Learning Machine," 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 2023, pp. 35-40.
R. J, V. K. G. Kalaiselvi, A. Sheela, D. S. D, and J. G, "Crop Yield Prediction Using Machine Learning Algorithm," 2021 4th International Conference on Computing and Communications Technologies (ICCCT), Chennai, India, 2021, pp. 611-616.
V. K, S. G, R. P. P, and R. M, "Enhancing Predictive Accuracy for Agricultural Crop Yields in Indian States Using Power Transformation in Machine Learning Models," 2024 10th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 2024, pp. 2403-2408.
N. M. Basavaraju, U. B. Mahadevaswamy, and S. Mallikarjunaswamy, "Design and Implementation of Crop Yield Prediction and Fertilizer Utilization Using IoT and Machine Learning in Smart Agriculture Systems," 2024 Second International Conference on Networks, Multimedia and Information Technology (NMITCON), Bengaluru, India, 2024, pp. 1-6.
M. Shilpa et al., "Enhancing Crop Yield and Growth Prediction Using IoT-Based Smart Irrigation Systems and Machine Learning Algorithms," 2024 Second International Conference on Networks, Multimedia and Information Technology (NMITCON), Bengaluru, India, 2024, pp. 1-5.
N. Santha Raju, R. Tamilkodi, V. C. Shekar, B. Jaya Bharathi, K. Dinesh Kumar, and Y. Sumanth, "AI-Powered Crop Suggestion, Yield Prediction, Disease Detection, and Soil Monitoring," 2024 3rd International Conference on Automation, Computing and Renewable Systems (ICACRS), Pudukkottai, India, 2024, pp. 1120-1124.
M. G. Ananthara, T. Arunkumar, and R. Hemavathy, "CRY — An Improved Crop Yield Prediction Model Using Bee Hive Clustering Approach for Agricultural Data Sets," 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering, Salem, India, 2013, pp. 473-478.
R. H. Meem and T. Noor Turna, "Crop Yields Prediction in Bangladesh: A Hybrid Machine Learning and DNN Approach," 2024 IEEE International Conference on Computing, Applications and Systems (COMPAS), Cox's Bazar, Bangladesh, 2024, pp. 1-6.
K. Keerthi, K. Tejaswi, S. Shalini, and K. Kishore Kumar, "Crop Yield Prediction: Leveraging Machine Learning for Sustainable Agriculture," 2024 2nd International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS), Erode, India, 2024, pp. 551-556.
A. k. Gajula, J. Singamsetty, V. C. Dodda, and L. Kuruguntla, "Prediction of Crop and Yield in Agriculture Using Machine Learning Technique," 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), Kharagpur, India, 2021, pp. 1-5.
G. B. Raj, J. S. Priya, R. D. Jadhav, V. P. S, A. K. Koshariya, and K. Sucharitha, "An Enhanced Extreme Learning Machine for Crop Yield Prediction," 2024 International Conference on Power, Energy, Control and Transmission Systems (ICPECTS), Chennai, India, 2024, pp. 1-4.
A. Tripathi et al., "Crop Yield Prediction using Systematic Review Process based Machine Learning Algorithm," 2024 2nd International Conference on Advances in Computation, Communication and Information Technology (ICAICCIT), Faridabad, India, 2024, pp. 1132-1136.
Y. Y. G, N. G, S. Joseph and J. Bhadra, "Yield Prediction of Rabi Crops in India Using XGBoost Regressor," 2025 3rd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT), Bengaluru, India, 2025, pp. 2217-2222.
A. S. Terliksiz and D. T. Altýlar, "Use Of Deep Neural Networks For Crop Yield Prediction: A Case Study Of Soybean Yield in Lauderdale County, Alabama, USA," 2019 8th International Conference on Agro-Geoinformatics (Agro-Geoinformatics), Istanbul, Turkey, 2019, pp. 1-4.
G. Hariyani, A. Singh, P. Patil, V. Kothari and D. Javale, "Analysis on Crop Yield Prediction using various Ensemble Methods," 2024 8th International Conference on Computing, Communication, Control and Automation (ICCUBEA), Pune, India, 2024, pp. 1-6.
P. S. Bharathi, V. Amudha, G. Ramkumar, T. J. Nagalakshmi, N. Nalini and P. Jagadeesh, "An Experimental Analysis of Crop Yield Prediction using Modified Deep Learning Strategy," 2022 International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI), Chennai, India, 2022, pp. 1-6.
A. Zainab, M. S. Boori and K. U. Din, "A Review of Crop Yield Prediction Models based on Crop Phenology Using Satellite Imagery and Environmental Data," 2024 X International Conference on Information Technology and Nanotechnology (ITNT), Samara, Russian Federation, 2024, pp. 1-5.
A. Sharma, A. Tamrakar, S. Dewasi and N. S. Naik, "Early Prediction of Crop Yield in India using Machine Learning," 2022 IEEE Region 10 Symposium (TENSYMP), Mumbai, India, 2022, pp. 1-6.
S. Thirumal and R. Latha, "Automated Hyperparameter Tuned Stacked Autoencoder based Rice Crop Yield Prediction Model," 2023 7th International Conference on Trends in Electronics and Informatics (ICOEI), Tirunelveli, India, 2023, pp. 14-18.
V. Patki and P. Wazurkar, "Application of Data Sampling for Crop Yield Prediction using Stochastic Gradient Descent Neural Networks," 2021 10th IEEE International Conference on Communication Systems and Network Technologies (CSNT), Bhopal, India, 2021, pp. 685-689.
P. Huo, Z. Ma, Z. He, K. Lu, H. Zhang and J. Tang, "Wide-area Crop Yield Prediction Based on Multi-Source Remote Sensing Data Fusion and Attention Net," 2024 7th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Hangzhou, China, 2024, pp. 322-326.
P. Saini and B. Nagpal, "Deep-LSTM Model for Wheat Crop Yield Prediction in India," 2022 Fifth International Conference on Computational Intelligence and Communication Technologies (CCICT), Sonepat, India, 2022, pp. 73-78.
A. Dhande and R. Malik, "Empirical Study of Crop-disease Detection and Crop-yield Analysis Systems: A Statistical View," 2022 International Conference on Emerging Smart Computing and Informatics (ESCI), Pune, India, 2022, pp. 1-4.
A. T, N. A. P, B. B. N and S. P, "Crop Recommendation and Yield Prediction using Machine learning and Deep learning models," 2024 International Conference on Integration of Emerging Technologies for the Digital World (ICIETDW), Chennai, India, 2024, pp. 1-6.
F. Shahrin, L. Zahin, R. Rahman, A. J. Hossain, A. H. Kaf and A. K. M. Abdul Malek Azad, "Agricultural Analysis and Crop Yield Prediction of Habiganj using Multispectral Bands of Satellite Imagery with Machine Learning," 2020 11th International Conference on Electrical and Computer Engineering (ICECE), Dhaka, Bangladesh, 2020, pp. 21-24..
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.