Bridging Ayurveda and AI: Data Standardization for Improved Machine Learning Application
DOI:
https://doi.org/10.52783/jns.v14.3556Keywords:
Ayurveda, Machine Learning, Data Standardization, BERT, NLP, Knowledge RetrievalAbstract
Ayurveda integration with machine learning (ML) applications must be grounded on a standardized and organized dataset to handle the complexity and heterogeneity of traditional medical terminology. The present paper suggests a process of data standardization in response to the vagueness in Ayurvedic texts to ensure uniformity in disease, symptom, treatment, and dosha categorization. A pre-defined ontology translated raw Ayurvedic terms into standardized terms to improve data quality for ML training. To analyze the impact of standardization, different ML models—Naïve Bayes, CNN, and BERT—were trained on standardized data. The results show that the maximum classification accuracy (100%) was achieved by BERT, which demonstrates the effectiveness of contextual embeddings for Ayurvedic text classification. The findings demonstrate that standardization significantly improves the performance of models, improving knowledge retrieval and compatibility with modern healthcare systems. This research contributes to building robust, machine-usable Ayurvedic datasets for AI-based diagnosis and treatment recommendation in traditional medicine.
Downloads
Metrics
References
A. V. Arun, P. N. Balasaheb, J. V. Babasaheb, J. D. Kailas, K. R. Adinath, and N. D. Dadasaheb, “Artificial intelligence and challenges in Ayurveda pharmaceutics: A review,” Research Journal of Science and Technology, vol. 16, no. 3, pp. 237–244, 2024.
P. Bidve and S. Mishra, “Enhancing Ayurvedic Diagnosis using Multinomial Naive Bayes and K-modes Clustering: An Investigation into Prakriti Types and Dosha Overlapping,” arXiv preprint, arXiv:2310.02920, 2023.
G. Névéol, K. B. Zweigenbaum, S. R. Velupillai, W. Chapman, M. Suominen, and P. Savova, “Challenges and Opportunities in Natural Language Processing for Clinical Data,” Journal of Biomedical Semantics, vol. 9, no. 1, p. 1, 2018.
Jadhav Vikas, S., Wakale Ashwini, D. and Mane, S.R., Integration of machine learning in Ayurveda: An Indian traditional health science.
T. M. Nesari, “Artificial intelligence in the sector of Ayurveda: Scope and opportunities,” Int. J. Ayurveda Res., vol. 4, no. 2, pp. 57–60, 2023.
L. Bheemavarapu and K. U. Rani, “Machine learning models used for Prakriti identification using Prasna Pariksha in Ayurveda–A review,” Mathematical Statistician and Engineering Applications, vol. 72, no. 1, pp. 1942–1951, 2023.
H. Singh, S. Bhargava, S. Ganeshan, R. Kaur, T. Sethi, M. Sharma, M. Chauhan, N. Chauhan, R. Chauhan, P. Chauhan, and S. K. Brahmachari, “Big data analysis of traditional knowledge-based Ayurveda medicine,” Progress in Preventive Medicine, vol. 3, no. 5, p. e0020, 2018.
A. M. S., “Ayurveda Meets AI: How NLP Is Shaping the Future of Holistic Medicine,” Journal of Emerging Technologies and Innovative Research (JETIR), vol. 11, no. 6, p. JETIR2406A62, Jun. 2024.
H. Terdalkar, A. Bhattacharya, M. Dubey, and B. N. Singh, “Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text,” arXiv preprint, arXiv:2202.00216, 2022
A. V. Arun, P. N. Balasaheb, J. V. Babasaheb, K. J. D. Kailas, A. K. R. Adinath, and D. N. Dadasaheb, “Artificial Intelligence and Challenges in Ayurveda Pharmaceutics: A Review,” Research Journal of Science and Technology, vol. 16, no. 3, pp. 237-244, 2024.
P. D. Gupta, “Pharmacogenetics, pharmacogenomics and ayurgenomics for personalized medicine: A paradigm shift,” Indian Journal of Pharmaceutical Sciences, vol. 77, no. 2, pp. 135–141, Mar.–Apr. 2015, doi: 10.4103/0250-474X.156543.
Y. D. Madgulwar and K. J. Shewalkar, “The intersection of Ayurveda and genomics: Exploring Ayurgenomics for personalized health solutions,” Journal of Ayurveda and Integrated Medical Sciences, vol. 9, no. 10, pp. 168–177, 2024.
Sanjay Gupta, Narasimha V, Vijaya Lakshmi A. Artificial Intelligence (AI) in Ayurveda: Its Application and Relevance. Ayushdhara [Internet]. 2025Jan.15 [cited 2025Mar.10];11(6):165-9.
Jyothi Raga P M, Vivek P, & Harinarayanan C M. (2023). Need of Standardization of Ayurveda Formulations. International Journal of Ayurveda and Pharma Research, 11(10), 40-45.
A. Chauhan, D. K. Semwal, S. P. Mishra, and R. B. Semwal, “Ayurvedic research and methodology: Present status and future strategies,” AYU (An International Quarterly Journal of Research in Ayurveda), vol. 36, no. 4, pp. 364–369, 2015.
Raghav Singh, “Ayurvedic Formulations and their Indications,” Kaggle,2023. https://www.kaggle.com/datasets/raghavdecoded/ayurvedic-formulations-and-their-indications
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.