Bridging Ayurveda and AI: Data Standardization for Improved Machine Learning Application

Authors

  • Sudha.P
  • Akash .K. A
  • Sanjana. B. K
  • Thejaswini .G. K

DOI:

https://doi.org/10.52783/jns.v14.3556

Keywords:

Ayurveda, Machine Learning, Data Standardization, BERT, NLP, Knowledge Retrieval

Abstract

Ayurveda integration with machine learning (ML) applications must be grounded on a standardized and organized dataset to handle the complexity and heterogeneity of traditional medical terminology. The present paper suggests a process of data standardization in response to the vagueness in Ayurvedic texts to ensure uniformity in disease, symptom, treatment, and dosha categorization. A pre-defined ontology translated raw Ayurvedic terms into standardized terms to improve data quality for ML training. To analyze the impact of standardization, different ML models—Naïve Bayes, CNN, and BERT—were trained on standardized data. The results show that the maximum classification accuracy (100%) was achieved by BERT, which demonstrates the effectiveness of contextual embeddings for Ayurvedic text classification. The findings demonstrate that standardization significantly improves the performance of models, improving knowledge retrieval and compatibility with modern healthcare systems. This research contributes to building robust, machine-usable Ayurvedic datasets for AI-based diagnosis and treatment recommendation in traditional medicine.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

A. V. Arun, P. N. Balasaheb, J. V. Babasaheb, J. D. Kailas, K. R. Adinath, and N. D. Dadasaheb, “Artificial intelligence and challenges in Ayurveda pharmaceutics: A review,” Research Journal of Science and Technology, vol. 16, no. 3, pp. 237–244, 2024.

P. Bidve and S. Mishra, “Enhancing Ayurvedic Diagnosis using Multinomial Naive Bayes and K-modes Clustering: An Investigation into Prakriti Types and Dosha Overlapping,” arXiv preprint, arXiv:2310.02920, 2023.

G. Névéol, K. B. Zweigenbaum, S. R. Velupillai, W. Chapman, M. Suominen, and P. Savova, “Challenges and Opportunities in Natural Language Processing for Clinical Data,” Journal of Biomedical Semantics, vol. 9, no. 1, p. 1, 2018.

Jadhav Vikas, S., Wakale Ashwini, D. and Mane, S.R., Integration of machine learning in Ayurveda: An Indian traditional health science.

T. M. Nesari, “Artificial intelligence in the sector of Ayurveda: Scope and opportunities,” Int. J. Ayurveda Res., vol. 4, no. 2, pp. 57–60, 2023.

L. Bheemavarapu and K. U. Rani, “Machine learning models used for Prakriti identification using Prasna Pariksha in Ayurveda–A review,” Mathematical Statistician and Engineering Applications, vol. 72, no. 1, pp. 1942–1951, 2023.

H. Singh, S. Bhargava, S. Ganeshan, R. Kaur, T. Sethi, M. Sharma, M. Chauhan, N. Chauhan, R. Chauhan, P. Chauhan, and S. K. Brahmachari, “Big data analysis of traditional knowledge-based Ayurveda medicine,” Progress in Preventive Medicine, vol. 3, no. 5, p. e0020, 2018.

A. M. S., “Ayurveda Meets AI: How NLP Is Shaping the Future of Holistic Medicine,” Journal of Emerging Technologies and Innovative Research (JETIR), vol. 11, no. 6, p. JETIR2406A62, Jun. 2024.

H. Terdalkar, A. Bhattacharya, M. Dubey, and B. N. Singh, “Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text,” arXiv preprint, arXiv:2202.00216, 2022

A. V. Arun, P. N. Balasaheb, J. V. Babasaheb, K. J. D. Kailas, A. K. R. Adinath, and D. N. Dadasaheb, “Artificial Intelligence and Challenges in Ayurveda Pharmaceutics: A Review,” Research Journal of Science and Technology, vol. 16, no. 3, pp. 237-244, 2024.

P. D. Gupta, “Pharmacogenetics, pharmacogenomics and ayurgenomics for personalized medicine: A paradigm shift,” Indian Journal of Pharmaceutical Sciences, vol. 77, no. 2, pp. 135–141, Mar.–Apr. 2015, doi: 10.4103/0250-474X.156543.

Y. D. Madgulwar and K. J. Shewalkar, “The intersection of Ayurveda and genomics: Exploring Ayurgenomics for personalized health solutions,” Journal of Ayurveda and Integrated Medical Sciences, vol. 9, no. 10, pp. 168–177, 2024.

Sanjay Gupta, Narasimha V, Vijaya Lakshmi A. Artificial Intelligence (AI) in Ayurveda: Its Application and Relevance. Ayushdhara [Internet]. 2025Jan.15 [cited 2025Mar.10];11(6):165-9.

Jyothi Raga P M, Vivek P, & Harinarayanan C M. (2023). Need of Standardization of Ayurveda Formulations. International Journal of Ayurveda and Pharma Research, 11(10), 40-45.

A. Chauhan, D. K. Semwal, S. P. Mishra, and R. B. Semwal, “Ayurvedic research and methodology: Present status and future strategies,” AYU (An International Quarterly Journal of Research in Ayurveda), vol. 36, no. 4, pp. 364–369, 2015.

Raghav Singh, “Ayurvedic Formulations and their Indications,” Kaggle,2023. https://www.kaggle.com/datasets/raghavdecoded/ayurvedic-formulations-and-their-indications

Downloads

Published

2025-04-12

How to Cite

1.
Sudha.P S, .K. A A, B. K S, .G. K T. Bridging Ayurveda and AI: Data Standardization for Improved Machine Learning Application . J Neonatal Surg [Internet]. 2025Apr.12 [cited 2025Sep.10];14(14S):183-9. Available from: https://www.jneonatalsurg.com/index.php/jns/article/view/3556