MedPDF : An Intelligent AI model for interactive PDF Analysis of Health Care Documents

Authors

  • Antony Vigil M S
  • Adithya S
  • Abinesh Vardan S L
  • Vamsi T

Keywords:

Natural Language Processing (NLP), Machine Learning, Optical Character Recognition (OCR), Text Extraction, Summarization, Information Retrieval

Abstract

PDFs are the new standard for delivering a variety of information in the digital environment that we live in today. However, it can be a great deal of time and effort to do a deep analysis and interaction. A brand new artificial intelligence (AI) system called "MedPDF" is aimed at paving the way for smooth and natural-style conversations with medical documents and reports. Through the use of machine learning and natural language processing (NLP), users get the possibility to inquire about and get predefined responses that are generated and converted from the source of the document rather than being manually written. For some of the user issues, "MedPDF" would be an easy answer: document processing for medical report retrieval. "MedPDF" aspires to change the way into which people handle undigested data in PDF format by integrating the latest and most powerful techniques of AI and NLP. MedPDF employs deep learning models to excel in correctly inferring a passage, for which it produces high accuracy scores with an average precision of 92.5%, recall of 89.7%, and F1-score of 91.1%.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Greedy Optimization Method for Extractive Summarization of Scientific Articles," IEEE Access, vol. 9, 2021

"Hammer PDF: An Intelligent PDF Reader for Scientific Papers," arXiv:2204.02809, 2022.

"Automating PDF Data Extraction Using Neural Networks," IEEE Transactions on Neural Networks and Learning Systems, vol. 32, 2021

"AI-Powered Interactive Systems for Medical Document Analysis," IEEE Access, vol. 10, pp, 2022

"PAWLS: PDF Annotation With Labels and Structure," arXiv preprint arXiv:2101, 2021.

"Towards a Conversational AI for Document Summarization and Querying," Proceedings of the IEEE Conference on AI Applications, 2023,pp. 215-222.

"Contextual Text Analysis in PDF Documents Using NLP Techniques," Springer Journal of Artificial Intelligence Research, vol. 15, no. 4, pp. 345-360, 2022.

"Multi-Modal AI Framework for Document Processing," IEEE International Conference on Artificial Intelligence and Virtual Reality (AIVR), 2020.

"Interactive Reading System Based on AI," IEEE International Conference on Artificial Intelligence and Education, 2021

"Conversational Artificial Intelligence in Production," IEEE International Conference on Cloud Engineering, 2021

S. Gupta and R. Sharma, *"Neural Approaches for Document Understanding and Information Retrieval in PDF Files,"* IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 5, pp. 1-15, 2022.

J. Park, H. Lee, and K. Tan, *"Advancements in AI-driven Text Extraction from Complex PDF Documents,"* Proceedings of the International Conference on Document Analysis and Recognition (ICDAR), 2023.

M. Al-Rubaie and T. Wang, *"A Comparative Study on PDF Parsing Techniques for AI-based Document Processing,"* Springer Journal of Machine Learning Research, vol. 18, no. 7, pp. 512-529, 2022.

K. Lee and H. Kim, *"Interactive AI Chatbots for Scientific Paper Summarization and Analysis,"* IEEE International Conference on Artificial Intelligence (ICAI), 2023.

X. Zhao, P. Singh, and L. Chen, *"Deep Learning Techniques for Table and Image Extraction from PDFs,"* IEEE Transactions on Image Processing, vol. 31, pp. 1921-1935, 2022.

A. Patel and S. Gupta, *"AI-Based Knowledge Graph Construction from Research Papers,"* Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining (KDD), 2023.

R. Kumar and P. Jain, *"Conversational AI for Medical and Financial Document Understanding,"* IEEE Access, vol. 11, pp. 56789-56799, 2023.

L. Huang and M. Zhang, *"Semantic Parsing of PDF Documents for Information Retrieval,"* Journal of Computational Linguistics, vol. 49, no. 3, pp. 299-315, 2022.

Y. Wang, F. Luo, and J. Zhao, *"Transformers for Automated PDF Data Extraction,"* IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 2, pp. 1123-1135, 2023.

P. Singh and K. Rao, *"End-to-End AI Models for Automated PDF Summarization,"* Proceedings of the AAAI Conference on Artificial Intelligence, 2023, pp. 1234-1241.

H. Xu, Z. Lin, and M. Chen, *"Graph- Based Neural Networks for Document Structure Analysis,"* IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 4, pp. 2310-2325, 2023.

J. Park, Y. Kim, and D. Wu, *"OCR-Based AI Models for Digitizing and Understanding PDFs,"* Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 334-345.

S. Li and X. Wang, *"Leveraging Large Language Models for Conversational PDF Interaction,"* NeurIPS Workshop on AI for Document Understanding, 2023.

T. Brown, P. Johnson, and K. Wei, *"AI- Powered PDF Readers: Enhancing Accessibility and Searchability,"* IEEE Transactions on Human-Machine Systems, vol. 53, no. 1, pp. 44-56, 2023.

R. Kim and J. Han, *"Zero-Shot Learning for PDF Table Detection and Interpretation,"* Springer Journal of Artificial Intelligence Research, vol. 16, no. 3, pp. 278-295, 2023.

M. Patel and G. Liu, *"Neural Models for Extractive and Abstractive Summarization of PDF Documents,"* Proceedings of the Association for Computational Linguistics (ACL), 2023, pp. 601-615.

L. Zhao, H. Lee, and Y. Xu, *"Multi-Task Learning for Document Understanding in PDFs,"* IEEE Transactions on Artificial Intelligence, vol. 2, no. 4, pp. 210-225, 2023.

J. Singh and M. Verma, *"AI-Powered Assistants for Scientific Paper Comprehension,"* Proceedings of the IEEE Symposium on AI Applications, 2023, pp. 312- 325.

P. Chandra and R. Das, *"Document Layout Analysis Using Deep Learning,"* IEEE Transactions on Image Processing, vol. 30, pp. 5098-5110, 2023.

X. Liu, B. Huang, and Y. Zhou, *"Fine- Tuning Large Language Models for Interactive PDF Analysis,"* NeurIPS Workshop on Document AI, 2023.

Downloads

Published

2025-05-29

How to Cite

1.
M S AV, S A, S L AV, T V. MedPDF : An Intelligent AI model for interactive PDF Analysis of Health Care Documents. J Neonatal Surg [Internet]. 2025May29 [cited 2025Sep.20];14(29S):61-8. Available from: https://www.jneonatalsurg.com/index.php/jns/article/view/6712