Chawla, Dipen, and Deven Chawla. “Scalable Large Language Model Inference in Cloud Ecosystems: Enterprise-Scale Performance Optimization and Resource-Aware Architectures”. Journal of Neonatal Surgery 14, no. 28S (September 23, 2025): 1124–1139. Accessed October 6, 2025. https://www.jneonatalsurg.com/index.php/jns/article/view/9206.