Chawla, D. ., and D. Chawla. “Scalable Large Language Model Inference in Cloud Ecosystems: Enterprise-Scale Performance Optimization and Resource-Aware Architectures”. Journal of Neonatal Surgery, vol. 14, no. 28S, Sept. 2025, pp. 1124-39, https://www.jneonatalsurg.com/index.php/jns/article/view/9206.