Chawla, D. . and Chawla, D. (2025) “Scalable Large Language Model Inference in Cloud Ecosystems: Enterprise-Scale Performance Optimization and Resource-Aware Architectures”, Journal of Neonatal Surgery. Lahore, Pakistan, 14(28S), pp. 1124–1139. Available at: https://www.jneonatalsurg.com/index.php/jns/article/view/9206 (Accessed: 6October2025).