1.
Chawla D, Chawla D. Scalable Large Language Model Inference in Cloud Ecosystems: Enterprise-Scale Performance Optimization and Resource-Aware Architectures. J Neonatal Surg [Internet]. 2025Sep.23 [cited 2025Oct.6];14(28S):1124-39. Available from: https://www.jneonatalsurg.com/index.php/jns/article/view/9206