1
results found
The large-scale deployment of Large Language Models (LLMs) is constrained by significant energy consumption and operational costs, with inference accounting for up to 90% of the total energy footprint...
Large Language Models (LLMs)
Energy-Latency-Quality (ELQ) Optimization
Carbon-Aware Scheduling
Green AI / Sustainable AI
Efficient LLM Inference
Adaptive Inference Systems
SinoXiv