SinoXiv

The large-scale deployment of Large Language Models (LLMs) is constrained by significant energy consumption and operational costs, with inference accounting for up to 90% of the total energy footprint. Existing optimization methods typically address latency or memory independently, frequently overlooking energy efficiency, carbon impact, and their intricate relationship with user-defined Service Level Agreements (SLAs) for quality and response time. This work presents Green Decoding, a novel co-optimization framework for LLM inference. Green Decoding formulates inference as a multi-objective optimization problem, minimizing a weighted function of Energy, Latency, and Quality (ELQ). The framework utilizes a policy engine that, on a per-request basis, jointly tunes a broad set of system parameters, including speculative decoding configurations, dynamic Key-Value (KV) cache policies, adaptive quantization tiers, and early-exit criteria. The framework introduces two key contributions: (1) a carbon-aware scheduler that leverages real-time grid carbon intensity data to strategically time-shift deferrable, non-interactive workloads to periods of cleaner energy, thereby directly reducing CO2 emissions without violating SLAs, and (2) 1 a safety-aware gating mechanism that employs runtime uncertainty and toxicity signals to limit aggressive, potentially quality-degrading optimizations, thereby ensuring model reliability. Across diverse workloads, Green Decoding demonstrates superior performance, establishing a more efficient ELQ Pareto frontier. The framework achieves up to 35% energy reduction and 50% lower carbon emissions (gCO2e) compared to highly optimized static baselines, while strictly adhering to p95 latency and quality-proxy SLAs.

基本文件流程错误 SQL 调试

/www/wwwroot/yyb/public/index.php ( 0.79 KB )
/www/wwwroot/yyb/vendor/autoload.php ( 0.75 KB )
/www/wwwroot/yyb/vendor/composer/autoload_real.php ( 1.63 KB )
/www/wwwroot/yyb/vendor/composer/platform_check.php ( 0.90 KB )
/www/wwwroot/yyb/vendor/composer/ClassLoader.php ( 15.99 KB )
/www/wwwroot/yyb/vendor/composer/autoload_static.php ( 4.98 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/helper.php ( 8.34 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/stubs/load_stubs.php ( 0.16 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Exception.php ( 1.69 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Facade.php ( 2.71 KB )
/www/wwwroot/yyb/vendor/symfony/polyfill-mbstring/bootstrap.php ( 8.22 KB )
/www/wwwroot/yyb/vendor/symfony/polyfill-mbstring/bootstrap80.php ( 9.78 KB )
/www/wwwroot/yyb/vendor/symfony/polyfill-php80/bootstrap.php ( 1.50 KB )
/www/wwwroot/yyb/vendor/symfony/var-dumper/Resources/functions/dump.php ( 0.79 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/App.php ( 14.17 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Container.php ( 15.38 KB )
/www/wwwroot/yyb/vendor/psr/container/src/ContainerInterface.php ( 1.02 KB )
/www/wwwroot/yyb/app/provider.php ( 0.19 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Http.php ( 6.12 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/helper/Str.php ( 7.29 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Env.php ( 4.64 KB )
/www/wwwroot/yyb/app/common.php ( 0.03 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/helper.php ( 18.44 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Config.php ( 5.03 KB )
/www/wwwroot/yyb/config/DisciplineConfig.php ( 5.07 KB )
/www/wwwroot/yyb/config/JournalConfig.php ( 3.40 KB )
/www/wwwroot/yyb/config/app.php ( 1.05 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/facade/Env.php ( 1.67 KB )
/www/wwwroot/yyb/config/cache.php ( 0.80 KB )
/www/wwwroot/yyb/config/console.php ( 0.23 KB )
/www/wwwroot/yyb/config/cookie.php ( 0.56 KB )
/www/wwwroot/yyb/config/database.php ( 2.22 KB )
/www/wwwroot/yyb/config/filesystem.php ( 0.63 KB )
/www/wwwroot/yyb/config/lang.php ( 0.81 KB )
/www/wwwroot/yyb/config/log.php ( 1.37 KB )
/www/wwwroot/yyb/config/middleware.php ( 0.19 KB )
/www/wwwroot/yyb/config/route.php ( 1.54 KB )
/www/wwwroot/yyb/config/session.php ( 0.57 KB )
/www/wwwroot/yyb/config/trace.php ( 0.34 KB )
/www/wwwroot/yyb/config/view.php ( 0.83 KB )
/www/wwwroot/yyb/app/event.php ( 0.25 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Event.php ( 6.96 KB )
/www/wwwroot/yyb/app/service.php ( 0.13 KB )
/www/wwwroot/yyb/app/AppService.php ( 0.26 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Service.php ( 1.67 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Lang.php ( 7.60 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/lang/zh-cn.php ( 12.88 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/initializer/Error.php ( 3.27 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/initializer/RegisterService.php ( 1.33 KB )
/www/wwwroot/yyb/vendor/services.php ( 0.14 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/service/PaginatorService.php ( 1.52 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/service/ValidateService.php ( 0.99 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/service/ModelService.php ( 1.76 KB )
/www/wwwroot/yyb/vendor/topthink/think-trace/src/Service.php ( 0.77 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Middleware.php ( 6.78 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/initializer/BootService.php ( 0.77 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/Paginator.php ( 11.80 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Validate.php ( 46.10 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/Model.php ( 25.67 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/contract/Arrayable.php ( 0.09 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/contract/Jsonable.php ( 0.13 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/model/concern/Attribute.php ( 17.61 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/model/concern/RelationShip.php ( 26.12 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/model/concern/ModelEvent.php ( 2.27 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/model/concern/TimeStamp.php ( 5.70 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/model/concern/Conversion.php ( 10.41 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Db.php ( 2.87 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/DbManager.php ( 8.28 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Log.php ( 8.50 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Manager.php ( 3.98 KB )
/www/wwwroot/yyb/vendor/psr/log/Psr/Log/LoggerInterface.php ( 3.04 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Cache.php ( 4.79 KB )
/www/wwwroot/yyb/vendor/psr/simple-cache/src/CacheInterface.php ( 4.50 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/helper/Arr.php ( 17.45 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/cache/driver/File.php ( 7.42 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/cache/Driver.php ( 8.06 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/contract/CacheHandlerInterface.php ( 2.25 KB )
/www/wwwroot/yyb/app/Request.php ( 0.09 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Request.php ( 54.19 KB )
/www/wwwroot/yyb/app/middleware.php ( 0.26 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Pipeline.php ( 2.61 KB )
/www/wwwroot/yyb/vendor/topthink/think-trace/src/TraceDebug.php ( 2.94 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Route.php ( 23.96 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/RuleName.php ( 5.33 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/Domain.php ( 5.55 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/RuleGroup.php ( 13.61 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/Rule.php ( 22.85 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/RuleItem.php ( 8.81 KB )
/www/wwwroot/yyb/route/app.php ( 2.49 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/facade/Route.php ( 4.88 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/dispatch/Controller.php ( 6.61 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/route/Dispatch.php ( 6.49 KB )
/www/wwwroot/yyb/app/controller/Page.php ( 75.32 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/facade/Db.php ( 0.94 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/connector/Mysql.php ( 4.39 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/PDOConnection.php ( 52.45 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/Connection.php ( 7.67 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/ConnectionInterface.php ( 4.56 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/builder/Mysql.php ( 15.93 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/Builder.php ( 41.58 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/Query.php ( 10.64 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/BaseQuery.php ( 36.47 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/TimeFieldQuery.php ( 7.50 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/AggregateQuery.php ( 3.26 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/ModelRelationQuery.php ( 16.06 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/ResultOperation.php ( 6.29 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/Transaction.php ( 2.85 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/WhereQuery.php ( 16.22 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/JoinAndViewQuery.php ( 6.86 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/ParamsBind.php ( 3.36 KB )
/www/wwwroot/yyb/vendor/topthink/think-orm/src/db/concern/TableFieldInfo.php ( 2.51 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/log/driver/File.php ( 6.17 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/contract/LogHandlerInterface.php ( 0.86 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/log/Channel.php ( 6.54 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/event/LogRecord.php ( 0.86 KB )
/www/wwwroot/yyb/vendor/topthink/think-helper/src/Collection.php ( 16.47 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/facade/View.php ( 1.71 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/View.php ( 4.41 KB )
/www/wwwroot/yyb/vendor/topthink/think-view/src/Think.php ( 8.38 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/contract/TemplateHandlerInterface.php ( 1.71 KB )
/www/wwwroot/yyb/vendor/topthink/think-template/src/Template.php ( 46.61 KB )
/www/wwwroot/yyb/vendor/topthink/think-template/src/template/driver/File.php ( 2.41 KB )
/www/wwwroot/yyb/vendor/topthink/think-template/src/template/contract/DriverInterface.php ( 0.86 KB )
/www/wwwroot/yyb/runtime/temp/583c2531a1b66d65207bdbfc2413d5ca.php ( 38.27 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Response.php ( 8.60 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/response/Html.php ( 0.98 KB )
/www/wwwroot/yyb/vendor/topthink/framework/src/think/Cookie.php ( 6.28 KB )
/www/wwwroot/yyb/vendor/topthink/think-trace/src/Html.php ( 4.49 KB )

Paper Details

Green Decoding: ELQ Co-Optimization and Carbon-Aware Scheduling for Efficient LLM Inference

Abstract

Keywords

Citation Information

Related Papers