Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference

Authors

STOYANOV Radostin SPIŠAKOVÁ Viktória REBER Adrian ARMOUR Wesley COPIK Marcin BRUNO Rodrigo

Year of publication 2025
Type Article in Proceedings
Citation

You are running an old browser version. We recommend updating your browser to its latest version.

More info