AutoMix: Automatically Mixing Language Models

This article introduces AutoMix, a method that optimizes the use of large language models (LLMs). AutoMix strategically routes queries to larger models based on the approximate correctness of outputs from smaller ones. This is achieved through a self-verification mechanism that estimates the reliability of its own outputs without requiring training. The study demonstrates that AutoMix surpasses established baselines in experiments, improving the incremental benefit per cost by up to 89%.

Publication date: 19 Oct 2023
Project Page: https://github.com/automix-llm/automix
Paper: https://arxiv.org/pdf/2310.12963

Post Views: 312

root

Exit mobile version

Please allow ads on our site

Looks like you're using an ad blocker. Please support us by disabling these ad blocker.

Press ESC to close

Share Article:

root

State Machine Frameworks for Website Fingerprinting Defenses: Maybe Not

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Please allow ads on our site