ML-based LLM request classifier for cost-optimized routing (~2ms inference)
📰 Dev.to · André Bergan
I built a request classifier that decides which LLM tier a prompt needs before it's sent to a...
I built a request classifier that decides which LLM tier a prompt needs before it's sent to a...