AI Prompt Router System Design
LLM request routing system that classifies prompts by complexity and intent, then routes to the optimal model (small, medium, or large) to minimize cost while maintaining quality. Includes semantic caching, response evaluation, and automatic fallback to larger models.