Not every question needs the knowledge base
Reduce latency, avoid irrelevant answers, and save GPU costs by skipping knowledge base retrieval for queries that are not questions. How pattern matching and semantic gates filter trivial queries before they hit the vector store.