Platform Engineer at Standard Compute
Priya designs the model routing layer — the system that decides which LLM handles your request and why. She spent four years at a large language model provider working on inference optimization and model serving before joining Standard Compute. Her work ensures that when you send a request to the API, it lands on the right model at the right time, balancing quality, speed, and cost without you having to think about it.
Previously optimized inference serving for models with 100B+ parameters. Background in operations research and combinatorial optimization. Holds a master's in computer science.