Brix . Location : Palo Alto, CA Employment type : Full-time Salary : $250k - $350k USD/year + Equity Responsibilities
Oversee and optimize the LLM inference engine to ensure performance, scalability, and cost-efficiency. Collaborate with teammates to implement cutting-edge AI inference engines. Maintain high reliability and strong engineering standards. Qualifications
3+ years of experience in software engineering. Familiar with vLLM, quantization, and current techniques of LLM optimization. Bachelor or Master degree from a leading academic institution. Tech Stack
Front end: Python, Flutter, Dart Back end: Python, GCP, Redis, Kubernetes Interview process