We are seeking an enthusiastic **Senior AI Engineer** to join our team and contribute to the development of intelligent talent technology. This is a hands-on opportunity for those early in their career to gain experience with AI and machine learning in a real-world, enterprise SaaS environment. You’ll work alongside experienced engineers and learn how large language models, recommender systems, and other AI solutions enhance the workplace.
**In this role you will…**
+ Inference Runtime Optimization: Architect, optimize, and maintain high-performance, asynchronous orchestration layers (FastAPI) integrated with advanced inference servers (vLLM, Triton Inference Server) to host and serve open-source models efficiently. + Intelligent Routing & Caching: Build and maintain an advanced semantic caching layer and dynamic routing engine that evaluates...