Advance your career with NVIDIA as a Senior Engineer, focusing on GPU inference systems for AI. Drive optimization and collaboration while enhancing performance across large-scale models. In this crucial role, you will architect high-performance inference stacks and fine-tune NVIDIA's GPU solutions to achieve top productivity. Your expertise will significantly contribute to hitting industry benchmarks and implementing advanced GPU kernels within a multi-cloud environment. Key Responsibilities: • Develop and optimize vLLM features with cutting-edge GPU technology • Benchmark and profile GPU kernels for enhanced efficiency • Create robust tools for inference benchmarking methods • Spearhead orchestration of large-scale inference deployments • Publish innovative research to elevate machine learning systems Requirements: • Extensive background in computer science with advanced degree options • Proficient in Python, C/C++, and GPU programming languages • Str...