SGLang Jobs - May 2026

Sort by

Search

Location

Has Compensation Summary

Has Contact Info

Remote

Onsite

Tech Stack

Title

Reset

VLM Run

Posted 3 weeks, 4 days ago

Building the inference and orchestration layer for production Vision-Language Models. Focus on fast and ergonomic visual inference, reliable structured outputs, and observability for iteration. Shipped projects include Orion (visual agent for images…

Roles
Product ML Staff Engineer

Tech Stack
Python Rust vLLM SGLang Ollama

Locations

Santa Clara, CA (HQ)
NVIDIA

Posted 6 months, 3 weeks ago

NVIDIA | vLLM + SGLang | Deep Learning Inference | Remote (North America preferred) Hi everyone — I’m Akbar, Senior Manager of Deep Learning Inference Software at NVIDIA. I lead our engineering efforts around vLLM and SGLang, two of the most widely …

Roles
Engineering Manager DL Performance Software Engineer - LLM Inference Deep Learning Inference Inference Senior Deep Learning Software Engineer

Tech Stack
compiler/runtime kernel fusion runtime optimizations vLLM scheduling optimizations SGLang Blackwell continuous integration GPUs LLM inference distributed serving Hopper

Locations

Remote (North America preferred), Santa Clara, CA