Tech Job Alerts Logo
Jobs Companies Technologies Job titles
Blog
Log in →
Tech Job Alerts Tech Job Alerts Logo
Jobs Companies Technologies Titles
Log in

Create Custom Alert

Please login to create custom alerts.

Location
Reset
  • VLM Run

    Posted 3 weeks, 4 days ago

    Building the inference and orchestration layer for production Vision-Language Models. Focus on fast and ergonomic visual inference, reliable structured outputs, and observability for iteration. Shipped projects include Orion (visual agent for images…

    Roles

    Product ML Staff Engineer

    Tech Stack

    Python Rust vLLM SGLang Ollama

    Locations

    Santa Clara, CA (HQ)

  • NVIDIA

    Posted 6 months, 3 weeks ago

    NVIDIA | vLLM + SGLang | Deep Learning Inference | Remote (North America preferred) Hi everyone — I’m Akbar, Senior Manager of Deep Learning Inference Software at NVIDIA. I lead our engineering efforts around vLLM and SGLang, two of the most widely …

    Roles

    Engineering Manager DL Performance Software Engineer - LLM Inference Deep Learning Inference Inference Senior Deep Learning Software Engineer

    Tech Stack

    compiler/runtime kernel fusion runtime optimizations vLLM scheduling optimizations SGLang Blackwell continuous integration GPUs LLM inference distributed serving Hopper

    Locations

    Remote (North America preferred), Santa Clara, CA

Showing page 1 of 1

© 2025 LVTD, LLC. All rights reserved.

Sitemap | Status | Privacy Policy | Terms of Service <- Google made me add those for the Social Login.