Preferred Networks

LLM Inference Optimization Engineer

Apply View on Hacker News Company site

Roles

LLM Inference Optimization Engineer MN-Core LLM Serving Engine Engineer

Compensation

Visa and relocation support provided.

Tech stack

PLaMo vLLM MN-Core CuPy LLM inference Optuna Hugging Face Chainer MN-Core L1000

Location

Tokyo, Japan, Remote in Japan

Work setup

Employment: Full-time Employee
Level: Senior

Description

Preferred Networks is an AI company based in Tokyo working across the stack. They design in-house chips (MN-Core series) and train LLMs (PLaMo series). They are hiring two roles: (1) MN-Core LLM Serving Engine Engineer to build software infrastructure to serve LLMs using the upcoming inference accelerator MN-Core L1000. (2) LLM Inference Optimization Engineer to improve the inference engine powering their API service and maintain PLaMo implementations in open source projects such as vLLM. Both roles require relocation to Japan, and the company provides visa and relocation support.

Similar jobs

Loading similar jobs...

Get matching jobs by email

Create a simple alert for one technology, or use the jobs page for a more specific saved search.

Source

Posted: May 1, 2026
Source: Hacker News