AI Accelerator Software Distinguished Engineer- Framework Integration

Ampere Computing

Portland, OR, US

Onsite 2026-07-03

Announced salary

$279,500 - $440,000

Estimated net pay

$14,725 - $21,781

/month · 37% withheld

after tax & contributions · Single, no dependents

Your situation Children

Open in iampro arrow_forward Apply open_in_new

Job description

**Description** **Invent the future with us.** Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focused on high\-performance, energy efficient AI compute. As a pioneer in the new frontier of energy efficient high\-performance computing, Ampere is part of the Softbank Group of companies driving sustainable computing for AI, Cloud, and edge applications. Join us at Ampere and work alongside a passionate and growing team \- we’d love to have you apply! **About the Role** As an AI Accelerator Software Distinguished Engineer – Framework Integration, you will lead end\-to\-end technical strategy and delivery for high\-performance deep learning inference across Ampere accelerator platforms. You will set direction for how major ML frameworks are enabled and optimized for our hardware, ensuring high throughput, low latency, and efficient compute/memory utilization for current and next\-generation AI workloads spanning data centers to edge. This role is distinguished by deep technical ownership, architecture leadership, and cross\-team influence—driving outcomes from performance modeling and integration strategy through production\-ready runtime and kernel behavior. **What You’ll Achieve:** * **Framework integration leadership (PyTorch / ONNX / llama.cpp)** Own and advance integration of major deep learning frameworks—PyTorch, ONNX, llama.cpp, and related tooling—into the Ampere deep learning accelerator backend, enabling robust execution of real\-world model graphs and operators. * **Full\-stack acceleration across the SW/HW execution path** Drive acceleration across the end\-to\-end stack, including (as applicable): * + inference serving and orchestration enablement + framework\-to\-runtime integration layers + compiler/graph lowering and optimization + runtime library and execution management + user\-mode execution paths a

On the map

map

See this employer on the map — Portland