Senior Software Engineer, LLM Performance at Parasail.ai | Hybrid Hired

About the role

Senior Software Engineer focusing on performance of LLMs and AI workloads on distributed infrastructure. Working with cutting-edge technologies and improving efficiency for enterprise applications.

Responsibilities

Add support for new LLMs, working across the stack from low-level GPU kernels to Kubernetes-based deployments.
Contribute to cutting-edge open-source LLM engines such as vLLM or SGLang to extend their capabilities and performance (e.g. use Python technologies to improve API servers or request schedulers).
Operate closer to the hardware, focusing on building and integrating solutions to boost performance and hardware utilization. For example, improve attention backends like FlashAttention or FlashInfer by contributing to their development and optimization, or by integrating their solutions into vLLM.
Improve LLM performance using advanced algorithmic solutions such as speculative decoding, quantization, or other state-of-the-art techniques. Understand the impact of such techniques in model quality.

Requirements

Expertise in GPU computing, including low-level platforms such as CUDA, ROCm, XLA, PyTorch, Jax, etc.
Background in performance analysis and optimization of AI/HPC workloads (e.g. profiling or theoretical analysis of Flops and bandwidth).
Experience in writing GPU kernels using technologies like CUDA, CUTLASS, Triton.
Strength in Python and C++.
Demonstrated contributions to open-source projects. Contributions to inference engines such as vLLM is a strong plus.
A production-oriented mindset emphasizing robust, scalable code suitable for enterprise-grade applications.
A relentless curiosity about cutting-edge AI technologies combined with a passion for solving complex problems.

Similar roles

Browse all Full Stack Engineer jobs

22 minutes ago

BO

Mid-Level Full Stack Developer, Cloud & AI

Boeing

Mid - Level Full Stack Developer at Boeing developing cloud - native solutions and AI - driven data analytics. Contributing to software design and deploying complex components in an Agile environment.

Hybrid Role

North Charleston United States Full Stack Engineer

$100,300 - $156,400 per year

29 minutes ago

TC

Software Engineer II – Live Pipeline

The Walt Disney Company

Software Engineer II for Disney Streaming services, building systems that power viewer playback. Collaborating with teams to implement scalable backend services for media platforms.

Hybrid Role

Glendale United States Full Stack Engineer

$117,500 - $157,500 per year

54 minutes ago

AD

Senior Full Stack Engineer

AND Digital

Senior Full Stack Engineer delivering full‑stack JavaScript/TypeScript solutions for AND Digital. Engaging with stakeholders and shaping technical approaches in a hybrid working environment.

Hybrid Role

Leeds United Kingdom Full Stack Engineer

1 hour ago

MI

Technical Consultant – Full Stack Developer

Mobilint, Inc.

Technical Consultant - Full Stack Developer at Daon, focusing on digital identity solutions and integration with existing systems. Collaborating with global teams and supporting mission - critical environments.

Hybrid Role

Dublin Ireland Full Stack Engineer

1 hour ago

FI

Mid-level Full Stack Developer

Fadami - Software & Innovation

Full Stack Developer at Fadami specializing in system integrations and responsive web development. Working with C#, ASP.NET, and various databases in a dynamic, delivery - oriented environment.

Hybrid Role

Rio de Janeiro Brazil Full Stack Engineer

1 hour ago

FI

Senior Full Stack Developer

Fadami - Software & Innovation

Senior Full Stack Developer at Fadami developing scalable solutions and ensuring code quality. Responsible for code reviews and mentoring fellow developers in innovative tech solutions.

Hybrid Role

Rio de Janeiro Brazil Full Stack Engineer

1 hour ago

TH

Lead Software Engineer

TheIncLab

Lead Software Engineer in R&D, guiding projects and teams through engineering challenges. Collaborating with stakeholders, architecting software solutions, and leading technical leadership.

Hybrid Role

McLean United States Full Stack Engineer

1 hour ago

GV

Software Engineer

GE Vernova

Software Engineer implementing microservices and user interfaces for application software at GE Vernova. Collaborating across teams to meet customer requirements and deliver high - quality solutions.

Onsite Role

Bengaluru India Full Stack Engineer

2 hours ago

MO

Embedded Software Architect

Mobileye

Software Architect developing state - of - the - art Imaging Radar solutions for ADAS and Autonomous Driving at Mobileye. Collaborating in a fast - paced environment to innovate technology in early stages.

Hybrid Role

Petah Tikva Israel Full Stack Engineer

2 hours ago

IS

Senior Engineering Leader – Track Engineering

Illinois Department of Human Services

Senior Engineering Leader for Permanent Way Engineering at Transport for London, ensuring reliability and safety of railway systems through engineering skills and stakeholder relationships.

Hybrid Role

London United Kingdom Full Stack Engineer

£72,000 per year