AI Software Development Engineer optimizing AI inference workloads including Large Language Models on Intel GPUs. Involves graph compilation, runtime execution, and kernel optimization.
Responsibilities
Optimize emerging AI inference workloads such as Large Language Models (LLMs) and Diffusion models on GPUs
Develop and optimize graph-based compilation flows (e.g., MLIR/LLVM) for neural network workloads
Write and tune performance-critical GPU kernels and runtime code in C++ or parallel programming languages
Identify and resolve bottlenecks across compiler, runtime, and kernel layers
Profile, benchmark, and characterize AI workloads to validate performance gains
Collaborate with hardware, driver, and framework teams on hardware/software co-optimization
Requirements
Bachelor's degree with 4+ years of relevant experience, OR Master's degree with 2+ years of relevant experience in Computer Science or a related field
Strong C++ development and debugging skills
Solid understanding of GPU architectures or AI accelerators
Hands-on experience with modern neural network architecture for inference on hardware accelerators
Preferred: PhD and 1+ years of relevant experience
Familiarity with OpenVINO or other AI inference frameworks
Knowledge of neural network optimization techniques and performance tradeoffs
Experience across multiple layers of the AI software stack, including AI inference engines or runtimes, graph compilers (e.g., MLIR/LLVM), GPU kernels or performance critical compute code
Staff Engineer leading a product team at Beamery, a transformational AI platform in HR technology. Designing scalable software and providing technical mentorship in a hybrid role.
iOS Engineer developing new financial services with Merpay, focusing on individual credit business in Japan. Collaborating with cross - functional teams to improve user experience and product quality.
Tech Lead managing development teams across mobile, web, and backend at Lotus's. Overseeing software solutions while ensuring technical excellence and high - quality code across projects.
Staff Engineer developing solutions with agile teams and mentoring junior engineers. Focused on leading development initiatives utilizing CI/CD, .NET, and web services.
Software Engineer developing and supporting client - server applications for gaming technology at Light & Wonder. Collaborating with teams to build reliable and scalable software solutions.
Associate Director role leading software development and team collaboration at RBC. Designing and building robust Java applications while mentoring a high - performing development team.
Senior Software Engineer in Mobility Engineering at WEX developing backend solutions for fleet management. Responsible for scalable system design and leadership in code quality and best practices.
Experienced AI - ML Engineer developing and implementing analytics solutions for aerospace applications at Boeing. Delivering cutting - edge R&D and high - quality engineering work in global markets.
Software Engineer developing domain - specific applications for industrial research at Uncountable. Focus on data integrations and automated data transfer routines in Python.
Working Student in Software Engineering at Uncountable, supporting scientific R&D for innovative materials companies in Europe. Collaborative role in Munich with flexibility in work hours.