Back to jobs
Senior Kernel Developer
Successfully
Req. VR-111416
Luxoft is looking for an AI software development engineer to develop ML kernels in the Triton kernel language. We are looking for an engineer who is passionate about optimizing Machine Learning GPU kernels and improving the performance of key applications and benchmarks. What you do directly impacts the performance of AMD GPUs and enables us to become a competitive solution for generative AI. Become a part of our high-impact and incredibly talented Triton kernels team.
Develop ML kernels for matrix multiplication, Flash Attention and other ML operators
Benchmark, perform competitive analysis and optimize kernels to improve performance
Collaborate with the GPU architecture team to improve future generations
Apply knowledge of software engineering best practices
Must have
Proficiency with C/C++
Proficiency in CUDA or HIP / ROCm or OpenCL programming
Solid understanding of parallel programming models, and optimization techniques
Strong problem-solving skills and the ability to work in a collaborative environment
Nice to have
Familiarity with models like LLama, Mixtral and Gemma is a plus
Knowledge of MLIR, LLVM and GPU assembly and GPU architecture is a plus
Familiarity with PyTorch or JAX
Languages
English: B2 Upper Intermediate
Seniority
Senior
Remote Brazil, Brazil
Req. VR-111416
C/C++
Automotive Industry
10/08/2025
Req. VR-111416
Apply for Senior Kernel Developer in Remote Brazil
*Indicates a required field