Project description

We are seeking an experienced AI Infrastructure Architect with deep expertise in designing and operating scalable, secure, and high‑performance cloud environments for Generative AI and LLM workloads. This role is ideal for someone who combines strong AWS architectural skills with hands‑on experience in GPU compute, MLOps/LLMOps, and enterprise‑grade AI platform design.
You should bring extensive experience building cloud‑native AI infrastructure, optimizing large‑scale model training and inference environments, and collaborating closely with AI/ML teams to enable advanced GenAI capabilities.
You should bring strong experience in designing complex AI systems, creating detailed technical specifications, and collaborating across multidisciplinary teams to ensure seamless implementation.

Responsibilities

Design and implement scalable AWS infrastructure to support Generative AI and LLM workloads, including training, fine‑tuning, and inference.

Architect secure, high‑performance environments using AWS core services such as Amazon SageMaker, Amazon Bedrock, Amazon EKS, AWS Lambda, and related cloud‑native components.

Design GPU‑based compute environments (e.g., EC2 P‑series, G‑series) optimized for distributed training, fine‑tuning, and low‑latency inference.

Implement secure VPC architectures, private endpoints, IAM policies, encryption (KMS), and enterprise‑grade data governance controls.

Build and govern MLOps/LLMOps pipelines using SageMaker Pipelines, CodePipeline, and CI/CD best practices.

Architect RAG infrastructure, including vector databases (OpenSearch, Aurora PostgreSQL with pgvector) and scalable storage solutions (S3).

Establish monitoring and observability using CloudWatch, model monitoring tools, logging frameworks, and performance dashboards.

Optimize infrastructure for latency, autoscaling, high availability, and cost efficiency, leveraging Spot Instances, Savings Plans, and right‑sizing strategies.

Define disaster recovery (DR) and backup strategies across multi‑AZ and multi‑region AWS setups.

Implement Infrastructure as Code (IaC) using Terraform or CloudFormation for consistent, repeatable provisioning of AI environments.

Collaborate with AI/ML teams to support LLM fine‑tuning, prompt orchestration, inference endpoints, and model deployment workflows.

Stay current with AWS GenAI advancements, evaluating new services, architectural patterns, and best practices for enterprise adoption.

Skills

Must have

Extensive experience (typically 7+ years) in cloud architecture, infrastructure engineering, or platform engineering, with a strong focus on AWS.

Proven expertise designing and operating AI/ML and Generative AI infrastructure at scale.

Deep knowledge of AWS services relevant to AI workloads (SageMaker, Bedrock, EKS, EC2 GPU instances, Lambda, VPC, IAM, KMS, S3).

Hands‑on experience with GPU compute, distributed training, and high‑performance inference environments.

Strong understanding of MLOps/LLMOps practices, CI/CD pipelines, and model deployment workflows.

Experience architecting secure, compliant, and highly available cloud environments.

Proficiency with Infrastructure as Code (Terraform or CloudFormation).

Familiarity with vector databases, RAG architectures, and scalable data storage patterns.

Strong collaboration skills and the ability to work closely with AI/ML, DevOps, and engineering teams.

Excellent documentation and communication skills.

Nice to have

n/a

Other

Languages

English: C1 Advanced

Seniority

Lead

Related jobs

View all vacancies

AI Engineer

AI/ML

Mexico

Remote Mexico

AI Engineer

AI/ML

India

Bengaluru

AI Dev Engineer

AI/ML

India

Bengaluru

London, United Kingdom of Great Britain and Northern Ireland

Req. VR-120987

AI/ML

BCM Industry

23/02/2026

Req. VR-120987

Apply for AI Infra Architecture in London

*Indicates a required field

First name

Last name

Your country

Your City

Phone

Specialization

Attach your resume (max file size 1MB; allowed extensions: doc, txt, pdf, docx) Upload resume

Number of years of experience in the specialization you are applying for

What are your monthly gross salary expectations?

Read Explanatory Notes

Under the terms of your specific consent or to perform our obligations under a contract with you, as applicable, we, Luxoft Holding Inc. will manually and electronically process your personal data, specifically your first name, last name, phone number, e-mail address and other data you provide us through this form.

Within this context, we process personal data only for the specific purpose(s) indicated in the individual consent language or other notices provided below.

We will – insofar as reasonably necessary for the purpose you have agreed to and within the scope of applicable laws – transfer your personal data to other entities within the Luxoft Group and to the group of third party recipients listed in our Privacy Notice. Such Recipients can be located outside the European Union (EU) and/or the European Economic Area (EEA) (“Third Countries”). The Third Countries concerned, e.g. the USA, may not have the level of data protection that you enjoy e.g. under the GDPR. This can result in disadvantages such as an impeded enforcement of data subjects’ rights, a lack of control over further processing and access by state authorities. You may only have limited legal remedies against this. Insofar our transfer of your personal data to recipients in Third Countries is not covered by an adequacy decision of the EU Commission, we achieve an adequate level of data protection as further detailed out in our Privacy Notice.

With your consent, we personalise marketing communications to you by way of carrying out marketing research analysis, analysing the surfing-behaviour of our website visitors and to adjust it to their detected tendencies, as well as to plan more efficient future marketing activities. This personalised marketing does not include any automated decision-making activities.

Further information on how we process personal data in general is available in our Privacy Notice. You may withdraw any given consent at any time. The withdrawal of your consent(s) will not affect the lawfulness of processing before its withdrawal. For any request in this context, please e-mail us at: DPO@luxoft.com.

Before uploading CV or any other information to this website, to learn more about your obligations and restrictions arising from the use of this website, please read our Terms of Use.

By clicking the Agree and Send button, you consent that Luxoft Holding Inc. can process your provided personal data for the purpose of processing your application to establish a potential employment, internship or relocation with Luxoft Holding Inc., a relevant affiliate of the Luxoft Group or Luxoft Group’s clients, as applicable. You consent to the retention of your personal data, with the purpose of contacting you for potential future employment opportunities. If you refer a friend, we will process your provided personal data of you and your referral to award you with a bonus for such recommendation. You can withdraw your consent at any time with effect for the future. Before you consent, please click on „Read these Explanatory Notes“ and view our Privacy Notice (e.g. for your data subject rights).

Note: Our system automatically converts each CV into our standardized format, enabling our HR teams to process applications more efficiently.

I consent that Luxoft Holding Inc. and the affiliates of the Luxoft Group can process my personal data for the purpose of sending me regular emails with information about Luxoft as an employer including job vacancies and (ii) marketing communications with information about recruitment-related events, organized webinars, conferences, as well as the Logeek Magazine and other similar publications. I can withdraw my consent at any time with effect for the future. These communications are personalised to my interests in accordance with our Privacy Notice.

Related Jobs

All jobs

MENU

AI Infra Architecture

Project description

Responsibilities

Skills

Other

Related jobs

AI Engineer

AI Engineer

AI Dev Engineer