AI Infra Architecture

Apply
Apply

Share

successfully icon

Successfully

The vacancy has been successfully added to favorites

location icon

London, United Kingdom of Great Britain and Northern Ireland

specialization icon

AI/ML

lob icon

BCM Industry

date icon

13/02/2026

Req. VR-120987

Apply
Project description

We are seeking an experienced AI Infrastructure Architect with deep expertise in designing and operating scalable, secure, and high‑performance cloud environments for Generative AI and LLM workloads. This role is ideal for someone who combines strong AWS architectural skills with hands‑on experience in GPU compute, MLOps/LLMOps, and enterprise‑grade AI platform design.
You should bring extensive experience building cloud‑native AI infrastructure, optimizing large‑scale model training and inference environments, and collaborating closely with AI/ML teams to enable advanced GenAI capabilities.
You should bring strong experience in designing complex AI systems, creating detailed technical specifications, and collaborating across multidisciplinary teams to ensure seamless implementation.

Responsibilities
bullet icon

Design and implement scalable AWS infrastructure to support Generative AI and LLM workloads, including training, fine‑tuning, and inference.

bullet icon

Architect secure, high‑performance environments using AWS core services such as Amazon SageMaker, Amazon Bedrock, Amazon EKS, AWS Lambda, and related cloud‑native components.

bullet icon

Design GPU‑based compute environments (e.g., EC2 P‑series, G‑series) optimized for distributed training, fine‑tuning, and low‑latency inference.

bullet icon

Implement secure VPC architectures, private endpoints, IAM policies, encryption (KMS), and enterprise‑grade data governance controls.

bullet icon

Build and govern MLOps/LLMOps pipelines using SageMaker Pipelines, CodePipeline, and CI/CD best practices.

bullet icon

Architect RAG infrastructure, including vector databases (OpenSearch, Aurora PostgreSQL with pgvector) and scalable storage solutions (S3).

bullet icon

Establish monitoring and observability using CloudWatch, model monitoring tools, logging frameworks, and performance dashboards.

bullet icon

Optimize infrastructure for latency, autoscaling, high availability, and cost efficiency, leveraging Spot Instances, Savings Plans, and right‑sizing strategies.

bullet icon

Define disaster recovery (DR) and backup strategies across multi‑AZ and multi‑region AWS setups.

bullet icon

Implement Infrastructure as Code (IaC) using Terraform or CloudFormation for consistent, repeatable provisioning of AI environments.

bullet icon

Collaborate with AI/ML teams to support LLM fine‑tuning, prompt orchestration, inference endpoints, and model deployment workflows.

bullet icon

Stay current with AWS GenAI advancements, evaluating new services, architectural patterns, and best practices for enterprise adoption.

Skills

Must have

bullet icon

Extensive experience (typically 7+ years) in cloud architecture, infrastructure engineering, or platform engineering, with a strong focus on AWS.

bullet icon

Proven expertise designing and operating AI/ML and Generative AI infrastructure at scale.

bullet icon

Deep knowledge of AWS services relevant to AI workloads (SageMaker, Bedrock, EKS, EC2 GPU instances, Lambda, VPC, IAM, KMS, S3).

bullet icon

Hands‑on experience with GPU compute, distributed training, and high‑performance inference environments.

bullet icon

Strong understanding of MLOps/LLMOps practices, CI/CD pipelines, and model deployment workflows.

bullet icon

Experience architecting secure, compliant, and highly available cloud environments.

bullet icon

Proficiency with Infrastructure as Code (Terraform or CloudFormation).

bullet icon

Familiarity with vector databases, RAG architectures, and scalable data storage patterns.

bullet icon

Strong collaboration skills and the ability to work closely with AI/ML, DevOps, and engineering teams.

bullet icon

Excellent documentation and communication skills.

Nice to have

bullet icon

n/a

Other
seniority icon

Languages

English: C1 Advanced

seniority icon

Seniority

Lead

London, United Kingdom of Great Britain and Northern Ireland

Req. VR-120987

AI/ML

BCM Industry

13/02/2026

Req. VR-120987

Apply for AI Infra Architecture in London

*Indicates a required field

Under the terms of your specific consent or to perform our obligations under a contract with you, as applicable, we, Luxoft Holding Inc. will manually and electronically process your personal data, specifically your first name, last name, phone number, e-mail address and other data you provide us through this form.


Within this context, we process personal data only for the specific purpose(s) indicated in the individual consent language or other notices provided below.


We will – insofar as reasonably necessary for the purpose you have agreed to and within the scope of applicable laws – transfer your personal data to other entities within the Luxoft Group and to the group of third party recipients listed in our Privacy Notice. Such Recipients can be located outside the European Union (EU) and/or the European Economic Area (EEA) (“Third Countries”). The Third Countries concerned, e.g. the USA, may not have the level of data protection that you enjoy e.g. under the GDPR. This can result in disadvantages such as an impeded enforcement of data subjects’ rights, a lack of control over further processing and access by state authorities. You may only have limited legal remedies against this. Insofar our transfer of your personal data to recipients in Third Countries is not covered by an adequacy decision of the EU Commission, we achieve an adequate level of data protection as further detailed out in our Privacy Notice.


With your consent, we personalise marketing communications to you by way of carrying out marketing research analysis, analysing the surfing-behaviour of our website visitors and to adjust it to their detected tendencies, as well as to plan more efficient future marketing activities. This personalised marketing does not include any automated decision-making activities.


Further information on how we process personal data in general is available in our Privacy Notice. You may withdraw any given consent at any time. The withdrawal of your consent(s) will not affect the lawfulness of processing before its withdrawal. For any request in this context, please e-mail us at: DPO@luxoft.com.


Before uploading CV or any other information to this website, to learn more about your obligations and restrictions arising from the use of this website, please read our Terms of Use.