Project description

We are seeking a Data Architect to join our AI project team. In this role, you will design and implement the data architecture needed to support machine learning and AI solutions, including defining data models, storage patterns, and governance frameworks. You will ensure that data from various sources is well-organised, accessible, and AI-ready, working closely with data engineers and ML engineers to build robust data pipelines and maintain high data quality for analytics and model development.

Responsibilities

Data Modelling & Schema Design: Develop and maintain data models (conceptual, logical, and physical) that define how data is stored and related. This includes designing relational schemas, graph data models for knowledge graphs, and time-series data structures as needed, ensuring they accurately represent business entities and relationships. You will continually refine these models to meet AI use cases and evolving business requirements.

Data Storage Architecture: Define and implement data storage and management patterns that optimise data retrieval and analytics performance. This involves selecting or designing appropriate storage solutions (e.g. relational databases, NoSQL/graph databases, data warehouses, data lakes) and structuring them for scalability and fast access to large datasets used in AI projects. Ensure the architecture can handle structured and unstructured data and is cloud-ready for elasticity.

Data Pipelines & Integration: Build and oversee robust data pipelines (ETL/ELT processes) to integrate data from multiple sources into centralised platforms. You will design workflows to collect, transform, and load data into analytics repositories or feature stores, guaranteeing that AI models have consistent, well-prepared data to work with. This includes setting up stream processing for real-time data when required and automating pipeline orchestration for efficiency.

Data Governance & Quality: Establish and enforce data governance policies and standards. This means defining practices for data quality, data cleaning, and master data management, as well as setting security and privacy controls to protect sensitive information. You will ensure compliance with relevant data regulations and implement data security measures (e.g. access controls, encryption) and validation rules so that the data used in AI is trustworthy and compliant.

Metadata Management & Lineage: Implement frameworks for data metadata management and lineage tracking. This includes maintaining data catalogues or dictionaries that describe data meaning (possibly leveraging ontologies), and tools or processes to trace how data flows through pipelines and transformations. By providing transparency into data origins and transformations, you support model interpretability and enable troubleshooting of data issues, which is critical in AI development.

Collaboration with Engineering Teams: Work closely with data engineers, ML engineers, and data scientists to ensure the data architecture meets their needs. You will collaborate on designing data interfaces (e.g. APIs or query endpoints) and assist in shaping how data is used for features in machine learning. This role requires translating requirements between data teams and ML teams, and jointly resolving issues to streamline the path from raw data to AI insights.

Performance Optimisation & Scaling: Monitor the performance and scalability of the data infrastructure, and tune it as the AI project grows. Optimise database queries, indexing, and storage layouts for faster model training and inference data access. Plan for scale by leveraging cloud capabilities (compute, storage) and manage costs effectively, adjusting architectures (partitioning, caching, etc.) to maintain efficient, cost-effective operations as data volumes increase. You may also evaluate new technologies (e.g. distributed computing frameworks or new databases) and incorporate them to continually improve the architecture.

Skills

Must have

Education: Bachelor's degree in Computer Science, Information Systems, or a related field (or equivalent professional experience). An advanced degree is a plus but not required.

Experience: Approximately 3-5 years of experience in data architecture, data engineering, or a related data management role. A proven track record in designing data solutions and managing data schemas is expected.

Data Modelling & Databases: Strong proficiency in data modelling and database design. You should be comfortable creating ER diagrams and defining relational schema, as well as working with NoSQL databases (e.g. document or graph databases). Practical experience with SQL and at least one relational database is required, as well as deep knowledge of other data store types (especially graph databases) is highly needed.

Data Pipeline Development: Hands-on experience developing data pipelines and integration workflows. This includes proficiency in ETL/ELT tools or frameworks (or custom scripting with Python/SQL) to gather and transform data. You should understand how to optimise data flow and have experience with batch processing; experience with real-time streaming data (e.g. using Kafka or equivalent) is a plus.

Ontologies & Knowledge Graphs: Exposure to semantic data modelling, ontologies, or knowledge graph construction. Experience in structuring data with ontologies (e.g. using RDF/OWL standards) or implementing a knowledge graph to link datasets can be very beneficial, since it helps in creating a unified data vocabulary and enriches the context for AI models.

Cloud Data Platforms: Experience working with cloud-based data platforms or big data technologies. While our approach is cloud-agnostic, you should be familiar with concepts like data lakes, data warehouses, and distributed computing in a cloud environment (e.g. using AWS, Azure, or GCP services). The ability to design solutions that leverage cloud scalability and tools for storage and processing is important.

Data Governance & Security: Solid understanding of data governance principles and best practices. You should be knowledgeable about data privacy regulations and data protection techniques, ensuring compliance in how data is stored and used. Experience implementing data quality checks, defining data standards, and using or setting up metadata management tools will be useful.

Communication & Teamwork: Excellent communication skills with the ability to collaborate in cross-functional teams. You should be able to translate complex data architecture concepts into clear terms for project managers or stakeholders, and work closely with engineering teams to guide implementation. Problem-solving aptitude and a willingness to mentor junior data team members are also important in our collaborative environment.

Nice to have

AI/ML Project Involvement: Experience working on projects that involve AI or machine learning, where you partnered with data scientists or ML engineers. For example, having supported an ML model deployment by providing well-structured data and ensuring data reliability. This background will help you anticipate the needs of AI initiatives and design data architectures that facilitate model training and inference.

Data Governance Tools: Familiarity with data governance or data cataloguing tools (such as Collibra, Alation, or Apache Atlas) and lineage-tracking systems. Hands-on experience setting up or maintaining a data catalogue, documenting data definitions, or automating data lineage capture is a strong plus, as it shows ability to operationalise governance and transparency in data ecosystems.

Modern Data Architecture Patterns: Experience with modern data architecture concepts and patterns. This could include implementing or working with data lakehouse architectures (combining data lake flexibility with data warehouse performance), data mesh principles (decentralising data ownership to domain teams), or event-driven/streaming architectures. Familiarity with these approaches demonstrates adaptability and knowledge of cutting-edge solutions for handling complex data workflows.

Certifications: Relevant industry certifications are advantageous. Certifications such as AWS/Azure/GCP data engineering certifications, Certified Data Management Professional (CDMP), or other credentials in data architecture and cloud services show validated expertise and a commitment to staying current with technology developments. While not mandatory, they could strengthen your candidacy.

Other

Languages

English: C1 Advanced

Seniority

Senior

Related jobs

View all vacancies

AI engineer with Agentic experience

AI/ML

United States of America

Los Angeles

Senior MLOps Engineer / Data Scientist

AI/ML

India

Remote India

Senior AI Engineer

AI/ML

United States of America

Remote United States

London, United Kingdom of Great Britain and Northern Ireland

Req. VR-123709

AI/ML

BCM Industry

15/07/2026

Req. VR-123709

Apply for Data Architect in London

*Indicates a required field

First name

Last name

Your country

Your City

Phone

Specialization

Attach your resume (max file size 1MB; allowed extensions: doc, txt, pdf, docx) Upload resume

Number of years of experience in the specialization you are applying for

What are your monthly gross salary expectations?

Read Explanatory Notes

Under the terms of your specific consent or to perform our obligations under a contract with you, as applicable, we, Luxoft Holding Inc. will manually and electronically process your personal data, specifically your first name, last name, phone number, e-mail address and other data you provide us through this form.

Within this context, we process personal data only for the specific purpose(s) indicated in the individual consent language or other notices provided below.

We will – insofar as reasonably necessary for the purpose you have agreed to and within the scope of applicable laws – transfer your personal data to other entities within the Luxoft Group and to the group of third party recipients listed in our Privacy Notice. Such Recipients can be located outside the European Union (EU) and/or the European Economic Area (EEA) (“Third Countries”). The Third Countries concerned, e.g. the USA, may not have the level of data protection that you enjoy e.g. under the GDPR. This can result in disadvantages such as an impeded enforcement of data subjects’ rights, a lack of control over further processing and access by state authorities. You may only have limited legal remedies against this. Insofar our transfer of your personal data to recipients in Third Countries is not covered by an adequacy decision of the EU Commission, we achieve an adequate level of data protection as further detailed out in our Privacy Notice.

With your consent, we personalise marketing communications to you by way of carrying out marketing research analysis, analysing the surfing-behaviour of our website visitors and to adjust it to their detected tendencies, as well as to plan more efficient future marketing activities. This personalised marketing does not include any automated decision-making activities.

Further information on how we process personal data in general is available in our Privacy Notice. You may withdraw any given consent at any time. The withdrawal of your consent(s) will not affect the lawfulness of processing before its withdrawal. For any request in this context, please e-mail us at: DPO@luxoft.com.

Before uploading CV or any other information to this website, to learn more about your obligations and restrictions arising from the use of this website, please read our Terms of Use.

By clicking the Agree and Send button, you consent that Luxoft Holding Inc. can process your provided personal data for the purpose of processing your application to establish a potential employment, internship or relocation with Luxoft Holding Inc., a relevant affiliate of the Luxoft Group or Luxoft Group’s clients, as applicable. You consent to the retention of your personal data, with the purpose of contacting you for potential future employment opportunities. If you refer a friend, we will process your provided personal data of you and your referral to award you with a bonus for such recommendation. You can withdraw your consent at any time with effect for the future. Before you consent, please click on „Read these Explanatory Notes“ and view our Privacy Notice (e.g. for your data subject rights).

Note: Our system automatically converts each CV into our standardized format, enabling our HR teams to process applications more efficiently.

I consent that Luxoft Holding Inc. and the affiliates of the Luxoft Group can process my personal data for the purpose of sending me regular emails with information about Luxoft as an employer including job vacancies and (ii) marketing communications with information about recruitment-related events, organized webinars, conferences, as well as the Logeek Magazine and other similar publications. I can withdraw my consent at any time with effect for the future. These communications are personalised to my interests in accordance with our Privacy Notice.

Related Jobs

All jobs

MENU

Data Architect

Project description

Responsibilities

Skills

Other

Related jobs

AI engineer with Agentic experience

Senior MLOps Engineer / Data Scientist

Senior AI Engineer