Data Engineer

Apply
Apply

Share

successfully icon

Successfully

The vacancy has been successfully added to favorites

location icon

Mumbai, India

specialization icon

Data Science

lob icon

BCM Industry

date icon

05/03/2026

Req. VR-121455

Apply
Project description

We're looking for a Data Engineer with hands on experience in graph databases to design, build, and optimize data pipelines and knowledge graph solutions that power advanced analytics and discovery. You'll collaborate with data scientists, platform engineers, and product teams to model complex domains, integrate heterogeneous sources, and deliver queryable, scalable graph data products.

Responsibilities
bullet icon

Graph Data Modeling & Design

bullet icon

Design and implement property graphs and RDF/OWL-based knowledge graphs.

bullet icon

Develop schemas/ontologies, entity resolution and lineage strategies; define best practices for graph modeling, naming, and versioning.

bullet icon

Data Engineering & Integration

bullet icon

Build and maintain ETL/ELT pipelines to ingest, cleanse, transform, and load data into graph stores from APIs, files, RDBMS, event streams.

bullet icon

Implement batch and streaming integrations using tools such as Airflow, dbt, Kafka/Kinesis, Spark/Flink.

bullet icon

Optimize data quality, deduplication, key management, and incremental upserts into graphs.

bullet icon

Querying & APIs

bullet icon

Write advanced queries in Cypher, Gremlin, and/or SPARQL; tune queries and indexes for performance.

bullet icon

Expose graph capabilities via APIs/services (REST/GraphQL/GRANDstack) with robust governance, observability and caching.

bullet icon

Performance, Reliability & Security

bullet icon

Capacity planning, clustering, backups, and high availability for graph databases.

bullet icon

Monitoring/alerting (e.g., Prometheus/Grafana, CloudWatch), profiling and query plan analysis.

bullet icon

Apply security best practices: encryption, RBAC/ABAC, least privilege, secrets management, and data masking/Pii handling.

bullet icon

MLOps/Analytics Enablement (nice if applicable)

bullet icon

Support downstream analytics and graph algorithms (PageRank, community detection, embeddings) and integrate with ML pipelines.

bullet icon

DevOps & SDLC

bullet icon

Infrastructure-as-Code (Terraform, Bicep, CloudFormation), containerization (Docker, Kubernetes), and CI/CD for data/infra.

bullet icon

Documentation, code reviews, and contribution to data governance (catalogs, lineage, metadata).

Skills

Must have

bullet icon

Experience: 6 years in Data Engineering (or similar) with 2+ years focused on graph databases (property graph and/or RDF).

bullet icon

Graph DBs: Hands-on with at least one of:

bullet icon

Property Graph: Neo4j, AWS Neptune (Gremlin/Cypher).

bullet icon

RDF Triple Stores: Ontotext GraphDB, Apache Jena/Fuseki, Blazegraph, Stardog, Neptune (RDF).

bullet icon

Query Languages: Strong in Cypher and/or Gremlin; SPARQL if working with RDF/OWL.

bullet icon

Data Pipelines: Proficient with Airflow (or similar), Kafka/Kinesis, Spark or Flink; building robust ETL/ELT at scale.

bullet icon

Programming: Python (dataframes, APIs, CLI tooling); solid testing practices (pytest/pytest-bdd).

bullet icon

Cloud: Experience with AWS managed graph/datastores, storage, compute, and networking basics.

bullet icon

Performance & Ops: Indexing, memory/GC tuning, query plan analysis, partitioning/sharding concepts, HA/DR, backup/restore.

bullet icon

Security & Governance: Secrets management, IAM, network isolation, PII compliance; familiarity with data catalog/lineage tools.

bullet icon

Communication: Ability to translate domain knowledge into graph models and explain trade-offs to non technical stakeholders.

Nice to have

bullet icon

Knowledge Graphs & Semantics: RDFS, SHACL, ontology engineering, reasoning/inference, vocabulary alignment (SKOS).

bullet icon

Graph Algorithms & Embeddings: Neo4j Graph Data Science, NetworkX, PyTorch Geometric, vector DB integration.

bullet icon

Graph + Search: Integration with Elasticsearch/OpenSearch, hybrid search (BM25 + embeddings).

bullet icon

Data Modeling: Experience migrating from relational to graph; CDC patterns (Debezium), event-driven architectures.

bullet icon

Observability: OpenTelemetry, tracing for data services; data quality frameworks (Great Expectations).

bullet icon

Delivery: Experience with productizing graph APIs, caching layers, SLA/SLO management.

bullet icon

Regulatory: Familiarity with GDPR/CCPA, data retention, sovereignty considerations.

Other
seniority icon

Languages

English: C1 Advanced

seniority icon

Seniority

Senior

Mumbai, India

Req. VR-121455

Data Science

BCM Industry

05/03/2026

Req. VR-121455

Apply for Data Engineer in Mumbai

*Indicates a required field

Under the terms of your specific consent or to perform our obligations under a contract with you, as applicable, we, Luxoft Holding Inc. will manually and electronically process your personal data, specifically your first name, last name, phone number, e-mail address and other data you provide us through this form.


Within this context, we process personal data only for the specific purpose(s) indicated in the individual consent language or other notices provided below.


We will – insofar as reasonably necessary for the purpose you have agreed to and within the scope of applicable laws – transfer your personal data to other entities within the Luxoft Group and to the group of third party recipients listed in our Privacy Notice. Such Recipients can be located outside the European Union (EU) and/or the European Economic Area (EEA) (“Third Countries”). The Third Countries concerned, e.g. the USA, may not have the level of data protection that you enjoy e.g. under the GDPR. This can result in disadvantages such as an impeded enforcement of data subjects’ rights, a lack of control over further processing and access by state authorities. You may only have limited legal remedies against this. Insofar our transfer of your personal data to recipients in Third Countries is not covered by an adequacy decision of the EU Commission, we achieve an adequate level of data protection as further detailed out in our Privacy Notice.


With your consent, we personalise marketing communications to you by way of carrying out marketing research analysis, analysing the surfing-behaviour of our website visitors and to adjust it to their detected tendencies, as well as to plan more efficient future marketing activities. This personalised marketing does not include any automated decision-making activities.


Further information on how we process personal data in general is available in our Privacy Notice. You may withdraw any given consent at any time. The withdrawal of your consent(s) will not affect the lawfulness of processing before its withdrawal. For any request in this context, please e-mail us at: DPO@luxoft.com.


Before uploading CV or any other information to this website, to learn more about your obligations and restrictions arising from the use of this website, please read our Terms of Use.