Back to jobs
Infrastructure Architect
Successfully
Req. VR-120922
We are seeking an experienced Infrastructure Architect with deep expertise in designing and delivering end‑to‑end on‑premises AI and data‑center infrastructure. In this role, you will architect and guide implementations spanning GPU compute clusters, high‑performance storage, data‑center networking fabrics, and automation frameworks, ensuring the environment is secure, resilient, and optimized for AI/ML workloads.
This position requires both high‑level architectural vision and hands‑on design across compute, network, storage, and control‑plane platforms.
End‑to‑End Infrastructure Architecture
Own and evolve the reference architecture for on‑prem AI compute ecosystems, including GPU servers, accelerators, and DPUs.
Design GPU clustering strategies and partitioning models (MIG, MPS) for multi‑tenant training and inference workloads.
Define rack‑to‑control‑plane architecture, aligning hardware, storage, network fabric, and Kubernetes/OpenShift environments.
Data Center Physical & Logical Design
Develop hardware BOMs, rack elevations, cabling schematics, and power/cooling envelopes.
Ensure alignment with modern data center design, including hot/cold aisle strategy, airflow optimization, and liquid‑cooling readiness.
High‑Performance Networking
Architect high‑performance data‑center fabrics such as spine‑leaf topologies, RoCEv2/InfiniBand, and high‑speed Ethernet (400G/800G).
Define network segmentation, QoS, and isolation strategies for multi‑tenant AI infrastructures.
Storage Architecture
Design scalable, high‑throughput storage solutions, including PowerScale, NVMe tiering, and object storage systems for AI/ML workloads.
Control Plane & Orchestration
Architect and harden Kubernetes/OpenShift control‑plane environments with HA topologies and GPU scheduling, ensuring Day‑0/1/2 operational readiness.
Capacity & Performance Engineering
Build capacity models covering GPU/CPU utilization, memory, storage I/O throughput, and network bandwidth aligned with model sizes and data‑ingestion patterns.
Must have
8+ years in infrastructure architecture across compute, network, and storage domains.
Deep knowledge of:
GPU compute platforms, clustering, and partitioning (MIG, MPS).
High‑performance data‑center fabrics: spine‑leaf, RoCEv2/InfiniBand, 400G/800G Ethernet.
Scale‑out storage systems (PowerScale, NVMe, object storage).
Kubernetes/OpenShift control‑plane design and HA patterns.
Experience with data‑center physical design (power, cooling, cabling, thermal).
Strong automation background (PowerShell, Terraform, Ansible).
Expertise in capacity planning, performance engineering, and resilience design.
Nice to have
N/A
Languages
English: C1 Advanced
Seniority
Lead
Gurugram, India
Req. VR-120922
IT Infrastructure Engineering
BCM Industry
11/02/2026
Req. VR-120922
Apply for Infrastructure Architect in Gurugram
*Indicates a required field