Back to jobs
Lead Cloud Platform Engineer/Senior SRE
Successfully
Req. VR-121266
Monitoring as a Platform is a platform that is the first critical step towards a self-managed infrastructure, and includes capabilities like real-time monitoring, intelligent networks, self-healing, and IoT to achieve improved productivity, organizational agility, and improved employee experiences.
Cloud Architecture & Infrastructure as Code (IaC)
Lead the design and implementation of highly available, multi-region AWS architectures with a primary focus on EKS (Elastic Kubernetes Service).
Extensive Terraform Tooling: Develop, maintain, and version-control modular Terraform templates to manage complex cloud resources, ensuring 100% of infrastructure is codified and reproducible.
Configuration Management: Utilize Ansible Playbooks for OS-level hardening, application configuration, and hybrid-cloud task automation.
Kubernetes Orchestration & Security
K8s Lifecycle Management: Manage the full lifecycle of EKS clusters, including upgrades, node group optimization, and cost management.
Security & Governance: Implement and enforce Kubernetes security best practices, including Service Accounts (IRSA), Network Policies, RBAC, and integrated Secrets Management (e.g., HashiCorp Vault or AWS Secrets Manager).
Containerization: Lead the effort to containerize complex legacy applications and optimize configuration patterns within Kubernetes.
CI/CD & SRE Automation
GitHub Actions Excellence: Design and optimize high-speed GitHub Actions workflows for automated testing, security scanning, and seamless deployments.
SRE Scripting: Develop advanced automation scripts (Python, Go, or Bash) to eliminate "toil," automate self-healing, and perform capacity planning.
Observability & Monitoring: Build and maintain comprehensive Grafana dashboards to monitor Pod/Service performance. Deploy and configure Beats agents (Filebeat, Metricbeat) as DaemonSets to ensure deep visibility into container logs and system metrics.
Must have
8+ years of professional experience in Cloud Engineering/DevOps, with at least 5 years of focused Kubernetes administration.
Cloud Mastery: Expert-level knowledge of AWS (VPC, IAM, EKS, RDS, Route53, S3, Gateway API, Lambda).
IaC: Expert in Terraform (Reusable modules, state management).
Automation: Strong proficiency in Ansible and SRE-focused scripting.
CI/CD: Deep experience with GitHub Actions.
Container Ecosystem: Expert knowledge of Docker, K8s networking, and the ELK/Beats stack.
Monitoring: Mastery of Grafana and Prometheus for performance tuning.
Certification: (Highly Preferred) CKA (Certified Kubernetes Administrator) or AWS Certified Solutions Architect
Professional.
Scripting Languages: Python, Shell Script
Architectural Vision: Ability to translate business requirements into scalable technical blueprints.
Mentorship: Proven track record of guiding junior and senior engineers through complex technical hurdles.
Incident Leadership: Experience leading Root Cause Analysis (RCA) and post-mortem discussions to improve system resilience.
Nice to have
Other AWS certifications is a plus
Languages
English: C1 Advanced
Seniority
Lead
Bengaluru, India
Req. VR-121266
Software/System Architecture
HLS & Consumer industry
26/02/2026
Req. VR-121266
Apply for Lead Cloud Platform Engineer/Senior SRE in Bengaluru
*Indicates a required field