The DevOps Engineer will design, build, and optimize cloud infrastructure for machine learning operations, manage CI/CD pipelines, and ensure reliability across systems.
We’re looking for a DevOps Engineer to help design, build, and optimize the cloud infrastructure powering our machine learning operations. You’ll play a key role in scaling AI models from research to production — ensuring smooth deployments, real-time monitoring, and rock-solid reliability across our Google Cloud Platform (GCP) environment.
You’ll work hand-in-hand with data scientists, ML engineers, and other DevOps experts to automate workflows, enhance performance, and keep our AI systems running seamlessly for millions of players worldwide.
What You’ll Do
- Manage, configure, and automate cloud infrastructure using tools such as Terraform and Ansible.
- Implement CI/CD pipelines for ML models and data workflows, focusing on automation, versioning, rollback, and monitoring with tools like Vertex AI, Jenkins, and DataDog.
- Build and maintain scalable data and feature pipelines for both real-time and batch processing using BigQuery, BigTable, Dataflow, Composer, Pub/Sub, and Cloud Run.
- Set up infrastructure for model monitoring and observability — detecting drift, bias, and performance issues using Vertex AI Model Monitoring and custom dashboards.
- Optimize inference performance, improving latency and cost-efficiency of AI workloads.
- Ensure overall system reliability, scalability, and performance across the ML/Data platform.
- Define and implement infrastructure best practices for deployment, monitoring, logging, and security.
- Troubleshoot complex issues affecting ML/Data pipelines and production systems.
- Ensure compliance with data governance, security, and regulatory standards, especially for real-money gaming environments.
What We’re Looking For
- 3+ years of experience as a DevOps Engineer, ideally with a focus on ML and Data infrastructure.
- Strong hands-on experience with Google Cloud Platform (GCP) — especially BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub.
- Proficiency with Terraform (and bonus points for Ansible).
- Solid grasp of containerization (Docker, Kubernetes) and orchestration platforms like GKE.
- Experience building and maintaining CI/CD pipelines, preferably with Jenkins.
- Strong understanding of monitoring and logging best practices for cloud and data systems.
- Scripting experience with Python, Groovy, or Shell.
- Familiarity with AI orchestration frameworks (LangGraph or LangChain) is a plus.
- Bonus points if you’ve worked in gaming, real-time fraud detection, or AI-driven personalization systems.
Similar Jobs
Artificial Intelligence • Software • Industrial • Manufacturing
Design, build, and operate secure, tenant-isolated cloud infrastructure and CI/CD for an AI-native multi-tenant platform. Implement observability, policy-as-code, SLAs, incident response, and automate provisioning (Terraform/Helm). Support deployment, monitoring, and secure operations for ML/agent systems at scale.
Top Skills:
ArgocdAWSAzureCi/CdDockerDustEmbedding PipelinesEncryptionGCPGithub ActionsGitopsGke)Gpu ScalingGrafanaHelmIamImage ScanningKey ManagementKubernetes (EksLangchainLanggraphOpaOpentelemetryPineconePrometheusPulumiQdrantReactRegoRetrieval-Augmented Generation (Rag)Secrets RotationSentryTerraformTgiVector DbsVllmVpcWeaviate
Information Technology • Software
The DevOps Engineer will design and maintain cloud infrastructure, improve reliability, manage CI/CD pipelines, and automate provisioning while enhancing collaboration with software engineers and focusing on operational excellence.
Top Skills:
AWSAzureBashCi/CdDockerGCPGoKubernetesLinuxPythonTerraform
Fintech • Payments • Financial Services
Design, automate, and manage Azure cloud infrastructure for a marketplace. Lead Infrastructure-as-Code with Terraform, build reusable modules, maintain CI/CD pipelines (Azure DevOps), script operational workflows with Bash, containerize apps with Docker, manage cloud networking and WAF rules, and collaborate with development, security, and QA to ensure reliable, secure deployments.
Top Skills:
.NetAzureAzure DevopsBashDnsDockerGithub ActionsGitlabKubernetesLinuxNetwork Security Group (Nsg)Private NetworkingSubnetTerraformVnetWeb Application Firewall (Waf)
What you need to know about the Delhi Tech Scene
Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.



