Design and optimize LLM systems, manage scalable infrastructure, implement CI/CD and automation, and ensure system reliability and compliance.
Company Description
👋🏼We're Nagarro
We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale across all devices and digital mediums, and our people exist everywhere in the world (17500 experts across 36 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!
Job DescriptionREQUIREMENTS:
- Experience : 7.5+ Years
- 10-12 years in infrastructure, platform, DevOps, or MLOps roles
- Strong experience with cloud platforms (AWS/GCP/Azure) and Kubernetes
- Hands-on experience deploying and operating LLMs (OpenAI, Anthropic, open-source models)
- Proficiency with GPU infrastructure, model serving frameworks, and vector databases
- Strong programming skills in Python; experience with Bash/Go is a plus
- Experience with monitoring, logging, and performance tuning for distributed systems
- Preferred Qualifications
- Experience with LLM fine-tuning, RAG pipelines, and prompt/version management
- Familiarity with tools like Terraform, Helm, Argo, Ray, or similar
- Exposure to cost optimization strategies for large-scale AI systems
Responsibilities:
- Design and manage scalable infrastructure for training, fine-tuning, serving, and monitoring LLMs
- Build and maintain LLMOps pipelines (deployment, versioning, rollback, monitoring, evaluation)
- Optimize inference performance (latency, throughput, cost) across GPU/accelerator stacks
- Implement CI/CD, IaC, and automation for AI/ML workloads
- Ensure observability, reliability, and governance of LLM systems in production
- Collaborate with ML, platform, and product teams to operationalize AI solutions
- Manage security, compliance, and access control for model and data pipelines
Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
Top Skills
Aws,Gcp,Azure,Kubernetes,Python,Bash,Go,Tensorflow,Pytorch,Terraform,Helm,Argo,Ray
Nagarro Gurugram, Haryana, IND Office
13, Sub. Major Laxmi Chand Rd, Maruti Udyog, Sector 18, Gurugram, Haryana, India, 122015
Similar Jobs
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Lead the development of enterprise-grade AI platforms, focusing on workflows, ML pipelines, and ensuring engineering quality while mentoring a team.
Top Skills:
AzureCompass AiGenaiMlPythonRag
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
The Senior Staff Engineer will develop automation features, manage cloud infrastructure on AWS, and optimize CI/CD pipelines. Responsibilities include code reviews, documentation, and leveraging infrastructure as code tools like Terraform and Ansible.
Top Skills:
Ai Coding AssistantsAnsibleAWSBashDockerGitJenkinsKubernetesPythonTerraform
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Lead the architecture and implementation of Salesforce Consumer Goods Cloud solutions, ensuring best practices for trade promotions and integration with ERP systems while overseeing data governance and analytics.
Top Skills:
ApexAPIsCRMEinstein AnalyticsErpLwcMicrosoft D365OracleSalesforceSAPTableau
What you need to know about the Delhi Tech Scene
Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.
