Design and optimize LLM systems, manage scalable infrastructure, implement CI/CD and automation, and ensure system reliability and compliance.
Company Description
👋🏼We're Nagarro
We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale across all devices and digital mediums, and our people exist everywhere in the world (17500 experts across 36 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!
Job DescriptionREQUIREMENTS:
- Experience : 7.5+ Years
- 10-12 years in infrastructure, platform, DevOps, or MLOps roles
- Strong experience with cloud platforms (AWS/GCP/Azure) and Kubernetes
- Hands-on experience deploying and operating LLMs (OpenAI, Anthropic, open-source models)
- Proficiency with GPU infrastructure, model serving frameworks, and vector databases
- Strong programming skills in Python; experience with Bash/Go is a plus
- Experience with monitoring, logging, and performance tuning for distributed systems
- Preferred Qualifications
- Experience with LLM fine-tuning, RAG pipelines, and prompt/version management
- Familiarity with tools like Terraform, Helm, Argo, Ray, or similar
- Exposure to cost optimization strategies for large-scale AI systems
Responsibilities:
- Design and manage scalable infrastructure for training, fine-tuning, serving, and monitoring LLMs
- Build and maintain LLMOps pipelines (deployment, versioning, rollback, monitoring, evaluation)
- Optimize inference performance (latency, throughput, cost) across GPU/accelerator stacks
- Implement CI/CD, IaC, and automation for AI/ML workloads
- Ensure observability, reliability, and governance of LLM systems in production
- Collaborate with ML, platform, and product teams to operationalize AI solutions
- Manage security, compliance, and access control for model and data pipelines
Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
Top Skills
Aws,Gcp,Azure,Kubernetes,Python,Bash,Go,Tensorflow,Pytorch,Terraform,Helm,Argo,Ray
Nagarro Gurugram, Haryana, IND Office
13, Sub. Major Laxmi Chand Rd, Maruti Udyog, Sector 18, Gurugram, Haryana, India, 122015
Similar Jobs
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
The Senior Staff Engineer will lead AI platform development, design reusable frameworks, and manage MLOps for ML workloads, ensuring engineering quality and mentoring team members.
Top Skills:
AzureCompass Ai ServicesGenaiMachine LearningMlopsPython
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Lead the architecture and implementation of Salesforce Consumer Goods Cloud solutions, ensuring best practices for trade promotions and integration with ERP systems while overseeing data governance and analytics.
Top Skills:
ApexAPIsCRMEinstein AnalyticsErpLwcMicrosoft D365OracleSalesforceSAPTableau
Artificial Intelligence • Information Technology • Machine Learning • Software • Virtual Reality • Analytics
Seeking a Senior Staff Engineer with a focus on leadership and mentorship, capable of addressing complex client challenges and enhancing team capabilities.
What you need to know about the Delhi Tech Scene
Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.
