Lead 24x7 global cloud operations: monitor infrastructure and alerts, resolve incidents and perform root cause analysis, troubleshoot compute/storage/networking, provision/manage cloud resources, support patching/backups, follow incident/change processes, document runbooks, and collaborate on automation and security best practices.
Description and Requirements
Responsibilities
1. Provide 24×7 global operational support for cloud services, including incident resolution and root cause analysis
2. Monitor cloud infrastructure, system alerts, and performance dashboards.
3. Perform basic troubleshooting for compute, storage, networking, and cloud services.
4. Assist in provisioning and managing cloud resources (VMs, storage accounts, databases, etc.).
5. Support patching, backups, and routine maintenance tasks.
6. Follow incident management and change management processes.
7. Document standard operating procedures, runbooks, and recurring issues.
8. Collaborate with senior engineers to implement improvements and automation.
9. Ensure compliance with security guidelines and operational best practices.
10. Being a team player.
Experience
4 to 8 years of hands-on experience
Knowledge and Skills
Required Skills:
• Basic understanding of cloud platforms. Exposure to Azure preferred.
• Knowledge of Linux/Windows servers and basic networking concepts.
• Familiarity with monitoring tools (e.g., CloudWatch, Azure Monitor, Datadog, or similar).
• Good analytical and problem‑solving skills.
• Ability to follow SOPs and work in a structured operations environment
Good to have skills
• Exposure to scripting (PowerShell, Bash, or Python).
• Understanding of ITIL concepts (Incidents, Changes).
Basic knowledge of CI/CD tools or automation frameworks
Certifications (Preferred)
Certifications Preferred
• Microsoft Azure AZ‑900 Certification (Fundamentals)
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Responsibilities
1. Provide 24×7 global operational support for cloud services, including incident resolution and root cause analysis
2. Monitor cloud infrastructure, system alerts, and performance dashboards.
3. Perform basic troubleshooting for compute, storage, networking, and cloud services.
4. Assist in provisioning and managing cloud resources (VMs, storage accounts, databases, etc.).
5. Support patching, backups, and routine maintenance tasks.
6. Follow incident management and change management processes.
7. Document standard operating procedures, runbooks, and recurring issues.
8. Collaborate with senior engineers to implement improvements and automation.
9. Ensure compliance with security guidelines and operational best practices.
10. Being a team player.
Experience
4 to 8 years of hands-on experience
Knowledge and Skills
Required Skills:
• Basic understanding of cloud platforms. Exposure to Azure preferred.
• Knowledge of Linux/Windows servers and basic networking concepts.
• Familiarity with monitoring tools (e.g., CloudWatch, Azure Monitor, Datadog, or similar).
• Good analytical and problem‑solving skills.
• Ability to follow SOPs and work in a structured operations environment
Good to have skills
• Exposure to scripting (PowerShell, Bash, or Python).
• Understanding of ITIL concepts (Incidents, Changes).
Basic knowledge of CI/CD tools or automation frameworks
Certifications (Preferred)
Certifications Preferred
• Microsoft Azure AZ‑900 Certification (Fundamentals)
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World's 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world's leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we're inspired to transform the next century in financial services. At MetLife, it's #AllTogetherPossible . Join us!
#BI-Hybrid
Top Skills
Azure
Azure Monitor
Bash
Ci/Cd
Cloudwatch
Datadog
Linux
Powershell
Python
Windows
Similar Jobs at MetLife
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead the design and development of Identity Governance and Administration processes using SailPoint IIQ, including deploying activities, rule development, and system integration. Responsible for maintenance and improvement of IAM systems with hands-on expertise required.
Top Skills:
AdBambooBean-ShellBitbucketJIRALdapMS OfficeSailpoint Identityiq
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The Assistant Manager - Operations supports US Pension products and is responsible for reporting, analysis, and maintaining internal controls, requiring strong accounting knowledge.
Top Skills:
ExcelMS Office
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Lead strategic planning and execution of Third-Party Risk Management operations across U.S. and Asia. Manage stakeholder relations, address performance gaps, drive continuous improvement, and implement changes to enhance TPRM adoption.
Top Skills:
Performance ManagementProcurement OperationsTprm
What you need to know about the Delhi Tech Scene
Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

