ZainTECH Logo

ZainTECH

Data & AI Operations Specialist

Posted 8 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in India
Mid level
Remote
Hiring Remotely in India
Mid level
The Data & AI Operations Specialist leads technical operations for AI infrastructure, manages data pipelines, and oversees MLOps across multi-cloud environments, ensuring compliance and performance optimization.
The summary above was generated by AI

The Data & Operations AI Specialist serves as the Level 3 technical lead for Artificial Intelligence and Data Platform estate. You will be responsible for the architecture, engineering, and advanced troubleshooting of AI infrastructure, data pipelines, and MLOps lifecycles across a multi-cloud environment (Azure and OCI).

Responsibilities:

AI Infrastructure & Platform Engineering

  • Design & Architecture: Maintain the monitoring architecture for AI/ML platforms and configure advanced dashboards in Grafana and Azure Monitor.
  • Environment Governance: Manage Azure Machine Learning (AML) workspace configurations, compute targets, and Databricks cluster lifecycles (including runtime versions and platform patching).
  • Resource Optimization: Oversee GPU resource allocation, reserved capacity, and cost-performance optimization to align with FinOps goals.
  • Security Integration: Ensure all AI services utilize private endpoints, VNET integration, and RBAC controls to protect sensitive citizen data.

Data Pipeline & ETL Management

  • Pipeline Engineering: Own the design, optimization, and remediation of Azure Data Factory (ADF) and Synapse pipelines.
  • Advanced Troubleshooting: Resolve complex bottlenecks related to authentication failures, data format changes, and ETL performance.
  • SOP Leadership: Author step-by-step Standard Operating Procedures (SOPs) for the L1 NOC team to handle routine monitoring and first-line triage.

MLOps & Model Lifecycle

  • Automation: Implement CI/CD pipelines for model training, testing, and deployment to AML endpoints.
  • Model Reliability: Configure data drift detection thresholds and automated retraining triggers.
  • Recovery Operations: Develop self-healing scripts and automated recovery runbooks for critical AI workflows.

Governance & Compliance

  • Audit Management: Implement and maintain audit logging for all AI decisions and model outputs, ensuring logs flow to the SIEM/vSOC.
  • Regulatory Alignment: Conduct quarterly AI governance reviews to ensure compliance with NESA standards and data privacy guidelines.

Requirements
  • AI/ML Platforms: Deep expertise in Azure Machine Learning and Databricks.
  • Data Integration: Proficiency in Azure Data Factory and Synapse.
  • Infrastructure-as-Code (IaC): Experience with Terraform or ARM Templates for reproducible deployments.
  • Observability: Ability to use Dynatrace, Grafana, and Azure Monitor for deep-tier diagnostics.
  • Containerization: Knowledge of AKS, Istio Service Mesh, and KEDA.
  • ITIL Mastery: Strong understanding of ITIL-aligned Incident, Change, and Problem management.
  • Security Mindset: Familiarity with NESA standards and UAE data residency requirements.
  • Technical Writing: Ability to draft complex SOPs and Root Cause Analysis (RCA) documents within 48 hours of an incident.
  • Certifications: Microsoft Azure Data Scientist Associate or Azure AI Engineer Associate is highly preferred.

Top Skills

Aks
Arm Templates
Azure Data Factory
Azure Machine Learning
Azure Monitor
Databricks
Dynatrace
Grafana
Istio Service Mesh
Keda
Synapse
Terraform

Similar Jobs

21 Minutes Ago
Easy Apply
Remote
India
Easy Apply
Mid level
Mid level
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
The Salesforce Developer will design, develop, and maintain Salesforce applications, ensuring scalability and adherence to best practices while collaborating with various teams.
Top Skills: ApexBulk ApiGearsetGitJavaScriptLwcRest ApiSalesforceSalesforce Soap ApiSfdxSOQLSoslVisualforce
21 Minutes Ago
Remote or Hybrid
Junior
Junior
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Implementation Consultant leads software implementations for clients, ensuring project milestones are completed successfully while providing training and consulting for optimal software use.
Top Skills: Accounting SoftwareMS OfficeMs ProjectSalesforce
21 Minutes Ago
Remote or Hybrid
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
The Senior Software Engineer will focus on developing the BlackLine application platform, particularly SAP ERP components, driving innovation, and mentoring colleagues while maintaining high software quality standards.
Top Skills: CdsClean CodeEcc6.0Eclipse/AdtFioriOdataRfcS/4HanaSap AbapTest Driven Design

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account