3Pillar Global Jobs

Lead Data Engineer with AI experience

3Pillar Global

Lead Data Engineer with AI experience

Reposted 17 Days Ago

Be an Early Applicant

Remote

Hiring Remotely in India

Senior level

Remote

Hiring Remotely in India

Senior level

Lead Data Engineer to design, build, and operate production data pipelines, retrieval/vector infrastructure, semantic/feature stores, and ML/LLMOps foundations. Drive CI/CD, governance, monitoring, and agent/data APIs for RAG, LLM, and predictive model workloads.

The summary above was generated by AI

3Pillar is an AI transformation partner on a mission to help enterprises build the AI-native products and intelligent agents that will define the next era of business. With teams across North America, Europe, Latin America, and Asia, we work with the most ambitious companies in financial services, healthcare, media, and technology — helping them move faster, modernize boldly, and compete on their own terms. Our HelixAI platform and Helix Pods delivery model put our engineers at the center of real agentic transformation — doing work that is open, portable, and built to last. We are building the future of enterprise AI

We are looking Lead Data Engineer to build, operate, and continuously improve the
data pipelines, retrieval infrastructure, and ML/LLMOps foundations that power our AI
initiatives. The resource will work on turning reference architectures and data contracts
into robust, production-grade implementations that serve conversational AI assistants,
dashboard copilots, autonomous agents, RAG applications, and predictive ML models.

Key Responsibilities:

Data Pipeline Engineering : Build, test, and maintain production pipelines (batch & real-time) on Snowflake, PySpark, Delta Lake, and Kafka.

Implement data quality checks, schema validation, and alerting at every pipeline stage.

Migrate legacy ETL/DWH to cloud-native AWS/Azure architectures with measurable latency and cost improvements.

Maintain CI/CD pipelines: automated testing, deployment, rollback, and IaC (Terraform, GitHub Actions).

RAG, Vector & Retrieval Infrastructure: Build end-to-end retrieval infrastructure: document ingestion, embedding pipelines, vector store management (Pinecone, FAISS, ChromaDB, OpenSearch), and hybrid retrieval layers.

Implement chunking, metadata filtering, and re ranking — tuning for precision, recall, and latency.

Maintain data freshness and index consistency; instrument with context relevance and faithfulness metrics.

Semantic Layer & Knowledge Infrastructure: Implement and maintain business entity mappings, ontologies, and knowledge graphs (Neo4j) per Architect design.

Build and version the feature store and semantic data contracts serving both ML models and LLM applications.

Manage metadata, data lineage, and audit trail instrumentation across the platform.

ML/LLMOps Pipeline Support: Build ML data infrastructure: training curation, feature engineering, MLflow experiment tracking, dataset versioning.

Support LLM fine-tuning workflows — corpus curation, quality filtering, dataset formatting.

Implement automated evaluation pipelines: factual accuracy, hallucination detection, regression tracking.

Maintain production monitoring dashboards for pipeline health, model metrics, and alerting.

Agentic Data Infrastructure: Build and maintain data APIs, tool schemas, and memory/state stores that autonomous agents depend on.

Implement agent observability: capture inputs, retrieved context, tool calls, reasoning traces, and outputs.

Maintain text-to-SQL layers, semantic query interfaces, and context APIs for conversational AI consumers.

Governance, Security & Data Quality: Implement RBAC, attribute-based access, PII detection/masking, data classification, and audit logging.

Enforce data contracts and schema governance with automated breaking-change detection and versioned migrations.

Build data quality monitoring (completeness, freshness, consistency) with automated alerting and root-cause tooling.

Support compliance readiness: audit trails, data provenance, and regulatory documentation.

Qualifications:

7+ years data engineering using Cloud services
2+ years production AI/ML or LLM-era data infrastructure. Proven experience building production pipelines at scale — batch and streaming, Snowflake,AWS/Azure.
Deep expertise: Python, PySpark, Snowflake, Delta Lake, Kafka, Spark Structured Streaming.
Hands-on with vector stores, embedding pipelines, and retrieval infrastructure in production RAG environments.
Working knowledge of MLOps: MLflow, CI/CD for AI, automated evaluation, and production monitoring.
Strong grounding in data governance, quality frameworks, and compliance-
aligned engineering.

Technical Skills:

Primary skills: Python, SQL, PySpark, Kafka, Snowflake/DataBricks, Delta Lake, AWS (S3, Glue, Kinesis, EKS, Redshift), Docker, Kubernetes, GitHub Actions.
Secondary Skills : LangChain, LlamaIndex, LLM APIs (OpenAI, Bedrock, Claude, HuggingFace), Pinecone, FAISS, ChromaDB, OpenSearch, MLflow, FastAPI, Neo4j, LangGraph, prompt engineering, RLHF dataset prep, LLM fine-tuning workflows

What It's Like to Work at 3Pillar:

At 3Pillar, we create an environment where people can do their best work while maintaining a healthy work-life balance.

Flexibility & Well-being – Our remote-first approach gives you the flexibility to work where you perform best, while prioritizing your well-being and personal commitments.

Global Community – Collaborate with talented colleagues across the globe in a culture built on connection, support, and shared success.

Your Voice Matters – We foster open communication and multiple feedback channels, ensuring every employee has the opportunity to be heard and make an impact.

Growth & Development – Gain exposure to diverse clients, industries, and challenges that accelerate learning and career growth.

Our culture is guided by four core values: Collaboration, Outperform, Respect, and Evolve—the principles that shape how we work, grow, and succeed together.

Thank you,

Kiran Dhanak

Manager, Talent Acquisition

Noida, India

Similar Jobs

Motive

Senior Product Manager

An Hour Ago

Easy Apply

Remote

India

Easy Apply

Senior level

Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation

Own and roadmap fleet management product experiences for dispatchers, operators, and drivers. Translate customer pain into web and mobile workflows, define metrics, run betas, work with data scientists to build AI features from sensor data, and collaborate cross-functionally while gathering field feedback.

Top Skills: AIAutomotive SystemsIndustrial IotSQLTelematics

Atlassian

Machine Learning Engineer

5 Hours Ago

In-Office or Remote

Expert/Leader

Cloud • Information Technology • Productivity • Security • Software • App development • Automation

Lead design and implementation of advanced ML systems: architect models and services, run experiments and evaluations, deploy and scale models (MLOps), build RESTful ML APIs, and mentor junior ML engineers to integrate AI functionality across Atlassian products.

Top Skills: KerasLlmsMlopsNumpyPandasPythonPyTorchRestful ApisScikit-LearnTensorFlow

Legora

WORLD'S BEST GOLFING LAWYER (Global)

7 Hours Ago

In-Office or Remote

India

Mid level

Artificial Intelligence • Legal Tech • Software

Select the world’s best golf-playing practicing lawyer to attend the Legora Invitational in NYC on Sept 2, 2026. Play golf with leading lawyers and Ludvig Åberg, present a written, specific case about how you use Legora, demonstrate putting ability, and join discussions about the future of law and legal technology. Travel and accommodation covered.

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

3Pillar Global

Lead Data Engineer with AI experience

3Pillar Global Noida, Uttar Pradesh, IND Office

Similar Jobs

Senior Product Manager

Machine Learning Engineer

WORLD'S BEST GOLFING LAWYER (Global)

What you need to know about the Delhi Tech Scene