Apiphany Logo

Apiphany

Associate Data Scientist

Posted 8 Days Ago
Remote
Hiring Remotely in India
Junior
Remote
Hiring Remotely in India
Junior
Prepare, clean, and validate structured and unstructured data for LLM-driven systems; build training datasets, support RAG and NL->SQL pipelines, perform data quality checks, and assist in data pipelines/APIs and model evaluation.
The summary above was generated by AI
Role Overview

We are seeking an Associate Data Scientist to support AI/ML engineering efforts by preparing, validating, and structuring data for LLM-driven systems. This is a hands-on role focused on real-world data processing, pipeline support, and model evaluation.

Key Responsibilities
  • Process and clean structured and unstructured data for AI/ML pipelines.

  • Prepare training-ready datasets for LLM fine-tuning and evaluation workflows.

  • Support RAG and NL→SQL systems through data preparation and validation.

  • Perform data quality checks and ensure completeness and consistency.

  • Assist in building and maintaining data pipelines and APIs (e.g., FastAPI).

  • Collaborate with engineering teams to troubleshoot and optimize data workflows.

Required Skills
  • 1–3 years of experience in data processing or data-focused roles.

  • Strong Python skills with experience in data libraries (Pandas, NumPy, Scikit-learn).

  • Experience supporting LLM workflows (fine-tuning, prompt engineering, evaluation).

  • Familiarity with structured (SQL) and unstructured text data.

  • Understanding of data preparation for AI/ML systems.

Nice to Have
  • Exposure to RAG pipelines, embeddings, or evaluation metrics.

  • Experience with ML frameworks (PyTorch/TensorFlow) and Docker-based workflows.

  • Experience with CI/CD pipelines for ML systems.

  • Familiarity with vector databases (e.g., Chroma) and reranking techniques.

  • Research exposure to Transformer-based architectures.

Top Skills

Python,Pandas,Numpy,Scikit-Learn,Sql,Fastapi,Llms

Similar Jobs

4 Hours Ago
Remote or Hybrid
Pune, Maharashtra, IND
Mid level
Mid level
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
Contribute to the development and monitoring of ML and LLM-based security models, including data acquisition, model evaluation, and deployment on AWS infrastructure.
Top Skills: AWSBedrockCloudwatchGithub ActionsHuggingface TransformersJenkinsLambdaLangchainNumpyPandasPythonPyTorchS3SagemakerScikit-LearnTensorFlow
4 Hours Ago
Remote or Hybrid
Pune, Maharashtra, IND
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
As a Senior Software Engineer in Test, you will ensure product quality through testing strategies, developing automation frameworks, and mentoring junior members.
Top Skills: AWSCucumberGoJavaJIRANunitPlaywrightPythonRobotframeworkSelenium
8 Hours Ago
Remote or Hybrid
4 Locations
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead the design and execution of knowledge management solutions within enterprise transformation programs, ensuring knowledge assets are captured and reused effectively.
Top Skills: AIAutomationBloomfireBusiness Process ManagementInformation ScienceKnowledge Management

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account