Pravāh Logo

Pravāh

Weather Data Scientist (Data Assimilation)

Reposted 3 Days Ago
Be an Early Applicant
In-Office
New Delhi, Delhi, IND
Senior level
In-Office
New Delhi, Delhi, IND
Senior level
The Weather Data Scientist will develop and implement data assimilation pipelines for weather forecasting, focusing on observational data quality and machine learning-driven forecasts.
The summary above was generated by AI
Weather Data Scientist: Data Assimilation


Working hours: The team is distributed across India and the US, so expect a few hours of evening overlap with US Pacific Time on most workdays.


Overview

About Pravāh

Pravāh is an AI lab building foundational intelligence for the electric grid. We apply modern machine learning to complex physical infrastructure problems spanning grid operations, weather, and geospatial systems.

Our work sits at the intersection of computer vision, physical systems, and large-scale ML, with deployments across utilities in the United States and India. We leverage multimodal data including satellite imagery, LiDAR, and street-level data to build high-fidelity representations of grid assets and their surroundings.

We are backed by Khosla Ventures, Pear VC, and Conviction - some of the most ambitious investors in Silicon Valley.

More about who we are, what we are building, and why we are excited: Website, Pravāh on Notion.


The role

We are hiring a Weather Data Scientist to advance the next generation of weather forecasting systems for India, with strong attention to observational data quality and geospatial consistency. You will work closely with machine learning and software engineers on three core threads:

1. Data assimilation: contribute hands-on to data assimilation for weather forecasting models.

2. ML-ready datasets: procure, process, and create ML-ready global and regional weather datasets at large scale (high volume, multi-source, long time horizons), with explicit focus on data-sparse regions.


What you'll work on

· Build and operate a cycling data assimilation pipeline for our operational forecasting models, and produce the high-resolution gridded products it enables downstream.

· Choose, deploy, and adapt a modern DA framework (e.g. JEDI/UFO, GSI, DART, PDAF) for our regional and global needs.

· Develop observation quality control, bias correction (VarBC), and thinning workflows that hold up at operational data volumes and degrade gracefully when feeds drop out.

· Contribute to AI-based data assimilation pipelines.

· Tailor weather prediction models to renewable-sector needs, particularly solar (GHI) and wind generation (100m winds).

· Assist in training AI-based weather prediction models.

· Work at the intersection of physics-based modeling and machine learning—hybrid physics–ML systems, learned parameterizations, and emulators.

Who you areRequired qualifications

· A master's or PhD in geophysical sciences, physics, applied mathematics, computer science, statistics, or a related field. A bachelor's degree with 3+ years of relevant research or operational experience is also acceptable.

· Demonstrated depth in data assimilation, evidenced by operational work, model contributions, research projects, publications, or technical reports.

· Hands-on experience across the DA toolkit: observation operators and error specification; variational (3D-/4D-Var) or ensemble (EnKF, LETKF, EDA) methods; cycling workflows and innovation statistics; and assimilation of satellite, radar, radiosonde, or station observations.

· Hands-on experience with at least one operational DA framework: JEDI/UFO, GSI, DART, PDAF, or an in-house equivalent, including building observation operators and forward models.

· Working knowledge of bias correction (VarBC), adaptive QC, and gross-error rejection.

· Experience contributing to or maintaining assimilation code, or holding responsibility in an operational or quasi-operational forecasting pipeline.

· Experience working with TB-scale, high-dimensional observational and modeling datasets (reanalysis, satellite, radar, weather-station, and sounding data) and the geospatial pipework (grids, reprojection, masks) around them.

· Hands-on experience with widely used reference datasets such as ERA5, MERRA-2, IMDAA, IMERG/GPM, and GOES/INSAT/Himawari.

· Practical experience on High Performance Computers (HPCs).

· Fluency in the modern geoscience Python stack—xarray, dask, zarr, netCDF.

· Experience building reproducible, production-grade pipelines.

· Excellent written and verbal communication, including the ability to explain technical work to both domain experts and cross-disciplinary collaborators.

Nice to have

· Prior work on projects specific to Indian geography.

· Familiarity with coupled earth-system models.

· Experience with any of: ensemble and probabilistic forecasting, regional downscaling, or subseasonal-to-seasonal (S2S) prediction.

· Experience working with operational forecasting agencies (IMD, NCMRWF, ECMWF, NOAA, etc.).

· Familiarity with AI-based weather prediction models and data assimilation techniques.

· Comfort using agentic AI tools to accelerate development.

· Publications in respected atmospheric, oceanic, or climate science venues.


What you'll gain

· Part of development of weather forecasting models deployed for real-time applications.

· Experience working on hard, open-ended problems at the intersection of AI and physical infrastructure.

· Exposure to how teams set priorities and push the frontier of AI weather prediction.

· Close collaboration with a deeply technical team.


Why this role

This role sits at the frontier of the AI weather revolution, applying modern machine learning to earth system modeling. The next decade of progress in weather and climate prediction will be built by scientists who understand the physics and the data and have learned to wield generative AI. You will work in data-sparse regions where data is heterogeneous, ground truth is incomplete, and progress requires both technical depth and first-principles thinking.

Similar Jobs

2 Hours Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
Lead pre-sales technical efforts with partners and customers to design, demo, and deploy MongoDB-based solutions. Support account teams with discovery, proofs of value, architecture, performance tuning, and partner enablement. Serve as a trusted advisor, translate technical decisions into business value, mentor peers, and represent MongoDB at events and partner engagements.
Top Skills: Angular.JsApache KafkaAtlasAtlas Data LakeAtlas Full-Text SearchAtlas Stream ProcessingAtlas Vector SearchAWSCC#C++Cloud DevopsCloud ManagerCompassConnector For BiConnector For SparkCSSCursorDatabase BenchmarkingHTMLInfrastructure As CodeJavaKubernetesLovableMongoDBNode.jsOps ManagerPerformance TuningProfilingPythonReactSQLTypescriptV0VueWindsurf
6 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, build, and operate agentic LLM-powered workflows, autonomous agents, and RAG/vector retrieval systems. Own end-to-end delivery including CI/CD, IaC, observability, DevSecOps, and Salesforce integrations to productionize enterprise AI for GTM applications.
Top Skills: AgentcoreAgentforceAutogenAws BedrockCdkCopadoCrewaiDastDockerGithub ActionsGitopsJavaScriptJenkinsKubernetesLangchainLanggraphLightning Web ComponentsModel Context Protocols (Mcp)PgvectorPineconePlatform EventsPythonSalesforce ApexSastSemantic KernelService MeshSlackTerraformTypescriptVertex AiWeaviate
6 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead a distributed SRE team owning CI/CD platform reliability, automation, observability, and data infrastructure. Provide people management, technical direction, architecture input, operational excellence, and cross-team collaboration while driving automation, monitoring, and AI-assisted workflows.
Top Skills: AnsibleApache AirflowSparkAWSAzureBashBazelBitbucketChefDatadogGCPGitGithub ActionsGitlabGitlab CiGoGrafanaHumio/LogscaleJenkinsKafkaKubernetesNasNfsObject StorageOpensearchOraclePostgresPowershellPrometheusPulsarPuppetPythonRedisRedpandaSanSli/SloSplunkTerraformValleyVarnish

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account