Nokia Logo

Nokia

Senior Data Engineer

Reposted 5 Days Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in India
Senior level
Remote or Hybrid
Hiring Remotely in India
Senior level
Design and maintain scalable data pipelines, optimize storage solutions, ensure data quality, mentor junior engineers, and collaborate with various teams.
The summary above was generated by AI

We are seeking a highly skilled and experienced Senior Data Engineer to join our growing team in Bangalore, India. We operate a large-scale private cloud infrastructure spanning thousands of servers across multiple data centers, built on OpenStack, Kubernetes, Ceph, and VMware. In this role, you will design, build, and maintain scalable data pipelines that collect, process, and deliver data from across this infrastructure to power analytics, capacity planning, cost optimization, and AI/ML initiatives. You will collaborate closely with data scientists, platform engineers, SRE, and product teams to deliver robust, real-time, and batch data solutions.

Responsibilities
  • Design, develop, and maintain scalable data pipelines for ingestion, transformation, and delivery of structured and unstructured data

  • Build and optimize real-time streaming architectures using Apache Kafka and related ecosystem tools

  • Develop and manage ETL/ELT workflows using dbt (dbt Labs) to support analytics, reporting, and AI/ML model training

  • Implement data collection strategies from diverse infrastructure sources including OpenStack, Kubernetes, Ceph, VMware, and ServiceNow (Snow), as well as APIs, databases, and log files

  • Collaborate with AI/ML teams to build feature stores and prepare training datasets at scale

  • Ensure data quality, integrity, and governance through monitoring, validation, automated testing frameworks, and metadata management using DataHub

  • Implement and maintain data quality validation across pipelines (e.g. Great Expectations) to ensure correctness, completeness, consistency, and freshness of data at every stage

  • Optimize data storage and processing solutions within a private cloud environment (OpenStack, Ceph, Kubernetes)

  • Build and manage observability and monitoring solutions with strong emphasis on the ELK stack (Elasticsearch, Logstash, Kibana) and Prometheus as core platforms, complemented by OpenTelemetry for distributed tracing and telemetry collection

  • Mentor junior engineers and contribute to engineering best practices and technical documentation
     

Qualifications

You have:

  • Bachelor’s or master’s degree in computer science, Data Engineering, or a related field with 12+ years of professional experience and 6+yrs experience in data engineering or a closely related discipline. Strong expertise in data pipeline design, data modelling, and data manipulation at scale.

  • Strong hands-on experience with the ELK stack (Elasticsearch, Logstash, Kibana) and Prometheus — these are essential to the role.

  • Deep experience with SQL and NoSQL databases (PostgreSQL, MongoDB, Cassandra, etc.)

  • Hands-on experience with Apache Kafka (or equivalent streaming platforms such as Apache Pulsar)

  • Experience with dbt (dbt Labs) for data transformation, modelling, and testing

  • Experience with data quality frameworks (e.g. Great Expectations) and pipeline validation practices such as data contracts, automated testing, and anomaly detection

  • Solid knowledge of big data technologies such as Apache Spark, Hadoop, or Flink

  • Experience with open table formats, particularly Apache Iceberg, for large-scale data lakehouse architectures

  • Familiarity with private cloud platforms (OpenStack, VMware) and containerization (Docker, Kubernetes)

  • Experience with OpenTelemetry for instrumentation, distributed tracing, and telemetry data collection

Nice to have:

  • Proficiency in Python, Scala, or Java for data processing and automation

  • Experience building data infrastructure to support AI/ML workflows and model serving

  • Familiarity with LLM tooling, vector databases (e.g. Milvus), and AI data pipelines

  • Knowledge of data governance frameworks, compliance standards, and metadata platforms such as DataHub

  • Experience with orchestration tools such as Apache Airflow or Prefect. Experience collecting and processing data from Ceph storage clusters, OpenStack APIs, or VMware vCenter

  • Familiarity with ServiceNow (Snow) for CMDB, ITSM data extraction, and asset management reporting.

  • Contributions to open-source data engineering projects

Nokia Gurugram, Haryana, IND Office

Sector 62, , Ghata, Gurugram, Haryana, India, 122102

Similar Jobs

3 Days Ago
Remote or Hybrid
Senior level
Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The role involves designing and building cloud-based data solutions, managing data pipelines, ensuring data quality and performance optimization while leading a team in data engineering.
Top Skills: Aecorsoft/DatasphereAirflowBigQueryDatabricksDbtErwinGcp Cloud ServicesPl/SqlPostgres SqlPower BIPythonSQLTableau
3 Days Ago
In-Office or Remote
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Senior Data Engineer will design data architecture, build scalable data pipelines, mentor junior engineers, collaborate with stakeholders, and optimize data solutions.
Top Skills: AirflowBitbucketDatabricksDynamo DbGitMongo DbPostgresPythonRedshiftScalaSparkSQL
26 Minutes Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The role involves managing data pipelines in cybersecurity, collaborating with teams to implement solutions, and troubleshooting issues efficiently using Python and SQL.
Top Skills: APIsCloud LogsEdrPythonSIEMSQLUnified Vulnerability Management

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account