NextHire Consulting Logo

NextHire Consulting

Lumiq - Data Engineer

Posted 2 Days Ago
Be an Early Applicant
In-Office
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Mid level
In-Office
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Mid level
Design, build, and maintain scalable data pipelines and ETL/ELT workflows using PySpark and SQL. Orchestrate jobs with Apache Airflow, leverage AWS data services (S3, Glue, EMR, Redshift, Lambda, EC2), and work with Hadoop ecosystem components. Ensure data quality, transformation, cleansing, and performance optimization to support analytics and reporting while collaborating with data scientists and engineers.
The summary above was generated by AI
Job Title:  Data Engineer

Experience: 3–6 Years
Location: Noida
Employment Type: Full-Time
Work Mode: 5 days WFO

Job Summary

We are looking for a skilled  Data Engineer with 3–6 years of experience to design, build, and maintain scalable data pipelines and data processing systems. The ideal candidate should have strong experience in PySpark, SQL, AWS services, and workflow orchestration tools like Airflow, along with exposure to big data technologies such as Hadoop.

Key Responsibilities
  • Design, develop, and maintain scalable data pipelines for processing large datasets.

  • Build and optimize ETL/ELT workflows using PySpark and SQL.

  • Develop and manage data workflows using Apache Airflow for scheduling and orchestration.

  • Work with AWS data services to build robust and scalable data platforms.

  • Integrate and process data from multiple sources including structured and unstructured data.

  • Perform data transformation, cleansing, and aggregation to support analytics and reporting.

  • Optimize data processing jobs for performance, reliability, and scalability.

  • Collaborate with data scientists, analysts, and engineering teams to support data requirements.

  • Ensure data quality, governance, and security across pipelines.

Required Skills
  • Strong programming experience in PySpark and Python.

  • Strong knowledge of SQL and database concepts.

  • Hands-on experience with AWS services such as S3, Glue, EMR, Redshift, Lambda, or EC2.

  • Experience building data pipelines and ETL workflows.

  • Experience with Apache Airflow for workflow orchestration.

  • Knowledge of Hadoop ecosystem (HDFS, Hive, Spark).

  • Experience handling large-scale data processing and distributed systems.

  • Understanding of data modeling and data warehousing concepts.

Good to Have Skills
  • Experience with Kafka or streaming data pipelines.

  • Experience with Docker or containerized environments.

  • Exposure to CI/CD pipelines and DevOps practices.

  • Experience with data lake architecture.

Education
  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or related field.

Similar Jobs

2 Days Ago
In-Office
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Mid level
Mid level
Artificial Intelligence • HR Tech • Professional Services • Software
Design, develop, and maintain scalable data pipelines using PySpark and Hadoop. Build and optimize ETL workflows, write complex SQL queries, manage AWS data services (S3, EMR, Glue, Redshift), ensure data quality and security, and collaborate with analysts and data scientists to troubleshoot and improve data processes.
Top Skills: Aws EmrAws GlueAws RedshiftAws S3Data PipelinesETLHadoopPysparkSQL
2 Days Ago
In-Office
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Mid level
Mid level
Artificial Intelligence • HR Tech • Professional Services • Software
Design, develop, and maintain scalable ETL/ELT data pipelines and architectures for analytics. Ensure data quality, integration, security, and performance. Collaborate with analysts and cross-functional teams, monitor production pipelines, troubleshoot issues, and implement data engineering best practices and documentation.
Top Skills: AirflowAWSAzureCi/CdData WarehousingDatabricksEtl/EltGCPGitHadoopPythonRest ApisSparkSQL
2 Days Ago
In-Office
Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Mid level
Mid level
Artificial Intelligence • HR Tech • Professional Services • Software
Design, develop, and maintain scalable data pipelines and ETL/ELT processes. Build and manage workflow orchestration with Apache Airflow, write optimized SQL queries, monitor pipeline performance, ensure data quality and integrity, and collaborate with analysts and product teams to resolve workflow issues.
Top Skills: Apache AirflowData WarehousingEltETLRelational DatabasesSQL

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account