Citi Logo

Citi

Senior PySpark Data Engineer - Assistant Vice President

Reposted Yesterday
Be an Early Applicant
In-Office
Pune, Mahārāshtra
Senior level
In-Office
Pune, Mahārāshtra
Senior level
The Senior PySpark Data Engineer will design, develop, and maintain efficient data pipelines, collaborate with teams, and ensure data integrity and compliance.
The summary above was generated by AI

About the Role

We are seeking a highly skilled and experienced Senior PySpark Data Engineer to join our dynamic data engineering team. The ideal candidate will have a strong background in building and managing large-scale data processing systems and a proven track record of working with cutting-edge Big Data technologies. You will be responsible for designing, developing, and maintaining our data pipelines, ensuring they are efficient, reliable, and scalable to meet our growing business needs.

Key Responsibilities

  • Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark.
  • Develop, schedule, and monitor complex data workflows using orchestration tools like Apache Airflow.
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions.
  • Optimize and tune Spark jobs for performance and efficiency.
  • Implement data quality checks and ensure data integrity across all data pipelines.
  • Design and implement data models for optimal storage and retrieval.
  • Mentor junior data engineers and promote best practices in data engineering.
  • Ensure compliance with data governance and security policies.
  • Troubleshoot and resolve data-related issues in a timely manner.

Required Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
  • 6+ years of professional experience in a data engineering role.
  • Extensive hands-on experience with PySpark and advanced Python programming skills.
  • Proven experience with Big Data ecosystems, including Cloudera and/or DataBricks.
  • Hands-on experience with distributed query engines like Starburst (Trino/Presto).
  • Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow.
  • Strong expertise in SQL and experience with relational and non-relational databases.
  • Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling techniques.
  • Experience working in a Linux/Unix environment.
  • GIT HUB, CI/CD Pipeline

------------------------------------------------------

Job Family Group:

Technology

------------------------------------------------------

Job Family:

Applications Development

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

 

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.

Top Skills

Apache Airflow
Ci/Cd Pipeline
Cloudera
Databricks
Git Hub
Pyspark
SQL
Starburst

Similar Jobs

An Hour Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The role involves onboarding cybersecurity data sources, mapping and normalizing data, customizing workflows, and collaborating with teams to enhance security measures.
Top Skills: APIsCi/CdCloudCnappContainersCspmCybersecurity ToolsEdrPythonSIEMSQLVm Scanners
An Hour Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Expert/Leader
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for connecting cybersecurity data sources, mapping security data, and customizing workflows to improve exposure and threat management. Collaborates with sales and engineering for product improvements.
Top Skills: APIsCi/CdContainersCybersecurity ToolsPythonSQL
An Hour Ago
Remote or Hybrid
India
Mid level
Mid level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The Credit Analyst performs credit underwriting for corporate clients, prepares credit proposals, conducts financial analysis, and monitors compliance with financial covenants.
Top Skills: Credit Risk AssessmentFinancial AnalysisFinancial Modelling

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account