Kanerika Logo

Kanerika

Lead Data Engineer - Databricks

Posted 8 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Telangana, IND
Senior level
In-Office or Remote
Hiring Remotely in Telangana, IND
Senior level
Lead design, development, and optimization of ETL/ELT data pipelines on Databricks using PySpark and Spark SQL. Implement Delta Lake, Unity Catalog, incremental ingestion (Auto Loader/streaming), Databricks Workflows, and performance tuning. Build reusable accelerators, participate in client technical discussions, mentor junior engineers, and resolve production issues while promoting engineering best practices.
The summary above was generated by AI

About Kanerika

Who we are:

Kanerika Inc. is a premier global software products and services firm that specializes in providing innovative solutions and services for data-driven enterprises. Our focus is to empower businesses to achieve their digital transformation goals and maximize their business impact through the effective use of data and AI.  We leverage cutting-edge technologies in data analytics, data governance, AI-ML, GenAI/ LLM and industry best practices to deliver custom solutions that help organizations optimize their operations, enhance customer experiences, and drive growth.

Awards and Recognitions

Kanerika has won several awards over the years, including:

·       CMMI Level 3 Appraised in 2024.

·       Best Place to Work 2023 by Great Place to Work

·       Top 10 Most Recommended RPA Start-Ups in 2022 by RPA today.

·       Frost & Sullivan India 2021 Technology Innovation Award for its Kompass composable solution architecture.

·        Kanerika has also been recognized for its commitment to customer privacy and data security, having achieved, ISO 9001, ISO 27701, SOC2, and GDPR compliances.

Working for us

Kanerika is rated 4.6/5 on Glassdoor, for many good reasons. We truly value our employees' growth, well-being, and diversity, and people’s experiences bear this out. At Kanerika, we offer a host of enticing benefits that create an environment where you can thrive both personally and professionally. From our inclusive hiring practices and mandatory training on creating a safe work environment to our flexible working hours and generous parental leave, we prioritize the well-being and success of our employees. Our commitment to professional development is evident through our mentorship programs, job training initiatives, and support for professional certifications. Additionally, our company-sponsored outings and various time-off benefits ensure a healthy work-life balance. Join us at Kanerika and become part of a vibrant and diverse community where your talents are recognized, your growth is nurtured, and your contributions make a real impact. See the benefits section below for the perks you’ll get while working for Kanerika.

Locations

We are located in Austin (USA), Singapore, Hyderabad, Indore and Ahmedabad (India).

Job Location: Hyderabad, Indore and Ahmedabad (India)



Requirements

Role:

As one of the founding members of Kanerika's Databricks delivery practice, you will be a hands-on builder responsible for designing, developing, and optimizing data pipelines on Databricks for client engagements. You'll work closely with the Practice Lead to translate architecture into working solutions, build reusable accelerators ahead of client demand, and help establish engineering best practices for the growing team.

Key Responsibilities

·       Design, build, and optimize ETL/ELT data pipelines on Databricks using PySpark and Spark SQL.

·       Build and maintain incremental ingestion pipelines using Auto Loader and structured streaming, including checkpoint management and schema evolution handling.

·       Implement and maintain Delta Lake tables (including Change Data Feed and liquid clustering), Unity Catalog structures, and Databricks Workflows for client and internal projects.

·       Build reusable accelerators, templates, and demo environments to support pre-sales and speed up future client delivery.

·       Collaborate with the Practice Lead/Architect on solution design for client engagements, providing engineering-level input on feasibility and effort.

·       Perform data quality checks, performance tuning, and cost optimization on Databricks clusters and jobs.

·       Participate in client-facing technical discussions as needed, including discovery sessions and technical walkthroughs.

·       Write clean, well-documented, and testable code following team engineering standards.

·       Mentor junior engineers as the team scales, and contribute to internal knowledge-sharing and best-practice documentation.

·       Troubleshoot and resolve production data pipeline issues across client environments.

·       Stay current with Databricks platform releases across Unity Catalog, Lakeflow Declarative Pipelines, and share learnings with the team through internal knowledge sessions and documentation.


Required Skills & Experience

·       5–9 years of data engineering experience, with at least 2 years of hands-on Databricks experience in production.

·       Strong proficiency in PySpark and/or Spark SQL for large-scale data processing.

·       Practical experience with Delta Lake, Unity Catalog, and Databricks Workflows/job orchestration.

·       Solid SQL skills and experience with data modeling for analytics/lakehouse architectures.

·       Experience with at least one major cloud platform (Azure, AWS, or GCP); Azure strongly preferred.

·       Experience with Python for data engineering tasks beyond Spark (scripting, automation, testing).

·       Familiarity with CI/CD practices for data pipelines (Git-based workflows, automated testing/deployment).

·       Strong debugging and performance-tuning skills for Spark jobs (partitioning, caching, cluster sizing).

·       Good communication skills and comfort working directly with client stakeholders when needed.

 

PEFERRED/ NICE TO HAVE

·       Databricks Certified Data Engineer Associate/Professional certification.

·       Experience with Lakeflow Declarative Pipelines (formerly Delta Live Tables), Lakeflow Connect for managed ingestion, MLflow for experiment tracking and model registry, Databricks SQL, and AI/BI Genie.

·       Exposure to data governance and security frameworks (row/column-level security, data masking).

·       Prior experience in a consulting/IT services environment delivering to multiple clients.

·       Familiarity with orchestration tools (Airflow) and ingestion tools (Fivetran, Kafka, Azure Data Factory).

What Success Looks Like

·       Within 3 months: Comfortable with Kanerika's delivery standards; has built or contributed to at least one reusable accelerator/demo asset.

·       Within 6 months: Independently delivering core engineering work on client engagement(s) with minimal oversight.

·       Within 12 months: Recognized as a go-to senior engineer on the team, mentoring newer hires and contributing to architecture decisions.

·       Leading databricks partnership upgrade to Gold level.



Benefits

Why join us?

·       Work with a passionate and innovative team in a fast-paced, growth-oriented environment.

·       Gain hands-on experience in content marketing with exposure to real-world projects.

·       Opportunity to learn from experienced professionals and enhance your marketing skills.

·       Contribute to exciting initiatives and make an impact from day one.

·       Competitive stipend and potential for growth within the company.

·       Recognized for excellence in data and AI solutions with industry awards and accolades.

Employee Benefits:

1. Culture:

        i.            Open Door Policy: Encourages open communication and accessibility to management.

       ii.            Open Office Floor Plan: Fosters a collaborative and interactive work environment.

     iii.            Flexible Working Hours: Allows employees to have flexibility in their work schedules.

     iv.            Employee Referral Bonus: Rewards employees for referring qualified candidates.

       v.            Appraisal Process Twice a Year: Provides regular performance evaluations and feedback.

2. Inclusivity and Diversity:

a.      Hiring practices that promote diversity: Ensures a diverse and inclusive workforce.

b.      Mandatory POSH training: Promotes a safe and respectful work environment.

3. Health Insurance and Wellness Benefits:

a.      GMC and Term Insurance: Offers medical coverage and financial protection.

b.      Health Insurance: Provides coverage for medical expenses.

c.       Disability Insurance: Offers financial support in case of disability.

4. Child Care & Parental Leave Benefits:

a.      Company-sponsored family events: Creates opportunities for employees and their families to bond.

b.      Generous Parental Leave: Allows parents to take time off after the birth or adoption of a child.

c.       Family Medical Leave: Offers leave for employees to take care of family members' medical needs.

5. Perks and Time-Off Benefits:

a.      Company-sponsored outings: Organizes recreational activities for employees.

b.      Gratuity: Provides a monetary benefit as a token of appreciation.

c.       Provident Fund: Helps employees save for retirement.

d.      Generous PTO: Offers more than the industry standard for paid time off.

e.      Paid sick days: Allows employees to take paid time off when they are unwell.

f.        Paid holidays: Gives employees paid time off for designated holidays.

g.       Bereavement Leave: Provides time off for employees to grieve the loss of a loved one.

 

6. Professional Development Benefits:

a.      L&D with FLEX- Enterprise Learning Repository: Provides access to a learning repository for professional development.

b.      Mentorship Program: Offers guidance and support from experienced professionals.

c.       Job Training: Provides training to enhance job-related skills.

d.      Professional Certification Reimbursements: Assists employees in obtaining professional certifications.

e.      Promote from Within: Encourages internal growth and advancement opportunities.

 

 



Similar Jobs

An Hour Ago
Remote or Hybrid
India
Senior level
Senior level
Digital Media • Information Technology • News + Entertainment
Design, develop, and maintain microservices-based Python applications and REST APIs. Implement asynchronous processing, ensure test coverage and code quality, deploy and operate services in Docker/Kubernetes, contribute to CI/CD pipelines, mentor junior engineers, collaborate with cross-functional teams, and optimize performance, scalability, and reliability.
Top Skills: CeleryDjangoDjango Rest FrameworkDockerFastapiGit/GitlabGitlab CiJenkinsKubernetesMongoDBNoSQLPostgresPythonRabbitMQRedisSQL
4 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Lead business analysis for Risk Weighted Assets (RWA) and Counterparty Credit Risk (CCR) initiatives. Translate Basel III/IV and SA-CCR/IMM/EAD regulatory requirements into BRDs, FRDs, user stories, process flows and data mappings. Liaise with Market Risk, CCR, Regulatory Reporting, Finance, Quant and Technology teams to support capital calculations, RWA optimization and regulatory reporting for transformation programs.
4 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
Design and develop web applications and UI/UX solutions, collaborate with stakeholders to define technical solutions, implement Python and JavaScript components, evaluate visualization tools and recommend approaches, deliver quality work in Agile teams under tight deadlines.
Top Skills: JavaScriptPython

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account