Forbes Advisor Logo

Forbes Advisor

Data Research Engineer - Data Extraction (India - Remote)

Posted 23 Days Ago
Remote
Hiring Remotely in Mumbai, Maharashtra
Mid level
Remote
Hiring Remotely in Mumbai, Maharashtra
Mid level
As a Data Research Engineer, you will develop processes for data quality assurance, automate data validation, and perform data profiling. Your role involves acquiring data from various sources, creating Python scripts for ETL processes, and contributing to data quality checks. Collaboration with cross-functional teams and staying updated on industry trends are key responsibilities.
The summary above was generated by AI

Company Description

Forbes Advisor is looking for a Data Research Engineer - Data Extraction to join the Forbes Marketplace Performance Marketing team with a focus on supporting one of Forbes business verticals. If you're looking for challenges and opportunities similar to those of a start-up, with the benefits of an established, successful company read on.

We are an experienced team of industry experts dedicated to helping readers make smart decisions and choose the right products with ease. Marketplace boasts decades of experience across dozens of geographies and teams, including Content, SEO, Business Intelligence, Finance, HR, Marketing, Production, Technology and Sales. The team brings rich industry knowledge to Marketplace’s global coverage of consumer credit, debt, health, home improvement, banking, investing, credit cards, small business, education, insurance, loans, real estate and travel.

The Data Extraction Team is a brand new team who plays a crucial role in our organization by designing, implementing, and overseeing advanced web scraping frameworks. Their core function involves creating and refining tools and methodologies to efficiently gather precise and meaningful data from a diverse range of digital platforms. Additionally, this team is tasked with constructing robust data pipelines and implementing Extract, Transform, Load (ETL) processes. These processes are essential for seamlessly transferring the harvested data into our data storage systems, ensuring its ready availability for analysis and utilization.

A typical day in the life of a Data Research Engineer will involve acquiring and integrating data from various sources, developing and maintaining data processing workflows, and ensuring data quality and reliability. They collaborate with the team to identify effective data acquisition strategies and develop Python scripts for data extraction, transformation, and loading processes. They also contribute to data validation, cleansing, and quality checks. The Data Research Engineer stays updated with emerging data engineering technologies and best practices.


Job Description

Responsibilities

  • Develop methods and processes for data quality assurance (QA) to ensure accuracy, completeness, and integrity.
  • Define and implement data validation rules and automated data quality checks.
  • Perform data profiling and analysis to identify anomalies, outliers, and inconsistencies.
  • Assist in acquiring and integrating data from various sources, including web crawling and API integration.
  • Develop and maintain scripts in Python for data extraction, transformation, and loading (ETL) processes.
  • Stay updated with emerging technologies and industry trends.
  • Explore third-party technologies as alternatives to legacy approaches for efficient data pipelines.
  • Contribute to cross-functional teams in understanding data requirements.
  • Assume accountability for achieving development milestones.
  • Prioritize tasks to ensure timely delivery, in a fast-paced environment with rapidly changing priorities.
  • Collaborate with and assist fellow members of the Data Research Engineering Team as required.
  • Leverage online resources effectively like StackOverflow, ChatGPT, Bard, etc., while considering their capabilities and limitations.

Skills and Experience

  • Bachelor's degree in Computer Science, Data Science, or a related field. 
  • Strong proficiency in Python programming for data extraction, transformation, and loading.
  • Proficiency in SQL and data querying is a plus.
  • Knowledge of Python modules such as Pandas, SQLAlchemy, gspread, PyDrive, BeautifulSoup and Selenium, sklearn, Plotly.
  • Knowledge of web crawling techniques and API integration.
  • Knowledge of data quality assurance methodologies and techniques.
  • Familiarity with machine learning concepts and techniques.
  • Familiarity with HTML, CSS, JavaScript.
  • Familiarity with Agile development methodologies is a plus.
  • Strong problem-solving and analytical skills with attention to detail.
  • Creative and critical thinking.
  • Ability to work collaboratively in a team environment.
  • Good and effective communication skills.
  • Experience with version control systems, such as Git, for collaborative development.
  • Ability to thrive in a fast-paced environment with rapidly changing priorities.
  • Comfortable with autonomy and ability to work independently.

Perks:
● Day off on the 3rd Friday of every month (one long weekend each month)
● Monthly Wellness Reimbursement Program to promote health well-being
● Monthly Office Commutation Reimbursement Program
● Paid paternity and maternity leaves
● Group Medical Insurance
● Group Term Life Insurance (2.5X of the CTC)
● Group Personal Accident Insurance (3 X of the CTC)

Qualifications

Bachelor's degree in Computer Science, Data Science, or a related field.

Top Skills

Beautifulsoup
CSS
Gspread
HTML
JavaScript
Pandas
Plotly
Pydrive
Python
Selenium
SQL
Sqlalchemy

Similar Jobs

8 Days Ago
Remote
Mumbai, Maharashtra, IND
Mid level
Mid level
Insurance • Software • Energy • Financial Services
The Data Research Engineer will design and implement web scraping frameworks, develop Python scripts for data extraction and ETL processes, and ensure data quality and reliability. They will also collaborate with cross-functional teams to understand data requirements and contribute to the creation of efficient data pipelines.
20 Days Ago
Remote
India
Senior level
Senior level
Software
As the Data Science & Engineering Lead, you will guide a team in developing and deploying machine learning models, optimizing data systems, and providing actionable insights through AI advancements. Responsibilities include model architecture, MLOps pipelines, and mentoring junior members while utilizing cutting-edge technology to drive innovation.
Top Skills: AirbyteAirflowAnnSparkApi GatewayAWSAzureAzureAzure Ml StudioBatch ProcessingChain Of ThoughtCloudFormationCnnCudaDatabricksDatabricksDbtDeltaDynamoDBElasticsearchEmrEnsemble MethodsGanGlueGpu AccelerationGreat ExpectationsHuggingfaceIcebergKafkaKinesisLambdaLangchainLogistic RegressionModel Of AlignmentMongoDBMySQLNltkNumpyOlsOpenaiOpencvPandasPostgresPower BIPyTorchQuicksightResnetRetrieval-Augmented GenerationRnnSagemakerScikit-LearnSnowflakeSparkSQL ServerTableauTensorFlowTerraformTransformers
6 Hours Ago
Remote
Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The Senior Data Analyst will work on delivering transformational projects within the banking and financial sectors, leveraging excellent analytical skills, data governance knowledge, and the ability to manage multiple priorities. Responsibilities include writing SQL queries, understanding data quality, and communicating effectively across stakeholders.
Top Skills: CmdHiveNote++PuttyScalaSQL

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account