Lead extraction and standardization of structured financial data from complex formats (PDF, HTML, XBRL/iXBRL), apply OCR and validation, automate data pipelines, collaborate across teams, and mentor junior analysts.
Job description EYDM is seeking a proactive, dynamic, and adaptable member to join our global, diverse, and inclusive team. Given the continuous growth and evolution of our products, we require candidates who not only embrace change but also possess the experience and capability to hit the ground running. Ideal team members will thrive in ambiguous environments, demonstrating curiosity and proactiveness while effectively contributing from day one. We are looking for an experienced Senior Data Analyst/Modeller to work within our EY.ai Data Marketplace (EYDM) Ingestion team, reporting into the Ingestion Product Manager. In this role, you will be responsible for working across teams, targeting complex data challenges and finding solutions that are efficient and cost effective, as well as compliant within EY policies. You will work closely with other Analysts and Engineers to deliver solutions, standards and problem-solving strategies. A core focus of this role is extracting, interpreting and standardising structured financial data from complex source formats including PDF documents, HTML, XBRL and iXBRL, with strong Python programming skills applied to data manipulation and automation. Your key responsibilities • PDF & Document Data Extraction: Lead the extraction and post-processing of structured financial data from PDF, HTML, and scanned documents, applying OCR tooling and validation logic to ensure accuracy and consistency at scale. • Data Extraction and Standardization: Lead initiatives to extract and standardise financial data from various formats, including XBRL and iXBRL, ensuring data accuracy and consistency. • Mentorship and Development: Provide guidance and support to junior analysts, fostering their growth in data analysis, programming, and statistical methodologies.
Similar Jobs
Information Technology • Consulting
Lead DataStage developer responsible for production support and enhancements: resolve incidents, perform root-cause analysis, implement CRs, develop and optimize DataStage jobs, tune Oracle SQL and UNIX processes, manage scheduling with Control-M, and follow SDLC and release procedures while interfacing with clients.
Top Skills:
Ci/CdControl-MIbm Infosphere DatastageLinuxOraclePl/SqlShell ScriptingSnowflakeSQLUnixVersion Control
Information Technology • Consulting
Lead extraction, post-processing and standardization of structured financial data from PDFs, HTML, XBRL and iXBRL using OCR and Python. Design scalable, validated ingestion workflows, collaborate across teams, deliver efficient compliant solutions, and mentor junior analysts.
Top Skills:
HTMLIxbrlOcrPdfPythonXbrl
Information Technology • Consulting
Design, build, and optimize ETL pipelines and SQL Server databases using SSIS and T-SQL. Architect OLTP and OLAP schemas, implement data validation and cleansing, optimize query performance, automate deployments with CI/CD and Autosys, and collaborate with stakeholders to translate requirements into scalable data solutions.
Top Skills:
AutosysBitbucketC#Ci/CdGitSql ProfilerSQL ServerSsisT-Sql
What you need to know about the Delhi Tech Scene
Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.
