Speechify Logo

Speechify

Software Engineer, Data Infrastructure & Acquisition - Noida, India

Reposted 4 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Senior level
In-Office or Remote
Hiring Remotely in Noida, Gautam Buddha Nagar, Uttar Pradesh, IND
Senior level
Responsible for data collection to support AI model training, managing cloud infrastructure, and collaborating with scientists to enhance data quality and throughput.
The summary above was generated by AI

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.  

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You’ll Do

  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.

An Ideal Candidate Should Have

  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

What we offer

  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.

Think you’re a good fit for this job? 

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit? 

Refer them! 

Speechify is committed to a diverse and inclusive workplace. 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Similar Jobs

9 Hours Ago
Easy Apply
Remote or Hybrid
India
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
Lead pre-sales technical efforts with partners and customers to design, demo, and deploy MongoDB-based solutions. Support account teams with discovery, proofs of value, architecture, performance tuning, and partner enablement. Serve as a trusted advisor, translate technical decisions into business value, mentor peers, and represent MongoDB at events and partner engagements.
Top Skills: Angular.JsApache KafkaAtlasAtlas Data LakeAtlas Full-Text SearchAtlas Stream ProcessingAtlas Vector SearchAWSCC#C++Cloud DevopsCloud ManagerCompassConnector For BiConnector For SparkCSSCursorDatabase BenchmarkingHTMLInfrastructure As CodeJavaKubernetesLovableMongoDBNode.jsOps ManagerPerformance TuningProfilingPythonReactSQLTypescriptV0VueWindsurf
13 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Design, build, and operate agentic LLM-powered workflows, autonomous agents, and RAG/vector retrieval systems. Own end-to-end delivery including CI/CD, IaC, observability, DevSecOps, and Salesforce integrations to productionize enterprise AI for GTM applications.
Top Skills: AgentcoreAgentforceAutogenAws BedrockCdkCopadoCrewaiDastDockerGithub ActionsGitopsJavaScriptJenkinsKubernetesLangchainLanggraphLightning Web ComponentsModel Context Protocols (Mcp)PgvectorPineconePlatform EventsPythonSalesforce ApexSastSemantic KernelService MeshSlackTerraformTypescriptVertex AiWeaviate
13 Hours Ago
Remote or Hybrid
India
Senior level
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead a distributed SRE team owning CI/CD platform reliability, automation, observability, and data infrastructure. Provide people management, technical direction, architecture input, operational excellence, and cross-team collaboration while driving automation, monitoring, and AI-assisted workflows.
Top Skills: AnsibleApache AirflowSparkAWSAzureBashBazelBitbucketChefDatadogGCPGitGithub ActionsGitlabGitlab CiGoGrafanaHumio/LogscaleJenkinsKafkaKubernetesNasNfsObject StorageOpensearchOraclePostgresPowershellPrometheusPulsarPuppetPythonRedisRedpandaSanSli/SloSplunkTerraformValleyVarnish

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account