Pod Network Jobs

Site Reliability Engineer (APAC)

Pod Network

Site Reliability Engineer (APAC)

Reposted 10 Days Ago

In-Office or Remote

Hiring Remotely in India

Mid level

In-Office or Remote

Hiring Remotely in India

Mid level

Operate and improve the Pod platform: respond to incidents, investigate root causes, build automation and observability, design monitoring/alerting, reduce alert fatigue, and drive reliability improvements across production systems.

The summary above was generated by AI

Pod is building a next-generation decentralized exchange focused on fairness, performance, and user experience. We believe traders shouldn't have to choose between speed, simplicity, and fair treatment, so we're building an exchange that delivers all three while enabling entirely new kinds of financial markets.

Under the hood, Pod is powered by low-latency systems designed for fast settlement and strong guarantees around ordering, timing, and execution. These are challenging engineering problems, and the reliability of the platform depends on operating those systems safely and effectively at scale.

About the Role:

We're looking for our first Site Reliability Engineer to help operate, improve, and scale the reliability of the Pod platform.

You'll join a team of engineers who already share responsibility for production systems and participate in an established on-call rotation. From day one, you'll work closely with the broader engineering team while taking ownership of the tooling, processes, and operational practices that keep the platform running smoothly.

This is a hands-on role for someone who enjoys operating complex systems, investigating difficult production issues, and building the automation and infrastructure that turn reliability into a competitive advantage.

On Call:

You'll be responsible for platform health during Asian business hours as part of our existing engineering on-call rotation. There are no permanent overnight shifts, and you'll never be the sole person responsible for the platform—the rest of the rotation is covered by the wider team. Occasionally, you may flex outside your normal hours to help cover the schedule, but that's the exception rather than the rule.

What You’ll Do:

Respond to and resolve incidents:

Monitor the health and performance of the platform
Respond to production incidents and drive them through to resolution
Investigate failures, identify root causes, and coordinate fixes
Ensure issues are detected, understood, and addressed quickly

Improve platform reliability:

Identify recurring operational pain points and eliminate them
Improve software, deployment processes, and operational workflows
Participate in incident reviews and help drive preventative improvements
Contribute reliability-focused changes directly to production systems

Build observability and operational tooling:

Design and maintain dashboards, metrics, alerting, and monitoring systems
Improve signal quality while reducing alert fatigue
Build automation and internal tools that make the platform easier to operate
Help establish reliability best practices across the engineering organization

Qualifications:

Strong experience with Linux and cloud infrastructure
Experience operating and supporting production systems
Experience with Docker and containerized environments
Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar
Ability to automate workflows using Rust, Python, Bash, or similar languages
Strong troubleshooting and debugging skills
A high degree of ownership and the ability to make sound decisions independently

Nice to Have:

Experience with distributed systems
Experience operating high-availability, low-latency services
Experience with CI/CD systems and deployment automation
Experience designing secure operational workflows and access controls
No prior blockchain or cryptocurrency experience is required.

What we offer:

Competitive compensation ($90k - $130k USD/year), plus a meaningful token/equity allocation
Real ownership and responsibility from day one as part of a small team
Work from wherever you are within the target timezone range (UTC+7 to UTC+1)
Occasional travel to Europe and elsewhere for team offsites

Similar Jobs

Careerflow.ai

Japanese Data Annotator

11 Minutes Ago

In-Office or Remote

India

Entry level

Artificial Intelligence • HR Tech • Software • Generative AI

Remote annotator reviews Japanese text and images, draws bounding boxes, answers structured questions, and writes short summaries per guidelines to train AI systems. On-the-job training provided; assessment required.

Kintsugi AI, Inc.

Technical Lead

32 Minutes Ago

In-Office or Remote

India

Senior level

Artificial Intelligence • Fintech • Software • Automation

Lead and mentor a Southeast Asia engineering team; ensure code quality and testing; drive project progress; define integrations strategy; design, build, and maintain third-party integrations; align engineering efforts with product and leadership.

Top Skills: Aws ServerlessFastapiLambdaPostgresPythonReact

ProspeX CRM

Head of Engineering - Custody

2 Hours Ago

In-Office or Remote

India

Senior level

Artificial Intelligence • Marketing Tech • Sales • Software

Lead and scale the engineering organization for a regulated digital-asset custody platform. Own hiring, org design, engineering operations (on-call, incidents, release hygiene), delivery against roadmaps, audit and regulator readiness (SOC 2, SAMA, ISO 27001), and cross-functional alignment. Amplify an existing small, specialized team to meet regulatory and institutional requirements while preserving culture and retention.

Top Skills: Aurora PostgresAWSBitcoinClickhouseEthereumGithub ActionsGoHsmKubernetesMpc/TssRustSolanaTemporalTerraformThreshold Signing ProtocolsTypescript

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.