Gigster Logo

Gigster

Staff SRE (Site Reliability Engineer)

Job Posted 3 Days Ago Posted 3 Days Ago
Remote
47 Locations
Senior level
Remote
47 Locations
Senior level
The Staff SRE will ensure the reliability, scalability, and performance of critical systems, drive infrastructure improvements, and mentor junior engineers while collaborating on various technological projects.
The summary above was generated by AI

Do you want to work on cutting-edge projects with the world’s best IT engineers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - the Gigster Talent Network is for you.

 

Our clients rely on our Network for two main areas, Software Development and Cloud Services. In some cases, they need help building great new products, in others they want our expertise in migrating, maintaining, and optimizing their cloud solutions.

 

At Gigster, whether working with entrepreneurs to realize ‘the next great vision’ or with Fortune 500 companies to deliver a big product launch, we build really cool enterprise software on cutting-edge technology

 

We seek highly skilled and experienced Staff Site Reliability Engineers (SRE)  interested in joining the Gigster Talent Network and being considered for exciting upcoming projects with one of our largest clients. As a member of the Gigster Network, you will have the chance to become a part of amazing teams where you'll be responsible for ensuring the reliability, scalability, and performance of our critical systems and services. As a Staff SRE, you will play a pivotal role in shaping infrastructure for our client and driving initiatives that improve the overall service quality.

Requirements:

System Design and Architecture:
    • Design, build, and maintain scalable and reliable infrastructure.
    • Collaborate with engineering teams to ensure systems are designed with reliability and scalability in mind.
    • Evaluate and integrate new technologies to enhance our infrastructure.
Monitoring and Incident Management:
    • Implement and maintain monitoring and alerting systems to detect and respond to issues promptly.
    • Lead incident response efforts, ensuring quick resolution and effective communication.
    • Conduct post-incident reviews and drive improvements based on findings.
Automation and Optimization - Reduce SRE Toil:
    • Architect & Build innovative automation projects (preferably in Python/GoLang) from scratch to help reduce day-to-day SRE toil 
    • Create Bash scripts to automate manual activities like upgrades, status checks, and deployment 
    • Develop and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or similar.
    • Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
Collaboration and Mentorship:
    • Collaborate with cross-functional teams to deliver high-quality products and services.
    • Mentor and guide junior SREs and other team members.
    • Advocate for best practices in reliability engineering across the organization.
Continuous Improvement:
    • Drive initiatives to improve service reliability, capacity, and performance.
    • Participate in capacity planning and disaster recovery exercises.
    • Stay current with industry trends and emerging technologies.
Education and Experience:
    • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
    • 8+ years of minimum experience in the industry as a Software Engineer, SRE, or Platform Engineer.
    • Minimum 3+ years of experience as a Platform Engineer or SRE.
    • Proven experience in managing large-scale, mission-critical infrastructure.
Technical Skills:
    • Deep understanding of Linux/Unix systems and networking.
    • Proficiency in at least one or more programming languages (e.g., Python, Go, Java).
    • Intermediate to Expert level skill in bash scripting 
    • Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes).
    • Strong knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
    • Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
Soft Skills:
    • Excellent problem-solving skills and a proactive attitude.
    • Strong communication and collaboration skills.
    • Ability to work independently and as part of a team.
    • Demonstrated leadership and mentoring abilities.

Candidates must be able to work during Pacific time hours 8am - 5pm PST, open to on-call rotation. 

​​Recruitment Process:
    • English Proficiency Assessment (25 mins)
    • Technical Assessment (45 mins)
    • Recruiter screen (30 mins)
    • Technical Interview (30-45 mins)

 

We strive to move efficiently from step to step so the recruitment process can be as fast as possible.

 

The Gigster Talent Network is a highly curated set of some of the best software developers in the world. It’s not easy to become part of this select network, but when you do, you will work amongst the best in Silicon Valley and around the world.

 

Our model is unique in the software development industry. We do the hard work of finding international clients and scoping their projects, and you get to choose from a large variety of ‘Gigs’. You can choose Gigs that fit your interests—from part-time and short-term projects to full-time long-term no-end-date openings for our amazing clients! You will be eligible for diverse positions that will help you take your career to the next level.

 

All of our projects are for top-tier international companies and are delivered with the highest quality. Projects range in both technologies and industries so you will have the opportunity to be considered for amazing cutting-edge products that make a difference!

 

Are you ready to join the club?

Top Skills

Ansible
AWS
Azure
Bash
Docker
Elk
GCP
Gitlab Ci
Go
Grafana
Java
Jenkins
Kubernetes
Linux
Prometheus
Python
Terraform
Unix

Similar Jobs

2 Days Ago
Easy Apply
Remote
32 Locations
Easy Apply
Senior level
Senior level
Cloud • Security • Software • Cybersecurity • Automation
The Senior Site Reliability Engineer will design and maintain infrastructure on GCP and AWS, automate operations, lead incident responses, and ensure system reliability and scalability.
Top Skills: AWSGCPGoGrafanaHashicorp VaultIstioKubernetesLinkerdOpenbaoPrometheusPulumiTerraform
Yesterday
Easy Apply
Remote
28 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Machine Learning • Natural Language Processing • Software
As a Senior Linguistic Engineer, you will enhance translation models, perform data analysis, and collaborate with cross-functional teams to innovate translation quality at Smartling.
Top Skills: AthenaPythonSagemakerSparkSQL
2 Days Ago
Remote
28 Locations
Expert/Leader
Expert/Leader
Big Data • Cloud • Software • Database
Drive MongoDB adoption among strategic customers, engage senior technical leaders, develop technical expertise, and contribute to thought leadership.
Top Skills: C#JavaMongoDBPythonRdbms

What you need to know about the Delhi Tech Scene

Delhi, India's capital city, is a place where tradition and progress co-exist. While Old Delhi is known for its rich history and bustling markets, New Delhi is defined by its modern architecture. It's clear the region places a strong emphasis on preserving its cultural heritage while embracing technological advancements, particularly in artificial intelligence, which plays a central role in shaping the city's tech landscape, fueled by investments in research and development.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account