HireMeFast LLC Surprise, AZ

Site Reliability Engineer

HireMeFast LLC Surprise, AZ

1 week ago

Be among the first 25 applicants

See who HireMeFast LLC has hired for this role

No longer accepting applications

This is a remote position.

DISCLAIMER: This job posting is intended for active pooling of candidates who will become part of our talent pool. Your qualifications will be assessed against both current and future job openings. Should your application align with a role that corresponds to your skills and experience, and an opportunity arises, our recruitment team will reach out to you immediately. Please note that this does not guarantee immediate placement or contact. Additionally, we exclusively consider applications from individuals who are currently reside in the US/Canada during their application process.

Salary: $65,000 - $75,000 per annum

Experience Required: Minimum 1 year of project experience

About The Job

As an SRE, you'll troubleshoot and resolve technical issues, optimize performance, and establish reliability-based release management processes. The SRE role is the practical implementation of DevOps principles, where speed and stability are carefully balanced, and the team acts as versatile problem solvers, filling gaps in knowledge and expertise to ensure efficient software operations.

You will:

Apply SRE principles to maintain the reliability, availability, and performance of software systems.
Automate deployment processes, configuration management, and CI/CD pipelines to streamline software development and delivery.
Planned and assisted with the migration of Windows and Linux-based machines to containerized machines.
Plan and Assist with the overall Disaster Recovery (DR) of the infrastructure and operations (InfraOps).
Manage and maintain software infrastructure, ensuring proper configuration, security, and scalability.
Perform system administration tasks, monitor system performance, troubleshoot issues, and apply necessary fixes.
Act as a versatile problem solver, filling gaps in team knowledge and expertise to ensure smooth and efficient software operations.
Facilitate smooth team and project transitions, providing guidance, training, and support for development teams to manage their infrastructure independently.
Develop a reliability rating system to assess team and project performance, collecting and analyzing metrics to evaluate adherence to best practices.
Respond quickly and effectively to critical incidents, conducting post-incident reviews to identify root causes and implement preventive measures.
Develop and maintain automation tools and scripts to improve operational efficiency.
Identify performance bottlenecks and implement optimizations to enhance system response times and resource utilization.
Stay up to date with the latest industry trends, technologies, and best practices related to SRE, DevOps, and infrastructure management.
Collaborate effectively with cross-functional teams and communicate technical concepts and recommendations clearly to both technical and non-technical stakeholders.
Implement a reliability-based release management process, allowing teams with higher reliability scores to perform quick and frequent releases.
Proactively identify potential issues and implement preventive measures to reduce incidents and outages.
Implement observability practices to detect abnormal behaviors in the software and collect information for effective problem resolution.
Set and monitor critical metrics to gain insights into system reliability, including latency, traffic, errors, and saturation levels.
Establish Service-Level Objectives (SLOs) and measure Service-Level Indicators (SLIs) to assess the quality-of-service delivery and reliability.
Planned, participated, and managed on-call rotations to ensure prompt response to reported software issues.
Utilize incident response tools to categorize the severity of reported cases and handle them promptly.
Implement configuration management tools to automate software workflows and enhance team productivity.

Projects you could work on:

Implementing automated CI/CD pipelines for smooth software deployment.
Setting up and maintaining a reliable and scalable cloud infrastructure.
Designing and implementing the migration of physical machines to virtual machines.
Designing incident response procedures and post-incident review processes.
Developing automation tools to streamline repetitive tasks and improve team productivity.
Analyzing system performance metrics and optimizing resources for better efficiency.
Establishing observability practices to detect and resolve software issues proactively.
Defining SLOs and SLIs to assess service quality and reliability across projects.
Planning and managing on-call rotations to ensure timely issue resolution.
Configuring and maintaining software workflows using configuration management tools.

Seniority level
Entry level
Employment type
Full-time
Job function
Information Technology
Industries
Software Development

Referrals increase your chances of interviewing at HireMeFast LLC by 2x

See who you know

Get notified about new Site Reliability Engineer jobs in Surprise, AZ.

Similar jobs

Software Engineer (L4), Content Engineering

Software Engineer (L4), Content Engineering

Netflix

United States 2 weeks ago
Software Engineer

Software Engineer

Fieldguide

San Francisco, CA 3 weeks ago
Software Engineer (L4) - Consumer Engineering

Software Engineer (L4) - Consumer Engineering

Netflix

United States 1 week ago
Software Engineer

Software Engineer

Fay

San Francisco, CA 3 weeks ago
Entry Level Software Engineer (Remote)

Entry Level Software Engineer (Remote)

Engtal

United States 1 week ago
Software Engineer - Frontend

Software Engineer - Frontend

Ever

San Francisco, CA 3 weeks ago
Jr Web Developer (Entry Level)

Jr Web Developer (Entry Level)

Planned Systems International

United States 2 days ago
Software Engineer (L5) - Ads Platform Engineering

Software Engineer (L5) - Ads Platform Engineering

Netflix

United States 3 weeks ago
Software Engineer (LA Remote)

Software Engineer (LA Remote)

Prelim

Los Angeles, CA 3 weeks ago
Junior Software Engineer (Backend)

Junior Software Engineer (Backend)

Telcoin

Los Angeles, CA 2 days ago
Software Engineer

Software Engineer

Check

New York, NY 1 year ago
Software Engineer

Software Engineer

Check

San Francisco, CA 1 year ago
Software Engineer Internship

Software Engineer Internship

Databento

Chicago, IL 1 month ago
Software Developer 1

Software Developer 1

Oracle

United States 1 week ago
Software Engineer (Front End)

Software Engineer (Front End)

Edmunds

Santa Monica, CA 1 month ago
Software Engineer - Fullstack

Software Engineer - Fullstack

Ever

San Francisco, CA 3 weeks ago
Software Engineer (L5) - Experimentation Platform

Software Engineer (L5) - Experimentation Platform

Netflix

United States 2 weeks ago
Software Developer 1

Software Developer 1

Oracle

United States 4 days ago
Junior Front End Engineer

Junior Front End Engineer

minware

Denver, CO 1 week ago
Junior Python Developer

Junior Python Developer

Team Remotely Inc

Philadelphia, PA 3 days ago
Software Developer 1

Software Developer 1

Oracle

United States 1 week ago
Software Engineer I

Software Engineer I

GitHub

United States 5 days ago
Junior Front End Engineer

Junior Front End Engineer

minware

Philadelphia, PA 1 week ago
Full Stack Developer - Remote

Full Stack Developer - Remote

NDG

La Plata, MD 1 day ago
Junior Full Stack Web Developer

Junior Full Stack Web Developer

Patterned Learning Career

West Valley City, UT 3 days ago
Junior Front End Engineer

Junior Front End Engineer

minware

San Francisco, CA 1 week ago
Full Stack Software Engineer

Full Stack Software Engineer

CallRail

Atlanta, GA 1 day ago

Looking for a job?

Visit the Career Advice Hub to see tips on interviewing and resume writing.

View Career Advice Hub