Infrastructure Site Reliability Engineer

New York, NY Remote Full Time
POSTED ON 5/4/2024

Why you should join our team: 

  • Our work is transforming the way people in pain access support at their fingertips
  • Our work is innovative in the crisis response space
  • Our dynamic, fun, and diverse culture
  • Our meaningful cause, led by empathy and innovation
  • Our strong values at the center of all we do
  • Our commitment to diversity, equity and inclusion
  • Our commitment to engagement and belonging
  • Our commitment to our employees and their holistic wellbeing
  • Our  value of work/life balance
  • Our growth mindset and prioritize professional development
  • Our leaders who truly care

What you'll be doing:

At Crisis Text Line, the engineering, product, and design teams are commonly referred to as Build. The vision of the team is to: 

  • Deliver the most trusted, innovative, and easy-to-use Crisis Care Platform in the industry and drive unprecedented levels of growth for people in need worldwide.
  • Ensure that every user feels a sense of community on our platform, allowing us to build trust, and grow our impact.
  • Allow our volunteers to spend all their time supporting people in need in an environment with few constraints, and minimal time searching for supporting information, resources, or support.
  • Provide a services/API-first architecture based on federated sources of data and infuses predictive insights (ML and otherwise) in every aspect of our Platform and Experience.

Role:
As a Site Reliability Engineer (SRE) at Crisis Text Line, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure optimal performance, availability, and security. You will work closely with our engineering and operations teams to streamline our deployment processes, enhance our monitoring and alerting systems, and drive continuous improvements to our platform reliability. This role offers an exciting opportunity to leverage your expertise in AWS Fargate, CloudWatch alerting, and monitoring to support our mission-critical applications and services.

Responsibilities:

  • Implement, and maintain highly available, scalable, and secure infrastructure on AWS Fargate.
  • Develop and maintain CloudWatch alerting and monitoring configurations to proactively identify and resolve potential issues.
  • Collaborate with cross-functional teams to define and implement best practices for infrastructure as code (IaC), continuous integration/continuous deployment (CI/CD), and site reliability engineering (SRE) methodologies.
  • Participate in incident response and resolution, including troubleshooting complex system issues and implementing preventive measures to minimize downtime.
  • Automate repetitive tasks and processes to improve operational efficiency and reduce manual intervention.
  • Conduct performance tuning and optimization of infrastructure components to ensure optimal resource utilization and cost efficiency.
  • Stay up-to-date with emerging technologies and industry trends to drive innovation and continuous improvement.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related field or equivalent experience.
  • Experience in site reliability engineering (SRE) or related roles, with a focus on cloud infrastructure management.
  • Hands-on experience with AWS services, particularly AWS Fargate, CloudWatch, and related tools.
  • Familiarity with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
  • Strong scripting and automation skills using languages such as Python, Bash, or PowerShell.
  • Experience with container orchestration platforms such as Kubernetes or Amazon ECS.
  • Solid understanding of networking concepts, security best practices, and DevOps principles.
  • Excellent problem-solving skills and the ability to work effectively in a fast-paced, collaborative environment.
  • AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified DevOps Engineer) are a plus.

Reliable High-Speed Internet Required: Must have a stable high-speed internet connection to support seamless remote collaboration, virtual meetings, online job tasks, etc.

The full salary range for this position, across all United States geographies, is (Insert Range) per year. The upper portion of the salary range is typically reserved for existing employees who demonstrate strong performance over time. Starting salary will vary by location, qualifications, and prior experience; during the interview process, candidates will learn the starting salary range applicable for their location. We pay competitively in the tech-forward nonprofit space and offer a robust benefits package.

Only candidates in the following states will be eligible for employment: CA, CO, CT, FL, GA, HI, IL, IN, IA, MD, MA, MI, MN, MO, NJ, NM, NY, NC, OH, PA, TN, TX, UT, VA, WA.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

Sign up to receive alerts about other jobs that are on the Infrastructure Site Reliability Engineer career path.

Click the checkbox next to the jobs that you are interested in.

Income Estimation: 
$112,564 - $133,311
Income Estimation: 
$134,729 - $184,670
Income Estimation: 
$137,320 - $164,791
Income Estimation: 
$126,607 - $163,214
Income Estimation: 
$142,670 - $173,703
Income Estimation: 
$143,745 - $179,489
Income Estimation: 
$149,803 - $188,396
Income Estimation: 
$85,527 - $128,251

Sign up to receive alerts about other jobs with skills like those required for the Infrastructure Site Reliability Engineer.

Click the checkbox next to the jobs that you are interested in.

  • Architecture Skill

    • Income Estimation: $78,839 - $97,557
    • Income Estimation: $90,722 - $115,517
  • Building Codes and Regulations Skill

    • Income Estimation: $56,001 - $87,864
    • Income Estimation: $58,943 - $80,353
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other Infrastructure Site Reliability Engineer jobs in the New York, NY area that may be a better fit.

Senior Infrastructure Site Reliability Engineer

Crisis Text Line, New York, NY

Site Reliability Engineer - Security Infrastructure

Palantir Technologies, New York, NY