What are the responsibilities and job description for the Site Reliability Engineer position at MadHive?
As a Site Reliability Engineer, you will have a key role in scaling our technology platforms and empowering our development teams.
What you’ll do:
- Code
- Guide and develop automation systems for continuous integration & deployment
- Manage deployments of services, including pre-release verification and communication
- Design and build infrastructure systems
- Interface across teams to codify and reliably test production builds
- Work closely with engineering, business, and product teams
- Actively participate in code and configuration reviews
- Create documentation for consumption by teammates and the larger engineering organization
- Be available to resolve critical issues that impact our high SLA production systems
Who you are:
- You are a software engineer with experience shipping high-availability systems
- You have experience with CI/CD systems. For example: GitHub Actions (preferred), Jenkins, Travis CI, Azure Pipelines, Buildkite, etc.
- You have experience orchestrating disparate systems to achieve a common goal
- You have experience with cloud services such as GCP, AWS or Azure
- You have experience working with a production environment with high uptime requirements and measurable SLAs
- You are familiar with container technologies such as Docker, Kubernetes or Mesos
- You prefer working in a dynamic environment, and are comfortable challenging the status quo
- You are familiar with configuration management, orchestration, and infrastructure-as-code tools such as Ansible or Terraform
- You have demonstrated the ability to deliver high-quality, on-time solutions that are reliable, scalable, and maintainable
- You are a strong communicator and have the ability to adjust to changing priorities
Senior Site Reliability Engineer
Kforce Technology Staffing -
Brooklyn, NY
Site Reliability Engineer (SRE)
Valstro -
New York, NY
Cloud Site Reliability Engineer
AYR Global IT Solutions Inc -
New York, NY