What are the responsibilities and job description for the Fabric SRE Generalist position at IBM?
Introduction
System Administrators are the gatekeepers to the many systems that run our company and our clients. As a System Admin with IBM, you will have the opportunity to provide high-value IT services and leverage our leading-edge technology portfolio in our global network. Your work has a direct impact on the day-to-day productivity of our business by ensuring integrity of, and access to, our most important resource: data.
Your Role and Responsibilities
IBM Cloud IaaS provides cloud infrastructure as a service from a growing number of data centers and network points of presence around the world. Our customers range from Web startups to global enterprises.
IBM Cloud is currently searching for a Site Reliability Engineer for our Infrastructure Fabric Tribe. Our SRE Team supports the internal systems that provide a best-in-class cloud infrastructure. SREs are responsible for monitoring and maintaining complex solutions to meet the needs of the business and our customers. We are seeking individuals with demonstrated initiative who take pride in their work.
Responsibilities:
Work Location: Dallas, Texas - Hybrid work from home IBM office
Individuals on the Fabric SRE Team are scheduled to work a specific shift to help provide 24/7 coverage.
This opening is for the team's second shift: 16:00 to 01:00 US Central
Required Technical and Professional Expertise
Preferred Technical and Professional Expertise
US Citizenship is required for this role.
System Administrators are the gatekeepers to the many systems that run our company and our clients. As a System Admin with IBM, you will have the opportunity to provide high-value IT services and leverage our leading-edge technology portfolio in our global network. Your work has a direct impact on the day-to-day productivity of our business by ensuring integrity of, and access to, our most important resource: data.
Your Role and Responsibilities
IBM Cloud IaaS provides cloud infrastructure as a service from a growing number of data centers and network points of presence around the world. Our customers range from Web startups to global enterprises.
IBM Cloud is currently searching for a Site Reliability Engineer for our Infrastructure Fabric Tribe. Our SRE Team supports the internal systems that provide a best-in-class cloud infrastructure. SREs are responsible for monitoring and maintaining complex solutions to meet the needs of the business and our customers. We are seeking individuals with demonstrated initiative who take pride in their work.
Responsibilities:
- Work with support and development teams to troubleshoot complex problems
- Actively respond to escalations and alerts
- Triage issues and determine the proper path for resolution
- Help perform and coordinate root cause analysis efforts for infrastructure impacting events
- Evaluate existing processes and identify opportunities for improvements
- Work with internal customers to create and deploy solutions for new and existing services
- Day to day maintenance and upkeep of existing systems infrastructure
Work Location: Dallas, Texas - Hybrid work from home IBM office
Individuals on the Fabric SRE Team are scheduled to work a specific shift to help provide 24/7 coverage.
This opening is for the team's second shift: 16:00 to 01:00 US Central
Required Technical and Professional Expertise
- One to two (1-2 ) years of Unix Administration in a professional setting
- Basic technical troubleshooting
- Ability to determine telemetry of an ongoing issue
- Basic security knowledge
- Experience with the CLI
- Unix basics
- Experience investigating incidents to determine a root cause
- Basic understanding of DNS
- Basic monitoring tool experience (Grafana, Graphite, Nagios)
- Cryptography awareness (http vs. https)
- Basic network knowledge
- Understand TCP and UDP
- Able to explain the OSI model
- Basic switching and routing knowledge
- Basic understanding of Load Balancing Hardware/Software
- Soft Skills
- Interface with internal and external customers
- Ability to coordinate efforts between different teams
- Good communication skills
- Detail oriented
- Self motivated
- Good team player
- Critical thinking
Preferred Technical and Professional Expertise
- One or more (1 ) years of programming in any language (ex. Bash, Ruby, Java, etc.)
- Able to do basic scripting
- RHCSA Certification
- Basic knowledge of Configuration Management Software (Chef, Ansible, etc.)
- Basic database knowledge (SQL, Oracle, Postrgress)
- Basic System Security Knowledge
US Citizenship is required for this role.
Salary : $1 - $1,000,000
SRE Lead
Hitachi Digital Services -
Dallas, TX
Product/Fabric Development Executive
The Apparel Group, Ltd -
Lewisville, TX
Generalist
James Hardie -
Waxahachie, TX