What are the responsibilities and job description for the Site Reliability Engineering position at Prohires?
Job Description
Job Role ; Site Reliability Engineering
Location: Nashville, TN 37238
- working hours: 9-17, 8-16, as per preference
- Hybrid work: 1-2 days from office
- must-have: AutoSys (platform support, not as user) or similar job scheduling tool, at least one scripting language
Responsibilities
provide SRE level support for Job Scheduling tool like AutoSys
handle complex issues and problem management
perform risk and security self-assessments
leverage automation and development technologies to execute operational stability and maintenance tasks
determine the reliability of our digital products, technology services, and the infrastructure that underpins them
minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or self-healing
respond to production incidents to gain first-hand experience of operational hotspots and to identify the root causes of problems
collect and analyze operational data, define and monitor key metrics to identify and communicate areas for improvement
apply a broad range of engineering practices with a focus on reliability, from instrumentation, performance analysis, and log analytics to automated testing, deployment, and operations
ensure the quality, security, reliability, and compliance of our solutions by applying our digital principles and implementing both functional and non-functional requirements
undertake root cause analysis to deliver resolution to problems and recurring faults
involvement in SRE & automation activities, ensuring the technical deliveries are delivered on time as scheduled and resourced appropriately; and the correct level of technical information is available to ensure successful implementation
ongoing review, QA and influence of internal processes, technical approaches and component-level build processes
engineering of solutions for middleware inventory, compliance reporting, vulnerability management & entitlement server.
Skills
Must have
ideally 8 years of experience in a similar position focused on Job Scheduling tools such as AutoSys or equivalent products
Bachelor/Master's degree or equivalent focusing on Information Technology
operating system administrative skills like Unix, Linux, Windows
experience working with either Oracle, MS SQL Server, Azure SQL server databases
experience working in Cloud native solutions and DevOps
good knowledge in scripting and programming languages (Python/JavaScript/Shell/PowerShell)
knowledge in Chef, Puppet, Ansible, Kubernetes, IP Center or any automation technology
familiar with monitoring tools like Netcool, AppDynamics, Loom, Moogsoft, Splunk etc.
ability to work and adapt to an increasingly global operating model
ability to solve complex issues, good at problem statement analysis and solution design thinking
building proprietary tools from the scratch to mitigate weaknesses in incident management or software delivery
capable of understanding client needs and translating this into products and services
Salary : $50 - $60