Reliability Engineer analyzes and evaluates the reliability of products, equipment, components, and processes using engineering methodologies and tools. Develops the methods and measures utilized for reliability analysis based on product specifications, tolerances, or operating standards. Being a Reliability Engineer utilizes analysis techniques like FMEA, fault tree, and root cause analysis to identify problems. Oversees testing activities and reviews results. Additionally, Reliability Engineer creates risk-based failure mitigation plans. Proposes new or revised product designs, manufacturing processes and testing specifications that utilize best practices and will increase reliability. Requires a bachelor's degree in engineering. Typically reports to an engineering manager. The Reliability Engineer work is closely managed. Works on projects/matters of limited complexity in a support role. To be a Reliability Engineer typically requires 0-2 years of related experience. (Copyright 2024 Salary.com)
Roles and Responsibilities
In this role, you will:
Establish performance baseline, capacity thresholds, correlate events, and define monitoring/alerting criteria
Develop automated solutions to address potential problems before they result in a service interruption
Provide impact assessment and mitigation plan for changes going into the production environment
Investigate root cause of severe and systemic outages, identify corrective actions and apply across the enterprise
Develop availability measures that align with consumer experience to accurately assess the usability of crucial services
Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages
Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages
Analyze failure points in services to model risk level and resolution steps if failure occurs.
Assist in driving architecture enhancements into system to mitigate potential failure points.
Programmatically monitor for and remediate configuration drift of critical devices
Develop response plans to potential failure points and evaluate effectiveness during planned tests
Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture
Provide technical coaching and direction to more junior teammates
Required Qualification
Desired Characteristics
Technical Expertise:
Excellent knowledge of common operating systems (Unix/Linux, Windows)
Strong oral and written communication skills.
Demonstrated experience scripting or developing software and services for the cloud Python, Go, Java, Node.js, etc.
Extensive knowledge of network protocols: TCP/IP, SNMP, FTP, syslog, TFTP, etc.
Experience managing version control systems such as Git
Experience deploying and managing infrastructure on public clouds such as AWS or Azure
Experience using an automated configuration management system (OpenTofu, Pulumi, Chef, Ansible, etc.)
Strong organizational and project management skills
Strong analytical and problem resolution skills
Excellent knowledge of Network Management (SNMP, MIB)
Experience with configuring, customizing, and extending monitoring tools (Splunk, Grafana, Prometheus, etc.)
Excellent knowledge of TCP/IP networking, and inter-networking technologies (routing/switching, proxy, firewall, load balancing etc.)
AWS DevOps Certification a plus
Leadership:
Proactively engages with cross-functional teams to resolve issues and design solutions using critical thinking and analytics skills and best practices by actively incorporating input from various sources
Strong analytical and strong problem solving skills - effectively evaluates information/data to make decisions; anticipates obstacles and develops plans to resolve
Continuous improvement oriented – actively generates process improvements; champions and drives change initiatives
Ability to deliver results in a rapidly changing dynamic environment
Personal Attributes:
Emotional Intelligence, ability to influence up and out and the ability to work independently
Must be a team player with a strong desire to win
Passionate about continuously learning and able to quickly adapt and pivot to win in dynamic environment
Highly organized and efficient; able to balance competing priorities and execute accordingly
Strong oral and written communication skills
Did you know that research has shown that women [and people of color] are less likely to apply to a job unless they meet 100% of the requirements? GE Digital is committed to building an inclusive workplace where everyone feels like they belong and can bring their entire selves to work – this drives our innovation! Even if you don’t meet every one of the preferred criteria in this job description, we encourage you to still apply as you might be a fantastic fit for this role, or other currently open roles.
#LI-ES1
Additional Information
The salary range for this position is $104,000USD -$156,000USD. The specific salary offered to a candidate may be influenced by a variety of factors including the candidate’s experience, their education, and the work location. In addition, this position is eligible for a performance bonus. Available benefits include be Health, Medical, 401K, and Paid Leave.
Compensation Grade
LPB1GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunities Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
GE will only employ those who are legally authorized to work in the United States for this opening.
Relocation Assistance Provided: No
Clear All
0 Reliability Engineer jobs found in Taunton, MA area