Reliability Engineer jobs in Oakland, CA

Reliability Engineer analyzes and evaluates the reliability of products, equipment, components, and processes using engineering methodologies and tools. Develops the methods and measures utilized for reliability analysis based on product specifications, tolerances, or operating standards. Being a Reliability Engineer utilizes analysis techniques like FMEA, fault tree, and root cause analysis to identify problems. Oversees testing activities and reviews results. Additionally, Reliability Engineer creates risk-based failure mitigation plans. Proposes new or revised product designs, manufacturing processes and testing specifications that utilize best practices and will increase reliability. Requires a bachelor's degree in engineering. Typically reports to an engineering manager. The Reliability Engineer work is closely managed. Works on projects/matters of limited complexity in a support role. To be a Reliability Engineer typically requires 0-2 years of related experience. (Copyright 2024 Salary.com)

N
Site Reliability Engineer
  • NTT DATA Americas, Inc
  • San Leandro, CA FULL_TIME
  • Job Details

    Company Overview:
    Req ID: 278807
    NTT DATA Services strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
    We are currently seeking a Site Reliability Engineer to join our team in San Leandro, California (US-CA), United States (US).

    Job Description:
    5-10 years of experience in Production support/SRE teams with continued focus on improving Platform health
    Experience working in Micro service architecture.
    Hands-on Java coding exp and able to analyze and trouble shoot production issues by reading stack trace and exceptions.
    Familiar with Agile or other rapid application development practices
    Hands-on expertise in building monitoring dashboards and setting up alerts using Splunk.
    Hands-on experience in writing Oracle SQL queries and MongoDB queries.
    Experience with distributed (multi-tiered) systems, algorithms, and relational databases.
    Must have working knowledge of APM tools such as splunk, ELK, Grafana, Prometheus etc
    Knowledge & Exposure caching tools (Redis, memcache) or messaging tools such as MQ, Kafka is a plus
    Working knowledge of CICD is a plus - Source control like Git/Bitbucket , Continuous Integration - Jenkins / UCD Release etc .
    Ability to work with Engineering teams across the ecosystem such as Security , Networking & Infrastructure challenges which can impact platform health & resiliency.
    Shell Scripting / DevOps tools like Ansible with good knowledge of yaml file to write playbooks .
    Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / OpenShift.
    A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
    Expectations:
    You will be a core member of a SRE support team, will be utilizing the latest technology tools to write code, test cases, working with API specs and automate to maintain the resiliency, performance and availability of Digital Sales & Marketing platforms.
    Strong & relevant experience in supporting Web/API platforms built using Java/java script Stack (Spring/Spring boot, Javascript -Angular/react)
    Proficiency in dealing with Legacy infrastructure along with cloud infrastructure (on prem & 3rd party) such as PCF or Azure.
    Identifying opportunities to adopt to new technologies while improving the efficiency by removing toil and continues to drive efficiency & optimization.
    Proactive monitoring of app performance through splunk, App dashboards, App dynamics & Dynatrace etc.
    Represent Platform engineering teams during production outages and collaborate with engineering teams to resolve production outages. Collaborate with stake holders across engineering function to own/derive RCA & work towards permanent resolution.
    Plan, support, execute and comply with governance programs/processes in support of a strong control environment in your functional area. Leverage process documentation to improve operational controls and identify and remediate process deficiencies.
    Proactively identify, communicate, mitigate and escalate risk originating from non-compliance of processes, operational errors, and data integrity issues in all applicable processes.
    Ability to influence SRE practices with in and outside teams to enable a strong DevOps culture with in the organization
    Responsible for working with Engineering teams to maintain the SLAs & SLOs. Constantly looking out for opportunities to improve platform metrics & communicate the same to stakeholders.
    Tech Stack : Java/J2EE ( Spring, spring boot, python, shell scripting).
    Exposure and proficiency in different API styles such as SOAP, REST, Micro services etc.

    #indist
    #li-ist

    About NTT DATA Services:

    NTT DATA Services is a recognized leader in IT and business services, including cloud, data and applications, headquartered in Texas. As part of NTT DATA, a $30 billion trusted global innovator with a combined global reach of over 80 countries, we help clients transform through business and technology consulting, industry and digital solutions, applications development and management, managed edge-to-cloud infrastructure services, BPO, systems integration and global data centers. We are committed to our clients' long-term success. Visit nttdata.com or LinkedIn to learn more.

    NTT DATA Services is an equal opportunity employer and considers all applicants without regarding to race, color, religion, citizenship, national origin, ancestry, age, sex, sexual orientation, gender identity, genetic information, physical or mental disability, veteran or marital status, or any other characteristic protected by law. We are committed to creating a diverse and inclusive environment for all employees. If you need assistance or an accommodation due to a disability, please inform your recruiter so that we may connect you with the appropriate team.

    Where required by law, NTT DATA provides a reasonable range of compensation for specific roles. The starting pay range for this role is $71/hour. Actual compensation will depend on several factors, including the candidate's relevant experience, technical skills, and other qualifications. This position may also be eligible for incentive compensation based on individual and/or company performance

  • 1 Day Ago

P
Senior/Staff Reliability Engineer
  • Pyka
  • Oakland, CA FULL_TIME
  • Pyka is looking for an experienced reliability engineer to oversee the testing and validation of the aircraft’s subsystems. You will oversee all aspects of hardware test/validation. On a day-to-day ba...
  • 20 Days Ago

F
Engineer
  • Foss
  • Richmond, CA FULL_TIME
  • JOB PURPOSE/SUMMARY: The primary responsibility of this position is to ensure all vessel machinery, equipment and systems are operated, maintained and repaired in a safe and effective manner. Addition...
  • 20 Days Ago

W
Engineer
  • Wood Rodgers
  • Oakland, CA FULL_TIME
  • Are you looking for a career where you can nurture a positive working environment and help achieve better employee relations? Are you flexible, proactive, easy to approach, like ‘figuring things out’ ...
  • 2 Months Ago

S
Process Engineer
  • Skydio
  • Hayward, CA FULL_TIME
  • Skydio is seeking a talented multi-disciplinary engineer to define, implement, and continuously improve the final assembly processes for our next generation of products. In this role, your contributio...
  • 10 Days Ago

P
ATS Engineer
  • pacificgastest.valhalla.stage
  • San Ramon, CA FULL_TIME
  • Requisition ID # 79640 Job Category : Engineering / Science Job Level : Individual Contributor Business Unit: Electric Operations Job Location : San Ramon Department Overview The men and women of Elec...
  • 11 Days Ago

Filters

Clear All

Filter Jobs By Location
  • Filter Jobs by companies
  • More

0 Reliability Engineer jobs found in Oakland, CA area

R
Reliability Engineer
  • Russell Tobin
  • Cupertino, CA
  • Test Engineer (Governance Analyst ) Cupertino, CA 95014, US Duration: 06+ Months Salary range: $ 55-60.28/hr. Test Engin...
  • 4/19/2024 12:00:00 AM

O
Site Reliability Engineer, Research Platform
  • OpenAI
  • San Francisco, CA
  • About the team: Reliable services are what enables Open AI to train the best AI models in the world and to bring the pro...
  • 4/19/2024 12:00:00 AM

W
Reliability Engineer
  • Wipro
  • Cupertino, CA
  • Reliability Engineer Auston, TX or Cupertino, CA/Remote ok for locals Permanent Role Job Summary: A hardware reliability...
  • 4/18/2024 12:00:00 AM

S
Director, Database Reliability Engineering
  • SHEIN Technology LLC
  • San Francisco, CA
  • Job Title: Director, Database Reliability Engineering Reports to: Senior Director, Production Engineering Job Location: ...
  • 4/18/2024 12:00:00 AM

V
Site Reliability Engineer - Infrastructure
  • Verkada
  • San Mateo, CA
  • Who We Are Verkada is the largest cloud-based B2B physical security platform company in the world. Only Verkada offers s...
  • 4/16/2024 12:00:00 AM

G
Senior Site Reliability Engineer
  • Glocomms
  • San Jose, CA
  • Glocomms has partnered with a leading social media platform with over 2.5B monthly users worldwide. Responsibilities: De...
  • 4/15/2024 12:00:00 AM

S
Director, Database Reliability Engineering
  • Shein Technology Llc
  • San Francisco, CA
  • Job Title: Director, Database Reliability Engineering Reports to Senior Director, Production Engineering Job Location: L...
  • 4/15/2024 12:00:00 AM

A
Staff Site Reliability Engineer
  • Abbott Laboratories
  • Pleasanton, CA
  • Abbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-chan...
  • 4/2/2024 12:00:00 AM

Oakland is in the eastern region of the San Francisco Bay. In 1991 the City Hall tower was at 37°48′19″N 122°16′21″W / 37.805302°N 122.272539°W / 37.805302; -122.272539 (NAD83). (The building still exists, but like the rest of the Bay Area, it has shifted northwest perhaps 0.6 meters in the last twenty years.) The United States Census Bureau says the city's total area is 78.0 square miles (202 km2), including 55.8 square miles (145 km2) of land and 22.2 square miles (57 km2) (28.48 percent) of water. Oakland's highest point is near Grizzly Peak Blvd, east of Berkeley, just over 1,760 feet (...
Source: Wikipedia (as of 04/11/2019). Read more from Wikipedia
Income Estimation for Reliability Engineer jobs
$92,626 to $109,764
Oakland, California area prices
were up 4.5% from a year ago

Reliability Engineer in Appleton, WI
Michael Kehoe, staff site reliability engineer for LinkedIn describes the person best suited for the SRE role as someone who has a little bit of knowledge about everything.
December 29, 2019
Reliability Engineer in Dallas, TX
The main responsibility of an electrical reliability engineer is to test an electrical component's reliability over a period of time and in various conditions.
December 23, 2019
Reliability Engineer in Allentown, PA
So, the same engineers that are building applications are also the ones that are running them, scaling them, and dealing with incidents.
December 04, 2019