What are the responsibilities and job description for the Site Reliability Engineer position at Vertisystem Inc.?
Job Description
I am Sridhar from Vertisystem.
Please let me know if you are interested with below position.
Site Reliability Engineer
Agoura Hills, CA On-site
Fulltime
We are looking for an engineering mindset who will be responsible for:
Partner with DevOps teams to drive observability into our technology stacks and define metrics so that we can improve our I.T. service stability and meet business needs.
Collaborate with engineering and DevOps so that our services are scalable and resilient.
Create feedback for continual improvement and evolution of non-functional requirements.
Knowledge of DevOps tooling and ability to partner with development teams to implement toolset and train teams.
Job Responsibilities:
Release and deploy foundational services to enable WFS engineering to move faster with confidence.
Design and develop systems and processes that enable highly available & scalable systems.
Design, build and deliver software to dramatically improve the availability, scalability, latency, and efficiency of WFS’s services.
Achieve breakthroughs in systems throughput by identifying and eliminating bottlenecks.
Leverage technology such as Python, Google Cloud Platform, Kubernetes, Terraform, Oracle, MySQL, Redis, ELK, Snowflake to advance WFS's platform.
Champion best practices by actively collaborating with other teams in a culture that values whiteboarding and technical design review.
Contribute to the company as a subject matter expert in multiple areas, constantly pushing yourself to be a better engineer and to level up all your peers within your team and within WFS.
Mentor and pair with other WFS engineers to build better software by focusing on performance, self-healing system, configuration as code, defensive programming, application security, etc.
Participate in periodic on call duties with a focus on solving issues when they are discovered, preventing recurrences, and minimizing alert fatigue.
Prototype and advocate for architectural improvements to achieve breakthrough results in WFS systems' operational scalability and reliability.
Work together with service-facing engineers to release and deploy impactful code.
Perform quantitative analysis to understand and scale WFS systems and manage the cross-functional effort to resolve scalability issues.
Produce and advocate for preventative, upstream solutions with internal stakeholders and external vendors and dependencies.
Confidently make informed, data-driven decisions in a fast-paced environment with competing priorities
Evangelize Site Reliability best practices across the engineering organization and community.
Qualifications:
2 years of working as an SRE, Systems Engineer, or DevOps Engineer.
Knowledge of Google Cloud Platform technical capabilities, resource provisioning and resource capacity management.
Experience in software engineering, site reliability engineering (SRE) model and DevOps paradigms.
Knowledge of Security, including understanding how security works, data encryption, etc.
Experience with DevOps tools, JIRA, ServiceNow and Google Cloud Platform based monitoring tools.
Experience in performance testing tools and SRE best practices.
Willing to work on a hybrid schedule.
Thanks & Regards,
Sridhar Reddy
Sr. Talent Acquisition Specialist
Email:
Contact no:
Salary : $120,000 - $140,000