What are the responsibilities and job description for the Staff DevOps Engineer position at Socure?
Who we are:
Founded in 2012, Socure is the leader in digital identity verification technology for Day Zero and beyond. Our predictive analytics platform applies artificial intelligence and machine learning techniques, with trusted online/offline data intelligence from email, phone, address, IP, device, velocity, and the broader internet, to verify identities in real-time. We have more than 400 customers across the financial services, gaming, telecom, and ecommerce industries, including three of the top five banks, seven top 10 issuers, three top MSBs, multiple tier-one payroll providers, the top credit bureau, and over 75 of the largest and most successful fintechs such as Varo Money, Public, Chime, and Stash. We are funded by some of the world's best investors and entrepreneurs including Accel, Scale Venture Partners, Commerce Ventures, Work-Bench, Sorenson, Citi Ventures, Wells Fargo Strategic Capital, Synchrony, MVB Bank, Santander InnoVentures, and Two Sigma Ventures.
Our trophy case includes numerous industry awards and accolades, including being named one of Forbes America’s Best Startup Employers 2021 as well as the Best New Technology Introduced over the Last 12 months – Data and Data Services at the 2020 American Financial Technology Awards (AFTAs), being ranked #70 on Deloitte’s Technology Fast 500™, getting listed as a Gartner Cool Vendor, and winning Finovate’s Award for Best Use of AI/ML, to name a few!
The only way we can further our mission of becoming the single, trusted source of identity verification and eliminating identity fraud is by building the best team on the planet. This is where you come in!
What the role is:
Socure is hiring three Staff DevOps Engineer roles on our DevOps team; the three areas of focus we are looking to fill are in the following areas:
- Storage
- Build Pipeline
- AWS Cloud
These individuals will play a driving role in shaping the foundational systems that Socure engineers use to develop and deploy their applications. We are looking for people who are motivated by the opportunity to solve scalability, reliability and security-related challenges in infrastructure. This role will have a major impact on the infrastructure and scalability of a high-growth company, and your contributions will make a difference across the entire company.
This role can be based remotely in the United States. You will report directly to our Senior Director of DevOps, Scott Pustay.
What you’ll do:
- Build the frameworks, automation and tooling to enable a culture of quality within Socure, and automate all aspects of the infrastructure for scale
- Participate in high-level architecture of large-scale distributed systems, promoting security, reliability and scalability in development, QA and production environments
- Build platforms to empower engineers (developers, data scientists, DevOps)
- Oversee and ensure the security of our cloud infrastructure, data, and internal services
- Be a technical leader and display leadership and vision for the future infrastructure at Socure, which may include pioneering new technologies/services to help meet growing challenges
- Collaborate with the Executive Team, Product Management, Architects, and existing engineering teams to design, develop, and publish software, processes, and workflows supporting a highly available, fault-tolerant SaaS platform
- Be a team leader who is responsible for the deliverables of the team
- Communicate objectives both within the team and to other teams
- Plan projects end-to-end; take ownership of the success of the project, delegate responsibilities mindfully, and foster engaging and productive discussions
- Take ownership of continuous improvement within the team, as we are continuously learning here at Socure
What you’ll bring:
- 5 years of experience building infrastructure on cloud hosted systems (AWS)
- Repeated successes in building scalable and resilient infrastructure
- Prior experience in software development ideal
- Experience with communication protocols, restful services, service-oriented architecture, distributed systems, and microservices
- A strong desire to automate the mundane
- Experience building processes and services that enable developers and enhance productivity
- Demonstrated expertise working with developers to understand their needs and ever changing challenges
- Experience in designing and architecting enterprise grade solutions and processes, and thorough understanding of the best tools to get the job done right
- Great communication skills—able to explain your solution and its impact to the team and non-technical folks so they can understand
- Proven experience prioritizing tasks, evaluating trade-offs, communicating deliverables, and organizing a small team in order to execute on projects
- Availability for on-call support on a rotating basis
- Comfortable working autonomously with minimal guidance
1. Storage:
We are looking for someone to lead our various storage related projects, think zero down-time database migrations, ensuring backups are working properly and are monitored, optimizing for cost and performance and availability, building tools that automate frequent requests around storage (new databases, etc.). You’ll be working with multiple databases: MySQL, Aurora, Postgresql, Redshift, and be expected to tune and monitor for performance and availability. You’ll work closely with our Data Science team to help build and optimize our EMR, and ETL services as well as our in-house vendor offerings; and you’ll work with our product engineers on our ElasticSearch, S3 and DynamoDB services.
- Incident response
- AWS RDS (mysql, postgresql, aurora)
- ElasticSearch
- Backup / recovery
- Vendor data
- PII data
- EMR
- Working with DS tools and humans
- Working with product eng tools and humans
- Performance and monitoring of data systems
- S3
- ETL
- Request automation (new RDS, ES, etc.)
- Elasticache
- DynamoDB
- Cost optimization
2. Build Pipeline:
You will own the build pipeline from engineering code commit to that very code being deployed and running in production. Initially you will work with Gitlab, Jenkins, and Elastic Beanstalk, but we expect you to bring new tools and solutions as well. You will be expected to monitor the health and performance of the tools and systems related to getting code into and running in production. You will also help us build the path from code to containerized deployments in an EKS environment. Time to release, number of successful releases, and uptime are your metrics; building a platform to empower developers to deploy and own their code is what gives you joy at work.
- From gitlab commit to code deployment
- Gitlab
- Elastic beanstalk
- Jenkins
- Performance and monitoring of gitlab, jenkins, and beanstalks
- Suggest new pipeline patterns for improved performance/productivity
- Maintain/monitor health of multiple environments (stage, ds, disaster, prod)
- Enhance self service automation to decrease release time
- Automate and templitize new microservice creation
- Incident response for area
- Feature branching support
- Creating/maintaining testing environments
- Mobile pipeline
- Maintenance of future environments: DIRT, pen-test, performance testing, etc.
- Cost optimization of resources
- Performance and monitoring of all environments
- Suggest infrastructure changes to optimize performance and/or cost
- Primarily serving product engineering, with some support to DS as it relates to product code
3. AWS Cloud:
This role will work as a Senior Generalist for all things AWS cloud related covering such areas as monitoring the overall health and availability, Networking, Capacity Planning, Load Testing, and any other area as needed. Experience within the public sector (US Government) is a big plus. You will be expected to to understand the nuts and bolts of multiple AWS OU deployment; aligning VPCs, security groups, and IAM policies into one beautifully functioning machine excites you. You will also work with our AWS enterprise support team on general trends and grand plans; think expanding an existing US based platform into multiple international regions.
- Work with public sector (US GOV) is a plus
- Incident response for area
- Sr. Generalist
- All things AWS cloud
- Networking
- VPCs
- Monitoring of overall health and availability
- Working with AWS on general trends
- Integrate teams and tools together cohesively
- Capacity planning
- Load testing
- Automation of general purpose tools/services (monitoring, logging, etc.)
- AWS Control Tower
- AWS IAM policy
- Being aware of new AWS tools to present as solutions to other teams
- Serving Security, GRC, product engineering, and other DevOps teams
- Multi-region support
- Multi AWS OU account support
- Region failover
Perks & Benefits:
- Competitive base salary
- Equity - every employee is a stakeholder in our upside
- Medical, dental and vision benefits for employees and their dependents
- Parental leave and fertility support
- Flexible PTO
- 401K with company match
- Stipend to supply your home office
- Annual professional development stipend
A Message on COVID-19:
Socure's number one priority is to safeguard the health and well-being of our team members, our families and our communities. During this unprecedented time, we are closely monitoring COVID-19 developments and updating our response plan quarterly. We are regularly soliciting feedback from our employees to help inform our return-to-office strategy. For our team members who loved going into the office, we are looking forward to meeting once again! But until then, we are striving to ensure that Socureans have the resources and support they need to excel from home. This includes a work-from-home stipend so you can build your home office and fun, virtual events so you can continue to feel connected to your coworkers.
We are an equal opportunity employer and value diversity of all kinds at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.