What are the responsibilities and job description for the Senior Data Engineer (Onsite) position at Spacee?
If you are a Data Engineer with experience, we are looking for you to join our Data and Computer Vision team to develop, maintain, and improve architecture and procedures to meet the needs of rapidly evolving computer vision and augmented reality projects. The ideal candidate is adept at ingesting data from a variety of sources, pre-processing that data, building data pipelines, leveraging open-source data engineering tools, wrangling data, and enjoys building & optimizing data systems.
If you are humble, confident, transparent, and have a knack for developing and iterating on minimally lovable Data Engineering solutions, then we’d like to meet you.
Top Reasons to Work with Us:
We are doing groundbreaking work in the Computer Vision and Augmented Reality industries. Our Data and CV stack is developing; so, there is a lot of opportunity for you to make a high impact.
What you will be doing:
- You will use a wide variety of data engineering and reliability engineering methods and open-source tools to help Spacee execute on our mission.
- You will build tools that leverage your specialty skillset so that data scientists and computer vision engineers can focus on what they’re best at.
- You will be working with and mixing batch & streaming data at a variety of scales and volumes.
- You will collaborate with many types of stakeholders and functional teams to understand, and document, their data needs & the data structures their work produces.
- You will use data summarization and visualization techniques to debug data quality issues.
Core Responsibilities
- Build, create, and maintain the infrastructure required for optimal extraction, loading, and transformation (ELT/ETL) of data from a wide variety of data sources.
- Design/manage data structures.
- Design/manage OLAP databases to provide a powerful and flexible set of relations that will support BI dashboards and optimize queries within them.
- Take primary responsibility for building, maintaining, and improving the team’s CI/CD pipeline and message/event-based designs.
- Work with stakeholders throughout the organization to assist with data-related technical issues, support their infrastructure needs, and identify opportunities for leveraging company data to drive business solutions.
- Assess the effectiveness, accuracy, and quality of new data sources and data gathering techniques.
- Combine a variety of data sets to meet functional/non-functional business requirements.
- Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, cost, etc.
- Work with data, analytics, and computer vision experts to strive for greater functionality in our data systems.
About You
- You are self-motivated with strong project management and organizational skills. You can’t just execute on work planned by others. Your expertise means we want you to identify the work that needs to be done and to propose it clearly and convincingly.
- You have solid written communication skills and interest in writing clear documentation.
- You have strong problem-solving skills with an emphasis on product development.
- You enjoy learning, researching, mastering, and implementing new approaches and technologies. But you temper your enthusiasm with a clear-eyed evaluation of opportunity costs and the value of implementation relative to the implementation cost.
- You are egalitarian when it comes to tools and technologies. What matters to you is that others can succeed, and you are supportive of them even if their choices differ from your individual preference.
- You gain satisfaction from helping other people and have a passion for the data scientist and computer vision engineers’ developer experience.
Required Qualifications
- Fluent in Python.
- Strong analytical skills related to transforming semi-structured, and unstructured datasets into structured data suitable for further analysis.
- 2 years of experience in a Data Engineer role, or equivalent.
- At least two of:
- Comfort with Docker, Dockerizing code, and deploying Dockerized code into a production environment.
- Functional SQL knowledge and experience working with relational databases; including comfort with DDL, joins, window functions, stored procedures, etc.
- Experience with CI/CD; we currently use Gitlab.
- Experience with at least one programmatic orchestration tool for batch or stream data processing; we currently use Apache Airflow.
Bonus Qualifications
- Familiar with and comfortable with testing and packaging Python code.
- Experience and familiarity with a variety of OLAP databases, data warehouses, and data lake approaches/technologies is desirable; we currently use Snowflake and aim for dimensional modeling to support PowerBI.
- Experience with Docker, Kubernetes, Terraform, or other SRE/DevOps tooling.
- Experience building and optimizing data pipelines, architectures, and data sets, including experience working with streaming data sources; we currently use Kafka (Confluent) and Spark (Databricks).
- Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
- You have previously built processes that support data transformation, data structures, metadata, dependency, and workload management.
- Experience supporting and working with cross-functional teams in a dynamic environment.
- Experience with Cloud services; we use Azure.
- Experience coordinating with programmers who write in data-oriented scripting languages: Python, R, Julia, etc.
What's In It for You
- Competitive Salary
- Health, Vision, Dental, legal insurance
- 401 (k) and stock options
- Flexible DTO
So, if you are a Data Engineer or are interested in the same, please apply today!
Applicants must be authorized to work in the U.S.
Spacee, Inc is proud to be an Equal Opportunity Employer
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, or any other characteristic protected by law.
We are not accepting resumes from agencies. No fees will be earned unless authorized for this specific search.