Senior Data Engineer

Sparibis
Washington, DC Remote Full Time
POSTED ON 9/20/2024

Location: 100% Remote

Years’ Experience: 10 years

Education: Bachelor’s in IT related field

Work Authorization: Must show that applicant is legally permitted to work in the United States.

Clearance: Applicants must be able to meet the requirements to obtain an Public Trust security clearance. NOTE: United States Citizenship is required to be eligible to obtain this security clearance.

Key Skills:

  • 10 years of IT experience focusing on enterprise data architecture and management
  • Experience with Databricks required
  • 8 years experience in Conceptual/Logical/Physical Data Modeling & expertise in Relational and Dimensional Data Modeling
  • Experience with Great Expectations or other data quality validation frameworks
  • Experience with ETL and ELT tools such as SSIS, Pentaho, and/or Data Migration Services
  • Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization)
  • Experience with AWS environment, CI/CD pipelines, and Python (Python 3)

 

 

Responsibilities

 

 

  • Plan, create, and maintain data architectures, ensuring alignment with business requirements
  • Obtain data, formulate dataset processes, and store optimized data
  • Identify problems and inefficiencies and apply solutions
  • Determine tasks where manual participation can be eliminated with automation.
  • Identify and optimize data bottlenecks, leveraging automation where possible
  • Create and manage data lifecycle policies (retention, backups/restore, etc)
  • In-depth knowledge for creating, maintaining, and managing ETL/ELT pipelines
  • Create, maintain, and manage data transformations
  • Maintain/update documentation
  • Create, maintain, and manage data pipeline schedules
  • Monitor data pipelines
  • Create, maintain, and manage data quality gates (Great Expectations) to ensure high data quality
  • Support AI/ML teams with optimizing feature engineering code
  • Expertise in Spark/Python/Databricks, Data Lake and SQL
  • Create, maintain, and manage Spark Structured Steaming jobs, including using the newer Delta Live Tables and/or DBT
  • Research existing data in the data lake to determine best sources for data
  • Create, manage, and maintain ksqlDB and Kafka Streams queries/code
  • Data driven testing for data quality
  • Maintain and update Python-based data processing scripts executed on AWS Lambdas
  • Unit tests for all the Spark, Python data processing and Lambda codes
  • Maintain PCIS Reporting Database data lake with optimizations and maintenance (performance tuning, etc)
  • Streamlining data processing experience including formalizing concepts of how to handle lake data, defining windows, and how window definitions impact data freshness.

 

 

Qualifications

 

 

  • 10 years of IT experience focusing on enterprise data architecture and management
  • Experience in Conceptual/Logical/Physical Data Modeling & expertise in Relational and Dimensional Data Modeling
  • Experience with Databricks, Structured Streaming, Delta Lake concepts, and Delta Live Tables required
    • Additional experience with Spark, Spark SQL, Spark DataFrames and DataSets, and PySpark
    • Data Lake concepts such as time travel and schema evolution and optimization
    • Structured Streaming and Delta Live Tables with Databricks a bonus
  • Experience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / support
    • Advanced level understanding of streaming data pipelines and how they differ from batch systems
    • Formalize concepts of how to handle late data, defining windows, and data freshness
    • Advanced understanding of ETL and ELT and ETL/ELT tools such as SSIS, Pentaho, Data Migration Service etc
    • Understanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.
    • Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonus
    • Understanding of streaming data pipelines and batch systems
    • Familiarity with concepts such as late data, defining windows, and how window definitions impact data freshness
  • Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design, Postgres performance optimization)
    • Indexing and partitioning strategy experience
  • Debug, troubleshoot, design and implement solutions to complex technical issues
  • Experience with large-scale, high-performance enterprise big data application deployment and solution
  • Understanding how to create DAGs to define workflows
  • Familiarity with CI/CD pipelines, containerization, and pipeline orchestration tools such as Airflow, Prefect, etc a bonus but not required
  • Architecture experience in AWS environment a bonus
    • Familiarity working with Kinesis and/or Lambda specifically with how to push and pull data, how to use AWS tools to view data in Kinesis streams, and for processing massive data at scale a bonus
    • Experience with Docker, Jenkins, and CloudWatch
    • Ability to write and maintain Jenkinsfiles for supporting CI/CD pipelines
    • Experience working with AWS Lambdas for configuration and optimization
    • Experience working with DynamoDB to query and write data
    • Experience with S3
  • Knowledge of Python (Python 3 desired) for CI/CD pipelines a bonus
    • Familiarity with Pytest and Unittest a bonus
  • Experience working with JSON and defining JSON Schemas a bonus
  • Experience setting up and management Confluent/Kafka topics and ensuring performance using Kafka a bonus
    • Familiarity with Schema Registry, message formats such as Avro, ORC, etc.
    • Understanding how to manage ksqlDB SQL files and migrations and Kafka Streams
  • Ability to thrive in a team-based environment
  • Experience briefing the benefits and constraints of technology solutions to technology partners, stakeholders, team members, and senior level of management

 

 

About Sparibis

 

Sparibis LLC is a professional solution firm that Clients rely on to access the best talent to drive their business success.

 

Sparibis is an equal opportunity employer that values diversity at all levels. All individuals, regardless of personal characteristics, are encouraged to apply.

For Employer
Looking for Real-time Job Posting Salary Data?
Keep a pulse on the job market with advanced job matching technology.
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Data Engineer?

Sign up to receive alerts about other jobs on the Senior Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$120,127 - $151,678
Income Estimation: 
$140,280 - $175,744
Income Estimation: 
$90,513 - $119,405
Income Estimation: 
$116,438 - $150,865
Income Estimation: 
$122,709 - $158,630
Income Estimation: 
$82,945 - $106,440
Income Estimation: 
$90,513 - $119,405
Income Estimation: 
$102,195 - $139,353
Income Estimation: 
$145,302 - $183,482
Income Estimation: 
$167,098 - $214,563
Income Estimation: 
$116,438 - $150,865
Income Estimation: 
$145,302 - $183,482

Sign up to receive alerts about other jobs with skills like those required for the Senior Data Engineer.

Click the checkbox next to the jobs that you are interested in.

  • SAP Asap Methodology Skill

    • Income Estimation: $151,295 - $200,741
    • Income Estimation: $151,578 - $207,146
  • Backup/Recovery Skill

    • Income Estimation: $114,095 - $148,824
    • Income Estimation: $113,527 - $144,047
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Sparibis

Sparibis
Hired Organization Address Washington, DC Full Time
Location: 100% remote Years’ Experience: 10 years development experience Education: Bachelor degree in Computer Science,...
Sparibis
Hired Organization Address Washington, DC Full Time
Location: 100% remote Years’ Experience: 8 Years Education: Bachelor’s degree required Work Authorization: United States...
Sparibis
Hired Organization Address Washington, DC Full Time
Location: 100% remote Years’ Experience: 4 years development experience Education: Bachelor degree required Security Cle...
Sparibis
Hired Organization Address Washington, DC Full Time
Location: 100% remote Years’ Experience: 7 years development experience Education: Bachelor degree required Security Cle...

Not the job you're looking for? Here are some other Senior Data Engineer jobs in the Washington, DC area that may be a better fit.

Data engineer

Data Engineer Jobs, Mc Lean, VA

Senior Desktop Engineer

Koniag Data Solutions, LLC, Washington, DC