Staff HPC Engineer

ASRC Federal
Travis Field, CA Full Time
POSTED ON 5/2/2024 CLOSED ON 7/16/2024

What are the responsibilities and job description for the Staff HPC Engineer position at ASRC Federal?

ASRC Federal is searching for a Staff HPC Engineer to support Inuteq LLC out of NASA AMES, CA

ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement, and assimilation of industry best practices. We are seeking to fill a role that primarily provides development for Supercomputing Batch Scheduling with Supercomputing Systems Administration secondary support for our NASA NACS High Performance Computing (HPC) contract.

Summary: The successful candidate will be an active supporting member of the ASRC Federal team reporting directly to the Manager of the Application Performance and Productivity (APP) group and matrixed directly to the Supercomputing Systems Team Manager.

An individual at this skill level should have demonstrated extensive experience working with common HPC batch schedulers e.g. (PBS, Slurm, or Moab/Torque) while contributing to the support of users of HPC resources on the various issues they might have getting applications to run efficiently. This individual should demonstrate experience installing, maintaining, and upgrading HPC systems. The individual, along with the entire HPC team, will be engaged in the day-to-day operations and support of the HPC resources. Activities may include system patching, OS upgrades, deploying new systems, writing scripts, and troubleshooting system issues on the HPC system. The ability to interact with users to determine symptoms, and then reproduce their issues to isolate the causes is critical skills for this work. There will also be activities in testing, benchmarking, user tool scripting, and analyzing trouble tickets to find patterns indicating system or user education issues.

Duties And Responsibilities

  • Designs, deploys and maintains HPC clusters with over 2000 nodes with InfiniBand, 100 petabytes of data storage in production.
  • Write and shepherd scalable feature designs through the entire software development process, from requirements and use cases to release
  • Designs and develops scripts for system administration, monitoring and usage reporting.
  • Modify existing software to correct errors and/or improve performance
  • Designs and develops scripts for system regression test and performance (file systems (Luster), scheduler (PBS), interconnect (HDR/NDR, Slingshot, ), high availability, etc.).
  • Troubleshoots, isolates and resolves application, system and other technical problems (hardware, software, and network).
  • Understands research use cases, researches and deploys new technologies, defining cost, performance and other trade-offs.
  • Manages and maintains tools for configuration management (HPCM, Ansible & GIT), resource management, scheduling and all necessary aspects of HPC in accordance with best practices.
  • Researches, deploys and manages networking and security infrastructure, including development of policies and procedures.
  • Assists in developing and writing proposals and publications.
  • Creates and provides clear documentation.
  • Mentoring junior staff and cross training peers
  • After hours/weekend support as required
  • Moderate Supercomputing System Administration that contributes to:
    • Day-to-day operations of the Linux HPC clusters and storage systems
    • Proactive monitoring, analyze, and correct system issues
    • Development of scripts to automate repetitive tasks or tools to enhance support of the HPC systems
    • System performance analysis and tuning
    • Building, installing, and supporting user-requested software
    • Supporting evaluation and assessment of new HPC technology
    • Resolving user report issues and manage support tickets requests in Remedy
Requirements

Requirements

  • Bachelor’s degree in computer science or related field
  • Strong computer science background with in-depth systems-level knowledge in operating systems and networking
  • A minimum of 5 years experience of administration of HPC systems and scheduling software (PBS, Slurm, or Moab/Torque)
  • A minimum of 5 years of experience of systems programming in heterogeneous, multi-platform HPC environments
  • Strong ability to analyze, debug and maintain the integrity of an existing code base
  • Demonstrated equivalence of 5 years of Linux/UNIX user support experience and hands-on experience with administration of Linux systems
  • Experience working with HPC applications and proficiency in at least C, C , or Fortran
  • Superior scripting skills and excellent attention to detail; proficiency in at least Python, Perl, or Bash
  • Strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions
  • Excellent communication and people skills; excellent time management and organizational skills
  • Experience with system configuration management tools e.g. , puppet, chef, ansible
  • Experience with revision control software e.g. CVS, SVN, Git
  • Track record of delivering commercial quality software on schedule with excellent quality through multiple release cycles
  • Proficiency at technical writing

Preferred Skills (Requesting Manager Defines)

  • Proficiency with analysis and problem-solving skills for debugging and optimization of applications
  • Familiarity/proficiency with OpenMP and Message Passing Interface (MPI) programming
  • Experience with Lustre, and InfiniBand
  • Experience with cloud technologies (AWS, Azure, GCP), OpenStack or Kubernetes is a plus
Staff Civil Structural Engineer
CHEMICAL & INDUSTRIAL ENGINEER -
Louisville, KY
Staff Piping Designer
CHEMICAL & INDUSTRIAL ENGINEER -
Brook, IL
HPC Engineer
Crossfire Consulting -
Cold Spring Harbor, NY

For Employer
Looking for Real-time Job Posting Salary Data?
Keep a pulse on the job market with advanced job matching technology.
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Staff HPC Engineer?

Sign up to receive alerts about other jobs on the Staff HPC Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Income Estimation: 
$114,502 - $144,630
Income Estimation: 
$161,645 - $210,079
Income Estimation: 
$125,425 - $164,196
Income Estimation: 
$130,162 - $165,530

Sign up to receive alerts about other jobs with skills like those required for the Staff HPC Engineer.

Click the checkbox next to the jobs that you are interested in.

  • Bug/Defect Analysis Skill

    • Income Estimation: $74,092 - $105,774
    • Income Estimation: $82,809 - $110,162
  • Computer Simulation Skill

    • Income Estimation: $77,439 - $91,585
    • Income Estimation: $77,510 - $95,546
This job has expired.
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at ASRC Federal

ASRC Federal
Hired Organization Address Myrtle Point, OR Full Time
ASRC Federal Highland Technology is actively seeking an technical writer to join our team. The selected candidate will s...
ASRC Federal
Hired Organization Address Beltsville, MD Full Time
ASRC Agile Decision Sciences (ADS) has a great opportunity for a Web Developer to become a part of the support team for ...
ASRC Federal
Hired Organization Address Elizabeth, NC Intern
ASRC Federal’s Summer Technology Readiness Intern Development Experience (STRIDE) offers undergraduate college students ...
ASRC Federal
Hired Organization Address Reston, VA Full Time
ASRC Federal is seeking a Tax Accountant to join our internal accounting team. Key Role Meticulous preparation and filin...

Not the job you're looking for? Here are some other Staff HPC Engineer jobs in the Travis Field, CA area that may be a better fit.

Staff Civil Structural Engineer

CHEMICAL & INDUSTRIAL ENGINEER, Houston, TX

Staff Civil Structural Engineer

CHEMICAL & INDUSTRIAL ENGINEER, Spring, TX

AI Assistant is available now!

Feel free to start your new journey!