HPC Automation Engineer
·
Parallel Works
·
Remote
DescriptionThe Role
This position involves development and enhancement of the Parallel Works platform, which runs on all three leading public clouds: AWS, Azure and Google Cloud. You’ll be responsible for developing automations which extend the existing platform and its cluster deployment processes. You will become an expert in interacting with HPC job schedulers, cloud APIs, workflow management frameworks, and container systems to create responsive, reliable and easy-to-use deployments of cloud clusters.

Parallel Works provides developers the rare opportunity to interact with cutting edge cloud and on-premises technologies. We are very interested in hearing from individuals with experience with one or more of the following specialties: architecting high performance computing clusters, parallel file systems, high throughput networking, and GPU clusters.
RequirementsRequired Experience ● BSci in Computer Science or related field.
● 5+ years of programming experience. ● 3+ years of automation experience using Terraform, Ansible, and/or Linux shell and Python scripting ● Deploying containerized applications on Kubernetes, Docker and Singularity ● Thorough knowledge of the Linux operating system and services. ● Familiarity with command-line shell interpreters and scripting techniques ● Network service programming including tunneling and port forwarding. ● Git revision control, branching and merging techniques using GitHub. ● Experience programming in TypeScript, Python, or Golang ● Experience with parallel profiling and debugging Desired Experience ● Advanced degree (MSci or PhD) in Computer Science ● Experience automating on-premise deployments ● Database experience with SQL and JSON/NoSQL databases, ideally PostgreSQL and MongoDB ● High speed data movement and parallel IO tools and techniques ● Architecting high performance computing clusters - on cloud and/or on premises ● Experience deploying parallel file systems (Lustre, BeeGFS/BeeOND, GPFS) ● Maintain up-to-date knowledge of cloud resources and design patterns in AWS, GCP, and Azure ● Familiarity with HPC schedulers such as PBS and Slurm
Company DescriptionAbout Us Do you like high performance computing and empowering researchers? Parallel Works is a Chicago-based startup whose SaaS platform makes high performance computing (HPC) workflows easy, fast and collaborative on hybrid cloud and in supercomputing environments. The platform acts as middleware between wide ranges of computing resources and allows organizations to leverage their resource, billing and user hierarchies in a uniform way. Our customers range from small and nimble startups, to government agencies, to global manufacturing companies. The interview process includes a panel interview and an assessment designed to highlight your relevant skills. Parallel Works is an Affirmative Action, Equal Opportunity Employer. As part of our standard hiring process for new employees, employment with Parallel Works will be contingent upon successful completion of a comprehensive technical evaluation and a background check.
·
·
2022-11-08
Event Type
Job Posting
TimeWednesday, 16 November 202210am - 3pm CST
Location
Back To Top Button