HPC Storage Systems Analyst
Lawrence Berkeley National Laboratory
Berkeley, CA 94720
DescriptionHPC Storage Systems Analyst - 97184
Division: NE-NERSC

Lawrence Berkeley National Lab’s (LBNL, https://www.lbl.gov/) NERSC Division has an opening for an HPC Storage Systems Analyst to join the team.

In this exciting role, you will join the Storage Systems Group which is made up of system engineers and programmers providing NERSC’s 300 petabyte High Performance Storage System and center-wide, parallel file systems. Our storage systems are utilized by more than 8,000 scientists who use NERSC to perform unclassified, scientific research across a wide range of disciplines, including climate modeling, research into new materials, simulations of the early universe, high energy physics and a host of other scientific endeavors.

In addition to system operation responsibilities, you will lead and contribute to ongoing efforts to develop a storage strategy and plan the associated architecture and integration with NERSC’s HPC infrastructure. You will lead and contribute to evaluation of existing and emerging High Performance Computing (HPC), as well as AI/ML, storage systems, including analyzing the performance characteristics of leading-edge DOE Office of Science workloads and workflows on these systems. This position requires knowledge of storage system architectures as well as associated interconnects and networks. The HPC Storage Systems Analyst will also work with peers at other leading HPC facilities and vendor engineering teams to evaluate emerging storage technologies and define future directions for deployment.

You will also participate in regular cross-team efforts to integrate our storage systems with NERSC’s computational and networking infrastructure, troubleshoot performance issues at scale, and develop innovative solutions to continuously optimize operational and user productivity.

What You Will Do:
• Monitor, administer, and optimize NERSC’s distributed parallel file systems, block storage arrays, and auxiliary Linux-based storage servers.
• Analyze, troubleshoot, and resolve complex problems that arise in NERSC's production storage hardware, software systems, storage networks and systems that utilize NERSC storage systems.
• Participate in the planning and execution of cross-team maintenance activities, upgrades, and deployments at scale.
• Provide off-hours emergency support in a shared, on-call rotation for a subset of NERSC storage systems.

Additional Responsibilities as needed:
• Contribute to evaluation and benchmarking of existing and emerging storage systems.
• Measure and analyze the performance of NERSC's evolving workloads on current and future storage systems.
• Propose remedies to identified bottlenecks via tuning and/or architectural improvement with comprehensive understanding of any trade-offs in design, cost, and operational effects.
• Prepare timely reports, papers, and lectures describing significant results for dissemination within NERSC and throughout the broader HPC research community.
• Provide technical conceptual guidance to other group members and management and suggest directions for investigation.
• Participate in the NERSC decision-making process for acquisition of new HPC storage systems.
• Participate in NERSC’s outreach activities through written documents, presentations and developing peer to peer contacts with other professionals in the HPC field.
• Conduct research related to NERSC’s interests and ensure that any relevant research outcomes are deployed in production at NERSC.
• Proactively seek opportunities to collaborate with researchers, operators, and vendors across the global HPC community to apply the best ideas and solutions to solving NERSC's technical challenges.

Want to learn more about Berkeley Lab's Culture, Benefits and answers to FAQs?
Please visit: https://recruiting.lbl.gov/

• This is a full-time, career appointment, exempt (monthly paid) from overtime pay.
• This position may be subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
• Work will be primarily performed at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA.

How To Apply
Apply directly online at and follow the on-line instructions to complete the application process.

Based on University of California Policy - SARS-CoV-2 (COVID-19) Vaccination Program and U.S Federal Government requirements, Berkeley Lab requires that all members of our community obtain the COVID-19 vaccine as soon as they are eligible. As a condition of employment at Berkeley Lab, all Covered Individuals must Participate in the COVID-19 Vaccination Program by providing proof of Full Vaccination or submitting a request for Exception or Deferral. Visit covid.lbl.gov (https://covid.lbl.gov/) for more information.

Berkeley Lab is committed to Inclusion, Diversity, Equity and Accountability (IDEA, https://diversity.lbl.gov/ideaberkeleylab/) and strives to continue building community with these shared values and commitments. Berkeley Lab is an Equal Opportunity and Affirmative Action Employer. We heartily welcome applications from women, minorities, veterans, and all who would contribute to the Lab's mission of leading scientific discovery, inclusion, and professionalism. In support of our diverse global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status.

Equal Opportunity and IDEA Information Links:
Know your rights, click here (https://www.dol.gov/agencies/ofccp/posters) for the supplement: Equal Employment Opportunity is the Law and the Pay Transparency Nondiscrimination Provision (https://www.dol.gov/sites/dolgov/files/ofccp/pdf/pay-transp_%20English_formattedESQA508c.pdf) under 41 CFR 60-1.4.
RequirementsWhat is Required: • Bachelor’s degree and a minimum of eight years of computing or storage experience; or six years and a Master’s degree; or equivalent experience. • Demonstrated expertise of programs used for performance evaluation (e.g. IOR, fio, SPECstorage). • Experience with the use of script languages and system utilities such as configure, Perl, Python, UNIX shell scripts, and “make.” • Experience leading technical projects in a highly collaborative team environment. • Strong understanding of Linux fundamentals including file systems, networking, and virtual memory management. • Understanding of file system internals, prior work developing storage systems, or experience troubleshooting and optimizing parallel I/O. • Knowledge of storage system and computer architecture used in HPC. • Working knowledge of parallel storage technologies such as distributed storage systems, parallel file systems, object stores, hierarchical storage management, storage networking, and/or relevant hardware technologies. • Proven record of working effectively in a team, seeing projects through to completion, meeting deadlines, interacting with users, and thorough documentation of contributions. • Familiarity with industry-standard benchmark programs and methods. • A demonstrated ability to lead technical efforts with teams of people. • Ability to write and present technical talks at conferences and other venues. • Excellent written and oral communication skills. • Strong organizational skills and ability to effectively manage priorities across many projects ranging from immediate problem resolution to long-term strategic planning. • Ability to work effectively and collaboratively on a team, as well as give and receive constructive feedback to foster communication and trust.
Event Type
Job Posting
TimeWednesday, 16 November 202210am - 3pm CST
Back To Top Button