HPC SW Cloud QA Engineer- Specialist
Zerto
This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.
Who We Are:
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
Job Description:
High Performance Computing, AI and Labs is a critical element of HPE. We are focused on delivering innovative solutions that accelerate our customers’ digital transformation, enabling them to tackle their complex, and data-intensive workloads. Combining deep expertise and the development of the world’s most cutting-edge, high-performance supercomputers, is defining the next era of computing delivering valuable insight & innovation.
What you’ll do:
Responsibilities:
We are looking for an experienced cloud development engineer to work on our HPC -CSM Manageability solution.
Role involves designing, implementing, and maintaining our HPC CSM manageability platform hosted on kubernetes infrastructure.
The position requires in-depth expertise in cloud native technologies, particularly Kubernetes, along with a strong background in , automation, and DevOps practices.
Good understanding of security on Cloud Native applications is expected.
- Test Planning & Execution
- Design, implement, and execute comprehensive test plans for the CSM platform, including functional, regression, integration, and performance testing.
- Validate HPC system management capabilities such as node provisioning, monitoring, workload orchestration, and system upgrades.
- Automation Development
- Develop automated test suites using Python, Bash, and CI/CD frameworks to ensure rapid and repeatable test execution.
- Integrate automated testing into the development pipeline to support continuous delivery.
- Defect Tracking & Reporting
- Identify, document, and track defects; work with engineering teams to resolve issues.
- Provide clear, reproducible test cases and logs to aid in troubleshooting.
- Performance & Scalability Validation
- Perform stress testing and scale testing on large HPC clusters.
- Monitor and analyze system metrics to assess stability under load.
What you need to bring:
Education and Experience Required:
- Bachelor's degree preferred or Associate degree holder (technical field) with 8-12 years working experience in related fields desired.
- Technical Skills
- Strong understanding of Linux (RHEL, SLES, Ubuntu) system administration.
- Experience with Kubernetes, containers (Docker/Podman), and networking fundamentals.
- Proficiency in scripting languages (Python, Bash) for automation.
- Familiarity with HPC architectures, job schedulers (Slurm, PBS Pro), and workload management concepts.
- Testing Expertise
- Experience with test automation frameworks (e.g., pytest, Robot Framework, Jenkins CI/CD).
- Hands-on experience in system-level testing, API testing, and performance validation.
- Tools & Platforms
- Familiarity with Git, Jira, Confluence, and defect tracking workflows.
- Experience with monitoring and log analysis tools (Grafana, Prometheus, ELK stack) is a plus.
Additional Skills:
What We Can Offer You:
Health & Wellbeing
We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
Personal & Professional Development
We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
Unconditional Inclusion
We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.
Let's Stay Connected:
Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.
Job:
EngineeringJob Level:
TCP_03
HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.
Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.
HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.