IBM ESS Storage Support Engineer

We are seeking a contractor to provide day-to-day remote support for an ageing 4.5 PB IBM ESS storage environment supporting HPC clusters within a data center setting. The engagement is focused on operational support, incident resolution, system health monitoring, filesystem support, and critical escalation coverage outside standard business hours. The IBM ESS Storage Support Engineer will be responsible for the ongoing support and maintenance of the storage environment. This includes monitoring, troubleshooting, break-fix support, routine health checks, and performance tuning of IBM Spectrum Scale / GPFS in an HPC environment. The role also requires participation in weekend on-call support for critical Severity 1 issues.

Location:

Brazil, Mexico

Responsibilities:

  • Provide day-to-day remote operational support for a 4.5 PB IBM ESS storage environment 
  • Monitor system health, alerts, capacity, and overall storage performance 
  • Investigate, troubleshoot, and resolve storage-related incidents and service issues 
  • Provide break-fix support and coordinate escalations where required 
  • Support IBM Spectrum Scale / GPFS in an HPC environment 
  • Assist with tuning and troubleshooting of parallel filesystem workloads 
  • Carry out routine health checks and preventative maintenance activities 
  • Support SAN-related troubleshooting, including fabric connectivity and multipathing 
  • Participate in weekend on-call support for Severity 1 incidents only 
  • Maintain clear documentation of incidents, resolutions, and operational recommendations 

What we expect:

  • Proven hands-on experience with IBM ESS storage systems 
  • Experience supporting IBM Spectrum Scale / GPFS in HPC environments 
  • Good understanding of parallel filesystem workloads and HPC job patterns 
  • Experience with SAN environments, including Brocade or Cisco fabrics, zoning, LUN masking, and multipathing 
  • Strong Linux administration skills, particularly RHEL and SUSE 
  • Working knowledge of the AIX command line 
  • Fluent English, both written and spoken 
  • Availability to work during EST business hours 
  • Ability to provide weekend on-call support for Severity 1 incidents 

Nice to have:

  • Experience supporting petabyte-scale storage environments 
  • Background in data centre or scientific computing environments 
  • Experience with performance tuning of GPFS / Spectrum Scale 
  • Familiarity with supporting ageing infrastructure and maintaining operational stability 
  • Experience in incident management and root cause analysis 

We offer:

  • Highly competitive compensation 
  • Flexible working hours
  • Continuous education, mentoring, and professional development programs
  • Strong management and tech expertise 
  • 12-month contract with possible extension

This field is required
This field is required
The maximum number of characters is 500

PDF / DOC / DOCX / RTF / Max 10 Mb

If you don't see an open position that suits your skills stack and/or professional background but you are interested in working with us — please send your CV to career@quantori.com. We will try to find something special and interesting for you!