UNIX/Linux Support Engineer (On-site)
We are looking for an on-site L2 UNIX/Linux Support Engineer to provide both remote and on-site operational support for the enterprise’s UNIX/Linux infrastructure. The environment includes approximately 1,500 servers (600 physical and 900+ virtual) running RHEL 6–9, Ubuntu 20/22/24, AIX, HP-UX, and Solaris on Dell, IBM, and Cisco UCS hardware. This position covers incident management, patching, monitoring, automation, and cluster administration, as well as occasional onsite tasks such as hardware inspection, vendor coordination, and recovery operations.
Location:
Lindhurst or Middletown
Responsibilities:
- Monitor infrastructure alerts and tickets through ServiceNow and TrueSight.
- Diagnose and resolve filesystem, CPU, memory, and agent issues across Linux and Unix platforms.
- Restart services and daemons, clean up logs or disk space, manage processes, and apply configuration corrections.
- Perform standard operational changes, including agent reinstalls, disk extensions, and configuration updates.
- Support VMware ESX hosts (Dell/Cisco UCS), perform VM health checks, validate resource and datastore usage.
- Monitor and troubleshoot GPFS (IBM Spectrum Scale) and LSF workload schedulers.
- Provide basic OpenShift and RStudio support for system availability.
- Execute and enhance Ansible Tower and Red Hat Satellite workflows for patching and configuration management.
- Maintain and improve Puppet modules and shell scripts for operational efficiency.
- Ensure alerts are validated; tickets are updated and resolved in accordance with SLAs.
- Perform hardware interventions, including disk replacements, NIC reseating, and console access.
- Coordinate hardware replacements with Dell, IBM, Cisco, and HP vendors.
- Validate data center connectivity, participate in DR testing, and ensure accurate asset documentation.
- Participate in the on-call rotation approximately once per month, providing after-hours support for critical production incidents.
- Serve as an escalation point for critical system alerts during off-hours to ensure service continuity.
- Coordinate with data center or vendor teams for emergency actions when needed.
What we expect:
- 3–6 years of experience in enterprise Linux/Unix administration and support.
- Strong experience in administering RHEL 6–9 and Ubuntu 20/22/24, with working knowledge of legacy AIX, HP-UX, and Solaris systems.
- Confident in managing Dell PowerEdge, IBM Frame, and Cisco UCS hardware, including iDRAC/iLO operations and rack-level maintenance.
- Hands-on experience with VMware ESX (console operations, vMotion, datastore validation).
- Familiarity with clustered storage environments such as GPFS (Spectrum Scale) and workload scheduling with IBM LSF.
- Automation experience with Ansible Tower, Red Hat Satellite, Puppet, and shell scripting
- Practical experience with ServiceNow for incident/change management and TrueSight/PATROL for monitoring.
- A good understanding of networking fundamentals,such as VLANs, bonding, and interface validation.
- Red Hat Certified System Administrator (RHCSA) or equivalent certification
- ITIL v4 Foundation certification.
- Strong analytical, documentation, and communication skills.
- Eligibility for compliance-based background checks (PHI exposure).
- Availability for working from 9 AM – 5 PM EST (On-site).
We offer:
- Competitive compensation
- Remote or office work
- Flexible working hours
- Healthcare benefits: medical insurance and paid sick leave
- Continuous education, mentoring, and professional development programs
- A team with an excellent tech expertise
- Certifications paid by the company
If you don't see an open position that suits your skills stack and/or professional background but you are interested in working with us — please send your CV to career@quantori.com. We will try to find something special and interesting for you!