Key Responsibilities:
- Build, provision, and maintain Linux (RHEL/CentOS/Ubuntu) and AIX servers, including domain/LDAP integration, security baseline enforcement, and CIS hardening.
- Execute patching cycles using yum/dnf (Linux) and SUMA/NIM (AIX); validate application services post-patch and record evidence for compliance.
- Monitor OS and VM health (syslog, journal, AIX errpt, performance counters) and resolve faults; implement and tune monitoring thresholds and dashboards to reduce alert noise.
- Troubleshoot identity integration (SSSD, PAM, Kerberos), SSH key issues, and name resolution; investigate CPU, memory, I/O, and network performance bottlenecks using sar, iostat, vmstat, and nmon.
- Manage TLS/certificate bindings for Unix-hosted services; enforce least-privilege access, SSH key lifecycle controls, and secure service account usage.
- Manage and administer (the primary backup solution) across the Unix estate: maintain backup agents, validate policy coverage, and run periodic restore tests; capture and document DR evidence; participate in DR tests (site failover, recovery) and document outcomes.
- Track EOL OS and hardware versions; plan and coordinate in-place upgrades or platform migrations, managing risk with application and business stakeholders.
- Lead incident response and root-cause analysis for Unix server-related issues; drive lasting remediation and produce post-incident reports.
- Oversee, maintain, and support the Hypervisor/Virtualisation layer for Unix workloads (VMware vSphere/ESXi and IBM PowerVM/LPAR) across on-premises data centres and VMC on AWS.
- Maintain and support Unix HA Clusters across on-premises data centres (HACMP/PowerHA on AIX; Pacemaker/Corosync on Linux), both on-premises and in VMC on AWS.
- Support application teams with prerequisites, port configurations, NFS/storage dependencies, and service integrations; collaborate with network, storage, and cloud teams.
- Keep server runbooks, diagrams, and CMDB attributes current and accurate; maintain comprehensive documentation of architectures, configurations, and identity integrations.
- Provide weekly operational KPIs (availability, incidents, patch compliance, backup success) and present findings in governance, reporting, and service review meetings.
- Ensure alignment with SLA requirements; lead the response to audit findings, ensuring timely remediation and evidenced closure.
- Manage vendor relationships and coordinate with third-party Unix support providers (IBM, Red Hat, hardware OEMs); oversee secure decommissioning of EOL hardware.
- Mentor and develop engineers, fostering skill development and knowledge sharing; manage team workload and capacity across concurrent workstreams, maintain and prioritise the team's Jira ticket queue, and organise the On-Call rota for the Unix area.
- Drive continuous improvement in Unix server management processes and stay current with Linux and AIX technology roadmaps and emerging best practices.
- Support business continuity and disaster recovery planning for Unix server environments.
Experience & Qualifications:
- 10+ years of enterprise Linux and/or AIX administration and engineering, with at least 2 years in a team lead or managerial role: including team workload management, ticket queue prioritisation (Jira), and hands-on development of engineers.
- Demonstrated experience managing large-scale Unix estates across hybrid on-premises and cloud environments, covering RHEL/CentOS/Ubuntu and IBM AIX.
- Hands-on experience with VMware vSphere/ESXi (for Linux workloads) and IBM PowerVM/LPAR (for AIX), including HA cluster management (HACMP/PowerHA, Pacemaker).
- Proven track record delivering complex Unix OS migrations, EOL remediations, and platform consolidation projects on time and within risk tolerance.
- Deep experience with Linux/AIX identity integration: SSSD, PAM module configuration, Kerberos realm binding, LDAP/AD integration, and SSH key lifecycle management.
- Hands-on experience with patch orchestration on both Linux (yum/dnf, Red Hat Satellite) and AIX (SUMA/NIM), including troubleshooting failed updates at scale.
- Familiarity with TLS/PKI infrastructure for Unix-hosted services and certificate lifecycle management.
- Background supporting or leading audit and compliance activities (ISO 27001, SOC 2, PCI-DSS, CIS or equivalent) with evidenced remediation.
- Proficiency with ITSM and project tracking platforms — Jira (primary) and ServiceNow or equivalent — for incident management, ticket queue management, CMDB, and change control.
What We Offer at Arab Bank
At Arab Bank, we offer a purpose-driven and inclusive environment where innovation, continuous learning, and employee wellbeing are at the core. We are proud to welcome individuals of all generations, genders, and backgrounds, valuing the diverse perspectives that strengthen our culture and contribute to our success.