Job Overview:
The Data Center Operations Engineer is responsible for the day-to-day operations and optimization of our data center facilities. This role involves ensuring the reliability, availability, and performance of all data center systems and infrastructure. The ideal candidate will have extensive experience in managing data center environments, including servers, networking equipment, power, and cooling systems. They will be proactive in identifying potential issues and implementing solutions to maintain high service levels.
Key Responsibilities:
- Data Center Monitoring: Continuously monitor the data center environment to ensure the optimal operation of servers, network devices, power systems, and cooling equipment.
- Maintenance and Upkeep: Perform routine inspections and follow up preventive maintenance on data center equipment, including servers, networking equipment, power, and cooling systems electrical systems, UPS systems, and backup generators.
- Incident Management: Respond to and resolve data center incidents, including equipment failures, network outages, and power issues, in a timely manner to minimize downtime and service disruptions.
- Capacity Planning: Analyze data center capacity requirements, including power, cooling, and space, and recommend upgrades or changes to meet future demands.
- Infrastructure Management: Oversee the installation, configuration, and decommissioning of data center hardware, including servers, network equipment, and storage devices.
- Environmental Controls: Monitor and maintain environmental controls such as temperature, humidity, and airflow to ensure they are within acceptable ranges to prevent equipment overheating or damage.
- Documentation and Reporting: Maintain accurate documentation of data center layouts, equipment inventories, incident logs, and maintenance activities. Prepare regular reports on data center performance and utilization.
- Compliance and Safety: Ensure compliance with industry standards, company policies, and regulatory requirements related to data center operations. Enforce safety protocols to protect personnel and equipment.
- Vendor Coordination: Coordinate with vendors and service providers for equipment maintenance, repairs, and upgrades, ensuring minimal disruption to data center operations.
Qualifications:
- Education: Bachelor’s degree in Computer Engineering, Information Technology, or a related field preferred. Relevant certifications (CDCP,CCNA) are a plus.
- Experience: Minimum of 3-5 years of experience in data center operations, IT infrastructure management, or a similar role.
Technical Skills:
- Proficiency in operating and managing data center hardware, including servers, routers, switches, and storage systems.
- Familiarity with data center infrastructure management (DCIM) tools.
- Strong understanding of network protocols, firewalls, and security practices.
- Experience with environmental monitoring and control systems (HVAC, power distribution, etc.).
- Proficiency in AutoCAD tool.
Soft Skills:
- Strong analytical and problem-solving skills.
- Ability to work independently and manage multiple tasks simultaneously.
- Excellent communication and teamwork skills.