Data Center Operations Engineer

The Data Center Operations Engineer provides support in data center equipment installation, logging data regarding installed corporate server base, developing procedures for server installation, racking, un-racking, de-commissioning hardware and cable patching from server through to server farm switches. He/ She manages the data center performance and operations. He monitors data volume and performs troubleshooting of non routine or novel issues with little precedence whenever required. He is required to be on standby with on-call availability with varied shifts including nights, weekends and holidays to resolve data center related incidents. He works in a team setting and is proficient in database administration, infrastructure concepts and database management related tools and techniques required by the organization. He is also familiar with the relevant software platforms on which the database is deployed. The Data Center Operations Engineer is able to quickly and effectively solve issues as they arise. He is able to methodically identify the cause of the issue, evaluate it and develop a solution in collaboration with the team. He is able to communicate effectively and displays high service level standards.

Skills and Competencies

Technical Skills & Competencies

Business Needs Analysis
Proficiency Level
"Elicit and analyze business requirements from key stakeholders and assess relevant solutions and their potential impact"
3
Business Continuity
Proficiency Level
"Implement business continuity and contingency procedures and exercises"
4
Cyber and Data Breach Incident Management
Proficiency Level
"Troubleshoot incidents, escalate alerts to relevant stakeholder, and analyze root causes and implications of incidents "
3
"Develop incident management procedures and synthesize incident-related analyses to distill key insights, resolve incidents and establish mitigating and preventive solutions "
4
Data Center Facilities Management
Proficiency Level
Identify ideal environmental conditions for operations and restore data center performance against security and service level requirements
3
Disaster Recovery Management
Proficiency Level
Identify and implement recovery solutions to support disaster recovery strategies
4

Generic Skills & Competencies

Communication
Proficiency Level
"Articulate and discuss ideas and persuade others to achieve common outcomes "
Intermediate
Interpersonal Skills
Proficiency Level
Detect and decipher emotions of others to manage interpersonal relationships in social situations.
Intermediate
Problem Solving
Proficiency Level
Identify easily perceivable problems and follow given guidelines and procedures to solve the problems.
Basic
Service Orientation
Proficiency Level
Exceed customer needs and expectations and handle service challenges with a positive mindset. Demonstrate an understanding of the organisation’s service vision, mission and values.
Basic
Teamwork
Proficiency Level
Facilitate work team activities, provide assistance and support needed by team members and promote ownership and commitment among team members to work goals to improve team performance.
Intermediate

Critical Work Functions and Key Tasks

Manage the set-up of the data center

• Conduct technical feasibility studies to determine viability, cost, time required and 
compatibility with organisational needs and requirements 
• Explore new concepts and ideas in data centre facilities and equipment 
• Review and communicate requirements to senior stakeholders 
• Analyse designs to ensure compliance with business requirements, predicted cooling, structural and operational concerns 
• Conduct short- and long-term planning to meet organisation’s requirements and business needs

Manage data centre performance and operations

• Oversee compliance with security policies, procedures and protocols 
• Develop documentation, training and guidance procedures for the management of data center operations 
• Identifies best practices in data center operations and management for adoption 
• Ensure compliance with security policies, procedures and protocols 
• Evaluate services provided by vendors and recommend changes 
• Recommend enhancements to improve availability and performance 
• Analyse data center facilities’ bandwidth, capacity requirements and system inter-dependencies 
• Optimize the interfaces between the IT equipment and data center

Manage data center-related incidents and business continuity

• Develop a disaster recovery plan for data centre operations 
• Oversee the execution of disaster recovery drills and exercises 
• Analyse incidents to determine patterns and propose recommendations to prevent future occurrences 
• Simulate incidents to diagnose and resolve escalated data centre-related incidents
• Oversee resolution of data centre-related incidents involving vendors

Oversee service level agreements and service improvements

• Manage the development of service-level objectives and targets 
• Monitor service-level objectives to ensure that requirements are met or exceeded 
• Develop client satisfaction metrics and service procedures 
• Propose recommendations to improve performance and client satisfaction

More Information

Related Occupations

Get yourself a new skill

In this Path

Coming soon...