SysOps Engineer

The SysOps Engineer is responsible for the configuration, reliability and efficiency of systems. He/She optimizes the capacity and performance of infrastructure, using knowledge of coding and scripting to automate the resolution of recurring issues and elimination of tasks, as well as enabling scalable and distributed systems. He also supports system installation and upgrades, performs continuous monitoring of infrastructure and ensures security and compliance in leveraging cloud platforms. He possesses a high level of proficiency in scripting and programming languages. He is familiar with cloud platforms, scaling and management of infrastructure. He works well with a variety of internal and external stakeholders. He is able to work on an on-call and shift basis, with the ability to prioritize effectively and operate under pressure. The SysOps Engineer enjoys hands-on problem-solving and is driven by investigating challenging, complex problems. He is a resourceful and self-directed individual who performs independently with minimal guidance. He is also an analytical thinker who demonstrates strong interpersonal skills in cross-team collaboration.

Skills and Competencies

Technical Skills & Competencies

Agile Coaching
Proficiency Level
"Coach teams in the conduct of Agile practices and the implementation of Agile methodologies and practices in the organization"
4
Applications Development
Proficiency Level
"Plan the application development process, program applications and secure features, applying suitable debugging techniques to resolve complex errors "
4
Applications Integration
Proficiency Level
"Oversee end-to-end process of application integration, determining suitable middleware and testing procedures and resolving issues that arise"
4
Budgeting
Proficiency Level
"Prepare business unit’s operational budgets "
3
Business Agility
Proficiency Level
"Lead the implementation of operational initiatives to enhance business agility "
4

Generic Skills & Competencies

Problem Solving
Proficiency Level
Anticipate potential problems beyond the current scope and apply higher order problem solving tools and techniques to turn problems into opportunities.
Advanced
Service Orientation
Proficiency Level
Anticipate customers needs and expectations, and elicit feedback from customers to improve service. Build relationships with customers to create and sustain customer loyalty.
Intermediate
Resource Management
Proficiency Level
Deepen insights into the planning, allocation and deployment of resources to anticipate needs. Plan the allocation and deployment of resources efficiently and effectively.
Intermediate
Teamwork
Proficiency Level
Contribute to a positive and cooperative working environment by fulfilling own responsibilities and providing support to co-workers to achieve team goals.
Basic
Sense Making
Proficiency Level
Interpret data to uncover patterns and trends between various sources of data.
Intermediate

Critical Work Functions and Key Tasks

Develop infrastructure architecture and standards

• Develop processes and standards for system or application reliability in areas of availability, performance, latency, capacity, 
emergency response, capacity planning, change management, security and monitoring 
• Translate business needs into cloud architectural requirements 
• Design scalable, robust systems using cloud architecture 
• Create procedures and documentation for site reliability and incident management 

Configure and deploy infrastructure

• Build and run large-scale, massively distributed and fault-tolerant systems 
• Perform provisioning of cloud resources 
• Configure infrastructure environment for software development and prototyping 
• Conduct pre-deployment testing of systems to ensure reliability 
• Implement operational cost control mechanisms for cloud infrastructure 
• Identify and resolve deployment issues

Monitor infrastructure and resolve issues

• Oversee configuration of operational systems to ensure alignment with technical and security requirements 
• Conduct measurement and monitoring of overall performance, system health, system availability, and latency 
• Provide proactive updates or alerts on infrastructure availability to relevant stakeholders 
• Address gaps in performance or availability based on identified metrics 
• Carry out testing and release procedures to ensure rigour of infrastructure and services 
• Resolve service operation issues and prevent recurrence using automation 
• Perform regular tuning of infrastructure and services 

Automate infrastructure operations and optimise performance

• Conduct capacity planning for cloud infrastructure and systems performance analysis 
• Identify opportunities to enhance operational workflows, systems and processes through automated deployment 
• Develop tools and scripts to automate deployments and optimise performance 
• Create an operating environment for monitoring, alerting, self-healing and automated recovery

Embed scalability into infrastructure

• Devise strategies and roadmap for scaling of infrastructure operations 
• Design and write code for scalable systems 
• Scale systems through automation to manage recurring tasks 
• Propose suggestions to enhance infrastructure architecture

Get yourself a new skill

In this Path

Coming soon...