We are on the lookout for a skilled Site Reliability Engineer to join our dynamic team in Durban. You will play a crucial role in ensuring the reliability and performance of our systems and services.
Responsibilities
Evaluate and improve system architecture for reliability.
Facilitate training and knowledge sharing within the engineering team.
Maintain accurate documentation for systems and procedures.
Lead efforts in disaster recovery testing and planning.
Identify opportunities for system automation and recommend optimization strategies.
Requirements
Education
Bachelor's degree in Computer Science or related field
Experience
5+ years of experience in IT operations or software engineering
Technical Skills
Linux
AWS
Kubernetes
Python
CI/CD
Soft Skills
Problem-solving
Collaboration
Adaptability
Certifications
AWS Certified Solutions Architect
Certified Kubernetes Administrator
Languages
English: Fluent
Advantageous
Experience with Terraform: Familiarity with infrastructure as code using Terraform.
Experience with monitoring tools: Usage of tools such as Prometheus and Grafana to monitor system performance.
Benefits
Competitive salary based on experience
Health insurance and wellness programs
Flexible working hours
Opportunity for remote work during certain projects
Company Culture
Innovation: We encourage an innovative spirit within our teams to drive impactful solutions.
Collaboration: Teamwork and open communication are the cornerstones of our work culture.
Emphasis on Growth: We support our employees' growth through ongoing training and development opportunities.
Status: Open
Other Jobs in Information Technology (IT) and Software Development