Senior Site Reliability Engineer Full-time Job
3 days ago Engineering Dubai 47 views Reference: 37461Job Details
Responsibilities Will Include
Assist in the design and continuously improve our team’s processes, tools and solutions used to build, deploy, monitor, maintain and scale production systems
Assist in the design and improve our monitoring, alerting and remediation solutions with focus on proactively identifying and addressing production issues
Collaborate with platform, support and dev teams for events such as production releases, change management and incident management
Participate in the on-call rotation for critical system alerts
Work in shifts in order to cover an extended time frame including evenings and weekends
Investigate and lead efforts to remediate critical operational production issues
Would be great if you brought this to the role
Minimum Requirements
Excellent communication skills (writing and speaking) in English
In-depth understanding of production management principles for distributed systems
3-5 years experience working with Infrastructure as Code and cloud provisioning tools
3-5 years experience working in operations teams managing production environments
3-5 years experience of utilizing and writing in languages such as Bash, Python, JavaScript and/or Go, or equivalents
3-5 years experience of general Linux experience
Hands-on experience with AWS, experience with Azure is a significant plus