Contract
Posted on 25 April 25 by Chelsea Kelly
Powered by Tracker
We are seeking an experienced Senior Site Reliability Engineer (SRE) to lead the design and implementation of observability and automation solutions within a complex cloud environment. This hands-on role will focus on enhancing system reliability, performance, and scalability by leveraging modern monitoring, scripting, and infrastructure automation tools.
Roles and Responsibilities
Design and deploy robust observability solutions using industry-standard monitoring tools (e.g., Dynatrace).
Automate monitoring and infrastructure tasks using Terraform, PowerShell, and Ansible.
Support end-to-end monitoring and alerting across cloud-based services, with a focus on Microsoft Azure.
Develop Infrastructure as Code (IaC) solutions to improve efficiency and consistency in deployments.
Collaborate with cross-functional teams to identify system inefficiencies and implement scalable solutions.
Provide hands-on technical leadership in operational excellence and automation best practices.
Troubleshoot complex infrastructure and application issues in production environments.
Qualifications and Skills
5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Automation roles.
Proven expertise in scripting and automation using PowerShell, Terraform, and Ansible.
Strong hands-on experience with Azure or other major cloud platforms (AWS, GCP).
Solid understanding of observability and monitoring tools, preferably Dynatrace.
Experience building and maintaining Infrastructure as Code (IaC) in production environments.
Strong problem-solving skills and the ability to work in fast-paced, collaborative teams.
Excellent communication and documentation skills.