- Contract
- Anywhere
Site Reliability Engineer (SRE) – Automation & Cutover
Â
We are seeking a Site Reliability Engineer with a strong automation background who understands production cutovers and service onboarding in large-scale enterprise environments.
Â
This role suits an SRE who has worked closely with release, platform, and application teams during go-lives, migrations, and cutover events, and who applies automationfirst thinking to improve reliability and operational efficiency.
Â
Key Responsibilities
• Support production readiness, deployments, and cutover activities for critical
services
• Design and build automation for infrastructure, monitoring, and operational
workflows
• Partner with application and platform teams in service design, onboarding, and
transition to production
• Monitor, troubleshoot, and stabilize production environments, driving
continuous improvement
• Contribute to reliability practices around availability, resilience, and incident
reduction
Â
Required Experience:
• Strong background as an SRE or senior production support engineer
• Proven exposure to production cutovers, go-lives, or large-scale migrations
• Hands-on automation experience using tools such as Terraform, Ansible, or
similar
• Scripting experience (Python, Bash, PowerShell, etc.)
• Experience with monitoring / observability tools (TruSight, Prometheus, Datadog, or equivalent)
Nice to Have (Not Mandatory)
• Exposure to TSO / Application Owner / SDDA-style environments
• Familiarity with BMC tooling (TruSight, Server Automation)
• Financial services or other regulated enterprise experience
