US
0 suggestions are available, use up and down arrow to navigate them
PROCESSING APPLICATION
Hold tight! We’re comparing your resume to the job requirements…

ARE YOU SURE YOU WANT TO APPLY TO THIS JOB?
Based on your Resume, it doesn't look like you meet the requirements from the employer. You can still apply if you think you’re a fit.
Job Requirements of Sr. Site Reliability Engineer:
-
Employment Type:
Contractor
-
Location:
Celebration, FL (Onsite)
Do you meet the requirements for this job?
Sr. Site Reliability Engineer
Careers Integrated Resources Inc
Celebration, FL (Onsite)
Contractor
Job Title: Sr. Site Reliability Engineer
Location: Celebration, FL 34747 (Onsite at least 1x per week, typically Wednesdays; subject to change)
Duration: 12 Months
Schedule: Potential on-call rotation
Position Overview
- The Sr. Site Reliability Engineer (SRE) will be responsible for ensuring the stability, reliability, and performance of enterprise systems.
- This role involves leading incident retrospectives, troubleshooting critical issues, driving long-term improvements, and supporting the design and automation of lower environments to enable reliable release and deployment activities.
- The SRE will work closely with infrastructure, application, and security teams to promote a DevOps culture and enhance operational excellence.
Key Responsibilities
- Lead incident retrospectives, coordinate troubleshooting efforts, and identify root causes of system failures.
- Develop and implement long-term solutions and interim mitigations to improve availability and reduce recovery time.
- Design, build, and manage lower environments to support release, deployment, and testing activities.
- Automate infrastructure, operations, monitoring, and deployment processes across Windows, Linux, and Kubernetes platforms.
- Create observability solutions leveraging telemetry, monitoring, and logging tools to ensure proactive issue detection and resolution.
- Collaborate with cross-functional teams to ensure system stability, security, performance, and capacity management.
- Apply SDLC, ITIL, and industry best practices to incident and problem management for continuous improvement.
- Provide expert-level troubleshooting and support for application- and infrastructure-related incidents.
- Promote DevOps principles, foster collaboration among engineers and developers, and drive adoption of automation and reliability practices.
- Stay current with emerging technologies and recommend improvements to architecture, processes, and tools.
- Lead and participate in technical projects to ensure successful delivery of reliable, scalable systems.
- Partner with Security Operations to implement and maintain secure solutions.
- Maintain accurate system documentation and knowledge-sharing practices.
Required Qualifications
- Bachelor’s degree in Computer Science, Information Systems, Engineering, or related field, or equivalent work experience.
- Minimum 5 years of experience in Site Reliability Engineering, DevOps, Systems Administration, or related roles.
- Proficiency in systems administration across Windows, Linux, and Kubernetes environments.
- Strong troubleshooting expertise in distributed systems, networking, performance, and security.
- Experience with cloud platforms (AWS, Azure, Google Cloud).
- Hands-on experience with CI/CD tools (GitLab, Ansible, Azure DevOps).
- Proficiency in configuration management and infrastructure-as-code tools (Terraform, Ansible, Chef).
- Applied understanding of observability and monitoring tools.
- Strong scripting and programming skills in languages such as Python, Go, Java, Rust, Perl, Ruby, PowerShell, or C/C++.
- Experience working in Agile environments and applying SDLC and ITIL practices.
- Proven ability to lead technical initiatives and collaborate across teams.
Preferred Qualifications
- Experience designing and managing lower environments to support software release and deployment.
- Knowledge of AWX or similar automation tools.
- Experience working in highly regulated or large-scale enterprise environments.
- Demonstrated ability to mentor peers and drive adoption of SRE and DevOps best practices.
- Familiarity with emerging data engineering tools and methodologies.
Get job alerts by email.
Sign up now!
Join Our Talent Network!