Senior Site Reliability Engineer (SRE)
Location – Charlotte, NC (3 Days Hybrid in a week)
Contract
Senior Site Reliability Engineer (SRE)
• 4+ years of application production support in complex, high-availability environments, including incident response and problem management with strong root cause discipline.
• 4+ years of hands-on automation and configuration management experience (Ansible preferred or similar), plus strong scripting skills (Python, Bash, PowerShell, or similar)
• 4+ years of Linux administration (RHEL preferred) and/or Windows Server administration supporting enterprise production workloads
• 4+ years of Git-based version control practices, including pull requests and peer review, with a focus on repeatability and code quality.
• 4+ years of experience with Monitoring and observability tools like Splunk, AppDynamics, Thousand eyes (alerts and dashboards).
• 4+ years of experience developing and/or supporting web applications (Preferably Java)
• Working experience with infrastructure-as-code concepts, including modular design and environment consistency.
• Experience implementing SRE operating practices (reliability metrics, reduction of manual toil, continuous improvement via post-indent learnings).
• Experience supporting common middleware platforms and shared services, ability to build automation patterns that standardize operational and reduce manual intervention.
• Familiarity with enterprise observability and operational practices (service health dashboards, alert engineering, actionable telemetry)
• Exposure to responsible AI usage in operations (security, validation, accuracy and appropriate guardrails for automation/agents).
• Strong cross functional communication skills; experience operating in regulated environments.
Skill Metrix
|
Skills |
Years of Experience |
Candidate Self Rating (Scale of 1 to 5) |
|
SRE |
||
|
Ansible |
||
|
Python |
||
|
Linux |
||
|
Java |
||
|
OpenShift |
Thanks & Regards
Preeti Upadhyay
Sr. Executive Recruiter
E-mail 📩 Preeti@tekintegral.com