
postal service jobs
Role: Senior Azure Site Reliability Engineer
Locations: Warren, NJ (Fully Onsite)
Duration: 12+ Months Contract
Note: Candidate needs to be in the office 5 Days every week. Local or
candidates from Nearby states only.
Job Description:
• Good hands-on experience in Azure cloud-based services.
• Experience in assessing complex cloud solutions which includes
redundancy, load balancing and fault tolerance
• Experience in load/chaos testing tools and process
Job Duties and Responsibilities:
• Identify and eliminate SPOFs to improve system reliability.
• Conduct FMEA to identify potential failure modes and their impacts.
• Develop mitigation strategies to enhance system resiliency.
• Assess and maintain fault-tolerant architectures using redundancy,
load balancing, and automated failover.
• Experience with load testing and chaos testing tools.
• Collaborate with development, operations, and security teams.
• Provide guidance on best practices for resiliency and reliability.
Qualifications:
• Bachelor’s degree in IT, Computer Science, or related field.
• knowledge of SPOF and FMEA methodologies.
• 10+ years of experience in IT infrastructure management, focusing on
resiliency and chaos engineering.
• Experience in designing and maintaining fault-tolerant architectures.
• Understanding of observability, tracing, and telemetry tools.
• Proficiency in root cause analysis and incident management.
• Excellent analytical and problem-solving skills.
• Strong communication and interpersonal skills for effective
collaboration
Must Have List:
• Azure cloud-based services
• Redundancy
• Load balancing
• Fault tolerance
• Load testing tools
• Chaos testing tools
• Single Points of Failure (SPOF)
• Failure Modes and Effects Analysis (FMEA)
• Fault-tolerant architectures
• Automated failover
• Observability tools
• Tracing tools
• Telemetry tools
• Root cause analysis
• Incident management
To apply for this job email your details to ingit@empowerprofessionals.com