A recognized services company is seeking an experienced AWS Cloud Site Reliability Engineer (SRE) to support a federal program focused on maintaining the reliability, scalability, and performance of mission-critical systems hosted in Amazon Web Services (AWS).
About the Opportunity:
- Telework Eligible (Remote)
- U.S. Citizenship Required | Ability to Obtain High Risk Public Trust (6C)
- Active High Risk Public Trust or Secret Clearance Preferred
Responsibilities:
- Design, implement, and manage Infrastructure as Code (IaC) using AWS CloudFormation, Terraform, or Helm to automate provisioning, deployment, and scaling
- Develop and maintain proactive monitoring and alerting solutions (CloudWatch, Prometheus) to ensure system health and SLA compliance
- Analyze AWS environments to enhance performance, reduce latency, and improve cost efficiency
- Implement and maintain AWS security best practices aligned with NIST, FedRAMP, and agency standards
- Participate in on-call rotations; conduct root-cause analysis and corrective actions for production issues
- Automate and streamline release processes with AWS CI/CD, GitLab CI/CD, or Jenkins, ensuring audit and change-control compliance
- Collaborate with Development, QA, Security, and Operations teams to deliver reliable, compliant, and secure systems for federal stakeholders
Qualifications:
- 5+ years of experience as an SRE, DevOps Engineer, or Cloud Engineer supporting AWS environments
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience)
- Proficiency in scripting languages such as Python and Bash
- Experience with IaC and CI/CD tools (CloudFormation, Terraform, Helm, AWS CodePipeline, Jenkins, GitLab CI/CD)
- Hands-on experience with containerization and orchestration (ECS, EKS, Docker, Kubernetes)
- Strong understanding of Agile and DevSecOps methodologies
Desired Skills:
- Relevant AWS or DevOps certifications



