Site Reliability Engineer
Fort Meade, Maryland
Onsite
Direct Hire
$190k - $210k
We're seeking a Site Reliability Engineer to support secure, mission-critical systems at Fort Meade. You'll ensure uptime and performance of AI-powered cyber applications in closed AWS environments, working closely with remote engineering teams and on-site stakeholders.
You’ll be responsible for ensuring the performance, reliability, and operational readiness of advanced AI-powered applications running in secure, closed-cloud AWS environments. You’ll support a powerful microservices-based platform used for cyber operations, working closely with engineering teams and government stakeholders to keep systems running smoothly and securely, even in the high-pressure context of real-time cyber warfare.
Key Responsibilities
Requirements
Nice to Have
You’ll be responsible for ensuring the performance, reliability, and operational readiness of advanced AI-powered applications running in secure, closed-cloud AWS environments. You’ll support a powerful microservices-based platform used for cyber operations, working closely with engineering teams and government stakeholders to keep systems running smoothly and securely, even in the high-pressure context of real-time cyber warfare.
Key Responsibilities
Requirements
Nice to Have
You’ll be responsible for ensuring the performance, reliability, and operational readiness of advanced AI-powered applications running in secure, closed-cloud AWS environments. You’ll support a powerful microservices-based platform used for cyber operations, working closely with engineering teams and government stakeholders to keep systems running smoothly and securely, even in the high-pressure context of real-time cyber warfare.
Key Responsibilities
- Maintain and troubleshoot Docker-based microservices in secure AWS enclaves
- Build monitoring, logging, and alerting systems
- Automate deployments using IaC tools (Terraform, CloudFormation)
- Lead incident response and root cause analysis
- Support real-time cyber ops systems in high-security environments
Requirements
- 5+ years in SRE, DevOps, or cloud operations
- Strong AWS (EC2, ECS, VPC), Docker, and Linux experience
- Proficient in scripting (Python, Bash, or Go)
- Familiar with secure/air-gapped environments and compliance
- TS/SCI clearance and ability to work on-site full-time
Nice to Have
- Experience with Neo4j, GraphQL, NATS, or AI/ML systems
- AWS/Kubernetes certifications
- Background in defense or cyber operations
You’ll be responsible for ensuring the performance, reliability, and operational readiness of advanced AI-powered applications running in secure, closed-cloud AWS environments. You’ll support a powerful microservices-based platform used for cyber operations, working closely with engineering teams and government stakeholders to keep systems running smoothly and securely, even in the high-pressure context of real-time cyber warfare.
Key Responsibilities
- Maintain and troubleshoot Docker-based microservices in secure AWS enclaves
- Build monitoring, logging, and alerting systems
- Automate deployments using IaC tools (Terraform, CloudFormation)
- Lead incident response and root cause analysis
- Support real-time cyber ops systems in high-security environments
Requirements
- 5+ years in SRE, DevOps, or cloud operations
- Strong AWS (EC2, ECS, VPC), Docker, and Linux experience
- Proficient in scripting (Python, Bash, or Go)
- Familiar with secure/air-gapped environments and compliance
- TS/SCI clearance and ability to work on-site full-time
Nice to Have
- Experience with Neo4j, GraphQL, NATS, or AI/ML systems
- AWS/Kubernetes certifications
- Background in defense or cyber operations