Site Reliability Engineer

Hybrid
Mid-level
🇺🇸 United States
Site Reliability Engineer
Technology

The Red Gate Group is seeking a Site Reliability Engineer to support DTRA. This hybrid role combines on-site and remote work, where you’ll enhance system resilience and efficiency for the DoD by building a robust infrastructure. By leveraging your expertise in Kubernetes, Ansible, AWS Cloud Migration, and Cloudera, you’ll build in redundancy, implement monitoring tools, and automate processes to reduce toil. This position offers the opportunity to guide junior engineers and expand your knowledge base while contributing to innovative cloud migration solutions.

Responsibilities:

  • Develop resilient infrastructure for the DoD.
  • Implement monitoring tools and automate routine tasks.
  • Build or modify Ansible playbooks with Bash scripts.
  • Troubleshoot and resolve issues related to CI/CD pipeline failures.
  • Collaborate with application development teams across the software development life cycle.

Requirements

Required Skills & Qualifications

  • Active TS/SCI
  • 5+ years of experience with working in Linux environments
  • 5+ years of experience with troubleshooting, triaging, and resolving issues related to CI/CD pipeline failures or slowness on production Enterprise environments
  • Experience with developing enterprise cloud-native solutions involving Kubernetes, Docker, Cloudera, AWS, Jenkins, or RHEL Systems
  • Experience in working with application development teams across the software development life cycle and creating solutions to complex problems in a collaborative team environment
  • Ability to build or modify Ansible playbooks with Bash scripts
  • Active DoD 8570 Level II Security Certification, including Security+

Desired Skills & Qualifications

  • Experience with Python and Go, Microservices, Serverless, MLOps, AIOps, Cloudera, and Kubernetes
  • Experience with Big Data stack using Hadoop, Spark, Accumulo or MongoDB, and Solr or Elasticsearch
  • Experience with software development processes and code management tools and processes
  • Experience with declarative Infrastructure as Code tools, including Puppet, Terraform, and Ansible
  • Experience with GitOps and CI/CD tools, including ArgoCD, Gitlab CI, or Jenkins
  • Possession of excellent verbal and written communication skills

 

Red Gate Group

Red Gate Group

At RED GATE, we do everything we can to serve our clients using the right technical skills, unique methodologies, best practices, and integrated technology to help clients implement bold solutions.

Consulting
Technology
Defense
Defense Contracting

Other jobs at Red Gate Group

 

 

 

 

 

 

 

 

View all Red Gate Group jobs

Notifications about similar jobs

Get notifications to your inbox about new jobs that are similar to this one.

🇺🇸 United States
Site Reliability Engineer

No spam. No ads. Unsubscribe anytime.

Similar jobs

 

 

 

 

 

 

 

Â