Site Reliability Engineer SRE Job at Expert Technology Services, Washington DC

bEN4Q01FeTI2OG9YbThYVVVKM0JacTVmR2c9PQ==
  • Expert Technology Services
  • Washington DC

Job Description

is seeking a Site Reliability Engineer for a high-impact role with a premier client based in Washington, DC . In this position, you will bridge the gap between development and operations by applying a software engineering mindset to system administration and infrastructure. You will be responsible for ensuring the scalability, performance, and high availability of cloud-based services across AWS and Azure environments. By leveraging Infrastructure-as-Code, advanced observability with Dynatrace, and SRE principles like error budgets and SLOs, you will drive operational excellence and lead incident response efforts for mission-critical applications.

Key Responsibilities
  • Deployment & Automation: Architect and manage CI/CD pipelines (GitHub Actions, AWS CodePipeline) and automate global infrastructure using Terraform, CloudFormation, or CDK.
  • Performance & Capacity: Drive cost-optimization initiatives, manage auto-scaling thresholds, and execute resiliency/performance testing to ensure system durability.
  • Incident Management: Act as a primary on-call responder using ITIL frameworks and ServiceNow; develop Root Cause Analysis (RCA) documentation and maintain knowledge bases.
  • Observability & Monitoring: Implement distributed tracing and optimize monitoring via Dynatrace and Kibana to create advanced dashboards and anomaly detection.
  • Reliability Engineering: Define and monitor SLIs and SLOs while managing error budgets to balance feature velocity with system stability.
  • Security & Compliance: Oversee service accounts, manage digital certificates, and execute rapid remediation for security incidents.
Qualifications
  • Education: Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Experience: 2 to 4 years of professional experience in SRE, DevOps, or Infrastructure roles.
  • Cloud Proficiency: Practical, hands-on experience with both AWS and Azure platforms.
  • Technical Skills: Mid-level proficiency in Python (or similar scripting languages) and configuration management tools like Ansible.
  • Containerization: Solid understanding of Docker and orchestration via Kubernetes or ECS.
  • Infrastructure Fundamentals: Strong knowledge of Linux systems, networking protocols, and both Relational/NoSQL database architectures.
  • Soft Skills: Excellent written and verbal communication skills with the ability to manage competing priorities independently.
  • Flexibility: Ability to participate in a production on-call rotation, including work outside standard business hours.

Required Skills :

Basic Qualification :

Additional Skills :

This is a high PRIORITY requisition. This is a PROACTIVE requisition

Background Check : No

Drug Screen : No

Job Tags

Similar Jobs

Stallion Infrastructure Services

Dispatcher Job at Stallion Infrastructure Services

 ...Role Summary: The Dispatcher will act as a communication point for all calls: coordinate requests, transmit messages and track vehicles. This individual would enable different parties to communicate well by ensuring the accurate and timely transmission of information... 

peeps - the social club

Co-Founder: Build a Flutter Social App to Help Teens Meet Job at peeps - the social club

 ...Flutter umsetzen und dabei kreative Freiheiten genieen. Erfahrungen in der App-Programmierung sind von Vorteil. Das Team bietet flexible Arbeitszeiten und ein tolles Netzwerk. Wenn du Lust hast, die Einsamkeit zu bekmpfen und junge Menschen zu untersttzen, dann bewirb... 

Radical Renovations LLC

Tile Installer Job at Radical Renovations LLC

 ...Looking for an experienced Tile installer with an exceptional work ethic to join our team for a full-time position: Continuous work throughout the year. Must have experience in with large format and standard tile, different types of grouts including epoxy, Schluter... 

Trimerge Construction Group LLC

Commercial Tile & Resilient Flooring Installer Job at Trimerge Construction Group LLC

 ...Commercial Tile & Resilient Flooring Installer - Traveling Position Are you a skilled craftsperson, looking to grow your skill and career? At Trimerge Construction Group, we believe in empowering our people to reach their full potential while delivering world-class... 

Medix™

Clinical Review & Appeals Nurse - 249736 Job at Medix™

 ...Clinical Review & Appeals Nurse Medix Healthcare Tempe, AZ 85288 Monday-Friday, FULL TIME $71,000 - 106,000 (Pay differs for an LPN) Qualifications/Requirements ~ Active and unrestricted Arizona Registered Nurse (RN) license ~3+ years of clinical experience...