Site Reliability Engineer SRE Job at Expert Technology Services, Washington DC

bEN4Q01FeTI2OG9YbThYVVVKM0JacTVmR2c9PQ==
  • Expert Technology Services
  • Washington DC

Job Description

is seeking a Site Reliability Engineer for a high-impact role with a premier client based in Washington, DC . In this position, you will bridge the gap between development and operations by applying a software engineering mindset to system administration and infrastructure. You will be responsible for ensuring the scalability, performance, and high availability of cloud-based services across AWS and Azure environments. By leveraging Infrastructure-as-Code, advanced observability with Dynatrace, and SRE principles like error budgets and SLOs, you will drive operational excellence and lead incident response efforts for mission-critical applications.

Key Responsibilities
  • Deployment & Automation: Architect and manage CI/CD pipelines (GitHub Actions, AWS CodePipeline) and automate global infrastructure using Terraform, CloudFormation, or CDK.
  • Performance & Capacity: Drive cost-optimization initiatives, manage auto-scaling thresholds, and execute resiliency/performance testing to ensure system durability.
  • Incident Management: Act as a primary on-call responder using ITIL frameworks and ServiceNow; develop Root Cause Analysis (RCA) documentation and maintain knowledge bases.
  • Observability & Monitoring: Implement distributed tracing and optimize monitoring via Dynatrace and Kibana to create advanced dashboards and anomaly detection.
  • Reliability Engineering: Define and monitor SLIs and SLOs while managing error budgets to balance feature velocity with system stability.
  • Security & Compliance: Oversee service accounts, manage digital certificates, and execute rapid remediation for security incidents.
Qualifications
  • Education: Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Experience: 2 to 4 years of professional experience in SRE, DevOps, or Infrastructure roles.
  • Cloud Proficiency: Practical, hands-on experience with both AWS and Azure platforms.
  • Technical Skills: Mid-level proficiency in Python (or similar scripting languages) and configuration management tools like Ansible.
  • Containerization: Solid understanding of Docker and orchestration via Kubernetes or ECS.
  • Infrastructure Fundamentals: Strong knowledge of Linux systems, networking protocols, and both Relational/NoSQL database architectures.
  • Soft Skills: Excellent written and verbal communication skills with the ability to manage competing priorities independently.
  • Flexibility: Ability to participate in a production on-call rotation, including work outside standard business hours.

Required Skills :

Basic Qualification :

Additional Skills :

This is a high PRIORITY requisition. This is a PROACTIVE requisition

Background Check : No

Drug Screen : No

Job Tags

Similar Jobs

ProKatchers LLC

Licensed Master Social Worker Job at ProKatchers LLC

 ...Job Title : Licensed Master Social Worker Location : New York, NY 10016 Duration :...  ...Education : Master's Degree in Social Work. Shift Details: 9:00 AM - 5:00 PM (EST...  ...: ~ Inpatient medicine, surgery or rehab experience. At least 1 year of experience.... 

REMSA Health

911 Dispatcher Job at REMSA Health

 ...is available! As a Communications Specialist (REMSA Health Dispatcher) you answer emergency medical and fire calls for service and dispatch...  ...or equipment should be taken. We offer a full academy training to develop the skills and competence to successfully process... 

Buffalo Marriott LECOM HARBORCENTER

Director of Food And Beverage Job at Buffalo Marriott LECOM HARBORCENTER

 ...We are seeking a dynamic Director of Food & Beverage to join our leadership team and shape the future of our food, beverage, and service operations by elevating every guest touchpoint through exceptional culinary, beverage, and service standards across all hotel outlets... 

Vertisystem (A MOURI Tech Company)

Workplace Coordinator Job at Vertisystem (A MOURI Tech Company)

Job Title: Workplace Coordinator - Operations Location: Onsite Role- Bellevue, Washington- 98004 Duration: 12+ Months Contract with Possible extension Pay Range: $30-$35 Per hour on W2 Job Description: Workplace Coordinator Clients Global Real Estate and...

Midland-Marvel Recruiters, LLC

Director of Case Management Job at Midland-Marvel Recruiters, LLC

Community hospital looking to bring on Director Case Management! Sign-On Bonus, Bonus Incentive Plan and Full Relocation!! Overall responsibility for managing and coordinating department activities. Ensures staff compliance with organizational policies and external ...