Want to get hired at ECS Tech Inc?
Senior Site Reliability Engineer
ECS Tech Inc
Fairfax, Virginia, United StatesOn Site
Original Job Summary
Senior Site Reliability Engineer
ECS Tech Inc is seeking a talented Senior Site Reliability Engineer to work remotely on the next-generation Continuous Diagnostics and Mitigation (CDM) Cyber data solution. This role is part of the Cybersecurity and Infrastructure Security Agency’s (CISA) initiative to enhance federal network security.
Role & Responsibilities
- Define, implement and grow the SRE practice.
- Ensure reliability, availability and performance of critical production environments.
- Design and maintain logging, monitoring and alerting systems using Elastic and other tools.
- Conduct root cause analyses and manage incident response.
- Collaborate with cross-functional teams to integrate SRE practices into the development lifecycle.
Qualifications
- US citizenship with ability to obtain Public Trust Suitability.
- 6+ years of SRE experience with hands-on observability, logging, monitoring, and alerting.
- 3+ years of experience with cloud platforms (AWS GovCloud preferred) and coding (Python, Bash, etc.).
- Strong knowledge of microservices, containerization, and orchestration tools (Docker, Kubernetes).
- Experience working in a SAFe (Scaled Agile Framework) environment.
Key skills/competency
- SRE
- Reliability
- Monitoring
- Elastic
- AWS
- Kubernetes
- Python
- DevOps
- Cybersecurity
- SAFe
How to Get Hired at ECS Tech Inc
🎯 Tips for Getting Hired
- Research ECS Tech Inc's culture: Study mission, values, and recent news.
- Tailor your resume: Highlight SRE and cloud experience.
- Showcase technical skills: Emphasize observability and automation.
- Prepare for situational questions: Demonstrate problem-solving in past roles.
📝 Interview Preparation Advice
Technical Preparation
circle
Review logging and monitoring tool documentation.
circle
Practice coding in Python and Bash scripts.
circle
Study AWS GovCloud and cloud platform basics.
circle
Familiarize with container orchestration using Kubernetes.
Behavioral Questions
circle
Describe a challenging incident response experience.
circle
Explain teamwork in a fast-paced environment.
circle
Discuss handling continuous improvement initiatives.
circle
Share examples of cross-functional collaboration.