SRE - Site Reliability Engineering

Department Icon IT / Software Development & Related
102+ Applicants
Posted: 1 year ago
0-15 years
Reston, Virginia, USA
Work from Office

Posted: 1 year ago
|
Applicants: 103+
Job Description
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

:
We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in cloud platforms, DevOps practices, and modern software development frameworks. The SRE will play a critical role in designing, building, and maintaining highly scalable, fault-tolerant, and secure cloud infrastructure while ensuring operational excellence, high availability, and reliability.
1. Cloud Infrastructure & Automation:
Design, implement, and manage cloud-based infrastructure using platforms like AWS, Azure, or Google Cloud Platform.
Utilize Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, and Ansible to automate deployments and configurations.
Create robust automation targeted at anomaly detection, toil reduction, recovery processes, and self-healing mechanisms, and optimize cloud costs.
2. DevSecOps & CI/CD:
Deep understanding of DevSecOps principles and CI/CD pipelines using tools like GitLab, Jenkins, SonarQube, NexArtifactory, and Docker.
Implement security best practices, including IAM roles, RBAC, vulnerability remediation, and SAST/DAST/SCA tools.
3. Observability & Incident Management:
Design and implement monitoring, logging, and distributed tracing solutions using tools like AWS CloudWatch, Splunk/SignalFX, Dynatrace, and OpenTelemetry.

Lead root cause analysis, blameless postmortems, and proactive incident management to minimize MTTR and MTTD.
Define and monitor SLOs, SLIs, and error budgets to ensure system reliability.
4. Microservices & API Management:
Architect and manage microservices, serverless computing, and RESTful APIs.
Ensure fault tolerance and resilience using design patterns like Circuit Breaker, Retry, Timeout, and Bulkhead.
5. Chaos Engineering & Resiliency:
Conduct chaos engineering experiments using tools like AWS FIS and Chaos Toolkit.
Perform resiliency assessments using Resilience Hub and implement self-healing solutions.
6. Database & Application Support:
Manage and optimize database technologies such as PostgreSQL, MongoDB, DynamoDB, Oracle, and Redshift.
Provide production support, including incident response, problem management, and runbook creation. Participate in on-call rotations.
7. Collaboration & Communication:
Collaborate with cross-functional teams to implement shift-left testing practices (BDD, TDD, Unit, Regression).

Looking to get Placed? Try our Placement Guarantee Plan

Create and maintain architecture diagrams, knowledge articles, and disaster recovery plans.
Communicate effectively with stakeholders and demonstrate strong relationship management skills.
Required Skills & Qualifications:
Expertise in cloud platforms (AWS, Azure, or Google Cloud Platform) and container orchestration.
Proficiency in programming/scripting languages such as Python, Java, Node.js, Bash, and PowerShell.
Strong knowledge of database technologies (e.g., PostgreSQL, MongoDB, DynamoDB, Oracle, Redshift).
Experience with DevOps tools (Jenkins, Docker, NexArtifactory) and build tools (Maven, Gradle).
Familiarity with AI/ML integrations, event-driven architectures, and distributed systems.
Expertise in observability, logging, and monitoring tools (AWS CloudWatch, Splunk, Dynatrace, OpenTelemetry).
Strong understanding of security practices, including IAM, RBAC, and vulnerability management.
Experience with chaos engineering, resiliency assessments, and disaster recovery planning.
Proficiency in performance testing tools (JMeter, LoadRunner) and capacity planning.

Skills

PythonDevopsDevops ToolsDistributed SystemsJavaNode.jsScripting LanguagesSoftware DevelopmentTestingRestfulApiOracleCloud

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

Important dates & deadlines?

Application Deadline

06 Jun 25, 07:27 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

SRE - Site Reliability Engineering

Share with