Sr SRE

Quantiphi Analytics Solutions Private Limited

IT / Software Development & Related

102+ Applicants

Posted: 3 months ago

5-8 years

Bengaluru / Bangalore, Karnataka

work from office

Posted: 3 months ago

Applicants: 102+

Job Description

About Company

Similar Jobs

Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!

Role: Site Reliability Engineer
Experience Level: 5-8 Years
Work location: Bangalore, Mumbai (Hybrid)

As Site Reliability Engineer, youll be responsible for ensuring the reliability, performance, and scalability of a
serverless platform. Youll work on improving system observability, automating operational tasks, optimizing
resource utilization, and maintaining our stringent SLOs while balancing cost efficiency. This role requires deep
technical expertise in distributed systems, cloud infrastructure, and a passion for operational excellence.

What Youll Do:

Ensure Platform Reliability: Own the availability, latency, performance, and efficiency of NG-SIEM platform services
Build Automation & Tooling: Design and implement automation solutions for deployment, monitoring, incident response, and capacity planning to reduce toil and improve operational efficiency
Monitor & Optimize: Develop comprehensive observability solutions using metrics, logs, and traces proactively identify and resolve performance bottlenecks and reliability issues

Incident Management: Lead incident response efforts, conduct blameless post-mortems, and drive continuous improvement initiatives to prevent recurrence
Capacity Planning: Analyze system performance data and growth trends to forecast infrastructure needs and ensure the platform scales efficiently with customer demand
SLO/SLA Management: Define, measure, and maintain Service Level Objectives and error budgets balance feature velocity with reliability requirements
Cost Optimization: Implement strategies to optimize cloud resource utilization and reduce operational costs while maintaining performance and reliability standards
Collaborate Cross-Functionally: Partner with engineering teams to improve system design for reliability, influence architectural decisions, and embed SRE best practices
On-Call Participation: Participate in on-call rotation to provide 24/7 support for critical production systems
Documentation: Create and maintain runbooks, operational procedures, and technical documentation to enable team scalability

What Youll Need:

Experience in Site Reliability Engineering, DevOps, or similar roles supporting large-scale distributed systems in production environments
Strong programming skills in at least one language (Go) for automation and tooling development
Deep cloud expertise with hands-on experience in at least one major cloud platform (AWS or GCP) including compute, storage, networking, and managed services
Distributed systems knowledge: Understanding of distributed system design patterns, consistency models, fault tolerance, and scalability principles
Infrastructure as Code: Proficiency with IaC tools (Terraform) and configuration management (Ansible, Chef, Puppet)
Container orchestration: Experience with Kubernetes, Docker, Podman and container-based deployment patterns
Observability expertise: Hands-on experience with monitoring and observability tools (Prometheus, Grafana)
CI/CD pipelines: Experience building and maintaining continuous integration and deployment pipelines

Looking to get Placed? Try our Placement Guarantee Plan
Incident management: Proven track record of managing high-severity incidents and implementing preventive measures
Data-driven approach: Ability to analyze system metrics and logs to identify trends, anomalies, and optimization opportunities
Communication skills: Excellent verbal and written communication abilities for remote collaboration across global teams

Bonus Points:

Massive scale experience: 3+ years owning systems handling over 1 trillion requests per day or more than 10 PB of data per day
Multi-cloud experience: Hands-on work with hybrid or multi-cloud environments
Database expertise: Deep knowledge of distributed databases, data lakes, or SIEM platforms (ClickHouse, Redis, MySQL)
Security background: Exposure to cybersecurity, threat intelligence, or security operations
Networking expertise: Advanced understanding of network protocols, load balancing, and CDN technologies.

Skills

CybersecurityDevopsDistributed SystemsKubernetesCloud

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

Quantiphi is a global provider of AI-driven solutions and services. They help businesses leverage the power of artificial intelligence to drive innovation and improve efficiency.

Important dates & deadlines?

Application Deadline

10 May 26, 06:37 PM IST

Similar Jobs

View All

Jobaaj

Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert

Google hiring for Specific Roles Apply Now!

1 min ago

New Opportunity

Amazon is hiring freshers Apply Now!

5 min ago

Featured Jobs

Microsoft opening 50+ positions Apply Now!

10 min ago

Sr SRE

Quantiphi Analytics Solutions Private Limited

Share with

Log in to Jobaaj

Sign up

Forgot password

Verify OTP

2 days Management Consulting Workshop

Free Workshop on How to Make a Career in Investment Banking?

2 Days Product Management Workshop

Financial Modelling Workshop

Career Opportunities in Equity Research & Investment Banking

Leveraging Data Is The Secret To Dubai's Rapid Growth

The Secret Behind Dubai's Growth :: Management Consulting

Sr SRE

Job Description

Skills

About Company

Important dates & deadlines?

Don't Miss out any Updates

Sr SRE

Apply with AI

Create Your AI Profile

Verify Password

Verify Email

Profile Created

Upload Your Resume

Note to Recruiter!

Jobs by Department

Jobs by Top Companies

Jobs in Demand

Jobs by Top Cities