Lead Support Analyst - Shared Services And Production Management , Information Technology

CLSA

IT / Software Development & Related

102+ Applicants

Posted: 1 week ago

8-10 years

Pune, Maharashtra

work from office

Posted: 1 week ago

Applicants: 102+

Job Description

Similar Jobs

Please verify your account first! Send OTP

Job Description

Key Areas of Responsibilities

Own and support monitoring and SRE operations, ensuring system reliability, availability, and performance.
Build, enhance, and maintain monitoring solutions using ITRS Geneos, Prometheus, Victoria‑Metrics, Elasticsearch, and Grafana.
Develop, optimize, and maintain alerting rules, dashboards, and observability pipelines.
Troubleshoot and resolve complex issues during major incidents, providing clear and timely communication.
Troubleshoot Linux servers (RHEL 7/8/9), including upgrades, configurations, patching, and maintenance, while determining appropriate monitoring requirements for system changes.
Analyze logs, investigate issues, and perform fault finding to identify performance exceptions.
Collaborate with engineering, application, and infrastructure teams to improve system resilience, stability, security, efficiency, and scalability.
Contribute to automation strategies, deployment processes, and continuous operational improvements.

Participate in on‑call rotations, including off‑hours and scheduled weekend support.
Participate in Disaster Recovery (DR) and Business Continuity Planning (BCP) drills.
Continuously research and adopt modern monitoring and SRE tools and practices.

Requirements

Bachelors degree in computer science / engineering
Minimum 8 years experience within IT / Investment bank.
Strong experience with monitoring and observability platforms, including: ITRS Geneos, Prometheus, Victoria‑Metrics, Elasticsearch, Grafana, and Kibana.
Hands-on experience building and implementing Prometheus pipelines, including exporters, scraping configurations, relabelling, metric routing, and integrations with long‑term storage (e.g., Victoria‑Metrics).
Experience building and maintaining Logstash pipelines, including ingestion, parsing, filtering, enrichment, and routing of logs into Elasticsearch.

Looking to get Placed? Try our Placement Guarantee Plan

Ability to design, build, and maintain Grafana and Kibana dashboards for metrics, logs, and performance analytics across distributed systems.
Solid understanding of metrics, logging, alerting, dashboards, and observability pipelines.
Strong Linux administration skills (RHEL 7/8/9), including troubleshooting, upgrades, configuration, patching, and performance optimization.
Good understanding of SRE principles, high availability, scalability, incident management and DR (Disaster Recovery) / BCP (Business Continuity Planning) activities
Experience with automation (e.g., Bash, Python, Ansible, CI/CD tools) is an advantage.
Understanding of networking fundamentals, performance tuning, and troubleshooting distributed systems.
Prior experience in Production Support, SRE, Monitoring Engineering, or Shared Services Operations with participation in on‑call rotations, including after-hours and weekend support.
Strong analytical, problem‑solving and communication skills with the ability to work collaboratively under pressure.
Self-motivated, adaptable and able to prioritize, learn continuously and manage multiple responsibilities effectively.
Excellent/Fluent in English

Skills

PythonDistributed SystemsLinux

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

Important dates & deadlines?

Application Deadline

06 Jul 26, 04:03 PM IST

Similar Jobs

View All

Jobaaj

Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert

Google hiring for Specific Roles Apply Now!

1 min ago

New Opportunity

Amazon is hiring freshers Apply Now!

5 min ago

Featured Jobs

Microsoft opening 50+ positions Apply Now!

10 min ago