Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs
Job Description
Company Vision
We are making finance simple. For millions in India.
Groww is on a mission to democratize access to financial services for millions of Indians responsibly. We are a customer-first company. We believe in crafting the best and most delightful user experience for our customers. And we leverage first principle thinking and technology to solve problems at scale. If this excites you, join us.
About Groww
Groww is India’s fastest growing investment platform.. Our long term vision to become the trusted money partner of ~100M Indians with core products in investment & banking. Groww was founded in 2016 by alums from Flipkart, ICICI, IITB, IITD, BITs Pilani and is backed by marquee investors like Tiger Global, Sequoia Capital, Ribbit Captial, and Y Combinator.
Our Values
We take pride in our values and hold ourselves to a high standard on:
- Radical customer centricity
- Simplicity in our products and personalities
- Transparency on everything
- Long term thinking
- Ownership driven culture
Job Requirement
We are looking for a Data Reliability Engineer/Sr. Data Site Reliability Engineer to help us build and enhance big data platforms to achieve availability, scalability and operational effectiveness. The right individual will embrace the opportunity to tackle challenging problems and use their influence to drive continual improvement. You will also work on the cutting edge technology leveraging Kafka, Burrow, DataProc, DataHub, Spark, Flink, Kubernetes, Prometheus, Airflow, Hadoop, Hbase, Hive, Nifi etc.
Roles and Responsibilities
Managing availability, performance, capacity and security of big data infrastructure and applications like DataProc, DataHub, Spark, Flink, Airflow etc.- Building and implementing observability for applications health/performance/capacity.
- Optimising On-call rotations and processes.
- Documenting "tribal" knowledge.
- Managing Infra-platforms like - DataProc - DataHub - Kafka - Kafka Connect - Kubernetes - Flink - Spark - Other Data Infrastructure
- Providing help in onboarding new big data applications with the production readiness review process.
- Developing tools to manage Big Data infrastructure at scale.
- Providing reports on services SLO/Error Budgets/Alerts and Operational Overhead.
- Working with Dev and Product teams to define SLO/Error Budgets/Alerts.
- Working with Dev team to have in depth understanding of the big data application architecture and it's bottlenecks.
- Identifying observability gaps in big data services, infrastructure and working with stake owners to fix it.
- Managing Outages and doing detailed RCA with developers and identifying ways to avoid that situation.
- Managing/Automating upgrades of the infrastructure services.
- Automate toil work.
Looking to get Placed? Try our Placement Guarantee Plan
Experience & Skills
3-5+ Years of experience as a Big Data Reliability Engineer on large scale Spark, DataProc and Kafka Infrastructure. A collaborative spirit with the ability to work across disciplines to influence, learn, and deliver.
- A deep understanding of computer science, software development, and networking principles.
- Demonstrated experience with languages, such as Python, Java, Golang etc.
- Extensive experience with Linux administration and good understanding of the various linux kernel subsystems (memory, storage, network etc).
- Extensive experience in DNS, TCP/IP, UDP, GRPC, Routing and Load Balancing.
- Expertise in GitOps, Infrastructure as a Code tools such as Terraform etc will be a plus.
- Expertise of Amazon Web Services (AWS), Google Cloud (GCP) and/or other relevant Cloud Infrastructure solutions like Microsoft Azure.
- Experience in managing and deploying containerised environments using Docker, Kubernetes is a plus.
- Experience with multiple data-stores is a plus (MySQL, PostgreSQL, Aerospike, Mongo, Scylla, Cassandra, Elasticsearch).
If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.
About Company
Groww offers a vibrant and dynamic environment for its employees, breaking away from the conventional corporate setup with its innovative and playful work culture. Emphasizing teamwork, creativity, and responsibility, Groww careers are not just about hard work but also about enjoying the process and learning from both successes and failures. With a commitment to democratizing finance in India, Groww is a place where every idea counts and employees are encouraged to experiment, fail, and learn, without the fear of judgment. The platform is responsible to over 50 million users daily, offering a wide range of job opportunities across various departments such as Business, Compliance, Engineering, Finance, Growth, and more, catering to experienced professionals looking to make a significant impact in the fintech sector
Important dates & deadlines?
Application Deadline
14 Nov 22, 12:00 AM IST
Similar Jobs
View All

