Data Engineer

Department Icon Data Science Analytics & Machine Learning
149+ Applicants
Posted: 2 years ago
4-6 years
Bengaluru, Karnataka, India
Work From Office

Posted: 2 years ago
|
Applicants: 149+
Job Description
About Company
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

Responsibilities
  • Collaborate with product design and engineering to develop an understanding of needs.
  • To capture and process billions of academic and behavioral attribute events on a daily basis.
  • To build a scalable ETL/ELT pipeline to cleanse the data to be consumed in a coherent way.
  • To store and retrieve an aggregated summary of the academic and behavioral attributes.
  • To design, build, and operate a data lake for all the critical data at Embibe.
  • To build a scalable event-driven pipeline for real-time analyses.
  • To build a scalable ETL/ELT processing pipeline for coherent analysis.
  • To implement the right level of access management for different roles.
  • To ensure Personal information protection (PIP) for students, parents, teachers, school, etc.
  • Develop and improve the current data architecture, data quality, monitoring, and data availability.
  • Collaborate with Data Scientists to implement advanced analytics algorithms that exploit our rich data sets for statistical analysis, prediction, clustering, and machine learning.
  • Help continually improve ongoing reporting and analysis processes, simplifying self-service support for customers.
  • Keep up to date with advances in big data technologies and run pilots to design the data architecture to scale with the increased data sets of customer experience on AWS.
  • Dashboards and Data Requests - Service Layer: To support easy-to-use plug-and-play data visualizations for content, users, devices, application and infrastructure logs, clickstream engagement, and company metrics.
Requirements
  • B. E. /B. Tech/M. E. /M. S. /M. Tech. or higher in Computer Science or related technical field, or equivalent practical experience.
  • Minimum 7 years of solid object-oriented programming experience in Java/Scala, Python, or other JVM languages.
  • Expertise in building large data warehouses and/ or data lakes and/ or lake house architecture.
  • Looking to get Placed? Try our Placement Guarantee Plan

    Knowledge of distributed systems and data architecture (lambda)- design and implement batch and stream data processing pipelines, knows how to optimize the distribution, partitioning, and MPP of high-level data structures.
  • 4 years of functional programming using Scala/Python must have experience in writing effective unit tests using Pytest/Scala test/Junit.
  • 6 years of experience in big data technologies like Hadoop, Spark, Spark Streaming, HBase, and Hive.
  • 4 years of Structured Streaming using Spark.
  • 4 years of experience in workflow management tools like Oozie/Airflow/ Azkaban/Luigi.
  • 4 years of experience in interfacing with messaging buses like Apache Kafka/ Rabbit MQ.
  • 4 years of experience in systems design, algorithms, and distributed systems.
  • 4 years of experience in all steps of the software engineering process including testing, continuous integration/continuous delivery, automated deployments, and monitoring.
  • 4 years of experience in building microservices with Spring Boot.
  • Big Plus: Experience with MS Azure (Data Bricks, Data Factory, Data Lake, Cosmos DB, ADX, Synapse, Event Hub, PowerBI, Synapse, SQL Server).
This job was posted by Shivani Singh from Embibe.

Skills

AnalyticsData ArchitectureData ProcessingData QualityEtlMachine LearningPythonQualityReportingSqlStatistical Analysis

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

Embibe, operated by Individual Learning Pvt Ltd, is an online test preparation portal. Offers mock tests for engineering exams, medical exams, banking along with foundation courses for grades 8-10. Provides personalized, guided practice and score improvement recommendations based on analytics for every user. Claims 23% score improvement in just one test and 92% predictability between exam scores and their score.

Important dates & deadlines?

Application Deadline

23 Dec 23, 06:44 AM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Data Engineer

Share with