Data Engineer ( Spark, Scala & Python )

Department Icon Data Science Analytics & Machine Learning
149+ Applicants
Posted: 2 years ago
5-7 years
India
Work From Office

Posted: 2 years ago
|
Applicants: 149+
Job Description
About Company
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

Greetings from Lucidspire Private Limited!!

JOB OPPORTUNITY WITH OUR CLIENT!

DATA ENGINEER - Spark , Python , Scala

Experience : 5+ yrs

Location : Offshore

Notice period : immediate joiners to 30 days

:

We are looking for a highly skilled and motivated Spark Data Engineer to join our team. The ideal candidate will have a strong background in Apache Spark, data ingestion, data processing, and data integration, and will be responsible for developing and maintaining our dynamic data ingestion framework using the Spark framework. The candidate should have expertise in building scalable, high-performance, and fault-tolerant data processing pipelines using Spark, and be able to optimize Spark jobs for performance and scalability. The candidate should also have experience in designing and implementing data models, handling data errors, implementing data quality and validation processes, and integrating Spark applications with other big data technologies in the Hadoop ecosystem.

Responsibilities:

Develop and maintain a dynamic data ingestion framework using Apache Spark

Implement data ingestion pipelines for batch processing and real-time streaming using Sparks data ingestion APIs

Design and implement data models using Sparks DataFrame and Dataset APIs

Optimize Spark jobs for performance and scalability, including caching, broadcasting, and data partitioning techniques

Implement error handling and fault tolerance mechanisms to handle data errors, processing failures, and system failures in Spark applications

Implement data quality and validation processes, including data profiling, data cleansing, and data validation rules using Sparks data processing and data validation APIs

Integrate Spark applications with other big data technologies in the Hadoop ecosystem, such as Hadoop, Hive, HBase, Kafka, and others

Ensure data security by implementing data encryption, data masking, and data access controls in Spark applications

Use version control systems, such as Git, for source code management, and implement DevOps practices, such as continuous integration, continuous delivery, and automated deployments, in Spark application development workflows

Qualifications:

Bachelors or Masters degree in Computer Science, Data Engineering, or a related field

Strong proficiency in Apache Spark, including Spark Core, Spark SQL, Spark Streaming, and Spark MLlib, with multiple production developments and deployment experience.

Proficiency in either Scala or Python programming languages, with knowledge of functional programming concepts

Looking to get Placed? Try our Placement Guarantee Plan

Experience in developing and maintaining dynamic data ingestion frameworks using Spark

Experience in data processing, data integration, and data modeling using Sparks DataFrame and Dataset APIs

Knowledge of performance optimization techniques in Spark, including caching, broadcasting, and data partitioning

Experience in implementing error handling and fault tolerance mechanisms in Spark applications

Knowledge of data quality and validation techniques using Sparks data processing and data validation APIs

Familiarity with other big data technologies in the Hadoop ecosystem, such as Hadoop, Hive, HBase, Kafka, etc.

Experience in implementing data security measures in Spark applications, such as data encryption, data masking, and data access controls

Strong problem-solving skills and ability to troubleshoot and resolve issues related to Spark applications

Proficiency in using version control systems, such as Git, and implementing DevOps practices in Spark application development workflows

Excellent communication and collaboration skills, with the ability to work effectively in a team-oriented environment

Skills

Data CleansingData IntegrationData ModelingData ProcessingData ProfilingData QualityData ValidationDesigningPythonQualitySql

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

SPI is a global leader in the Distribution and Specialty Fabrication of insulation products for Thermal, Acoustic, Fire Protection and Refractory applications. We service Commercial, Industrial, Marine and OEM markets. From the first contact to project completion, you’ll benefit from our extensive product offering, superior service, and value.

Important dates & deadlines?

Application Deadline

30 Sep 23, 06:11 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Data Engineer ( Spark, Scala & Python )

Share with