Data Engineer AI/ML

Department Icon Data Science Analytics & Machine Learning
149+ Applicants
Posted: 1 week ago
3-6 years
Gurugram, Haryana
work from office

Posted: 1 week ago
|
Applicants: 150+
Job Description
Similar Jobs
Please verify your account first! Send OTP

Job Description

Data Engineer / ML Engineer

NOTE - Apply only if youre a immediate joiner

Location - Gurugram (Onsite)

Salary Budget - Upto 25 LPA

Key Responsibilities

  • Design, build, and maintain scalable data pipelines (batch + streaming) using Spark, Hadoop, and other Apache ecosystem tools.
  • Develop robust ETL workflows for large-scale data ingestion, transformation, and validation.
  • Work with Cassandra, Data Lakes, and distributed storage systems to handle large-volume datasets.
  • Write clean, optimized, and modular Python code for data processing, automation, and machine learning tasks.
  • Utilize Linux environments for scripting, performance tuning, and data workflow orchestration.
  • Build and manage web scraping pipelines to extract structured and unstructured data from diverse sources.
  • Collaborate with ML/AI teams to prepare training datasets, manage feature stores, and support model lifecycle.
  • Implement and experiment with LLMs, LangChain, RAG pipelines, and vector database integrations.
  • Assist in fine-tuning models, evaluating model performance, and deploying ML models into production.
  • Optimize data workflows for performance, scalability, and fault tolerance.
  • Document data flows, transformation logic, and machine learning processes.
  • Work cross-functionally with engineering, product, and DevOps teams to ensure reliable, production-grade data systems.

Requirements

  • 36 years of experience as a Data Engineer, ML Engineer, or similar role.
  • Strong expertise in Advanced Python (data structures, multiprocessing, async, clean architecture).
  • Solid experience with:
  • Looking to get Placed? Try our Placement Guarantee Plan

  • Apache Spark / PySpark
  • Hadoop ecosystem (HDFS, Hive, Yarn, HBase, etc.)
  • Cassandra or similar distributed databases
  • Linux (CLI tools, shell scripting, environment management)
  • Proven ability to design and implement ETL pipelines and scalable data processing systems.
  • Hands-on experience with data lakes, large-scale storage, and distributed systems.
  • Experience with web scraping frameworks (BeautifulSoup, Scrapy, Playwright, etc.).
  • Familiarity with LangChain, LLMs, RAG, vector stores (FAISS, Pinecone, Milvus), and ML workflow tools.
  • Understanding of model training, fine-tuning, and evaluation workflows.
  • Strong problem-solving skills, ability to deep dive into complex data issues, and write production-ready code.
  • Experience with cloud environments (AWS/GCP/Azure) is a plus.

Skills

PythonEtlData ProcessingMachine LearningData EngineerMl EngineerAiMlAi/ml

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

Important dates & deadlines?

Application Deadline

03 Apr 26, 04:47 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Data Engineer AI/ML

Share with