Data Engineer - AI-Assisted Data Ingestion Platform

Department Icon Data Science Analytics & Machine Learning
149+ Applicants
Posted: 1 month ago
3-5 years
Bengaluru / Bangalore, Karnataka
work from office

Posted: 1 month ago
|
Applicants: 149+
Job Description
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

We are looking for a Data Engineer to join our team and help build an AI-powered data ingestion platform. The platform collects data from multiple sources and loads it into analytical databases that power our B2B analytics product, BevGenie, where users can interact with their data using natural language.

In this role, you will work closely with a senior data engineer and use AI coding assistants (Claude Code) to build, improve, and maintain production-grade data pipelines. You will be responsible for extracting, cleaning, transforming, and loading data from both structured and unstructured sources.

Key Responsibilities:

  • Design, build, and maintain scalable data ingestion pipelines

  • Extract data from websites, APIs, documents (PDFs), emails, and databases

  • Parse, clean, and transform structured and unstructured data

  • Implement data validation, quality checks, and error handling

  • Handle missing, duplicate, and malformed data

  • Ensure pipelines are reliable and idempotent

  • Orchestrate workflows using Dagster or similar tools

  • Load data into PostgreSQL, Supabase, Snowflake, and MongoDB

  • Collaborate with senior engineers and cross-functional teams


Requirements
  • 35 years of experience in Python-based development or data engineering

  • Strong Python fundamentals (functions, classes, logging, error handling)

  • Experience working with APIs and common data formats (JSON, CSV, XML)

  • Strong SQL skills including joins, aggregations, CTEs, and window functions

  • Understanding of relational database concepts, schema design, and indexing

  • Basic knowledge of web scraping and data extraction techniques

  • Familiarity with handling structured and unstructured data

  • Understanding of data quality, validation, and pipeline reliability

  • Good problem-solving and analytical skills

  • Strong communication and collaboration skills

  • Willingness to learn new tools and technologies

Nice to Have:

  • Experience with Dagster, Airflow, or Prefect

    Looking to get Placed? Try our Placement Guarantee Plan

  • Experience with web scraping tools (BeautifulSoup, Scrapy, Playwright)

  • Experience with PDF extraction tools

  • Exposure to AWS services (S3, Lambda, Glue)

  • Knowledge of dimensional modeling or analytics data modeling

  • Any experience using AI/LLMs for data extraction


Benefits
  • Work closely with and learn from a senior data engineer

  • Hands-on experience building real production data pipelines

  • Exposure to modern AI-assisted development workflows

  • Opportunity to work with diverse data sources and technologies

  • Remote-first and flexible work environment

  • Competitive compensation

  • High-impact role in a growing product-focused team

  • Skills

    Data ValidationPythonData ModelingData ExtractionSnowflakeData EngineerAnalyticsAiSql

    If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

    Important dates & deadlines?

    Application Deadline

    02 Jun 26, 04:06 PM IST

    Similar Jobs

    View All
    Loading...
    Bag Logo
    Jobaaj
    Don't Miss out any Updates

    Subscribe now for the latest job alerts
    and never miss an update

    Job Alert
    Google hiring for Specific Roles Apply Now!
    1 min ago
    New Opportunity
    Amazon is hiring freshers Apply Now!
    5 min ago
    Featured Jobs
    Microsoft opening 50+ positions Apply Now!
    10 min ago

    Data Engineer - AI-Assisted Data Ingestion Platform

    Share with