Engineer (Web Scraping), MeasureOne

Department Icon IT / Software Development & Related
102+ Applicants
Posted: 1 month ago
3-5 years
Ahmedabad, Gujarat
work from office

Posted: 1 month ago
|
Applicants: 102+
Job Description
About Company
Similar Jobs
Please verify your account first! Send OTP

Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs

Job Description

Who are we

MeasureOne, the leading consumer-permissioned data exchange platform, transforms the way in which businesses access and use consumer data. MeasureOne empowers organizations to access a wide range of trusted consumer data while prioritizing privacy and consent.Through MeasureOnes platform, businesses can confidently and reliably integrate and verify consumer information such as income, employment, education, and student enrollment. MeasureOne offers flexible implementation options for businesses to easily leverage consumer-permission data, from a developer-friendly API to third-party integrations. MeasureOne is headquartered in San Francisco.

What we are looking for

The ideal candidate will possess hands-on expertise in designing and deploying advanced web scraping solutions, leveraging Node.js and other technologies. A significant focus will be on overcoming bot detection challenges, building scalable and resilient scraping systems, and ensuring the efficiency and scalability of data acquisition pipelines. This is a highly technical, hands-on role ideal for someone passionate about solving complex scraping and infrastructure challenges.

Things you will be doing

Advanced Web Scraping:

  • Develop and maintain high-performance scraping systems using Node.js, Python, or other relevant technologies.
  • Handle JavaScript-heavy and asynchronous content using tools like Puppeteer, Playwright, or custom solutions in Node.js.
  • Implement advanced bot detection bypass techniques, including:
  • CAPTCHA solving using automation, AI/ML, or third-party services.
  • Advanced proxy management and IP rotation strategies.
  • User-agent, cookie, and header spoofing.
  • Build robust error-handling mechanisms to adapt to changes in website structures or anti-scraping measures.

Bot Detection and Anti-Scraping Expertise:

  • Analyze and reverse-engineer advanced bot detection systems and anti-scraping mechanisms, including rate-limiting, behavioral analysis, and fingerprinting.
  • Design and implement techniques to bypass WAFs (Web Application Firewalls) and server-side protections using Node.js libraries and tools.
  • Monitor, log, and analyze bot detection patterns to ensure system adaptability.
  • Create innovative solutions to blend scraping traffic with legitimate user behavior.

Infrastructure and Networking:

  • Architect and maintain scalable infrastructure using containerization tools like Docker and orchestration platforms such as Kubernetes.
  • Leverage cloud platforms (AWS, GCP, Azure) for distributed scraping and data acquisition.
  • Utilize Node.js and related tools to optimize network configurations for high-throughput scraping, including proxy and load balancer configurations.
  • Automate deployment and scaling of scraping systems using CI/CD pipelines.

Performance and Optimization:

  • Ensure optimal performance of scraping systems by reducing latency and optimizing resource utilization.
  • Develop robust monitoring and logging systems to track and troubleshoot issues in real time.
  • Optimize pipelines for scalability, fault tolerance, and high availability.

Compliance and Security:

  • Ensure adherence to legal, ethical, and regulatory standards (e.g., GDPR, CCPA) for all scraping activities.
  • Looking to get Placed? Try our Placement Guarantee Plan

  • Safeguard data acquisition systems from detection, blocking, and external threats.
  • Respect websites terms of service while implementing efficient scraping solutions.

Skills you need to succeed in this role

Technical Skills:

  • 3+ years of hands-on experience in web scraping or data engineering.
  • Expertise in Node.js for building and optimizing scraping systems.
  • Deep expertise in handling advanced bot detection systems and anti-scraping mechanisms.
  • Strong knowledge of programming languages such as Python and JavaScript.
  • Advanced understanding of networking concepts, including HTTP/HTTPS protocols, WebSockets, DNS, and API integrations.
  • Experience with containerization tools (Docker) and orchestration platforms (Kubernetes).
  • Proficiency in cloud platforms (AWS, GCP, Azure) for scalable data acquisition pipelines.
  • Familiarity with tools like Puppeteer, Playwright, Scrapy, or Selenium.

Problem-Solving Expertise:

  • Proven ability to reverse-engineer anti-bot measures such as CAPTCHA, IP blocks, and fingerprinting.
  • Strong debugging and optimization skills for network and scraping pipelines.

Skills

PythonDebuggingJavascriptKubernetesNode.jsDeveloperApiCloud

If an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.

About Company

MeasureOne is a pioneer in data and analytics, transforming the way consumers can access and share their data with lenders. MeasureOne is a data and analytics firm specializing in data-driven insight for the higher education finance industry.

Important dates & deadlines?

Application Deadline

12 May 26, 05:55 PM IST

Similar Jobs

View All
Loading...
Bag Logo
Jobaaj
Don't Miss out any Updates

Subscribe now for the latest job alerts
and never miss an update

Job Alert
Google hiring for Specific Roles Apply Now!
1 min ago
New Opportunity
Amazon is hiring freshers Apply Now!
5 min ago
Featured Jobs
Microsoft opening 50+ positions Apply Now!
10 min ago

Engineer (Web Scraping), MeasureOne

Share with