10x Devs
Sousse

Web Scraping / Data Extraction Intern (pré-embauche)

pré-embauchescraping

Publié il y a 1 jour

Stage
⏱️2 mois
💼Télétravail
Partenaire
Reste lisible (ATS friendly).

Description du poste

We are looking for a Web Scraping Intern passionate about data extraction and automation. You will work closely with our technical team to scrape e-commerce websites, collect structured data, and maintain reliable scraping pipelines using Python.

This internship is ideal for someone motivated, curious, and eager to gain hands-on experience with real-world scraping challenges and production environments.

Responsibilities

  • Develop and maintain web scrapers using Python
  • Scrape e-commerce websites and extract structured data (prices, products, availability, etc.)
  • Work with API endpoints (REST/JSON) for data collection and integration
  • Identify and use CSS selectors and XPath expressions efficiently
  • Handle pagination, dynamic content, and data normalization
  • Debug and improve scraper reliability and performance
  • Collaborate with the team to understand data requirements and improve extraction logic
  • Document scraping logic and workflows clearly

Required Skills

  • Good knowledge of Python
  • Understanding of HTML, CSS selectors, and XPath
  • Experience working with API endpoints
  • Basic knowledge of Git & GitHub
  • Motivation, autonomy, and attention to detail

Nice to Have (not mandatory)

  • Experience scraping e-commerce websites
  • Knowledge of techniques to bypass basic blocking (rate limits, headers, proxies, etc.)
  • Experience with headless browsers (Playwright, Selenium)
  • Understanding of anti-bot mechanisms
  • Personal projects or GitHub portfolio related to scraping or automation

What We Offer

  • Real production experience with real scraping challenges
  • Mentorship and technical guidance
  • Opportunity to work on meaningful data projects
  • Flexible working environment (remote)
  • Unpaid internship, but strong possibility of being hired if performance and results meet expectations