We are looking for a Web Scraping Intern passionate about data extraction and automation. You will work closely with our technical team to scrape e-commerce websites, collect structured data, and maintain reliable scraping pipelines using Python.
This internship is ideal for someone motivated, curious, and eager to gain hands-on experience with real-world scraping challenges and production environments.
Responsibilities
- Develop and maintain web scrapers using Python
- Scrape e-commerce websites and extract structured data (prices, products, availability, etc.)
- Work with API endpoints (REST/JSON) for data collection and integration
- Identify and use CSS selectors and XPath expressions efficiently
- Handle pagination, dynamic content, and data normalization
- Debug and improve scraper reliability and performance
- Collaborate with the team to understand data requirements and improve extraction logic
- Document scraping logic and workflows clearly
Required Skills
- Good knowledge of Python
- Understanding of HTML, CSS selectors, and XPath
- Experience working with API endpoints
- Basic knowledge of Git & GitHub
- Motivation, autonomy, and attention to detail
Nice to Have (not mandatory)
- Experience scraping e-commerce websites
- Knowledge of techniques to bypass basic blocking (rate limits, headers, proxies, etc.)
- Experience with headless browsers (Playwright, Selenium)
- Understanding of anti-bot mechanisms
- Personal projects or GitHub portfolio related to scraping or automation
What We Offer
- Real production experience with real scraping challenges
- Mentorship and technical guidance
- Opportunity to work on meaningful data projects
- Flexible working environment (remote)
- Unpaid internship, but strong possibility of being hired if performance and results meet expectations