Project overview: Design and build a high‑performance full‑text search engine capable of crawling, indexing, and retrieving information efficiently. The system includes a web crawler to gather data, an indexing engine using inverted indexes, and a ranking algorithm to return the most relevant results.
Objectives:
- Build a robust web crawler to traverse links and extract content from diverse web pages or local documents
- Implement an inverted index structure enabling near‑instantaneous search queries across large datasets
- Develop a ranking algorithm to ensure result relevance based on keyword frequency and link authority
Technologies:
- Languages: Python, Java, or C++ (core engine)
- Search core: Elasticsearch, Apache Solr, or custom Lucene‑based implementation
- Storage: NoSQL
- Front‑end: React or Vue.js
Profile and requirements:
- Bac+3 to Bac+5 in Software Engineering, Data Science, or Backend Systems (Bac+4/Bac+5 preferred; engineering cycle or master’s degree)
- Strong backend/software engineering foundations and interest in information retrieval, indexing, and ranking
Internship details:
- Duration: 4–6 months
- Mode: Remote
Application package:
- Updated CV
- 1‑page cover letter specifying: Project reference code (PFE‑05) and relevant skills/experience
Selection process: 1) Screening of application package and pre‑selection 2) Interview: short technical and product‑focused discussion 3) Validation: tripartite agreement between Student, University, and IMEDRA SPHERE
Subject line for email: Application PFE – PFE‑05 – [Full Name]
📧 Pour postuler:
administration@imedrasphere.com