ShopeeFood_Web_Scraping is a Python project designed to scrape restaurant and food-related information from ShopeeFood Vietnam. The goal is to automate data collection for purposes such as food trend analysis, pricing research, and restaurant performance tracking.
This project focuses on collecting and organizing data from ShopeeFood with the following key outputs:
- Restaurant Metadata
- Name, address, categories, coordinates, and delivery info
- Menu Information
- Dishes, prices, descriptions, availability
- Ratings and Reviews
- Average rating, number of reviewers, tags (e.g., "tasty", "fast delivery")
- Python 3.x – core programming language
- Requests – for sending HTTP requests
- Selenium
- Pandas – for processing and exporting structured data
- JSON / CSV – for storing results
- Time / Random – for delay logic to avoid getting blocked
SHOPEEFOOD_WEB_SCRAPING/
├── .gitattributes
├── links_shopee.txt
├── README.md
├── restaurant_shopefood.csv
└── shopeefood_webscraping.ipynb
Format: CSV
Example CSV Output:
- The
get_links.py
script sends requests to category/location-based ShopeeFood URLs. - Parses pagination pages to gather all restaurant page URLs.
- The
get_restaurant_info.py
script loops through restaurant URLs. - Extracts key fields like name, location, menu, prices, ratings, etc.
- Cleans and structures the data for export.
- Data is stored in both
.json
(raw) and.csv
(tabular) formats. - Files are saved into the
/output
directory for later use or analysis.
- Market research on food delivery platforms
- Menu and pricing trend analysis
- Clustering restaurants by cuisine, rating, or delivery zones
- Building dashboards for F&B performance by city or district