The program aimed to extract product data from the Tokopedia marketplace website based on specified keywords using web scraping techniques. Selenium with JavaScript-enabled selectors was utilized to extract the data due to the dynamic elements on the website. The extracted data included product name, price, location, rating, number of items sold, and details link, which were essential for data analysis and market research. The data was saved in both CSV and JSON formats for further processing and analysis.
URL: https://www.tokopedia.com/
- Python 3
- WebDriver
Selenium Documentation: https://www.selenium.dev/documentation/
pip install selenium
pip3 install selenium
pip install tqdm
pip3 install tqdm
This code is intended for educational purposes. Please respect privacy, copyright and code and data terms of use. Read more about Tokopedia here.
- Do not retrieve unnecessary data
- Do not use the collected data for illegal purposes
- Maintain the confidentiality of the data collected
- Documenting data scraping times
Last scraping date: 9 February 2023