Skip to content

Latest commit

 

History

History
45 lines (32 loc) · 1.45 KB

README.md

File metadata and controls

45 lines (32 loc) · 1.45 KB

Tokopedia-Scraper

Overview

The program aimed to extract product data from the Tokopedia marketplace website based on specified keywords using web scraping techniques. Selenium with JavaScript-enabled selectors was utilized to extract the data due to the dynamic elements on the website. The extracted data included product name, price, location, rating, number of items sold, and details link, which were essential for data analysis and market research. The data was saved in both CSV and JSON formats for further processing and analysis.

URL: https://www.tokopedia.com/

Website: Screenshot 2023-02-09 at 21 49 34

Prerequisite

  • Python 3
  • WebDriver

Official Docs

Selenium Documentation: https://www.selenium.dev/documentation/

Installation

Installing Selenium

pip install selenium
pip3 install selenium

Installing tqdm

pip install tqdm
pip3 install tqdm

Disclaimer

This code is intended for educational purposes. Please respect privacy, copyright and code and data terms of use. Read more about Tokopedia here.

Rules

  1. Do not retrieve unnecessary data
  2. Do not use the collected data for illegal purposes
  3. Maintain the confidentiality of the data collected
  4. Documenting data scraping times

Website Version

Last scraping date: 9 February 2023