Scraping TripAdvisor with Python 2020 *

Python implementation of web scraping of TripAdvisor with Selenium in a new 2020 website.

There are two scripts:

"restaurants_scraper.py" to scrape restaurant
"things_to_do_scraper.py" to scrape hotels, attraction and monuments.

The python function is commented, write me if you have doubts. If you have a slow connection and you encounter code problems, try increasing the seconds of time.sleep () function

Features implemented:

The click function to open the "more" button of the reviews
The click function to change the page
Csv file with the date, the score, the title and the full review!

How to use:

First approach: download the python file, open it and edit the default fields (csv file path, number of pages, tripadvisor url)
Second approach: download the file and launch it directly from the terminal, passing:
- the path of your csv file where the reviews will be stored
- the number of pages of the desired website that you want to scrape
- the url of tripadvisor website that you want to scrape

Code to paste into terminal: python3 path_to_downloaded_script/things_to_do_scraper.py desktop/reviews.csv 50 https://www.tripadvisor.com/Attraction_Review-g187791-d192285-Reviews-Colosseum-Rome_Lazio.html

What I used:

Python 3.8.2
Selenium 3.141.0
Safari 14.0.1
Visual Studio Code 1.51.1
Macbook Pro 13" M1 2020 with macOS Big Sur 11.0.1

*This activity has been supported by a grant from the Project IDEHA - PON "Ricerca e Innovazione" 2014-2020 - Innovation for Data Elaboration in Heritage Areas, Azione II

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
LICENSE.md		LICENSE.md
README.md		README.md
restaurants_scraper.py		restaurants_scraper.py
things_to_do_scraper.py		things_to_do_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraping TripAdvisor with Python 2020 *

About

Releases

Packages

Languages

License

giuseppegambino/Scraping-TripAdvisor-with-Python-2020

Folders and files

Latest commit

History

Repository files navigation

Scraping TripAdvisor with Python 2020 *

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages