Web-scraping-and-API-in-Python

In this repository, you will know and learn in-depth about:

How to use APIs like EDAMAM API, Github API and iTunes API.
- Initial setup and registration
- Passing parameters
- Testing invalid input
- Investigating output
- Structuring and exporting data
- sending GET and POST requests
- Pagination
- Extracting results from multiple pages
Building a currency converter using exchange rates API.
- Extracting data on currency exchange rates
- Handling JSON
- Obtaining historical exchange rates
- Extracting data from a time period
How to download files with requests.
- Naive downloading
- Streaming the download to a file
- Writing to a file
Using BeautifulSoup library.
- Making a GET request and soup
- Exporting the HTML to a file
- Searching and Navigating HTML tree
- Extracting the text
- Extracting data from HTML tree and nested tags
- Searching by attributes
- Processing links and multiple links at once
- Scraping multiple pages automatically
Scraping Rotten Tomatoes
- Choosing a parser among html.parser and lxml
- Finding an element containing all the data
- Extracting the title, year and score of each movie (including preprocessing and cleaning)
- Extracting adjusted score, synopsis, critics consensus (plus 2 ways of text processing), directors and cast info
- Representing the data in structured form and exporting the data
Scraping HTML tables with the help of pandas
- Extracting tables with Beautiful Soup
- Using Pandas to extract tables
Exploring requests-html library:
- Searching for elements, text;
- Using CSS selectors - select elements based on ID, class, tag name and other attributes
- Combining different filters together into a compound selector
- Incorporating tag hierarchy
- Scraping data generated by JavaScript via Asynchronous sessions

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
BeautifulSoup - Navigating tree, extracting data from HTML tree and nested tags, searching by attributes, processing links, scraping multiple pages, .ipynb		BeautifulSoup - Navigating tree, extracting data from HTML tree and nested tags, searching by attributes, processing links, scraping multiple pages, .ipynb
Creating a simple currency converter.ipynb		Creating a simple currency converter.ipynb
Downloading files with requests.ipynb		Downloading files with requests.ipynb
EDAMAM API - Initial setup, registration, sending a POST request and testing invalid input.ipynb		EDAMAM API - Initial setup, registration, sending a POST request and testing invalid input.ipynb
GitHub API - Pagination.ipynb		GitHub API - Pagination.ipynb
README.md		README.md
Scraping HTML Tables with the help of Pandas.ipynb		Scraping HTML Tables with the help of Pandas.ipynb
Scraping Rotten Tomatoes.ipynb		Scraping Rotten Tomatoes.ipynb
Using the requests-html library.ipynb		Using the requests-html library.ipynb
iTunes API - Passing parameters, investigating output, structuring and exporting data.ipynb		iTunes API - Passing parameters, investigating output, structuring and exporting data.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-scraping-and-API-in-Python

About

Releases

Packages

Languages

vishnukanduri/Web-scraping-and-API-in-Python

Folders and files

Latest commit

History

Repository files navigation

Web-scraping-and-API-in-Python

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages