A News Aggregator is a combination of web crawlers and web applications. Both of these technologies have their implementation in Python. It is a web application which aggregates data (news articles) from multiple websites. Then presents the data in one location.
Front-End Development | Back-End Development |
---|---|
HTML | Python |
CSS | Django |
JavaScript | - |
- Django Framework
- BeautifulSoup
- requests
-
Django Framework : Django is a Python web framework that enables fast development of secure websites. It is a high level frame-work. Built by experienced developers, Django takes care very much of the difficult tasks of web development, so the user can focus on writing the application without worrying for the different tools. Django is free and open source. It also has a vast community.
-
BeautifulSoup -: Beautiful Soup is a Python package for parsing HTML and XML documents (including having malformed markup, i.e. non-closed tags, so named after tag soup). It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping.
-
requests module -: The Requests library in Python is one of the integral parts of Python for making HTTP requests to a specified URL. Whether it be REST APIs or Web Scraping, requests are a must to be learned for proceeding further with these technologies. When one makes a request to a URI, it returns a response. Python requests provide inbuilt functionalities for managing both the request and response.
Step 1: Setting Up the Project
Step 2: Fetching Web Page Content
Step 3: Parsing HTML Content with Beautiful Soup
Step 4: Extracting News Articles
Step 5: Displaying the Aggregated News