Skip to content

Interface developed to extract information from web through scraping and summarize given data.

Notifications You must be signed in to change notification settings

AAC-Open-Source-Pool/Text-Summarization-and-information-extraction

Repository files navigation

Table of Contents

Introduction

  • In the era of information overload, the "Text Summarization and Information Extraction Tool" is an innovative project designed to empower users with efficient and intelligent methods for processing textual content. This tool combines the capabilities of Natural Language Processing (NLP) and web scraping to facilitate text summarization and information retrieval from both user-provided text and online sources, such as Wikipedia articles.

Requirements

For running the code, make sure that the following are installed on your local device.

Python 3.11.x python3.11.x
Streamlit 1.27.x Streamlit1.27.x
spacy 3.6.x spacy3.6.x
requests 2.31.x requests2.31.x
pdfplumber 0.10.x pdfplumber0.10.x
beautifulsoup4 4.12.x beautifulsoup44.12.x

How to use

  1. Clone this repo.
  • git clone https://github.com/AAC-OSP/Text-Summarization-and-information-extraction
    
  1. Install the required libraries from Requirements
  2. Execute the python script
  3. Add star to this repo if you liked it 😄

Contribution

This section provides instructions and details on how to submit a contribution via a pull request. It is important to follow these guidelines to make sure your pull request is accepted.

  1. Before choosing to propose changes to this project, it is advisable to go through the readme.md file of the project to get the philosphy and the motive that went behind this project. The pull request should align with the philosphy and the motive of the original poster of this project.
  2. To add your changes, make sure that the programming language in which you are proposing the changes should be same as the programming language that has been used in the project. The versions of the programming language and the libraries(if any) used should also match with the original code.
  3. Write a documentation on the changes that you are proposing. The documentation should include the problems you have noticed in the code(if any), the changes you would like to propose, the reason for these changes, and sample test cases. Remember that the topics in the documentation is strictly not limited to the topics aforementioned, but are just an inclusion.
  4. Submit a pull request via Git etiquettes

About

Interface developed to extract information from web through scraping and summarize given data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published