Retrieval Augmented Generation (RAG) Demonstrator

Green-AI Hub Mittelstand

About The Project

This demonstrator was developed by the German Research Center for AI (DFKI) as part of the Green-AI Hub Mittelstand funded by the German ministry of Federal Ministry for the Environment, Nature Conservation, Nuclear Safety and Consumer Protection of Germany and managed by the "Zukunft - Umwelt - Gesellschaft" (ZUG).

We developed this software to show to all kinds of SMEs that LLMs can easily be deployed for custom use cases without having to rely on big and sometimes expensive cloud hosted LLMs. At the same time, this software demonstrates that LLMs can be deployed without causing a massive carbon footprint in the form of an expensive Deep-Learning-Server, as this software can be deployed easily on a current MacBook Pro.

Features

RAG implementation using the mps back-end from apple or cuda (upcoming) for nvidia GPUs
Web interface for easy interaction with the RAG system
Interface to add custom wikipedia articles to knowledge base
Customizable knowledge base for retrieval
Performance metrics and comparisons

Why RAG?

Retrieval Augmented Generation offers several advantages over traditional fine-tuning approaches, particularly from a sustainability perspective:

Reduced computational resources: RAG doesn't require retraining the entire model, significantly reducing the computational power needed compared to fine-tuning.
Lower energy consumption: With less computation required, RAG consumes less energy, making it a more environmentally friendly option.
Easier updates: The knowledge base can be updated independently of the model, allowing for more frequent and efficient information updates without the need for resource-intensive retraining.
Smaller carbon footprint: The combination of reduced computation and energy usage results in a smaller overall carbon footprint for RAG systems.
Scalability: RAG can handle larger and more diverse knowledge bases without the exponential increase in model size and training time associated with fine-tuning.

By choosing RAG over fine-tuning, we can create more sustainable AI solutions that are both powerful and environmentally responsible.

(back to top)

Getting Started

Clone this repository, navigate with your terminal into this repository and execute the following steps.

Prerequisites

Install the required software libraries for your local python environment.

pip install -r requirements.txt

If you want to use the DeepL translation service for improved answer quality, you have to insert your API key in line 202 in app.py

Starting the software

To use the demonstrator after you installed the required packages, you simply have to execute a few commands.

Navigate to the repository with your terminal.
Install the repository as a pip package
```
python3 app.py
```
If successful, the terminal should show under which url:port the web interface is available.

Please note that the first start-up requires a connection to the internet and can take some time, as the LLM has to be downloaded first. Be aware that the downloaded files will be multiple Gigabytes in space.

(back to top)

Usage

How to Use the LLM RAG Demonstrator

The LLM RAG Demonstrator provides a web interface for comparing the outputs of a standard LLM (Language Model) with an LLM enhanced by RAG (Retrieval-Augmented Generation). Below are the steps to effectively use this demonstrator:

Access the Demonstrator:
- Open your web browser and navigate to the previously provided URL for the LLM RAG Demonstrator.
User Input:
- In the center of the page, you will find a text box labeled "User Input".
- Enter your query or text prompt in this text box. This can be a question or a statement that you want the models to respond to.
Submit the Query:
- After entering your input, click the green "Submit" button located below the text box.
View Outputs:
- Once you submit your query, the page will display two sections for outputs:
  - LLM Default Output: This section shows the response generated by the standard LLM.
  - LLM RAG Output: This section shows the response generated by the LLM enhanced with Retrieval-Augmented Generation.
- Both outputs will appear in designated boxes labeled "Output...".
Compare Results:
- Compare the outputs from both sections to analyze the differences in responses.
- The RAG-enhanced output should ideally provide more accurate and contextually relevant information due to the integration of retrieved documents or data.

Developers

The overall concept of the demonstrator was developed by Cornelius Wolff, while the Front-End, the wikipedia feature and the retrieval process was developed by Leonie Grafweg.

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See the LICENSE file for more information.

(back to top)

Contact

Green-AI Hub Mittelstand - info@green-ai-hub.de

Project Link: https://github.com/Green-AI-Hub-Mittelstand/LLM-Demonstrator

Get in touch »

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
chromadb		chromadb
crawler		crawler
images		images
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
RAG.ipynb		RAG.ipynb
RAG_accuracy.ipynb		RAG_accuracy.ipynb
RAG_performance.ipynb		RAG_performance.ipynb
README.md		README.md
app.py		app.py
energy.py		energy.py
llm.py		llm.py
rag.py		rag.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrieval Augmented Generation (RAG) Demonstrator

Green-AI Hub Mittelstand

About The Project

Features

Why RAG?

Table of Contents

Getting Started

Prerequisites

Starting the software

Usage

How to Use the LLM RAG Demonstrator

Developers

Contributing

License

Contact

About

Releases

Packages

Contributors 3

Languages

License

Green-AI-Hub-Mittelstand/Retrieval-Augmented-Generation-LLM-Demonstrator

Folders and files

Latest commit

History

Repository files navigation

Retrieval Augmented Generation (RAG) Demonstrator

Green-AI Hub Mittelstand

About The Project

Features

Why RAG?

Table of Contents

Getting Started

Prerequisites

Starting the software

Usage

How to Use the LLM RAG Demonstrator

Developers

Contributing

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages