BOT - A Bot for Technical documentation

Description

Some months ago, as a proof of concept and weekend project, I created a Q&A chatbot based on GPT-3.5 for Optimizely's support documentation.

I am sharing here now that hacky code because, with some tweaks, I think you can use this as base for other scenarios.

Architecture

The data collection is arguably the most difficult step. Here, it is done automatically, and recursively by getting all internal links of http://support.optimizely.com/ using linkchecker.

Then, for each URL found, the content of interest (i.e., an HTML class, assuming that is always the same) is scraped using Selenium and Beautiful Soup, cleaned, and stored as a local FAISS vector database.

All pairs (cleaned text and URL) are then used together with GPT-3.5’s text-davinci-003 model for answering questions with sources using LangChain.

The architecture:

Setup

This project has as main pre-requisites (detailed in Pipfile):

Python 3.10
LangChain 0.0.55

How to run:

Clone this repo
Open a terminal within that folder
Run pip install pipenv
Run pipenv install to install the dependencies from Pipfile
Run pipenv shell
Run python build_db.py
Finally, run streamlit run bot.py to interact with the bot

You need to customize build_db.py:

add your OpenAI's key
define your starting URLs
define any additional URLs filtering rule
define your class html where the text is in the URLs

You need to customize bot.py:

add your OpenAI's key

Demo

WARNING:

As web scraping is done for the data collection, please use this carefully and according to the legal regulations.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
bot.py		bot.py
bot_architecture.png		bot_architecture.png
build_db.py		build_db.py
demo_chatbot.gif		demo_chatbot.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BOT - A Bot for Technical documentation

Description

Architecture

Setup

Demo

WARNING:

About

Releases

Packages

Languages

cmigpereira/technical-docs-bot

Folders and files

Latest commit

History

Repository files navigation

BOT - A Bot for Technical documentation

Description

Architecture

Setup

Demo

WARNING:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages