LLamaIndex Integration #12

gallegi · 2024-11-22T12:45:33Z

LlamaIndex integration using workflow
Continue conversation (ask follow-up questions)
RAG pipeline, allow dropping multiple pdf, doc, txt files and ask related questions
Fix text cut off in the chat box
Be able to display markdown in the chat box

…upload; Continue conversation with chat history

vietanhdev · 2024-11-23T09:17:16Z

llama_assistant/agent.py

+        self.node_processor = SimilarityPostprocessor(similarity_cutoff=0.3)
+        self.llm = llm
+
+    def udpate_index(self, files: Optional[Set[str] ] = set()):


Should it be update_index?

vietanhdev · 2024-11-23T09:17:34Z

llama_assistant/agent.py

+
+    @step
+    async def setup(self, ctx: Context, ev: StartEvent) -> SetupEvent:
+        # set frequetly used variables to context


frequently?

vietanhdev · 2024-11-23T09:18:57Z

llama_assistant/model_handler.py

-                messages=[{"role": "user", "content": message}], stream=stream
-            )
-
+        import asyncio


Move import to top?

vietanhdev · 2024-11-23T09:20:20Z

llama_assistant/processing_thread.py

@@ -9,21 +10,31 @@ class ProcessingThread(QThread):
    update_signal = pyqtSignal(str)
    finished_signal = pyqtSignal()

-    def __init__(self, model, prompt, image=None):
+    def __init__(self, model, prompt, lookup_files=set(), image=None):


Using set() as a default parameter value can be dangerous because it creates a mutable default argument, which is a common Python pitfall. The same set object will be shared across all instances of the class. Instead, use None and create the set inside the method:

def __init__(self, model, prompt, lookup_files=None, image=None): self.lookup_files = set() if lookup_files is None else lookup_files

vietanhdev · 2024-11-23T09:22:33Z

llama_assistant/config.py

+document_icon = "llama_assistant/resources/document_icon.png"
+
+# for RAG pipeline
+embed_model_name = "BAAI/bge-base-en-v1.5"


Add a TODO: Make it configurable next time.

vietanhdev · 2024-11-23T09:27:57Z

requirements.txt

@@ -7,4 +8,8 @@ huggingface_hub==0.25.1
 openwakeword==0.6.0
 pyinstaller==6.10.0
 ffmpeg-python==0.2.0
+llama-index-core==0.12.0


Add new requirements to pyproject.toml to be installable from PyPi.

vietanhdev · 2024-11-23T09:42:01Z

Error due to missing package:

 File "/opt/homebrew/Caskroom/miniforge/base/envs/la/lib/python3.11/site-packages/llama_index/core/readers/file/base.py", line 67, in _try_loading_included_file_formats
    raise ImportError("`llama-index-readers-file` package not found")
ImportError: `llama-index-readers-file` package not found

vietanhdev · 2024-11-23T09:44:22Z

Error while inference:

 File "/opt/homebrew/Caskroom/miniforge/base/envs/la/lib/python3.11/site-packages/llama_cpp/llama_chat_format.py", line 289, in _convert_text_completion_chunks_to_chat
    for i, chunk in enumerate(chunks):
  File "/opt/homebrew/Caskroom/miniforge/base/envs/la/lib/python3.11/site-packages/llama_cpp/llama.py", line 1269, in _create_completion
    raise ValueError(
ValueError: Requested tokens (2387) exceed context window of 2048

File:
sockets.txt

Question:

Is socket supported in AnyLearning?

Fix:

This works when I update all context length (2048) to 4096. The answer was based on the content of the text file. Good job!

TODO: Make it configurable.

vietanhdev · 2024-11-23T10:09:02Z

Implementing #13.

gallegi added 6 commits November 20, 2024 22:21

Integrate with llamaindex, build basic RAG, allow multiple documents …

a24abe2

…upload; Continue conversation with chat history

Only allow uploading pdf, txt, and doc files

94908b3

Fix text cut off in chat box

5770a13

Display markdown

86db3e6

remove redundant styling

90631be

Add mistune lib in requirements.txt

d3a3056

vietanhdev requested changes Nov 23, 2024

View reviewed changes

vietanhdev reviewed Nov 23, 2024

View reviewed changes

vietanhdev changed the title ~~Update features~~ LLamaIndex Integration Nov 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLamaIndex Integration #12

LLamaIndex Integration #12

gallegi commented Nov 22, 2024 •

edited

Loading

vietanhdev Nov 23, 2024

vietanhdev Nov 23, 2024

vietanhdev Nov 23, 2024

vietanhdev Nov 23, 2024

vietanhdev Nov 23, 2024

vietanhdev Nov 23, 2024

vietanhdev commented Nov 23, 2024

vietanhdev commented Nov 23, 2024 •

edited

Loading

vietanhdev commented Nov 23, 2024

LLamaIndex Integration #12

Are you sure you want to change the base?

LLamaIndex Integration #12

Conversation

gallegi commented Nov 22, 2024 • edited Loading

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev Nov 23, 2024

Choose a reason for hiding this comment

vietanhdev commented Nov 23, 2024

vietanhdev commented Nov 23, 2024 • edited Loading

vietanhdev commented Nov 23, 2024

gallegi commented Nov 22, 2024 •

edited

Loading

vietanhdev commented Nov 23, 2024 •

edited

Loading