GitHub - txmxthy/OasisLLM: Offline LLM Query with Vector DB

Enabling Offline Local Querying

About

Oasis transforms your files into vector embeddings, and enables offline querying. We use Chroma to store the embeddings, and LangChain to query them. There are a variety of open source models which can be configured to work with Oasis, the license of the models may vary. Models from Hugging Face are used by default.

You can run everything offline after the initial setup, with an assortment of compatible models available for selection to be made available offline.

The ingestion process uses LangChain
Oasis uses a local LLM to process and understand questions from which it can generate a response.
By using the pretrained models you are able to leverage the cutting edge models available online, and apply a large corpus of your own data to enable top-tier context aware and domain specific performance.
- This context is taken from multiple sources across the local vector database via a similarity search.

Ingestion:

Currently only .txt, .csv, and .pdf files are supported with more to come. Place your files in the /data/ingestion/ folder, and run ingest.py to absorb them into the local vector database. Processing the files may take some time depending on the number of words/tokens in the files.

Database

The database is kept at /data/chroma/ - for a fresh start, delete the subdir index and run ingest.py again.

Querying:

Run main.py to start the program and select the query option. Loading the model may take some time. Once it is loaded the shell will prompt you for a question, enter your question and submit it with enter.

The model will then search the local vector database for the most relevant documents, and use them as context to answer your question. The time taken scales with GPU power.

The model will then print the answer as well as asking if the user would like to see its references. You can choose to see the references before proceeding or not. You are then brought back to the question interface to submit another question.

For a good query, be specific and give context!

Installation

Install Oasis with Git

  git clone https://github.com/txmxthy/oasis.git

Usage

To run oasis, ensure you have the pre-requisites installed, and you have followed the steps in docs/setup.md, then run main.py

  python main.py

Demo

Gif of Oasis in action to come

Contributing

Contributions are always welcome!

See contributing.md for ways to get started. Please adhere to this project's code of conduct.

-- Credit to PrivateGPT for the approach!

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github/workflows		.github/workflows
Oasis Logos		Oasis Logos
data		data
docs		docs
utilities		utilities
.env		.env
.gitignore		.gitignore
ingestion.py		ingestion.py
main.py		main.py
oasis.py		oasis.py
readme.md		readme.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enabling Offline Local Querying

About

Ingestion:

Database

Querying:

Installation

Usage

Demo

Contributing

About

Releases

Packages

Languages

txmxthy/OasisLLM

Folders and files

Latest commit

History

Repository files navigation

Enabling Offline Local Querying

About

Ingestion:

Database

Querying:

Installation

Usage

Demo

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages