This AI Chatbot Starter is designed to help developers find the information they need to debug their issues.
It should answer customer questions about the products or services specified.
- Clone the repository
- Make sure you have Python 3.11 installed
- Get required Astra credentials from Anant
- Get Langflow id and flow id from Astra langflow
Now follow the steps below to get the chatbot up and running.
Documentation (provided as a list of web urls in the config.yml
) can be ingested into your Astra DB Collection. Follow these steps:
-
Install the requirements using the following command:
pip install -r requirements.txt pip uninstall openai # if fails use sudo pip uninstall openai pip install openai==0.28
-
Obtain your OpenAI API Key from the OpenAI Settings page.
-
Create a
config.yml
file with the values required. Here you specify both the list of pages to scrape, as well as the list of rules for your chatbot to observe. For an example of how this can look, take a look at eitherconfig.yml.example_datastax
, orconfig.yml.example_pokemon
. -
Create a
.env
file & add the required information. Add the OpenAI Key from Step 1 as the value ofOPENAI_API_KEY
. The Astra and OpenAI env variables are required, while the others are only needed if the respective integrations are enabled. For an example of how this can look, take a look at.env_example
. -
From the root of the repository, run the following command. This will scrape the pages specified in the
config.yml
file into text files within theoutput
folder of yourai-chatbot-starter
directory.PYTHONPATH=. python data/scrape_site.py
-
From the root of the repository, run the following command. This will store the embeddings for the scraped text in your AstraDB instance.
pip install --upgrade openai PYTHONPATH=. python data/compile_documents.py
-
From the root of the repository, run the following command. Process requires, Playlist.yaml as input and looks like below
This will get video ids using playlist ids or play list url's specified in the playlist_ids.yaml
. These video ids will be appended to video_ids.yaml
file present under ai-chatbot-starter
directory.
pip uninstall openai # if fails use sudo pip uninstall openai
pip install openai==0.28
PYTHONPATH=. python data/playlist.py
- From the root of the repository, run the following command. Process require video_ids.yaml as input and looks like below.
This will scrape videos specified in the video_ids.yaml
file into text files within the video_output
folder of your ai-chatbot-starter
directory.
pip uninstall openai # if fails use sudo pip uninstall openai
pip install openai==0.28
PYTHONPATH=. python data/scrape_videos.py
- From the root of the repository, run the following command. This will store the embeddings for the scraped videos to your AstraDB instance.
pip install --upgrade openai
PYTHONPATH=. python data/compile_documents.py "video_output"
- Load Products / Ecommerce data. Update config.yml file with sites from which data to be loaded. Below is the look and feel of product section in configuration file.
From the root of the repository, run the below commands. This will crawal through all urls specified under "products" and save in ecommerce_sites
folder under ai-chatbot-starter
directory.
pip uninstall openai # if fails use sudo pip uninstall openai
pip install openai==0.28
PYTHONPATH=. python data/ecommerce_crawler_generic.py
From the root of the repository, run the below commands. This will store the embeddings for the product information to your AstraDB instance.
pip install --upgrade openai
PYTHONPATH=. python data/compile_documents.py ecommerce_sites
- Load Recipes data. Update config.yml file with sites from which data to be loaded. Below is the look and feel of product section in configuration file.
From the root of the repository, run the below commands. This will crawal through all urls specified under "recipes" and save in recipes_sites
folder under ai-chatbot-starter
directory.
pip uninstall openai # if fails use sudo pip uninstall openai
pip install openai==0.28
PYTHONPATH=. python data/recipe_crawler_generic.py
From the root of the repository, run the below commands. This will store the embeddings for the recipe information to your AstraDB instance.
pip install --upgrade openai
PYTHONPATH=. python data/compile_documents.py recipe_sites
If you have Docker installed, you can run the app using the following command:
-
Build the docker image using the following command:
docker build -t docker_aibot --no-cache .
-
Run the docker image using the following command:
docker run -p 5555:5555 docker_aibot
-
You can test an example query by running:
python scripts/call_assistant.py "<your_query_here>"
Alternatively, you can run the app normally using the following steps:
-
Install the requirements using the following command:
pip install -r requirements.txt
-
Run the app using the following command:
uvicorn app:app --host 0.0.0.0 --port 5555 --reload
-
You can test an example query by running:
python scripts/call_assistant.py "<your_query_here>"
-
To run the chainlit front-end:
python -m chainlit run chainlit_app.py -w
-
Login to Astra portal:
use https://accounts.datastax.com/session-service/v1/login url for login
-
Select Langflow that need to be started and click on "Playground". In this case "Vector Store RAG-AI Chat Bot"
-
Key-in your query in text box below and press "Arrow" button
-
Login to Astra portal:
use https://accounts.datastax.com/session-service/v1/login url for login
-
Click on "New Flow" button. This would display flow templates
-
Click on "Blank Flow". Blank flow will be displayed as below with title "Untitled document"
-
Click on / button and select import option as shown below. Select "Integrate-Product-Recipe-Video.json" from langflow folder
-
Following would be displayed after import. Update title from "Untitled Document" to suitable name.
Below screen would be displayed.
Click on "Python API" tab and make a note of below varables. Langflow_ID, flow_id, and Base_api_Url. Use previously obtained Application_token
- From command line, use below git commands to diploy application.
git clone https://github.com/Anant/ai-chatbot-starter
cd ai-chatbot-starter
git switch feature/brandaid
cd langflow-chat-ui-chainlit
Rename .env_example to .env Update all variables using the information obtained above/earlier
- Use below docker-compose command to build and host application locally.
docker-compose build
docker-compose up -d