Experiment automation for models on inferences in InstructLab or watsonx

1. Objective

The objective is to run tests against models running in the inference of the InstructLab and watsonx using the REST APIs.

This repository is related to the two blog posts related to the InstructLab.

Setup of the InstructLab for fine-tuning
Fine-tune a model with the InstructLab

The repository contains automation for a question-answering use case with LLM models.

The input questions will provided by the users in an Excel file, and the automation will generate an Excel output file with various information from the run.

Questions are input parameters to use the models in a question-answering use case. The LLM models run in the local InstructLab and on watsonx.ai in the IBM Cloud.

Note: You can find the currently supported foundation models for InstructLab on the InstructLab-compatible foundation models page.

2. Setup and Architecture

Here is a simplified overview of the input, output, and configuration of the text framework:

We use a Phyton application with command line parameters to invoke the REST API for watsonx and InstructLab to send all questions.
We use shell scripts to save various configurations to invoke the Python application.

2.1. Python application

Input: Excel file with a question column
Output: Excel file with the columns.

The following table contains the columns for the output Excel file.

question	prompt	generated_text	model_id	model_version	generated_token_count	input_token_count	stop_reason
question	prompt	generated_text	model_id	model_version	generated_token_count	input_token_count	stop_reason

Environment variables in the .env file for the Python application

# IBM Cloud
export IBMCLOUD_APIKEY=
export IBMCLOUD_URL="https://iam.cloud.ibm.com/identity/token"

# Watsonx
export WATSONX_URL="https://us-south.ml.cloud.ibm.com/ml/v1/text/generation"
export WATSONX_VERSION=2023-05-29
export WATSONX_PROJECT_ID=XXXXXX

export WATSONX_MIN_NEW_TOKENS=1
export WATSONX_MAX_NEW_TOKENS=300

export WATSONX_LLM_NAME=ibm/granite-13b-chat-v2
export WATSONX_PROMPT_FILE="$(pwd)/prompts/prompt-granite.txt"

# InstructLab
export INSTRUCTLAB_URL="http://127.0.0.1:8000/v1/completions"
export INSTRUCTLAB_PROMPT_FILE="$(pwd)/prompts/prompt-clean.txt"
export INSTRUCTLAB_MAX_NEW_TOKENS=300

2.1. Shell automation

The following code is the content for a shell script to run the test against a model on watsonx. We define the model and the prompt file, set the proper input parameters, and specify the output location and file name.

echo "##########################"
echo "# 0. Load environments"
source ./venv/bin/activate
source .env

export WATSONX_LLM_NAME=ibm/granite-13b-chat-v2
export WATSONX_PROMPT_FILE="$(pwd)/prompts/prompt-clean.txt"

echo "##########################"
echo "# 1. Set input and output path."
export OUTPUT_PATH=$(pwd)/output
export INPUT_PATH=$(pwd)/input

echo "##########################"
echo "# 2. Set input and output filename."
export RUN_OUTPUT_FILENAME=${OUTPUT_PATH}/"experiment_granite_clean_$(date +%Y-%m-%d_%H-%M-%S).xlsx"
export RUN_INPUT_FILENAME=${INPUT_PATH}/"questions.xlsx"

echo "##########################"
echo "# 3. Set inference."
export RUN_INFERENCE="watsonx"

echo "##########################"
echo "# 4. Run experiment"
python3 run_experiment.py --inputfile ${RUN_INPUT_FILENAME} --outputfile ${RUN_OUTPUT_FILENAME} --inference ${RUN_INFERENCE}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experiment automation for models on inferences in InstructLab or watsonx

1. Objective

2. Setup and Architecture

2.1. Python application

2.1. Shell automation

2.3. Setup README

About

Releases

Packages

Languages

License

thomassuedbroecker/experiment-automation-inference-instruct-lab-watsonx-models

Folders and files

Latest commit

History

Repository files navigation

Experiment automation for models on inferences in InstructLab or watsonx

1. Objective

2. Setup and Architecture

2.1. Python application

2.1. Shell automation

2.3. Setup README

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages