Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EPIC] Watsonx.ai Release Dev Tracker #1

Closed
heyselbi opened this issue Jun 19, 2023 · 0 comments
Closed

[EPIC] Watsonx.ai Release Dev Tracker #1

heyselbi opened this issue Jun 19, 2023 · 0 comments
Assignees
Labels
tracker Non-completable ticket; used for tracking work at a high level

Comments

@heyselbi
Copy link

heyselbi commented Jun 19, 2023

Goal 1: Integration of Watsonx.ai components into ODH

Req. 1: [P0] The following components must be included out-of-the-box in RHODS at a GA support level

--> Caikit (Compositional AI Kit) serving runtime
--> TGIS (Text Generation Inference Service)
--> KServe, Service Mesh, Serverless

Goal 2: Seamless updates to models and Caikit

Req. 2: [P0] Users must be able to deploy an updated version of a foundation model

For example, if using a curated IBM model, when IBM releases a new model version, users must be able to deploy the updated model version. The same scenario applies to foundation models from other sources such as Hugging Face.

Req. 4: [P0] The system must support the ability to update the Caikit serving runtime version without impacting actively served models

For example, the upstream version will be updated and need to incorporate in RHODS as appropriate without impacting deployed models. A new RHODS release must not break model serving functionality.

Goal 3: Monitoring & Metrics

Req. 5: [P0] Users must be able to access all applicable metrics for any deployed model

Metrics required:
--> number of inference requests over defined time period
--> Avg. response time over defined time period
--> number of successful / failed inference requests over defined time period
--> Compute utilization (CPU, GPU, Memory)

Goal 4: Support for HF models

Req. 3: [P0] The product must support the ability to deploy foundation models from Hugging Face using OOTB capabilities.

To be clear, we’re not looking to support the actual models themselves, but rather the ability to deploy/serve the models. If a customer has an issue with the actual model, that is outside the support scope.

Goal 5: HTTP and gRPC endpoints

Req. 6: [P0] Enable users to create & access http and grpc endpoints for model serving routes

@heyselbi heyselbi added the tracker Non-completable ticket; used for tracking work at a high level label Jun 22, 2023
@heyselbi heyselbi changed the title KServe Onboarding Dev Tracker Watsonx.ai Release Dev Tracker Jul 19, 2023
@heyselbi heyselbi changed the title Watsonx.ai Release Dev Tracker [EPIC] Watsonx.ai Release Dev Tracker Jul 19, 2023
@heyselbi heyselbi self-assigned this Aug 2, 2023
israel-hdez referenced this issue in israel-hdez/kserve Aug 15, 2023
Add requirements.txt file to support CPaaS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
tracker Non-completable ticket; used for tracking work at a high level
Projects
Status: No status
Status: No status
Status: Done
Development

No branches or pull requests

1 participant