Helm chart - copy models from NFS storage to attached storage #10

bd-g · 2024-05-31T18:26:29Z

Proposed changes

The AWS and GCP default configurations configure the Engine Pods to read models from shared network attached storage. This can increase the disk latency and possibly increase the latency of requests that require a model load, which is particularly sensitive for streaming requests.

There should be an option to copy models from the NFS onto the Pod's attached host storage to reduce read latency for models. This could be done once on startup, and possibly poll for updated models in NFS.

bd-g added the ✨ enhancement label May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Helm chart - copy models from NFS storage to attached storage #10

Helm chart - copy models from NFS storage to attached storage #10

bd-g commented May 31, 2024

Helm chart - copy models from NFS storage to attached storage #10

Helm chart - copy models from NFS storage to attached storage #10

Comments

bd-g commented May 31, 2024

Proposed changes