Use container right now #15

ericcurtin · 2024-07-31T13:28:38Z

ramalama run/serve right now require the container, it has the version of llama.cpp that works.

Long-term we may be able to remove this.

rhatdan · 2024-07-31T13:42:01Z

ramalama

@@ -342,6 +342,11 @@ def main(args):
    conman = select_container_manager()
    ramalama_store = get_ramalama_store()

+    if conman:


Ok so we default to running a container for use with the server.

Sounds good to me, the other problem is we need to add:

pip install "huggingface_hub[cli]==0.24.2"

to the install script but that's no biggie.

Re-pushed, it now does the re-exec in a container thing for just run/serve and installs the huggingface dependency in install script

rhatdan · 2024-07-31T13:42:25Z

LGTM, merge when tests pass.

ericcurtin · 2024-07-31T13:59:48Z

ramalama

@@ -215,6 +215,7 @@ def list_cli(ramalama_store, args):


 funcDict["list"] = list_cli
+funcDict["ls"] = list_cli


Ollama has ls as an alias to list, so trying to keep with "norms" from other tools, when it's easy at least

ericcurtin · 2024-07-31T14:25:46Z

I think it should work this time, just wanted to get a test in to prevent run/serve breakage again

If we can get the llama-cpp-python library working well, it might help reduce dependencies on containers.

But these LLM environments in general can get complex

ramalama run/serve right now require the container, it has the version of llama.cpp that works. Long-term we may be able to remove this. Signed-off-by: Eric Curtin <ecurtin@redhat.com>

ericcurtin · 2024-07-31T14:46:22Z

Merging, people might try this out in the next few minutes, would hope they don't install the broken version

ericcurtin · 2024-07-31T14:46:35Z

We should create a stable branch soon

ericcurtin self-assigned this Jul 31, 2024

ericcurtin requested a review from rhatdan July 31, 2024 13:28

ericcurtin force-pushed the ramalama-run-serve branch 2 times, most recently from fced804 to d40b962 Compare July 31, 2024 13:39

rhatdan reviewed Jul 31, 2024

View reviewed changes

ericcurtin force-pushed the ramalama-run-serve branch 2 times, most recently from 180a93c to 47c3d24 Compare July 31, 2024 13:59

ericcurtin commented Jul 31, 2024

View reviewed changes

ericcurtin force-pushed the ramalama-run-serve branch 2 times, most recently from 34635dd to 2f3cc71 Compare July 31, 2024 14:24

ericcurtin force-pushed the ramalama-run-serve branch from 2f3cc71 to a6c69f6 Compare July 31, 2024 14:26

Use container right now

2f708b9

ramalama run/serve right now require the container, it has the version of llama.cpp that works. Long-term we may be able to remove this. Signed-off-by: Eric Curtin <ecurtin@redhat.com>

ericcurtin force-pushed the ramalama-run-serve branch from a6c69f6 to 2f708b9 Compare July 31, 2024 14:45

ericcurtin merged commit 9172fbc into main Jul 31, 2024
3 checks passed

ericcurtin deleted the ramalama-run-serve branch July 31, 2024 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use container right now #15

Use container right now #15

ericcurtin commented Jul 31, 2024

rhatdan Jul 31, 2024

ericcurtin Jul 31, 2024

ericcurtin Jul 31, 2024

rhatdan commented Jul 31, 2024

ericcurtin Jul 31, 2024 •

edited

Loading

ericcurtin commented Jul 31, 2024

ericcurtin commented Jul 31, 2024

ericcurtin commented Jul 31, 2024

		@@ -215,6 +215,7 @@ def list_cli(ramalama_store, args):


		funcDict["list"] = list_cli
		funcDict["ls"] = list_cli

Use container right now #15

Use container right now #15

Conversation

ericcurtin commented Jul 31, 2024

rhatdan Jul 31, 2024

Choose a reason for hiding this comment

ericcurtin Jul 31, 2024

Choose a reason for hiding this comment

ericcurtin Jul 31, 2024

Choose a reason for hiding this comment

rhatdan commented Jul 31, 2024

ericcurtin Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

ericcurtin commented Jul 31, 2024

ericcurtin commented Jul 31, 2024

ericcurtin commented Jul 31, 2024

ericcurtin Jul 31, 2024 •

edited

Loading