Implement ramalama run/serve #8

ericcurtin · 2024-07-30T12:08:05Z

Now we can run "ramalama run/serve granite-code", if not using
container, one must at least build/install llama.cpp. Added huggingface
support.

rhatdan · 2024-07-30T13:30:29Z

ramalama

@@ -68,9 +68,9 @@ def run_curl_command(args, filename):
            sys.exit(e.returncode)


-def pull_ollama_manifest(ramalama_store, manifests, accept, registry_head, model_tag):
+def pull_ollama_manifest(repos_ollama, manifests, accept, registry_head, model_tag):


Why not just call it repos. No need to keep repeating _ollama

My thinking was shortly I will bring in huggingface support again and there will likely be "repos_hf", "repos_ollama" directory is:

~/.local/share/ramalama/repos/ollama

shortly there will be:

~/.local/share/ramalama/repos/huggingface

I think you should embed the location of the store inside of the functions, and not force it to be different.

Once we have multiple pull_manifests() functions, then we can start to consildate functionality and get as much reuse as possible.

rhatdan · 2024-07-30T13:34:53Z

ramalama

    command = sys.argv[1]
    if command == "pull" and len(sys.argv) > 2:
-        pull_cli(ramalama_store + "/repos/ollama",
-                 ramalama_store + "/models/ollama", sys.argv[2])
+        pull_cli(ramalama_store, sys.argv[2])


switch command {
case "pull":
case "run":
}

Do you have commands that handle 2 options?

Yeah there will be:

ramalama list/ls

as an example that lists all models downloaded and ready to use, it's coming.

Will change to switch, apparently they call it match in python for some reason

How would we feel about going back to if/else if/else? The python switch equivalent match is only available from Python 3.10 (2021) onwards and macOS build is failing here, I'm guessing the python3 version on macOS is older

Going back to if, else, etc. fixed it anyway

Now we can run "ramalama run/serve granite-code", if not using container, one must at least build/install llama.cpp. Added huggingface support. Signed-off-by: Eric Curtin <ecurtin@redhat.com>

rhatdan · 2024-07-30T21:16:09Z

Ok merging, we can cleanup later.

rhatdan reviewed Jul 30, 2024

View reviewed changes

ericcurtin force-pushed the ramalama-run branch from 5ac4293 to 87d7f04 Compare July 30, 2024 13:34

rhatdan reviewed Jul 30, 2024

View reviewed changes

ericcurtin force-pushed the ramalama-run branch 6 times, most recently from 569718d to 1bd8647 Compare July 30, 2024 20:19

Implement ramalama run/serve

99e0fd7

Now we can run "ramalama run/serve granite-code", if not using container, one must at least build/install llama.cpp. Added huggingface support. Signed-off-by: Eric Curtin <ecurtin@redhat.com>

ericcurtin force-pushed the ramalama-run branch from 1bd8647 to 99e0fd7 Compare July 30, 2024 20:21

ericcurtin changed the title ~~Implement ramalama run~~ Implement ramalama run/serve Jul 30, 2024

rhatdan merged commit 4aa5fde into main Jul 30, 2024
5 checks passed

ericcurtin deleted the ramalama-run branch July 30, 2024 21:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ramalama run/serve #8

Implement ramalama run/serve #8

ericcurtin commented Jul 30, 2024 •

edited

Loading

rhatdan Jul 30, 2024

ericcurtin Jul 30, 2024 •

edited

Loading

rhatdan Jul 30, 2024

rhatdan Jul 30, 2024

rhatdan Jul 30, 2024

ericcurtin Jul 30, 2024 •

edited

Loading

ericcurtin Jul 30, 2024 •

edited

Loading

ericcurtin Jul 30, 2024

rhatdan commented Jul 30, 2024

Implement ramalama run/serve #8

Implement ramalama run/serve #8

Conversation

ericcurtin commented Jul 30, 2024 • edited Loading

rhatdan Jul 30, 2024

Choose a reason for hiding this comment

ericcurtin Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

rhatdan Jul 30, 2024

Choose a reason for hiding this comment

rhatdan Jul 30, 2024

Choose a reason for hiding this comment

rhatdan Jul 30, 2024

Choose a reason for hiding this comment

ericcurtin Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

ericcurtin Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

ericcurtin Jul 30, 2024

Choose a reason for hiding this comment

rhatdan commented Jul 30, 2024

ericcurtin commented Jul 30, 2024 •

edited

Loading

ericcurtin Jul 30, 2024 •

edited

Loading

ericcurtin Jul 30, 2024 •

edited

Loading

ericcurtin Jul 30, 2024 •

edited

Loading