Skip to content

LLM inference in C/C++, with Nexa AI's support for audio language model and swift binding

License

Notifications You must be signed in to change notification settings

NexaAI/llama.cpp

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Llama cpp

The original Llama cpp implementation is available at here.

To build this project:

make clean
cmake -B build
cmake --build build --config Release -j 24

Checking in Your Code Changes

To commit your local code changes and push them to your repository, use the following steps:

  1. Stage all changes:

    git add .
  2. Commit the changes with a descriptive message:

    git commit -m "Describe your changes here"
  3. Push the changes to your master branch:

    git push origin master

Syncing with Upstream Changes

To pull the latest changes from the upstream repository (ggerganov/llama.cpp), follow these steps:

  1. Add the upstream repository if you haven't done so already:

    git remote add upstream https://github.com/ggerganov/llama.cpp
  2. Fetch the latest changes from the upstream repository:

    git fetch upstream
  3. Merge the upstream changes into your local master branch:

    git merge upstream/master
  4. If necessary, commit the merge (if there were any conflicts to resolve):

    git commit -m "Merge from upstream"
  5. Push the merged changes to your origin/master branch:

    git push origin master

Error handling

If you got below error for kompute:

fatal: cannot chdir to '../../../ggml/src/kompute': No such file or directory

You can fix it by running below command:

git reset ggml_llama/src/kompute

About

LLM inference in C/C++, with Nexa AI's support for audio language model and swift binding

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 55.3%
  • C 22.6%
  • Cuda 7.3%
  • Python 6.7%
  • Metal 2.6%
  • Objective-C 2.3%
  • Other 3.2%