llama : NvAPI performance state change support #8116

sasha0552 · 2024-06-25T17:44:08Z

Note: I will continue this PR a bit later

Related: #8084

Reference implementation

TODO:

Implement performance state switching functions
Place performance state switching calls in a common function before/after inference start/end
- Switch only if Pascal GPU(s) present
Compile only if ~~CUDA~~ enabled
- Enable by default if CUDA enabled, otherwise disable
Log performance state changes and library loading status
Synchronize pstate changes between n instances of llama.cpp on a single GPU
Clean up temporary/debug code

Alternative implementations (just thoughts):

A separate daemon process?
Add options in main/server/etc to allow calling processes before/after inference? (probably the simplest solution)

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

sasha0552 · 2024-07-26T03:58:34Z

Superseded with nvidia-pstated.

mirh · 2024-09-15T03:11:38Z

I mean.. is it though?
I see why a datacenter gpu that doesn't even have a display output could even call it a day with just a rough loop that checks for "any activity at all", but for the seemingly best results you would need some kind of "token cycles awareness".

Besides on top of that, a very nice feature that could also be integrated with direct nvapi support is bus activity monitoring.
It's supposedly not super accurate, but printing a note when pcie is busy more than 50 or 70 percent of the time could help many people to quickly diagnose when a model isn't slow just because it is slow but because they are depending on RAM swapping for non-trivial amounts of layers.

sasha0552 added 2 commits June 25, 2024 17:42

llama : NvAPI performance state change support

450eafc

fix ci

a7e1725

sasha0552 force-pushed the nvapi-pstate-ch branch from 974c7ba to a7e1725 Compare June 25, 2024 18:20

fix ci

89f7645

mofosyne added the Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level label Jun 25, 2024

github-actions bot added the build Compilation issues label Jun 25, 2024

sasha0552 added 3 commits June 27, 2024 07:10

make shared library containing nvapi

b4b2d96

Merge remote-tracking branch 'upstream/master' into nvapi-pstate-ch

d0f71b5

minor fixes

2432c6f

sasha0552 force-pushed the nvapi-pstate-ch branch from 2ab89b7 to 2432c6f Compare June 27, 2024 07:26

sasha0552 added 4 commits June 27, 2024 07:36

reformat code

720de00

implement pstate switching

7cdad3a

minor fixes

742597e

fix win ci

5e20dfc

sasha0552 force-pushed the nvapi-pstate-ch branch from 9e0b2f6 to 5e20dfc Compare June 27, 2024 16:39

sasha0552 mentioned this pull request Jul 25, 2024

common : support for lifecycle scripts #8689

Closed

8 tasks

sasha0552 closed this Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : NvAPI performance state change support #8116

llama : NvAPI performance state change support #8116

sasha0552 commented Jun 25, 2024 •

edited

Loading

sasha0552 commented Jul 26, 2024

mirh commented Sep 15, 2024

llama : NvAPI performance state change support #8116

llama : NvAPI performance state change support #8116

Conversation

sasha0552 commented Jun 25, 2024 • edited Loading

sasha0552 commented Jul 26, 2024

mirh commented Sep 15, 2024

sasha0552 commented Jun 25, 2024 •

edited

Loading