A way to continue training #83

Garshishka · 2024-04-21T11:28:32Z

Considering that this program can use CPU and low VRam cards to train, how about adding a way or parameter to continue training from saved splat.ply? Is this even feasible?

pierotofy · 2024-04-21T15:45:48Z

I don't see why not.

Modify savePly to store the current step count (in a comment PLY header value, maybe)
Read PLY back into the tensors (reverse of savePly), read step count.
Resume from the previous step count.

For a numerically correct resume, one should also dump the optimizer state but I don't think that would actually matter too much for the end result.

We'd welcome a pull request for this. Interested?

Garshishka · 2024-04-21T21:21:48Z

I would if I could :(
But cpp and ML are an unknown to me

stefvfx · 2024-04-28T12:12:50Z

I think it would be very useful.

Itox001 · 2024-07-06T15:45:10Z

+1 for this feature. Currently I can only reasonably train ~3000 iterations before RAM consumption exhausts my resources because of the memory leak on MPS devices. I am hoping that stopping and resuming the training would reset this, allowing me to train for longer.

eloquentarduino · 2024-09-20T11:39:08Z

+1. I'm not a C++ guy so I can't help here.

pierotofy added the enhancement New feature or request label Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A way to continue training #83

A way to continue training #83

Garshishka commented Apr 21, 2024

pierotofy commented Apr 21, 2024 •

edited

Loading

Garshishka commented Apr 21, 2024

stefvfx commented Apr 28, 2024

Itox001 commented Jul 6, 2024

eloquentarduino commented Sep 20, 2024

A way to continue training #83

A way to continue training #83

Comments

Garshishka commented Apr 21, 2024

pierotofy commented Apr 21, 2024 • edited Loading

Garshishka commented Apr 21, 2024

stefvfx commented Apr 28, 2024

Itox001 commented Jul 6, 2024

eloquentarduino commented Sep 20, 2024

pierotofy commented Apr 21, 2024 •

edited

Loading