You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 2, 2023. It is now read-only.
@jds485 pointed out to me today that the container docs/container for this repository are out of date. We set up the container awhile back and no one really switched to using it from their conda environments. I think partially because folks had different environment set-ups based on if they wanted TF or PyTorch, and also because the Snakemake workflow for training models in parallel on TG is set up around conda environments rather than singularity. Best practice would probably to have an up-to-date container (or maybe even two, one for TF and one for PyTorch?), but if we're going to update/maintain them just to let them fall to the wayside like the first one did I'm not sure it's worth the effort. Thoughts on this? @jesse-ross I'm sure you have a hot take 😉!
The text was updated successfully, but these errors were encountered:
If conda environments are working for folks, then that is probably OK for working purposes. However, it doesn't address system dependencies. I could envision different versions of something like BLAS potentially causing issues, or different versions of nvidia toolkits?
I don't really know enough about the current snakemake process to say much more than that. It seems like it would likely be possible to have a dirt-simple container which just had system-level libraries installed, and no python packages, and then let folks do whatever they want in conda on top of that. That might be a way to boost reproducibility without locking people in to specific python packages/versions in the way that the current Dockerfile does.
I'd be happy to talk about this more if you want to show me the current workflow - I'd guess it might be easiest to do that on a call.
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
@jds485 pointed out to me today that the container docs/container for this repository are out of date. We set up the container awhile back and no one really switched to using it from their conda environments. I think partially because folks had different environment set-ups based on if they wanted TF or PyTorch, and also because the Snakemake workflow for training models in parallel on TG is set up around conda environments rather than singularity. Best practice would probably to have an up-to-date container (or maybe even two, one for TF and one for PyTorch?), but if we're going to update/maintain them just to let them fall to the wayside like the first one did I'm not sure it's worth the effort. Thoughts on this? @jesse-ross I'm sure you have a hot take 😉!
The text was updated successfully, but these errors were encountered: