-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dockerfile-default-rocm split into two separate files mainly aimed at different users. #272
base: main
Are you sure you want to change the base?
Conversation
…eanup needed in the Makefile.
ARG BASE_IMAGE | ||
FROM ${BASE_IMAGE} | ||
|
||
# MAY NOT BE IMPORTANT ANYMORE |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is it important?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess we can remove it and if something breaks then we add it back in.
|
||
|
||
# THIS FIX IS FOR SAWMILL, UNCLEAR IF NECESSARY FOR GENERAL USERS | ||
#TODO: is this necessary? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO?
@@ -0,0 +1,55 @@ | |||
ARG BASE_IMAGE | |||
FROM ${BASE_IMAGE} | |||
#why no highlighting? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove comment
|
||
# LIBFABRIC ISSUE | ||
# USE CONDA FOR WORKAROUND | ||
#TODO: MAY NOT BE A PROBLEM ANYMORE? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO?
#TODO: finish iterating here, preferably turn it into a shell script. | ||
RUN if [ -n "$DEEPSPEED_PIP" ]; then DEBIAN_FRONTEND=noninteractive apt-get install -y pdsh libaio-dev&& git clone https://github.com/ROCmSoftwarePlatform/triton.git && cd triton && git checkout triton-mlir && cd python && pip3 install ninja cmake && python setup.py install;fi | ||
RUN if [ -n "$DEEPSPEED_PIP" ]; then DEBIAN_FRONTEND=noninteractive apt-get install -y pdsh libaio-dev&& python -m pip install pydantic==1.10.11 && git clone https://github.com/ROCmSoftwarePlatform/DeepSpeed.git && cd DeepSpeed && python3 setup.py build && python3 setup.py install && python -m deepspeed.env_report; fi | ||
RUN if [ -n "$DEEPSPEED_PIP" ]; then python -m deepspeed.env_report ; fi |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This deepspeed section definitely needs cleanup
INFINITYHUB_PYTORCH_PREFIX := rocm/pytorch | ||
INFINITYHUB_TENSORFLOW_PREFIX := rocm/tensorflow | ||
INFINITYHUB_PYTORCH_VERSION := 2.1.2 | ||
INFINITYHUB_TENSORFLOW_VERSION := |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this line
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still needs CI build integration as well, so I expect there to be .circleci
code changes.
ROCM_57_PREFIX := $(REGISTRY_REPO):rocm-5.7- | ||
ROCM_60_PREFIX := $(REGISTRY_REPO):rocm-6.0- | ||
ROCM_61_PREFIX := $(REGISTRY_REPO):rocm-6.1- | ||
ROCM_60_TF_PREFIX := tensorflow-infinity-hub:tensorflow-infinity-hub |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why other images are stored in REGISTRY_REPO := environments
, or a repo with a -dev
suffix, but this one is not?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do we test these images?
Description
Checklist
bumpenvs
procedure in the determined repo. See README.