-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
batch out of range & loss value becomes 'nan' when running monocular depth estimation #1832
Comments
You seem to be using the older
|
The tutorial which you are referring to has not been migrated to Keras 3 yet, possibly due to some dependency on Tensorflow or Keras 2 APIs. I was able ti run the tutorial successfully for 1 epoch with TensorFlow 2.15 which uses Keras 2.15 in it's backend. |
In the published tutorial we can see output for more number of epochs. |
Issue Type
Bug
Source
source
Keras Version
Keras 2.10
Custom Code
No
OS Platform and Distribution
Windows 11
Python version
3.10.13
GPU model and memory
RTX 3050 6GB
Current Behavior?
When calling the
.fit()
function to train the model, the 1st epoch runs as expected and stops when all batches have been iterated.The problem starts from the 2nd epoch onwards where batches start running out of the given range and loss values become
nan
. Once the epoch is complete, the UI becomes normal again but this behavior is observed again for the 3rd epoch and so on.All tutorials on Youtube running the same Colab notebook given by the Keras Team seem to run without having any issues and the model trains properly but this isn't the case when I run the notebook locally or on Colab using both CPU and GPU.
Standalone code to reproduce the issue or tutorial link
Relevant log output
No response
The text was updated successfully, but these errors were encountered: