offline plotting of prediction and ground truth #15

clechartre · 2024-04-22T08:30:40Z

Purpose

Offline plotting function to handle the plotting of the prediction output as a numpy array against a single .zarr archive. Useful to verify the accuracy of the inference

Code changes:

offline.py: created a module for this purpose
slum_offline.sh: automation of the offline.py module through batch system

Checklist

Before submitting this PR, please make sure:

You have followed the coding standards guidelines established at Code Review Checklist.
Docstrings and type hints are added to new and updated routines, as appropriate
All relevant documentation has been updated or added (e.g. README)

Review

For the review process follow the guidelines at Checklist

sadamov · 2024-04-22T09:54:13Z

Hi @clechartre and thanks for the PR. I have some high-level comments before a deeper code-review. We have a new zarr-archive on Tsa that will be the default MeteoSwiss archive after tomorrow (when creation is done). I have already merged and updated this PR-branch where possible with latest main-branch.

Path: /scratch/cosuna/neural-lam/zarr/cosmo_ml_data.zarr2
The new lat/lon don't require unrotation anymore and that module was removed from the repo, you can remove it as well

One more question: Do we still need cli_plotting.py or can we combine these two workflows?

This is already defined in constants.py

…plot-offline

sadamov

I have merged this branch with #16, added some prints to the log file and some smaller comments. Before merge we should:

Merge Updated code for latest single zarr archive #16 to have a clean history
Discuss whether cli_plotting.py + vis.py workflow is now redundant
Fix minor issues

sadamov · 2024-04-23T20:39:34Z

offline.py

+    vmin = target_all.min().values
+    vmax = target_all.max().values
+
+    for i in range(22):


Suggested change

for i in range(22):

for i in range(time_range):

The length of the prediction should define the plotting range, no?

sadamov · 2024-04-24T03:25:42Z

offline.py

+    start_time = pd.to_datetime(start_time_str, format="%Y%m%d%H")
+
+    # Output the prediction time range
+    time_range = len(predictions[1, :, 1, 1])  # Number of time steps


Suggested change

time_range = len(predictions[1, :, 1, 1]) # Number of time steps

time_range = len(predictions[0, :, 0, 0]) # Number of time steps

use first indices to avoid issues with edge cases

sadamov · 2024-04-24T03:26:24Z

offline.py

+    parser.add_argument(
+        "--variable_to_plot",
+        type=str,
+        default="TQV",


Suggested change

default="TQV",

default="T_2M",

Use more common variable as default. Future checkpoints might not contain TQV

variables are called x,y now

new example zarr archive placed in templates folder

…plot-offline

default now points towards newly created templates folder

sadamov · 2024-04-26T16:47:10Z

I have added a template of the latest zarr archive in data/cosmo/templates, to make sure we are using the latest set of variables and also the unrotated grid. For that reason I removed the unrotate functionality from offline.py file.
Further, a template for a inference.npy files is also available now in data/cosmo/templates
After addressing small comments above we can merge. Note that the scripts are not backward compatible with older input datasets that will be deleted soon anyways.
Note that the latest checkpoint is hi_lam, we need to re-train the model and create one checkpoint for each model soon
Since I merged this PR branch with latest main-branch you should now again only see changes relevant to offline-plotting (which was the goal of the whole exercise).

--> Maybe add a few words in the README.md about the new data/ folder structure

offline plotting of prediction and ground truth

8d2510c

clechartre requested review from twicki and sadamov April 22, 2024 08:30

sadamov added 3 commits April 22, 2024 11:38

prevent figure from being uploaded to git

f230020

Merge remote-tracking branch 'origin/main' into plot-offline

dd04eb5

linter fixes

6238aab

sadamov and others added 6 commits April 23, 2024 10:35

Removed code duplication

a2f00e1

This is already defined in constants.py

Merge remote-tracking branch 'origin/setup_tsa' into plot-offline

62cef2e

Handling a single zarr

dd6733e

Merge branch 'plot-offline' of github.com:clechartre/neural-lam into …

bf81bf0

…plot-offline

Merge remote-tracking branch 'origin/setup_tsa' into plot-offline

3506a2d

added some printouts

1fd9508

sadamov requested changes Apr 24, 2024

View reviewed changes

Simon Adamov and others added 5 commits April 26, 2024 16:49

Merge remote-tracking branch 'origin/main' into plot-offline

9f06872

Latest zarr input data has already unrotated latlon

ff421cd

variables are called x,y now

TQV replaced by T_2M which is part of dataset

24d9d53

new example zarr archive placed in templates folder

Merge branch 'plot-offline' of github.com:clechartre/neural-lam into …

7dc7c09

…plot-offline

default path should not be absolute to folder without permissions

01de4a1

default now points towards newly created templates folder

Capucine Lechartre and others added 2 commits May 1, 2024 14:10

Merge branch 'origin/main' into plot-offline

ce4c520

small data-specific bugfixes

267478d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

offline plotting of prediction and ground truth #15

offline plotting of prediction and ground truth #15

clechartre commented Apr 22, 2024

sadamov commented Apr 22, 2024

sadamov left a comment

sadamov Apr 23, 2024

sadamov Apr 24, 2024

sadamov Apr 24, 2024

sadamov commented Apr 26, 2024

	time_range = len(predictions[1, :, 1, 1]) # Number of time steps
	time_range = len(predictions[0, :, 0, 0]) # Number of time steps

offline plotting of prediction and ground truth #15

Are you sure you want to change the base?

offline plotting of prediction and ground truth #15

Conversation

clechartre commented Apr 22, 2024

Purpose

Code changes:

Checklist

Review

sadamov commented Apr 22, 2024

sadamov left a comment

Choose a reason for hiding this comment

sadamov Apr 23, 2024

Choose a reason for hiding this comment

sadamov Apr 24, 2024

Choose a reason for hiding this comment

sadamov Apr 24, 2024

Choose a reason for hiding this comment

sadamov commented Apr 26, 2024