[refactoring] Shortening the experiment names #47

qetdr · 2023-04-01T12:23:22Z

🔬 Background
Each experiment within a benchmark is given a name. This name is created based on the model name, the dataset set, and the selected parameters. While this is mostly a unique name, it is also very long. Is there a way to adapt the experiment name to be short and unique? Implement an improved experiment name.

🔮 Key changes
The main change was abbreviating the object names in the experiment name.

Reduce the name of a df to max 3 characters.
Reduce the name of a model to use 2-character syllables (e.g., NeuralProphet[Model] -> NePrM)
Create a dictionary (needs to be updated with model parameters) with parameters and their abbreviations. Then, loop over each param (from the model), and abbreviate.
Object separation in the name: '-' separates objects, '_' separates params-args (e.g., parameter:argument -> PM_arg).
Concatenate all to a single string.

Example of the change:

'datasetname_NeuralProphet_seasonality_mode_multiplicative_learning_rate_0.1__data_params__seasonality_mode_ additive, freq_ MS_'

to

'dat-NePrM-SM_mult-LR_0.1-_data_params_-SM_add-freq_MS'

📋 Review Checklist

I have performed a self-review of my own code.
[NA] I have added docstring
[NA] I have added typing
I have modified pytests to check against ValueErrors

LeonieFreisinger

@qetdr great work! I very much appreciate the thoughts you put into renaming and like the idea of abbreviating all names. Also, thanks for the detailed description of the PR changes, that makes it easy to understand your code.
The first part of your changes looks good to me. Have another look at the part, where you rename the model parameters. Here we need to have a solution that generalizes well to all possible model parameters.

As some inspiration: Another idea for naming the experiments would be with a numeric unique identifier. However, in this case, we would still need to hold the option for the user, to view or query the underlying configuration. What are your thoughts on this?

Lastly, please check the following:

black and flake8 is still failing in your workflows
regarding your example 'dat-NePrM-SM_mult-LR_0.1-_data_params_-SM_add-freq_MS' , where does the _ after _data_param come from?

Thank you!!

tot/models/__init__.py

tot/experiment.py

LeonieFreisinger · 2023-04-02T17:22:49Z

tot/experiment.py

+            abbreviated_name += "M" # for 'Model'
+
+            # 3. params (dictionary with abbreviations)
+            d = {'seasonality_mode': 'SM',


The model parameter names can largely vary among the available tsf model. Hence, it might not be a good idea of hard-coding them in a dict. Can you think of another solution?

We should find a solution, that generalizes well to all possible model parameters

qetdr added 2 commits April 1, 2023 12:25

fixed a typo (__initi__.py -> __init__.py)

cc2f544

shortening experiment_name in __post_init__

bf45c07

qetdr changed the title ~~Fix exp names~~ [refactoring] Fix exp names Apr 2, 2023

qetdr changed the title ~~[refactoring] Fix exp names~~ [refactoring] Shortening the experiment names Apr 2, 2023

LeonieFreisinger self-requested a review April 2, 2023 17:04

LeonieFreisinger linked an issue Apr 2, 2023 that may be closed by this pull request

Experiment name too long #45

Open

LeonieFreisinger assigned qetdr Apr 2, 2023

LeonieFreisinger requested changes Apr 2, 2023

View reviewed changes

exp name: rm cmnts & condensed code in __post_init__

b2ad5ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[refactoring] Shortening the experiment names #47

[refactoring] Shortening the experiment names #47

qetdr commented Apr 1, 2023

LeonieFreisinger left a comment

LeonieFreisinger Apr 2, 2023

LeonieFreisinger Apr 2, 2023

[refactoring] Shortening the experiment names #47

Are you sure you want to change the base?

[refactoring] Shortening the experiment names #47

Conversation

qetdr commented Apr 1, 2023

LeonieFreisinger left a comment

Choose a reason for hiding this comment

LeonieFreisinger Apr 2, 2023

Choose a reason for hiding this comment

LeonieFreisinger Apr 2, 2023

Choose a reason for hiding this comment