Skip to content

Releases: determined-ai/determined

v0.38.0

22 Nov 21:17
7154424
Compare
Choose a tag to compare

Release Notes

v0.38.0

Changelog

  • 7154424 chore: release notes 0.38.0 (#10231)
  • 13e49a7 [AUTO-BACKPORT release-0.38.0] 10226: chore: eliminate use of fury repo (#10229)
  • a554cd0 [AUTO-BACKPORT release-0.38.0] 10224: fix: make some k8s tests pass (#10228)
  • 0d373b1 [AUTO-BACKPORT release-0.38.0] 10221: fix: use new migration gist (#10222)
  • c93b848 [AUTO-BACKPORT release-0.38.0] 10213: fix: port k8s perf fix (#10220)
  • 0cc57df chore: backport 10208 to release 0.38.0 (#10219)
  • 7d9c5ed [AUTO-BACKPORT release-0.38.0] 10216: fix: license check tests (#10217)
  • e2d8f47 [AUTO-BACKPORT release-0.38.0] 10206: ci: remove datadog from ci (#10214)
  • 9619dcf [AUTO-BACKPORT release-0.38.0] 10211: chore: fix license check (#10215)
  • 332cefc [AUTO-BACKPORT release-0.38.0] 10207: fix: revert: fix: resolve indefinitely queued (STOPPING_COMPLETED) trials (#10210)
  • e693655 [AUTO-BACKPORT release-0.38.0] 10203: revert: log search (#10205)
  • 50b7690 chore: 0.38.0 environment images (#10197)
  • bb6f140 [AUTO-BACKPORT 10160] fix: maxPoolSlotCapacity bug (#10195)
  • 7db183e [AUTO-BACKPORT 10182] docs: docs changes for searcher context removal (#10194)
  • 23f9793 [AUTO-BACKPORT 10192] fix: keras continue from cloud checkpoint (#10193)
  • 508d400 [AUTO-BACKPORT 10174] docs: update docs for non-Trial-centric world (#10186)
  • 87f5ff8 [AUTO-BACKPORT 10188] fix: include max_length in continue expconf (#10190)
  • e725918 [AUTO-BACKPORT 10183] docs: fix typos in the release note (#10185)
  • 23687db [AUTO-BACKPORT 10178] docs: known issue of tb_plugin (#10181)
  • 5427a68 [AUTO-BACKPORT 10172] fix: ban archive columns in filter for experiment/search search (#10176)
  • 88c8887 [AUTO-BACKPORT 10173] fix: client.logout() re-enables client.login() (#10177)
  • 42f74e6 [AUTO-BACKPORT 10168] chore: ignore test_e2e_longrunning tests when merging auto-backports (#10179)
  • 020fc43 [AUTO-BACKPORT 10161] fix: fix diffusion example [DET-10470] (#10169)
  • c69aa68 [AUTO-BACKPORT 10140] fix: set max slots and checkpoint gc policy should comply with config policies (#10167)
  • b5e6315 fix: set max slots and checkpoint gc policy should comply with config policies (#10140)
  • 8e6a658 [AUTO-BACKPORT 10105] chore: change det deploy aws's default deployment type to simple-rds (#10162)
  • 6fc6710 [AUTO-BACKPORT 10153] docs: checkpoint storage note for config policies (#10165)
  • b366f80 [AUTO-BACKPORT 10138] feat: determined_master_host and friends helm support, better defaults (#10159)
  • d8afc57 [AUTO-BACKPORT 10155] fix: fix iris example to use reported metric name (#10156)
  • 38ae54b [AUTO-BACKPORT 10149] fix: error message fix for duplicate model name (#10154)
  • 47ba6a9 build: INFENG-943: GoReleaser configure prerelease (#10146)
  • aad58c1 build: INFENG-942: Conditionally bypass build-react job checks (#10145)
  • d7f0bbf chore: lock published urls to preserve redirects
  • e3c31f0 Temporarily disable GitHub Actions credentials.
  • 3be954b build: INFENG-938: Update version format in Makefiles (#10142)
  • 69b93b0 build: INFENG-940: Fix logic error in CircleCI config make-component job (#10143)
  • 00870f5 build: INFENG-937: Publish Helm chart release candidates (#10141)
  • 3910426 feat: remove searcher context from harness and master [MD-498] (#10131)
  • 27bebdd build: INFENG-938: Tweak version string format (#10139)
  • 30ad3c0 feat: add master configurations for access token max and default lifespans [DET-10464] (#10101)
  • 782f7a0 revert: "chore: determined_master_host and friends helm support, better defaults" (#10134)
  • 233e095 chore: add checkpoint and max slots config policy enforcements in PATCH experiment (#10125)
  • b3f928b chore: determined_master_host and friends helm support, better defaults (#10092)
  • 6755467 chore: bump Go version used by CI builds to 1.22.8 (#10127)
  • 834eeda feat: add actual select all to glide tables [ET-238] (#10081)
  • c7e0fb5 docs: add log signal release note and update docs (#10126)
  • 02fcc74 test: Add test for filtering user by Role Id (#10095)
  • f97fb5a build: INFENG-933: add GitHub action to start a minor release (#10112)
  • 685918d docs: Add aurora postgres release note (#10115)
  • a84f8c6 chore: SSO improvement feature requires Enterprise Edition. (#10124)
  • c71617c feat: Log Signal Exp Config and Monitoring (#9947)
  • 06b0b31 chore: fix merge exp flake (#10122)
  • 962810a chore: improve messaging when workspace configs conflict with global … (#10121)
  • 6158ef7 docs: Update postgres aurora info (#10116)
  • 4b0c065 docs: log policies restore exp config (#10120)
  • 186962c chore: add config policies to CLI reference docs (#10118)
  • 11ea6f4 chore: clarify version overrides during helm installs (#10094)
  • 4394f29 chore: standardize status api errors for task config policies (#10119)
  • e834302 fix: Add on delete cascade to system_metrics (#10113)
  • 3c59233 chore: populate final merged config with defaults when merging invariant configs (#10107)
  • deb3772 feat: additional APIs to support "actual select all" functions [ET-238] (#10102)
  • fd9cd8a feat: Allow master configuration for ssh key type (#10072)
  • 5e9df7c docs: Update release notes (#10114)
  • c655f33 docs: fix internal link in multi-rm docs page. (#10074)
  • e7186fe docs: Update log policies (#10098)
  • 993296b fix: update copy in experiment and trial headers (#10111)
  • d74a462 docs: Describe sso improvements (#10110)
  • 24d3390 chore: conditionally create VolumeSnapshotClass (#10103)
  • f45ebb9 chore: improve documentation surrounding slot caps helm configuration (#10090)
  • 0013fd0 ci: shorten test_pending_hpc.py (#10104)
  • 22ad457 fix: version upgrade notification bug [CM-411] (#10069)
  • 935fa66 fix: Log searche feedbacks (#10088)
  • 29a08ec Revert "docs: Describe arbitrary metadata logging" (#10099)
  • c6c476c chore: remove e2e_slurm_preemption test series (#10053)
  • e6182ed docs: Describe arbitrary metadata logging (#10073)
  • 539df5e chore: update CLI commands to work with global APIs (#10089)
  • 1f2bea0 feat: update ConfigPolicies with docs link [CM-558] (#10055)
  • 4afc15f build: INFENG-926: Fix version.sh version string output (#10085)
  • 04861dd chore: return error if workspace config violates global constraints (#10076)
  • 912f91e docs: task config policies release note (#10087)
  • 6d56101 fix: remove flake-inducing logretention global singleton (#10016)
  • b70a622 fix: correct token creation CLI to ensure it works with default expiry (#10084)
  • b155332 docs: Describe task config policies (#9969)
  • 27a014b fix: Tensorboard broken on unified install [CM-578] (#10080)
  • bdb56a4 chore: INFENG-922: use correct gh_team tag for infrastructure (#10077)
  • 91e358a INFENG-382: Release redesign (#10002)
  • 34e4749 chore: remove redundant rm.ExternalPreemptionPending interface (#10071)
  • 28bc072 feat: SSO Improvement - alter user_sessions table to include access token, implement CRUD ops, GET, POST, PATCH APIs and det token CLIs (#9867)
  • 472baf9 feat: Add copy task id to task list (#10058)
  • 2e822b7 chore: fix update invariant config and constraints (#10078)
  • d69f7cc chore(deps): bump google.golang.org/grpc from 1.64.0 to 1.64.1 (#9910)
  • e796b92 fix: run checkpoint GC more aggressively to ensure tensorboards are GC'd (#10017)
  • a14525f fix: nil deref in usage of incomplete experiment config policies (#10068)
  • 6c46a46 refactor: remove annotations requiring search ids in bulk action js (ET-241) (#10062)
  • 3ca3418 Docs: describe data files apptainer (#10020)
  • 315f65d chore: ntsc config not supported (#10056)
  • 2e8de9b test: User Management test updates [CM-468] (#10051)
  • 3fc9fed chore: experiment config slots to comply with constraint max slots (#10054)
  • 1d5c984 chore: fix slices and maps merge test (#10063)
  • 219409b chore: fix helptext for det user (#10060)
  • 7d6a1a7 docs: add k8s RP example to the helm values.yaml. (#10027)
  • 9efd96d fix: apply config policy constraints to PATCH /experiments/:id (#10048)
  • dd6aeda chore: change error code back (#10042)
  • 5a39ecb chore: check config policies on 'det notebook set priority' (#10047)
  • 2ef2f12 feat: bulk actions matching filters (ET-241) (#9895)
  • ac82b3c chore: default priority earlier to ensure constraints are satisfied [CM-553] (#10043)
  • 34557ef feat: Extend LogViewer to support scrollable search (#10005)
  • dadf75e chore: take invariant_config priority into account with manage job workflow (#10025)
  • 2356f91 chore: remove e2e_slurm_misconfigured series tests (#10023)
  • b243c26 ci: deflake test_disable_agent_zero_slots (#10040)
  • 4e0f1c4 chore: validate global, admin input against task config policies & constraints (#10028)
  • 3c1630f test: add e2e tests to the "move project" functionality on the "List View" (#10037)
  • 0613cc6 docs: revise postgres permission setup instructions. (#10039)
  • 2594d90 chore: remove e2e_slurm_gpu series tests (#10021)
  • 1f7ccad chore: exp invariant config silent override during add or update (#10019)
  • 30b197d feat: Global Config Policies UI [CM-522] (#10022)
  • c27054d feat: add e2e tests for multi-sort filter on experiments lista (#9992)
  • 9faa0cb chore: wait_for_task_state shows logs on failure (#10029)
  • a166826 fix: Workspace Projects and Tasks test flakes [CM-554] (#10026)
  • 33dfdaf test: Workspace Models tests [CM-538] (#9998)
  • 7e8dbac fix: Update action bar row layout in UserManagement page (#9862)
  • 5b1380c chore: check experiment constraints (#10018)
  • f609a2d fix: remove formatDatetime (#10011)
  • 9b6f0ac docs: Update release notes date (#9999)
  • f5400ea feat: Add regex search to task logs API (#9994)
  • ddca766 fix: correct expToWebhookConfig cache locking (#10014)
  • 80b29fa feat: Config Policies UI, Workspaces Experiments [CM-521] (#10009)
  • 262b4a9 chore: check task conf...
Read more

v0.38.0-ee

22 Nov 21:16
7154424
Compare
Choose a tag to compare
v0.38.0-ee Pre-release
Pre-release

Release Notes

v0.38.0-ee

Changelog

  • 7154424 chore: release notes 0.38.0 (#10231)
  • 13e49a7 [AUTO-BACKPORT release-0.38.0] 10226: chore: eliminate use of fury repo (#10229)
  • a554cd0 [AUTO-BACKPORT release-0.38.0] 10224: fix: make some k8s tests pass (#10228)
  • 0d373b1 [AUTO-BACKPORT release-0.38.0] 10221: fix: use new migration gist (#10222)
  • c93b848 [AUTO-BACKPORT release-0.38.0] 10213: fix: port k8s perf fix (#10220)
  • 0cc57df chore: backport 10208 to release 0.38.0 (#10219)
  • 7d9c5ed [AUTO-BACKPORT release-0.38.0] 10216: fix: license check tests (#10217)
  • e2d8f47 [AUTO-BACKPORT release-0.38.0] 10206: ci: remove datadog from ci (#10214)
  • 9619dcf [AUTO-BACKPORT release-0.38.0] 10211: chore: fix license check (#10215)
  • 332cefc [AUTO-BACKPORT release-0.38.0] 10207: fix: revert: fix: resolve indefinitely queued (STOPPING_COMPLETED) trials (#10210)
  • e693655 [AUTO-BACKPORT release-0.38.0] 10203: revert: log search (#10205)
  • 50b7690 chore: 0.38.0 environment images (#10197)
  • bb6f140 [AUTO-BACKPORT 10160] fix: maxPoolSlotCapacity bug (#10195)
  • 7db183e [AUTO-BACKPORT 10182] docs: docs changes for searcher context removal (#10194)
  • 23f9793 [AUTO-BACKPORT 10192] fix: keras continue from cloud checkpoint (#10193)
  • 508d400 [AUTO-BACKPORT 10174] docs: update docs for non-Trial-centric world (#10186)
  • 87f5ff8 [AUTO-BACKPORT 10188] fix: include max_length in continue expconf (#10190)
  • e725918 [AUTO-BACKPORT 10183] docs: fix typos in the release note (#10185)
  • 23687db [AUTO-BACKPORT 10178] docs: known issue of tb_plugin (#10181)
  • 5427a68 [AUTO-BACKPORT 10172] fix: ban archive columns in filter for experiment/search search (#10176)
  • 88c8887 [AUTO-BACKPORT 10173] fix: client.logout() re-enables client.login() (#10177)
  • 42f74e6 [AUTO-BACKPORT 10168] chore: ignore test_e2e_longrunning tests when merging auto-backports (#10179)
  • 020fc43 [AUTO-BACKPORT 10161] fix: fix diffusion example [DET-10470] (#10169)
  • c69aa68 [AUTO-BACKPORT 10140] fix: set max slots and checkpoint gc policy should comply with config policies (#10167)
  • b5e6315 fix: set max slots and checkpoint gc policy should comply with config policies (#10140)
  • 8e6a658 [AUTO-BACKPORT 10105] chore: change det deploy aws's default deployment type to simple-rds (#10162)
  • 6fc6710 [AUTO-BACKPORT 10153] docs: checkpoint storage note for config policies (#10165)
  • b366f80 [AUTO-BACKPORT 10138] feat: determined_master_host and friends helm support, better defaults (#10159)
  • d8afc57 [AUTO-BACKPORT 10155] fix: fix iris example to use reported metric name (#10156)
  • 38ae54b [AUTO-BACKPORT 10149] fix: error message fix for duplicate model name (#10154)
  • 47ba6a9 build: INFENG-943: GoReleaser configure prerelease (#10146)
  • aad58c1 build: INFENG-942: Conditionally bypass build-react job checks (#10145)
  • d7f0bbf chore: lock published urls to preserve redirects
  • e3c31f0 Temporarily disable GitHub Actions credentials.
  • 3be954b build: INFENG-938: Update version format in Makefiles (#10142)
  • 69b93b0 build: INFENG-940: Fix logic error in CircleCI config make-component job (#10143)
  • 00870f5 build: INFENG-937: Publish Helm chart release candidates (#10141)
  • 3910426 feat: remove searcher context from harness and master [MD-498] (#10131)
  • 27bebdd build: INFENG-938: Tweak version string format (#10139)
  • 30ad3c0 feat: add master configurations for access token max and default lifespans [DET-10464] (#10101)
  • 782f7a0 revert: "chore: determined_master_host and friends helm support, better defaults" (#10134)
  • 233e095 chore: add checkpoint and max slots config policy enforcements in PATCH experiment (#10125)
  • b3f928b chore: determined_master_host and friends helm support, better defaults (#10092)
  • 6755467 chore: bump Go version used by CI builds to 1.22.8 (#10127)
  • 834eeda feat: add actual select all to glide tables [ET-238] (#10081)
  • c7e0fb5 docs: add log signal release note and update docs (#10126)
  • 02fcc74 test: Add test for filtering user by Role Id (#10095)
  • f97fb5a build: INFENG-933: add GitHub action to start a minor release (#10112)
  • 685918d docs: Add aurora postgres release note (#10115)
  • a84f8c6 chore: SSO improvement feature requires Enterprise Edition. (#10124)
  • c71617c feat: Log Signal Exp Config and Monitoring (#9947)
  • 06b0b31 chore: fix merge exp flake (#10122)
  • 962810a chore: improve messaging when workspace configs conflict with global … (#10121)
  • 6158ef7 docs: Update postgres aurora info (#10116)
  • 4b0c065 docs: log policies restore exp config (#10120)
  • 186962c chore: add config policies to CLI reference docs (#10118)
  • 11ea6f4 chore: clarify version overrides during helm installs (#10094)
  • 4394f29 chore: standardize status api errors for task config policies (#10119)
  • e834302 fix: Add on delete cascade to system_metrics (#10113)
  • 3c59233 chore: populate final merged config with defaults when merging invariant configs (#10107)
  • deb3772 feat: additional APIs to support "actual select all" functions [ET-238] (#10102)
  • fd9cd8a feat: Allow master configuration for ssh key type (#10072)
  • 5e9df7c docs: Update release notes (#10114)
  • c655f33 docs: fix internal link in multi-rm docs page. (#10074)
  • e7186fe docs: Update log policies (#10098)
  • 993296b fix: update copy in experiment and trial headers (#10111)
  • d74a462 docs: Describe sso improvements (#10110)
  • 24d3390 chore: conditionally create VolumeSnapshotClass (#10103)
  • f45ebb9 chore: improve documentation surrounding slot caps helm configuration (#10090)
  • 0013fd0 ci: shorten test_pending_hpc.py (#10104)
  • 22ad457 fix: version upgrade notification bug [CM-411] (#10069)
  • 935fa66 fix: Log searche feedbacks (#10088)
  • 29a08ec Revert "docs: Describe arbitrary metadata logging" (#10099)
  • c6c476c chore: remove e2e_slurm_preemption test series (#10053)
  • e6182ed docs: Describe arbitrary metadata logging (#10073)
  • 539df5e chore: update CLI commands to work with global APIs (#10089)
  • 1f2bea0 feat: update ConfigPolicies with docs link [CM-558] (#10055)
  • 4afc15f build: INFENG-926: Fix version.sh version string output (#10085)
  • 04861dd chore: return error if workspace config violates global constraints (#10076)
  • 912f91e docs: task config policies release note (#10087)
  • 6d56101 fix: remove flake-inducing logretention global singleton (#10016)
  • b70a622 fix: correct token creation CLI to ensure it works with default expiry (#10084)
  • b155332 docs: Describe task config policies (#9969)
  • 27a014b fix: Tensorboard broken on unified install [CM-578] (#10080)
  • bdb56a4 chore: INFENG-922: use correct gh_team tag for infrastructure (#10077)
  • 91e358a INFENG-382: Release redesign (#10002)
  • 34e4749 chore: remove redundant rm.ExternalPreemptionPending interface (#10071)
  • 28bc072 feat: SSO Improvement - alter user_sessions table to include access token, implement CRUD ops, GET, POST, PATCH APIs and det token CLIs (#9867)
  • 472baf9 feat: Add copy task id to task list (#10058)
  • 2e822b7 chore: fix update invariant config and constraints (#10078)
  • d69f7cc chore(deps): bump google.golang.org/grpc from 1.64.0 to 1.64.1 (#9910)
  • e796b92 fix: run checkpoint GC more aggressively to ensure tensorboards are GC'd (#10017)
  • a14525f fix: nil deref in usage of incomplete experiment config policies (#10068)
  • 6c46a46 refactor: remove annotations requiring search ids in bulk action js (ET-241) (#10062)
  • 3ca3418 Docs: describe data files apptainer (#10020)
  • 315f65d chore: ntsc config not supported (#10056)
  • 2e8de9b test: User Management test updates [CM-468] (#10051)
  • 3fc9fed chore: experiment config slots to comply with constraint max slots (#10054)
  • 1d5c984 chore: fix slices and maps merge test (#10063)
  • 219409b chore: fix helptext for det user (#10060)
  • 7d6a1a7 docs: add k8s RP example to the helm values.yaml. (#10027)
  • 9efd96d fix: apply config policy constraints to PATCH /experiments/:id (#10048)
  • dd6aeda chore: change error code back (#10042)
  • 5a39ecb chore: check config policies on 'det notebook set priority' (#10047)
  • 2ef2f12 feat: bulk actions matching filters (ET-241) (#9895)
  • ac82b3c chore: default priority earlier to ensure constraints are satisfied [CM-553] (#10043)
  • 34557ef feat: Extend LogViewer to support scrollable search (#10005)
  • dadf75e chore: take invariant_config priority into account with manage job workflow (#10025)
  • 2356f91 chore: remove e2e_slurm_misconfigured series tests (#10023)
  • b243c26 ci: deflake test_disable_agent_zero_slots (#10040)
  • 4e0f1c4 chore: validate global, admin input against task config policies & constraints (#10028)
  • 3c1630f test: add e2e tests to the "move project" functionality on the "List View" (#10037)
  • 0613cc6 docs: revise postgres permission setup instructions. (#10039)
  • 2594d90 chore: remove e2e_slurm_gpu series tests (#10021)
  • 1f7ccad chore: exp invariant config silent override during add or update (#10019)
  • 30b197d feat: Global Config Policies UI [CM-522] (#10022)
  • c27054d feat: add e2e tests for multi-sort filter on experiments lista (#9992)
  • 9faa0cb chore: wait_for_task_state shows logs on failure (#10029)
  • a166826 fix: Workspace Projects and Tasks test flakes [CM-554] (#10026)
  • 33dfdaf test: Workspace Models tests [CM-538] (#9998)
  • 7e8dbac fix: Update action bar row layout in UserManagement page (#9862)
  • 5b1380c chore: check experiment constraints (#10018)
  • f609a2d fix: remove formatDatetime (#10011)
  • 9b6f0ac docs: Update release notes date (#9999)
  • f5400ea feat: Add regex search to task logs API (#9994)
  • ddca766 fix: correct expToWebhookConfig cache locking (#10014)
  • 80b29fa feat: Config Policies UI, Workspaces Experiments [CM-521] (#10009)
  • 262b4a9 chore: check tas...
Read more

v0.38.0-rc9

21 Nov 03:36
0d373b1
Compare
Choose a tag to compare
v0.38.0-rc9 Pre-release
Pre-release

Release Notes

v0.38.0-rc9

Changelog

  • 0d373b1 [AUTO-BACKPORT release-0.38.0] 10221: fix: use new migration gist (#10222)
  • c93b848 [AUTO-BACKPORT release-0.38.0] 10213: fix: port k8s perf fix (#10220)
  • 0cc57df chore: backport 10208 to release 0.38.0 (#10219)
  • 7d9c5ed [AUTO-BACKPORT release-0.38.0] 10216: fix: license check tests (#10217)
  • e2d8f47 [AUTO-BACKPORT release-0.38.0] 10206: ci: remove datadog from ci (#10214)
  • 9619dcf [AUTO-BACKPORT release-0.38.0] 10211: chore: fix license check (#10215)
  • 332cefc [AUTO-BACKPORT release-0.38.0] 10207: fix: revert: fix: resolve indefinitely queued (STOPPING_COMPLETED) trials (#10210)
  • e693655 [AUTO-BACKPORT release-0.38.0] 10203: revert: log search (#10205)
  • 50b7690 chore: 0.38.0 environment images (#10197)
  • bb6f140 [AUTO-BACKPORT 10160] fix: maxPoolSlotCapacity bug (#10195)
  • 7db183e [AUTO-BACKPORT 10182] docs: docs changes for searcher context removal (#10194)
  • 23f9793 [AUTO-BACKPORT 10192] fix: keras continue from cloud checkpoint (#10193)
  • 508d400 [AUTO-BACKPORT 10174] docs: update docs for non-Trial-centric world (#10186)
  • 87f5ff8 [AUTO-BACKPORT 10188] fix: include max_length in continue expconf (#10190)
  • e725918 [AUTO-BACKPORT 10183] docs: fix typos in the release note (#10185)
  • 23687db [AUTO-BACKPORT 10178] docs: known issue of tb_plugin (#10181)
  • 5427a68 [AUTO-BACKPORT 10172] fix: ban archive columns in filter for experiment/search search (#10176)
  • 88c8887 [AUTO-BACKPORT 10173] fix: client.logout() re-enables client.login() (#10177)
  • 42f74e6 [AUTO-BACKPORT 10168] chore: ignore test_e2e_longrunning tests when merging auto-backports (#10179)
  • 020fc43 [AUTO-BACKPORT 10161] fix: fix diffusion example [DET-10470] (#10169)
  • c69aa68 [AUTO-BACKPORT 10140] fix: set max slots and checkpoint gc policy should comply with config policies (#10167)
  • b5e6315 fix: set max slots and checkpoint gc policy should comply with config policies (#10140)
  • 8e6a658 [AUTO-BACKPORT 10105] chore: change det deploy aws's default deployment type to simple-rds (#10162)
  • 6fc6710 [AUTO-BACKPORT 10153] docs: checkpoint storage note for config policies (#10165)
  • b366f80 [AUTO-BACKPORT 10138] feat: determined_master_host and friends helm support, better defaults (#10159)
  • d8afc57 [AUTO-BACKPORT 10155] fix: fix iris example to use reported metric name (#10156)
  • 38ae54b [AUTO-BACKPORT 10149] fix: error message fix for duplicate model name (#10154)
  • 47ba6a9 build: INFENG-943: GoReleaser configure prerelease (#10146)
  • aad58c1 build: INFENG-942: Conditionally bypass build-react job checks (#10145)
  • d7f0bbf chore: lock published urls to preserve redirects
  • e3c31f0 Temporarily disable GitHub Actions credentials.
  • 3be954b build: INFENG-938: Update version format in Makefiles (#10142)
  • 69b93b0 build: INFENG-940: Fix logic error in CircleCI config make-component job (#10143)
  • 00870f5 build: INFENG-937: Publish Helm chart release candidates (#10141)
  • 3910426 feat: remove searcher context from harness and master [MD-498] (#10131)
  • 27bebdd build: INFENG-938: Tweak version string format (#10139)
  • 30ad3c0 feat: add master configurations for access token max and default lifespans [DET-10464] (#10101)
  • 782f7a0 revert: "chore: determined_master_host and friends helm support, better defaults" (#10134)
  • 233e095 chore: add checkpoint and max slots config policy enforcements in PATCH experiment (#10125)
  • b3f928b chore: determined_master_host and friends helm support, better defaults (#10092)
  • 6755467 chore: bump Go version used by CI builds to 1.22.8 (#10127)
  • 834eeda feat: add actual select all to glide tables [ET-238] (#10081)
  • c7e0fb5 docs: add log signal release note and update docs (#10126)
  • 02fcc74 test: Add test for filtering user by Role Id (#10095)
  • f97fb5a build: INFENG-933: add GitHub action to start a minor release (#10112)
  • 685918d docs: Add aurora postgres release note (#10115)
  • a84f8c6 chore: SSO improvement feature requires Enterprise Edition. (#10124)
  • c71617c feat: Log Signal Exp Config and Monitoring (#9947)
  • 06b0b31 chore: fix merge exp flake (#10122)
  • 962810a chore: improve messaging when workspace configs conflict with global … (#10121)
  • 6158ef7 docs: Update postgres aurora info (#10116)
  • 4b0c065 docs: log policies restore exp config (#10120)
  • 186962c chore: add config policies to CLI reference docs (#10118)
  • 11ea6f4 chore: clarify version overrides during helm installs (#10094)
  • 4394f29 chore: standardize status api errors for task config policies (#10119)
  • e834302 fix: Add on delete cascade to system_metrics (#10113)
  • 3c59233 chore: populate final merged config with defaults when merging invariant configs (#10107)
  • deb3772 feat: additional APIs to support "actual select all" functions [ET-238] (#10102)
  • fd9cd8a feat: Allow master configuration for ssh key type (#10072)
  • 5e9df7c docs: Update release notes (#10114)
  • c655f33 docs: fix internal link in multi-rm docs page. (#10074)
  • e7186fe docs: Update log policies (#10098)
  • 993296b fix: update copy in experiment and trial headers (#10111)
  • d74a462 docs: Describe sso improvements (#10110)
  • 24d3390 chore: conditionally create VolumeSnapshotClass (#10103)
  • f45ebb9 chore: improve documentation surrounding slot caps helm configuration (#10090)
  • 0013fd0 ci: shorten test_pending_hpc.py (#10104)
  • 22ad457 fix: version upgrade notification bug [CM-411] (#10069)
  • 935fa66 fix: Log searche feedbacks (#10088)
  • 29a08ec Revert "docs: Describe arbitrary metadata logging" (#10099)
  • c6c476c chore: remove e2e_slurm_preemption test series (#10053)
  • e6182ed docs: Describe arbitrary metadata logging (#10073)
  • 539df5e chore: update CLI commands to work with global APIs (#10089)
  • 1f2bea0 feat: update ConfigPolicies with docs link [CM-558] (#10055)
  • 4afc15f build: INFENG-926: Fix version.sh version string output (#10085)
  • 04861dd chore: return error if workspace config violates global constraints (#10076)
  • 912f91e docs: task config policies release note (#10087)
  • 6d56101 fix: remove flake-inducing logretention global singleton (#10016)
  • b70a622 fix: correct token creation CLI to ensure it works with default expiry (#10084)
  • b155332 docs: Describe task config policies (#9969)
  • 27a014b fix: Tensorboard broken on unified install [CM-578] (#10080)
  • bdb56a4 chore: INFENG-922: use correct gh_team tag for infrastructure (#10077)
  • 91e358a INFENG-382: Release redesign (#10002)
  • 34e4749 chore: remove redundant rm.ExternalPreemptionPending interface (#10071)
  • 28bc072 feat: SSO Improvement - alter user_sessions table to include access token, implement CRUD ops, GET, POST, PATCH APIs and det token CLIs (#9867)
  • 472baf9 feat: Add copy task id to task list (#10058)
  • 2e822b7 chore: fix update invariant config and constraints (#10078)
  • d69f7cc chore(deps): bump google.golang.org/grpc from 1.64.0 to 1.64.1 (#9910)
  • e796b92 fix: run checkpoint GC more aggressively to ensure tensorboards are GC'd (#10017)
  • a14525f fix: nil deref in usage of incomplete experiment config policies (#10068)
  • 6c46a46 refactor: remove annotations requiring search ids in bulk action js (ET-241) (#10062)
  • 3ca3418 Docs: describe data files apptainer (#10020)
  • 315f65d chore: ntsc config not supported (#10056)
  • 2e8de9b test: User Management test updates [CM-468] (#10051)
  • 3fc9fed chore: experiment config slots to comply with constraint max slots (#10054)
  • 1d5c984 chore: fix slices and maps merge test (#10063)
  • 219409b chore: fix helptext for det user (#10060)
  • 7d6a1a7 docs: add k8s RP example to the helm values.yaml. (#10027)
  • 9efd96d fix: apply config policy constraints to PATCH /experiments/:id (#10048)
  • dd6aeda chore: change error code back (#10042)
  • 5a39ecb chore: check config policies on 'det notebook set priority' (#10047)
  • 2ef2f12 feat: bulk actions matching filters (ET-241) (#9895)
  • ac82b3c chore: default priority earlier to ensure constraints are satisfied [CM-553] (#10043)
  • 34557ef feat: Extend LogViewer to support scrollable search (#10005)
  • dadf75e chore: take invariant_config priority into account with manage job workflow (#10025)
  • 2356f91 chore: remove e2e_slurm_misconfigured series tests (#10023)
  • b243c26 ci: deflake test_disable_agent_zero_slots (#10040)
  • 4e0f1c4 chore: validate global, admin input against task config policies & constraints (#10028)
  • 3c1630f test: add e2e tests to the "move project" functionality on the "List View" (#10037)
  • 0613cc6 docs: revise postgres permission setup instructions. (#10039)
  • 2594d90 chore: remove e2e_slurm_gpu series tests (#10021)
  • 1f7ccad chore: exp invariant config silent override during add or update (#10019)
  • 30b197d feat: Global Config Policies UI [CM-522] (#10022)
  • c27054d feat: add e2e tests for multi-sort filter on experiments lista (#9992)
  • 9faa0cb chore: wait_for_task_state shows logs on failure (#10029)
  • a166826 fix: Workspace Projects and Tasks test flakes [CM-554] (#10026)
  • 33dfdaf test: Workspace Models tests [CM-538] (#9998)
  • 7e8dbac fix: Update action bar row layout in UserManagement page (#9862)
  • 5b1380c chore: check experiment constraints (#10018)
  • f609a2d fix: remove formatDatetime (#10011)
  • 9b6f0ac docs: Update release notes date (#9999)
  • f5400ea feat: Add regex search to task logs API (#9994)
  • ddca766 fix: correct expToWebhookConfig cache locking (#10014)
  • 80b29fa feat: Config Policies UI, Workspaces Experiments [CM-521] (#10009)
  • 262b4a9 chore: check task config policies against slots and max_slots (#10015)
  • a0cc818 ci: replace no_op fixture with a noop api (#9997)
  • 987b2a5 test: add e2e experiment list pagination test (#9993)
  • 1297899 fix: use UID not username to set H...
Read more

v0.38.0-rc10

21 Nov 17:16
13e49a7
Compare
Choose a tag to compare
v0.38.0-rc10 Pre-release
Pre-release

Release Notes

v0.38.0-rc10

Changelog

  • 13e49a7 [AUTO-BACKPORT release-0.38.0] 10226: chore: eliminate use of fury repo (#10229)
  • a554cd0 [AUTO-BACKPORT release-0.38.0] 10224: fix: make some k8s tests pass (#10228)

0.35.1

09 Nov 01:04
Compare
Choose a tag to compare

Release Notes

0.35.1

Changelog

  • 9d4bed2 chore: bump version: 0.35.1-rc0 -> 0.35.1
  • 46b3761 fix: perf issue with too many API reqs when listing pods in all ns (#10202)
  • 5b03599 chore: bump version: 0.35.0 -> 0.35.1-rc0
  • 4182da4 chore: bump current environment image versions to 0.35.1

v0.38.0-rc8

04 Nov 17:58
bb6f140
Compare
Choose a tag to compare
v0.38.0-rc8 Pre-release
Pre-release

Release Notes

v0.38.0-rc8

Changelog

  • bb6f140 [AUTO-BACKPORT 10160] fix: maxPoolSlotCapacity bug (#10195)

v0.38.0-rc7

04 Nov 15:11
7db183e
Compare
Choose a tag to compare
v0.38.0-rc7 Pre-release
Pre-release

Release Notes

v0.38.0-rc7

Changelog

  • 7db183e [AUTO-BACKPORT 10182] docs: docs changes for searcher context removal (#10194)
  • 23f9793 [AUTO-BACKPORT 10192] fix: keras continue from cloud checkpoint (#10193)

v0.38.0-rc6

01 Nov 20:58
508d400
Compare
Choose a tag to compare
v0.38.0-rc6 Pre-release
Pre-release

Release Notes

v0.38.0-rc6

Changelog

  • 508d400 [AUTO-BACKPORT 10174] docs: update docs for non-Trial-centric world (#10186)
  • 87f5ff8 [AUTO-BACKPORT 10188] fix: include max_length in continue expconf (#10190)

v0.38.0-rc5

01 Nov 14:51
e725918
Compare
Choose a tag to compare
v0.38.0-rc5 Pre-release
Pre-release

Release Notes

v0.38.0-rc5

Changelog

  • e725918 [AUTO-BACKPORT 10183] docs: fix typos in the release note (#10185)
  • 23687db [AUTO-BACKPORT 10178] docs: known issue of tb_plugin (#10181)
  • 5427a68 [AUTO-BACKPORT 10172] fix: ban archive columns in filter for experiment/search search (#10176)
  • 88c8887 [AUTO-BACKPORT 10173] fix: client.logout() re-enables client.login() (#10177)
  • 42f74e6 [AUTO-BACKPORT 10168] chore: ignore test_e2e_longrunning tests when merging auto-backports (#10179)
  • 020fc43 [AUTO-BACKPORT 10161] fix: fix diffusion example [DET-10470] (#10169)

v0.38.0-rc4

31 Oct 18:07
c69aa68
Compare
Choose a tag to compare
v0.38.0-rc4 Pre-release
Pre-release

Release Notes

v0.38.0-rc4

Changelog

  • c69aa68 [AUTO-BACKPORT 10140] fix: set max slots and checkpoint gc policy should comply with config policies (#10167)
  • b5e6315 fix: set max slots and checkpoint gc policy should comply with config policies (#10140)
  • 8e6a658 [AUTO-BACKPORT 10105] chore: change det deploy aws's default deployment type to simple-rds (#10162)
  • 6fc6710 [AUTO-BACKPORT 10153] docs: checkpoint storage note for config policies (#10165)
  • b366f80 [AUTO-BACKPORT 10138] feat: determined_master_host and friends helm support, better defaults (#10159)