Skip to content

v0.21.0

Compare
Choose a tag to compare
@teo teo released this 09 Mar 16:34
· 1267 commits to master since this release

This release brings major improvements to AliECS, including multiple fixes for common error conditions, a refactored environment teardown sequence, a reimplementation of the ODC (EPN subcontrol) client based on the new integration plugin system, and miscellaneous bug fixes.

DEPRECATION NOTICE: starting with AliECS v0.21.0 the o2-aliecs-odc-shim component is deprecated and expected to be removed in the near future. It is still accessible by setting odcshim_enabled to true. The existing variable odc_enabled, which was previously used to enable/disable odc-shim, now controls the new implementation.

  • Error handling:

    • [core] Transition configure should return immediately after failing the workflow deployment
    • [core] Log which role failed during deployment
    • [core] Improve output on deployment failure
    • [core] Environment should kill tasks after failed deployment
    • [core] Fix panic close nil channel on teardown
    • [core] Adjust CreateAutoEnvironment to the new Teardown
    • [core] Send event with the correct TaskID
    • [executor] Correct timeout for Kill sequence GetState
    • [executor] Fix PID running check for negative PID (PGID)
    • [executor] Kill leftover tasks when rpc failed to start
  • Environment termination and cleanup:

    • [core] Ensure every Teardown API call is forced if necessary
    • [core] Refactor environment.Manager.CreateEnvironment
    • [core] Use TeardownEnvironment to cleanup after a failed deployment
  • Integration:

    • [core] New ddsched.EnsureTermination call, to be used as DESTROY hook
    • [core] New ODC integration plugin
    • [core] Sort functions in ODC integration plugin
    • [core] Add EnsureCleanup to ODC functions
    • [core] Concurrently initialize all plugins
    • [core] Reduce integration plugin dial timeouts
  • Miscellaneous:

    • [coconut] Print number of FLPs for environment
    • [common] Add ALICE Run 3 System ID mapping
    • [common] Add some non-standard system IDs for convenience
    • [core] Expose number of FLPs per environment
    • [core] Await expressions, optional with trigger
    • [executor] Reduce gRPC dial timeout to 30s thanks to modules v4
    • [misc] Update default configuration parameters
    • [walnut] Fix workflow template schema