Cocoon Scheduler

This Dart project contains logic for constructing infrastructure configs to validate commits in the repositories owned by Flutter.

ci.yaml

This is the config file in a repository used to tell Cocoon what tasks are used to validate commits. It includes both the tasks used in presubmit and postsubmit.

In addition, it supports tasks from different infrastructures as long as cocoon supports that scheduler. Only luci and cocoon are supported, but contributions are welcome.

Example config:

# /.ci.yaml

# Enabled branches is a list of regexes, with the assumption that these are full line matches.
# Internally, Cocoon prefixes these with $ and suffixes with ^ to enable matches.
enabled_branches:
  - main
  - flutter-\\d+\\.\\d+-candidate\\.\\d+

# Platform properties defines common properties shared among targets from the same platform.
platform_properties:
  linux:
    properties:
      # os will be inherited by all Linux targets, but it can be overrided at the target level
      os: Linux

targets:
# A Target is an individual unit of work that is scheduled by Flutter infra
# Target's are composed of the following properties:
# name: A human readable string to uniquely identify this target.
#       The first word indicates the platform this test will be run on. This should match
#       to an existing platform under platform_properties.
# recipes: LUCI recipes the target follows to run tests
#          https://flutter.googlesource.com/recipes/+/refs/heads/main/recipes/
# bringup: Whether this target is under active development and should not block the tree.
#          If true, will not run in presubmit and will not block postsubmit.
# presubmit: Whether to run this target on presubmit (defaults to true).
# postsubmit: Whether to run this target on postsubmit (defaults to true).
# run_if: List of path regexes that can trigger this target on presubmit.
#         If none are passed, it will evaluare run_if_not. If both are empty the target
#         will always run in presubmit.
# run_if_not: List of path regexes used to filter out presubmit targets. The target will
#         be run only if the files changed do not match any paths in this list. If run_if
#         is provided and not empty run_if_not will be ignored.
# enabled_branches: List of strings of branches this target can run on.
#                   This overrides the global enabled_branches.
# properties: A map of string, string. Values are parsed to their closest data model.
# postsubmit_properties: Properties that are only run on postsubmit.
# timeout: Integer defining whole build execution time limit for all steps in minutes.
# dimensions: A list of testbed dimensions which the CI determines what testbed to assign a target to.
#
# Minimal example:
# Linux analyze will run on all presubmit and in postsubmit.
 - name: Linux analyze
#
# Bringup example:
# Linux licenses will run on postsubmit, but it also passes the properties
# `analyze=true` to the builder. Since `bringup=true`, presubmit is not run,
# and postsubmit runs will not block the tree.
 - name: Linux licenses
   bringup: true
   properties:
     - analyze: license

#
# Tags example:
# This test will be categorized as host only framework test.
# Postsubmit runs will be passed "upload_metrics: true".
 - name: Linux analyze
   properties:
     tags: >-
       ["framework", "hostonly"]
   postsubmit_properties:
     - upload_metrics: "true"

#
# Devicelab example:
# For tests that are located https://github.com/flutter/flutter/tree/master/dev/devicelab/bin/tasks:
# 1) target name follows format of `<platform> <taskname>`
# 2) properties
#    2.1) update `tags` based on hosts, devices, and tests type. These tags will be used for statistic analysis.
#    2.2) a `taskname` property is required, which should match the task name
#
# Here is the target config for a task named: `analyzer_benchmark.dart`.
 - name: Linux_android analyzer_benchmark
   recipe: devicelab/devicelab_drone
   presubmit: false
   properties:
     tags: >
       ["devicelab", "android", "linux"]
     task_name: analyzer_benchmark

Adding new targets

All new targets should be added as bringup: true to ensure they do not block the tree.

Targets first need to be mirrored to flutter/infra before they will be run. This propagation takes about 30 minutes, and will only run as non-blocking in postsubmit.

The target will show runs in https://ci.chromium.org/p/flutter (under the repo). See https://github.com/flutter/flutter/blob/master/docs/infra/Adding-a-new-Test-Shard.md for up to date information on the steps to promote your target to blocking.

For flutter/flutter, there's a GitHub bot that will promote a test that has been passing for the past 50 runs.

Test Ownership

This only applies to flutter/flutter

To prevent tests from rotting, all targets are required to have a clear owner. Add an owner in TESTOWNERS

Properties

Targets support specifying properties that can be passed throughout infrastructure. The following are a list of keys that are reserved for special use.

Properties is a Map<String, String> and any special values must be JSON encoded (i.e. no trailing commas). Additionally, these strings must be compatible with YAML multiline strings

Property Name	Description	Default Value	Type	Example
add_recipes_cq	Whether to add this target to flutter/recipes CQ. This ensures changes to flutter/recipes pass on this target before landing.	"false"	string bool	add_recipes_cq: "true"
cache_name	The name identifier of the second layer Engine source cache. This is maintained by flutter/infra team via cache.py recipe and is separate from LUCI side default caches.	N/A	string	cache_name: "builder"
cache_path	The paths of Engine checkout source that will be auto saved to CAS for boosting source checkout when caches no longer exist from the bots.	N/A	list	cache_paths: >- [ "builder", "git" ]
clobber	Whether to clean the Engine source code cache.	"false"	string bool	clobber: "true"
config_name	The config name of the targets. It is used for `Engine V2 recipes`, and is a one-on-one map to the config files located under `ci/builders`. This is not needed for targets using none `Engine V2 recipes`.	N/A	string	config_name: linux_benchmarks
contexts	The list of contexts that will guide recipes to add to the ExitStack. This will initialize and prepare the virtual device used for tests. Other supported contexts include: `osx_sdk`, `depot_tools_on_path`, etc.	N/A	list	contexts: >- [ "android_virtual_device" ]
cores	The machine cores a target will be running against. A higher number of cores may be needed for extensive targets. Note: This property will be auto populated to CI builder dimensions, which CI uses to determine the testbed to run this target.	N/A	string int	cores: "8"
dependencies	JSON list of objects with "dependency" and optionally "version". The list of supported deps is in flutter_deps recipe_module. Dependencies generate a corresponding swarming cache that can be used in the recipe code. The path of the cache will be the name of the dependency. Versions can be located in CIPD	N/A	list	dependencies: >- [ {"dependency": "android_sdk"}, {"dependency": "chrome_and_driver", "version": "latest"}, {"dependency": "clang"}, {"dependency": "goldctl"} ]
device_type	The phone device type a target will be running against. For host only targets that do not need a phone, a value of `none` should be used. Note: This property will be auto populated to CI builder dimensions, which CI uses to determine the testbed to run this target.	N/A	string	device_type: "msm8952"
drone_dimensions	A list of testbed dimensions which the CI determines what testbed to assign a subbuild drone of a target to. This property will be auto populated to CI dimensions of a subbuild triggered from the orchestrator target.	N/A	string	drone_dimensions: > ["device_type=none", "os=Linux"]
$flutter/osx_sdk	Xcode configs including sdk and runtime	N/A	map	$flutter/osx_sdk : >- { "sdk_version": "14e222b", "runtime_versions": [ "ios-16-4_14e222b", "ios-16-2_14c18" ] }
gclient_variables	The gclient variables populated to recipes when checking out sources via gclient sync.	N/A	map	gclient_variables: >- { "download_android_deps": "true" }
ignore_cache_paths	The paths of Engine checkout source that will be skipped when saved to CAS. Please reference to `cache_path`.	N/A	list	ignore_cache_paths: >- [ "buibuilder/src/flutter/prebuilts/SDKs", "builder/src/flutter/prebuilts/Library"lder" ]
no_goma	Whether to use goma when building artifacts.	"false"	string bool	no_goma: "true"
os	The machine os a target will be running against, such as `Linux`, `Mac-13`, etc. Note: This property will be auto populated to CI builder dimensions, which CI uses to determine the testbed to run this target.	N/A	string	os: Linux
presubmit_max_attempts	The max attempts the target will be auto executed in presubmit. If it is not specified, the default value is `1` and it means no auto rerun will happen. If explicitly defined, it controls the max number of attempts. For example: `3` means it will be auto rescheduled two more times.	"1"	string int	presubmit_max_attempts: "3"
release_build	Whether is required to run to release Engine. Will be triggered via release_builder.py	"false"	string bool	release_build: "true"
shard	The shard name of the sharding target, used in the test.dart test runner.	N/A	string	shard: web_tests
subshards	The sub shards of the sharding target, used in the test.dart test runner. If omitted with `shard` defined, it will run all unit tests in a single shard.	N/A	list	subshards: >- ["0", "1", "2", "3", "4", "5", "6", "7_last"]
tags	JSON list of strings. These are currently only used in flutter/flutter to help with TESTOWNERSHIP and test flakiness.	N/A	list	tags: > ["devicelab","hostonly"]
test_timeout_secs	String determining seconds before timeout for an individual test step. Note that this is the timeout for a single test step rather than the entire build execution timeout.	"1800"	string int	test_timeout_secs: "2700"

Updating targets

Properties

Find the cipd ref to upgrade to
- If this is a Flutter managed package, look up its docs on uploading a new version
- For example, JDK is at https://chrome-infra-packages.appspot.com/p/flutter_internal/java/openjdk/linux-amd64
In ci.yaml, find a target that would be impacted by this change
- Override the version specified in dependencies
```
- name: Linux Host Engine
  recipe: engine
  properties:
    build_host: "true"
    dependencies: >-
    [
        {"dependency": "open_jdk", "version": "11"}
    ]
  timeout: 60
```
- Send PR, wait for the checks to go green (the change takes effect on both presubmit and postsubmit as cocoon scheduling fetches latest change and applies it to new builds immediately)
If the check is red, add patches to get it green
Once the PR has landed, infrastructure may take 1 or 2 commits to apply the latest properties
1. PRs/commits that have rebased on the changing PR do not need to wait
2. PRs/commits that have not rebased on the changing PR need to wait
3. Local LUCI runs need to wait
4. Package cache needs to wait for roll out

Note: updates on other entries except properties will not take effect immediately. Ths PR needs to be landed first to wait for changes propagated in infrastructure.

Update target platform

Target depends on the prefix platform in its name to decide which platform to run on. This should match to an existing platform under platform_properties.

If one target needs to switch running platforms, e.g. from a devicelab bot to a host only bot:

Keep the old target entry
Add a new entry under the new platform with
bringup: true
necessary dependencies
corresponding tags (tags will only be used for infra metrics analysis)
Land the change with the new entry
If the new target under the new platform passes in postsubmit
Remove the old target entry and mark the new target as bringup: false

Example: say one wants to switch Linux_android web_size__compile_test to a vm.

Existing config:

- name: Linux_android web_size__compile_test
  properties:
    tags: >
        ["devicelab", "android", "linux"]

Add a new config:

- name: Linux web_size__compile_test
  bringup: true # new target
  properties:
    dependencies: >- # optional
      [
        {"dependency": "new-dependency", "version": "new-dependency-version"}
      ]
    tags: >
      ["devicelab", "hostonly", "linux"]

After validating the new target passes, lands the clean up change by removing the config of old target Linux_android web_size__compile_test and removing the bringup: true for the new target.

Note: this change may affect benchmark metrics. Notify the metrics sherrif to monitor potential regression.

External Tests

Cocoon supports tests that are not owned by Flutter infrastructure. By default, these should not block the tree but act as FYI to the gardeners.

Contact flutter-infra@ with your request (go/flutter-infra-office-hours)
Add your system to SchedulerSystem (https://github.com/flutter/cocoon/blob/master/app_dart/lib/src/model/proto/internal/scheduler.proto)
Add your service account to https://github.com/flutter/cocoon/blob/master/app_dart/lib/src/request_handling/swarming_authentication.dart
Add a custom frontend icon - https://github.com/flutter/cocoon/blob/master/dashboard/lib/widgets/task_icon.dart
Add a custom log link - https://github.com/flutter/cocoon/blob/master/dashboard/lib/logic/qualified_task.dart
Wait for the next prod roll (every weekday)

Add a target to .ci.yaml

# .ci.yaml
# Name is an arbitrary string that will show on the build dashboard
- name: my_external_test_a
  # External tests should not block the tree
  bringup: true
  presubmit: false
  # Scheduler must match what was added to scheduler.proto (any unique name works)
  scheduler: my_external_location

Send updates to https://flutter-dashboard.appspot.com/api/update-task-status - https://github.com/flutter/cocoon/blob/master/app_dart/lib/src/request_handlers/update_task_status.dart

Scheduling Targets

For targets using the Cocoon scheduler, they can run on:

Presubmit (via GitHub checks)
Postsubmit (via build dashboard)

By default, all targets should use the Cocoon scheduler.

Presubmit Features

GitHub checks enable targets to run immediately, and are available on the pull request page.
Changes to the ci.yaml will be applied during those presubmit runs.
New targets are required to be brought up with bringup: true

Postsubmit Features

Targets are immediately triggered on GitHub webhooks for merged pull requests
Updates are made immediate via LUCI PubSub notifications
Prioritizes recently failed targets (to unblock the tree quicker)
Backfills targets at a low swarming priority when nothing is actively running
Batches targets that have a high queue time, and backfills in off peak hours
Flakiness monitoring

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI_YAML.md

CI_YAML.md

Cocoon Scheduler

ci.yaml

Adding new targets

Test Ownership

Properties

Updating targets

Properties

Update target platform

External Tests

Scheduling Targets

Presubmit Features

Postsubmit Features

Files

CI_YAML.md

Latest commit

History

CI_YAML.md

File metadata and controls

Cocoon Scheduler

ci.yaml

Adding new targets

Test Ownership

Properties

Updating targets

Properties

Update target platform

External Tests

Scheduling Targets

Presubmit Features

Postsubmit Features