Skip to content

Commit

Permalink
[docs] Update information on AliECS production deployment
Browse files Browse the repository at this point in the history
  • Loading branch information
teo committed Jul 4, 2024
1 parent e556b5a commit e996a90
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions docs/running.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,3 +27,15 @@ http://centosvmtest:5050/api/v1/scheduler
```

See [Using `coconut`](./coconut/README.md) for instructions on the O² Control core command line interface.

# Running AliECS in production

The AliECS core runs as a systemd service in the O²/FLP cluster at Point 2.

## Health checks

There is a checker script that polls AliECS for its status (`coconut env list`).

1) The checker script (checkAliECScore available in GL) now makes 3 attempts with 10 seconds timeout.
2) All failed attempts are recorded in the aliecs local file /tmp/checkAliECScore.out
3) The ILG message is issued at the third consecutive failure.

0 comments on commit e996a90

Please sign in to comment.