- Maintained by: the Posit Docker team
- Where to get help: our Github Issues page
- RStudio Workbench image: Docker Hub
- RStudio r-session-complete image: Docker Hub
Posit Workbench, formerly RStudio Workbench, is the preferred data analysis and integrated development experience for professional R users and data science teams who use R and Python. Posit Workbench enables the collaboration, centralized management, metrics, security, and commercial support that professional data science teams need to operate at scale.
Some of the functionality that Workbench provides is:
- The ability to develop in Workbench and Jupyter
- Load balancing
- Tutorial API
- Data connectivity and Posit Professional Drivers (formerly RStudio Professional Drivers)
- Collaboration and project sharing
- Scale with Kubernetes and SLURM
- Authentication, access, & security
- Run multiple concurrent R and Python sessions
- Remote execution with Launcher
- Auditing and monitoring
- Advanced R and Python session management
For more information on running RStudio Workbench in your organization please visit https://www.rstudio.com/products/workbench/.
- This image may introduce BREAKING changes; as such we recommend:
- Avoid using the
{operating-system}
tags to avoid unexpected version changes, and - Always read through the NEWS to understand the changes before updating.
- Avoid using the
- Outdated images will be removed periodically from DockerHub as product version updates are made. Please make plans to update at times or use your own build of the images.
- These images are meant as a starting point for your needs. Consider creating a fork of this repo, where you can continue to merge in changes we make while having your own security scanning, base OS in use, or other custom changes. We provide instructions for building for these cases.
- Security Note: These images are provided AS IS based on the build environment at the time their product version was released/updated. They should be reviewed and updated before production use. If your organization has a specific set of security requirements related to CVE/Vulnerability severity levels, you should plan to use the instructions for building to clone this repository, and rebuild these images to your specific internal security standards.
To verify basic functionality as a first step:
# Replace with valid license
export RSW_LICENSE=XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX
# Run without persistent data using default configuration
docker run -it \
-p 8787:8787 \
-e RSW_LICENSE=$RSW_LICENSE \
rstudio/rstudio-workbench:ubuntu1804
Open http://localhost:8787 to access RStudio Workbench. The default username and password are
rstudio
.
Note that running the RStudio Workbench Docker image requires a valid RStudio Workbench license.
This container includes:
- Two versions of R
- Two versions of Python
- RStudio Workbench
RStudio Workbench is configured via config files in the /etc/rstudio
directory. Mount this directory as
a volume from the host machine. Changes will take effect when the container is restarted.
You can review possible RStudio Workbench configuration in the documentation.
See a complete example of server configuration at workbench/conf
.
In order to persist user files between container restarts please mount the /home
directory from a persistent volume on the host
machine or your docker orchestration system.
The RStudio Workbench Docker image requires a valid license, which can be set in three ways:
- Setting the
RSW_LICENSE
environment variable to a valid license key inside the container - Setting the
RSW_LICENSE_SERVER
environment variable to a valid license server / port inside the container - Mounting a
/etc/rstudio-server/license.lic
single file that contains a valid license for RStudio Workbench
NOTE: the "offline activation process" is not supported by this image today. Offline installations will need to explore using a license server, license file, or custom image with manual intervention.
By default, the container will create a test user, which you can control or disable with the environment
variables: RSW_TESTUSER
, RSW_TESTUSER_PASSWD
, RSW_TESTUSER_UID
.
If you have a directory (LDAP server, Active Directory, etc.) available to
provision users, sssd
is installed in the container and enabled by default (
see Process Management
below). In order to make use of it, you will need to
mount your own configuration file into /etc/sssd/conf.d/
. For instance,
sssd.conf
[sssd]
config_file_version = 2
domains = LDAP
[domain/LDAP]
id_provider = ldap
auth_provider = ldap
chpass_provider = ldap
sudo_provider = ldap
# ... more configuration
Then:
# sssd is picky about file permissions
chmod 600 sssd.conf
docker run -it \
-p 8787:8787 -p 5559:5559 \
-v $PWD/data/rsp:/home \
-v $PWD/server-pro/conf/:/etc/rstudio \
-v $PWD/sssd.conf:/etc/sssd/conf.d/sssd.conf \
-e RSW_LICENSE=$RSW_LICENSE \
rstudio/rstudio-workbench:ubuntu1804
It is worth noting that you may also need to modify the PAM configuration files in the container, if you are using PAM for custom authentication or session behavior. See the RStudio Workbench guide for more information.
Variable | Description | Default |
---|---|---|
RSW_TESTUSER |
Test user to be created in the container, turn off with an empty value | rstudio |
RSW_TESTUSER_PASSWD |
Test user password | rstudio |
RSW_TESTUSER_UID |
Test user UID | 10000 |
RSW_LICENSE |
License key for RStudio Workbench, format should be: XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX |
None |
RSW_LICENSE_SERVER |
Floating license server, format should be: my.url.com:port |
None |
RSW_LAUNCHER |
Whether or not to use launcher locally / start the launcher process | true |
RSW_LAUNCHER_TIMEOUT |
The timeout, in seconds, to wait for launcher to start listening on the expected port before failing startup | 10 |
Variable | Description |
---|---|
8787 |
Default HTTP Port for RStudio Connect |
5559 |
Port for RStudio Launcher server |
# Replace with valid license
export RSW_LICENSE=XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX
# Run without persistent data and using an external configuration
docker run -it \
-p 8787:8787 -p 5559:5559 \
-v $PWD/workbench/conf/:/etc/rstudio \
-e RSW_LICENSE=$RSW_LICENSE \
rstudio/rstudio-workbench:ubuntu1804
# Run with persistent data and using an external configuration
docker run -it \
-p 8787:8787 -p 5559:5559 \
-v $PWD/data/rsw:/home \
-v $PWD/workbench/conf/:/etc/rstudio \
-e RSW_LICENSE=$RSW_LICENSE \
rstudio/rstudio-workbench:ubuntu1804
Open http://localhost:8787 to access RStudio Workbench.
The default username and password are rstudio
.
In order for RStudio Workbench to function properly, several services need to
be accounted for. We run these services using
supervisord
. supervisord
is an open source
process supervisor with an active community. It enables running multiple
services in the container, as well as exiting the container if any of those
services exit.
NOTE: generally speaking, running multiple services in a single container is an anti-pattern. However, we have implemented the below as a workaround until RStudio Workbench can handle users and other processes in a more container-friendly fashion
Details on the various processes and their configuration is below:
-
RStudio Workbench: the main server process
- this startup configuration is mounted at
/startup/base
- this startup configuration is mounted at
-
RStudio Job Launcher: enables launching Jupyter, JupyterLab, and VSCode sessions, as well as talking to job schedulers like Slurm and Kubernetes.
- Optional and enabled by default
- this startup configuration is mounted at
/startup/launcher
- to disable, mount an empty volume over
/startup/launcher
-
sssd: often used for user provisioning when connected to an LDAP directory or other user store.
- Optional, and enabled by default, but with a "dummy" domain so it does nothing.
- To use this with your directory, mount required
.conf
files into/etc/sssd/conf.d/
(more details inUser Provisioning
, above) - Startup configuration is installed at
/startup/user-provisioning/
- To disable entirely, mount an empty volume over
/startup/user-provisioning/
-
custom: Do you have a service that you need to run inside the container for user provisioning or otherwise? Mount other configuration files into
/startup/custom
, and they will be started and managed bysupervisord
as well- NOTE: in many cases (i.e. Kubernetes)
initContainers
orsidecar
containers are a better fit
- NOTE: in many cases (i.e. Kubernetes)
Note: This section does not apply to activations using license files.
There is currently a known licensing bug when using our products in containers. If the container is not stopped gracefully, the license deactivation step may fail or be skipped. Failing to deactivate the license can result in a "license leak" where a product activation is used up and cannot be deactivated using traditional methods as the activation state on the container has been lost.
To avoid "leaking" licenses, we encourage users not to force kill containers and to use --stop-timeout 120
and
--time 120
for docker run
and docker stop
commands respectively. This helps ensure the deactivation script has
ample time to run properly.
In some situations, it can be difficult or impossible to avoid a hard termination (e.g. power failure, critical error on host). Unfortunately, any of these cases can still cause a license to leak an activation. To help prevent a license leak in these situations, users can mount the following directories to persistent storage to preserve the license state data across restarts of the container. These directories differ between products.
- License Key
/var/lib/.local
/var/lib/.prof
/var/lib/rstudio-workbench
- Floating License
/var/lib/.TurboFloat
Please note that the files created in these directories are hardware locked and non-transferable between hosts. Due to the nature of the hardware fingerprinting algorithm, any low-level changes to the host or container can cause existing license state files to invalidate. To avoid this problem, we advise that product containers are gracefully shutdown and allowed to deactivate prior to changing any hardware or firmware on the host (e.g. upgrading a network card or updating BIOS) or the container (e.g. changing the network driver used or the allocated number of CPU cores).
While preserving license state data can help avoid license leaks across restarts, it's not a guarantee. If you run into issues with your license, please do not hesitate to contact Posit support.
While neither of these solutions will eliminate the problem, they should help mitigate it. We are still investigating a long-term solution.
The license associated with the RStudio Docker Products repository is located in LICENSE.md.
As is the case with all container images, the images themselves also contain other software which may be under other licenses (i.e. bash, linux, system libraries, etc., along with any other direct or indirect dependencies of the primary software being contained).
It is an image user's responsibility to ensure that use of this image (and any of its dependent layers) complies with all relevant licenses for the software contained in the image.