Source files and build instructions for an OCI image (compatible with e.g. Docker or Podman) for pdftk-java. If PDF is electronic paper, then pdftk-java is an electronic staple-remover, hole-punch, binder, secret-decoder-ring, and X-Ray-glasses. PDFtk is a simple tool for doing everyday things with PDF documents: Merge PDF documents, split PDF pages into a new document, decrypt input as necessary (password required), encrypt output as desired, burst a PDF document into single pages, report on PDF metrics, including metadata and bookmarks, uncompress and re-compress page streams, and repair corrupted PDF (where possible).
Pdftk-java is a port of the original GCJ-based PDFtk to Java. The GNU Compiler for Java (GCJ) is a portable, optimizing, ahead-of-time compiler for the Java programming language, which had no new developments since 2009 and was finally removed in 2016 from the GCC development tree before the release of GCC 7.
The OCI image automatically runs pdftk-java with the given options and arguments. It may be started with Docker using:
docker run --rm --volume $(pwd):/work pdftk/pdftk:latest --help
And it may be started with Podman using:
podman run --rm --volume $(pwd):/work quay.io/pdftk/pdftk:latest --help
For command-line convenience it might be suitable to alias
the command above, e.g.:
alias pdftk='podman run --rm --volume $(pwd):/work quay.io/pdftk/pdftk:latest'
/work
- Default working directory for pdftk-java.
While none of the volumes is required, meaningful usage requires at least persistent storage for /work
.
For custom OCI images, the following build arguments can be passed:
VERSION
- Version of the pdftk-java release tarball, defaults to3.3.3
.GIT
- Git repository URL of pdftk-java, defaults tohttps://gitlab.com/pdftk-java/pdftk.git
.COMMIT
- Git commit, branch or tag of pdftk-java, e.g.master
, unset by default.
To build a custom OCI image from current Git, e.g. --build-arg COMMIT=master
needs to be passed.
Docker Hub and Quay can both automatically build OCI images from a linked GitHub account and automatically push the built image to the respective container repository. However, as of writing, this leads to OCI images for only the amd64
CPU architecture. To support as many CPU architectures as possible (currently 386
, amd64
, arm/v6
, arm/v7
, arm64/v8
, ppc64le
and s390x
), GitHub Actions are used. There, the current standard workflow "Build and push OCI image" roughly uses first a GitHub Action to install QEMU static binaries, then a GitHub Action to set up Docker Buildx and finally a GitHub Action to build and push Docker images with Buildx.
Thus the OCI images are effectively built within the GitHub infrastructure (using free minutes for public repositories) and then only pushed to both container repositories, Docker Hub and Quay (which are also free for public repositories). This not only saves repeated CPU resources but also ensures identical bugs independent from which container repository the OCI image gets finally pulled (and somehow tries to keep it distant from program changes such as Docker Hub Rate Limiting in 2020). The authentication for the pushes to the container repositories happen using access tokens, which at Docker Hub need to be bound to a (community) user and at Quay using a robot account as part of the organization. These access tokens are saved as "repository secrets" as part of the settings of the GitHub project.
For each release of the project, a new Git branch (named like the version of the release, e.g. 3.3.3
) is created (based on the default branch, e.g. master
). The workflow takes care about creating and moving container tags, such as latest
. By not using Git tags but branches, downstream bug fixes can be easily applied to the OCI image (e.g. for bugs in the Dockerfile
or patches for the source code itself). Old branches are not touched anymore, equivalent to old release archives.
Each commit to a Git branch triggers the workflow and leads to OCI images being pushed (except for GitHub pull requests), where the container tag is always based on the Git branch name. OCI images with non-release container tags pushed for testing purposes need to be cleaned up manually at the container repositories. Additionally, a cron-like option in the workflow leads to a nightly build being also tagged as edge
.
Re-running a workflow for failed builds can be performed using the GitHub web interface at the "Actions" section. However, to re-run older or successful builds (e.g. to achieve a newer operating system base image layer for an existing release), git commit --allow-empty -m "Reason" && git push
might do the trick (because the GitHub Actions API doesn't seem to allow such re-runs either).
This project is licensed under the GNU General Public License, version 2 or later - see the LICENSE file for details.
As with all OCI images, these also contain other software under other licenses (such as BusyBox, OpenJDK etc. from the base distribution, along with any direct or indirect dependencies of the contained pdftk-java).
As for any pre-built image usage, it is the image user's responsibility to ensure that any use of this image complies with any relevant licenses for all software contained within.