Apache Hadoop docker images

These images are part of the Bigdata docker image series. All of the images use the same base docker image which contains plugin scripts to launch different project in containerized environments.

For more detailed instruction about the available environment variables see the README in the flokkr/docker-baseimage repository.

Docker images are tested with Kubernetes

Getting started with Kubernetes

The easiest way to start is to do a kubectl apply -f . from the ./exmaples directories (Using ephemeral storage!)

For more specific use case it's recommended to use flekszible. The resource definitions can be found in this repository (./hadoop,./hdfs,./yarn...)

Getting started with Flekszible

Install Flekszible (download binary and put it to the path)

Create a working dir

cd /tmp
mkdir cluster
cd cluster

Add this repository as a source

flekszible source add github.com/flokkr/docker-hadoop

Choose and add required services:

flekszible app add hdfs

Generate Kubernetes resource files

flekszible generate

Lunch the rockets:

kubectl apply -f .

Additional Flekszible options

You can list available apps (after source import):

flekszible app search
+---------+-------------------------------+
| path    | description                   |
+---------+-------------------------------+
| hdfs    | Apache Hadoop HDFS base setup |
| hdfs-ha | Apache Hadoop HDFS, HA setup  |
...

The base setup can be modified with additional transformatios:

flekszible definitions search | grep hdfs
...
| hdfs/persistence    | Add real PVC based persistence                                                             |
| hdfs/onenode        | remove scheduling rules to make it possible to run multiple datanode on the same k8s node. |
...

You can apply transformations with modifing the Flekszible descriptor file:

Original version:

source:
- url: github.com/flokkr/docker-hadoop
import:
- path: hdfs

Modified:

source:
- url: github.com/flokkr/docker-hadoop
import:
- path: hdfs
  transformations:
  - type: hdfs/onenode
  - type: image
    image: flokkr/hadoop:3.2.0

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
.github/workflows		.github/workflows
012_hdfsinit		012_hdfsinit
examples		examples
flekszible		flekszible
.flokkr.yaml		.flokkr.yaml
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build		build
flokkr.yaml		flokkr.yaml
log4j.properties		log4j.properties
magefile.go		magefile.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Hadoop docker images

Getting started with Kubernetes

Getting started with Flekszible

Additional Flekszible options

About

Releases

Packages

Contributors 4

Languages

License

flokkr/docker-hadoop

Folders and files

Latest commit

History

Repository files navigation

Apache Hadoop docker images

Getting started with Kubernetes

Getting started with Flekszible

Additional Flekszible options

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages