diff --git a/.gitignore b/.gitignore
new file mode 100644
index 0000000..d944721
--- /dev/null
+++ b/.gitignore
@@ -0,0 +1,2 @@
+_site/
+Gemfile.lock
diff --git a/Gemfile b/Gemfile
new file mode 100644
index 0000000..8c73203
--- /dev/null
+++ b/Gemfile
@@ -0,0 +1,4 @@
+source 'https://rubygems.org'
+gem "github-pages", group: :jekyll_plugins
+
+gem "webrick", "~> 1.8"
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..460588b
--- /dev/null
+++ b/README.md
@@ -0,0 +1,194 @@
+# Corpus of German misogynistic hatespeech posts (GMHP7k)
+## A German Corpus on misogynistic hatespeech posts from Twitter
+On this page we provide the data set for the corpus on German misogynistic hatespeech posts (GMHP7k), which was first presented on the [18th International AAAI Conference on Web and Social Media](https://www.icwsm.org/2024/) (ICWSM 2024) along with a dataset paper.
+
+
+## Description
+We provide a German corpus consisting of 7,061 posts authored by users of social media platforms. A group of volunteers annotated each post according to hatespeech and misogynistic/misogynous hatespeech in a binary fashion. The inter-rater reliability over all annotators according to Fleiss’ Kappa is 0.6409 for hatespeech and 0.8258 for misogynistic hatespeech. Furthermore, baseline measurements with machine learning based text classification with BERT are presented. Initial experiments with the corpus achieve macro average F1-scores up to 0.79 for hatespeech and 0.75 for misogynistic hatespeech.
+
+### Classes to annotate
+During annotation, volunteers rated two aspects of a post: the presence of *hatespeech* and *misogynistic hatespeech*. The availability of hatespeech depends on perception of the comment text by the annotators and can be rated as *hatespeech* or *not hatespeech*. The misogynistic hatespeech, on the other hand, can be either *misogynistic hatespeech* or *not misogynistic hatespeech*.
+
+### Data Description
+
+
+ {% if site.title and site.title != page.title %}
+
+ {% endif %}
+
+ {{ content }}
+
+ {% if site.github.private != true and site.github.license %}
+
+ {% endif %}
+
+
+
+ {% if site.google_analytics %}
+
+ {% endif %}
+
+
diff --git a/assets/css/style.scss b/assets/css/style.scss
new file mode 100644
index 0000000..ff9937e
--- /dev/null
+++ b/assets/css/style.scss
@@ -0,0 +1 @@
+@import "{{ site.theme }}";
diff --git a/images/interrater-reliability_hs.png b/images/interrater-reliability_hs.png
new file mode 100644
index 0000000..14f8098
Binary files /dev/null and b/images/interrater-reliability_hs.png differ
diff --git a/images/interrater-reliability_mhs.png b/images/interrater-reliability_mhs.png
new file mode 100644
index 0000000..d150cb3
Binary files /dev/null and b/images/interrater-reliability_mhs.png differ
diff --git a/images/logo-ccwi.png b/images/logo-ccwi.png
new file mode 100644
index 0000000..6b07184
Binary files /dev/null and b/images/logo-ccwi.png differ
diff --git a/images/wordcloud_hatespeech_50.png b/images/wordcloud_hatespeech_50.png
new file mode 100644
index 0000000..1870b89
Binary files /dev/null and b/images/wordcloud_hatespeech_50.png differ
diff --git a/images/wordcloud_misogynistic_hatespeech_50.png b/images/wordcloud_misogynistic_hatespeech_50.png
new file mode 100644
index 0000000..cfbd646
Binary files /dev/null and b/images/wordcloud_misogynistic_hatespeech_50.png differ
diff --git a/images/wordcloud_neutral_50.png b/images/wordcloud_neutral_50.png
new file mode 100644
index 0000000..6d3870f
Binary files /dev/null and b/images/wordcloud_neutral_50.png differ