Skip to content

Latest commit

 

History

History
116 lines (97 loc) · 8.61 KB

README.md

File metadata and controls

116 lines (97 loc) · 8.61 KB

CLARIAH-PLUS use cases

Introduction

This repository collects all use cases for CLARIAH-PLUS, across all work packages and interest groups. Our aim is to have a central repository where everybody can get insight into the wide variety of use cases we currently deal with. With this transparency with intend to foster cooperation and provide a basis for discussion and implementation, both in the CLARIAH interest groups and beyond.

At this stage, the structure of the use cases is still very much open-ended, we intend to first gather use cases and in a later stage distill a more structured format.

How to contribute?

Everybody working in CLARIAH is welcome and encouraged to contribute his/her use cases. Please see the contribution guidelines and the template.

Use cases

The below listing links to all use cases based on their state, but in no further particular order, better ordering will be applied at a later stage and is all subject to further discussion as we grow.

Proposed

Name WP IG Tech
A Cross-Media Analysis of the Refugee Crisis 5, 6 WF,AV,TP MediaSuite
Annotatable collections 6, 3, 5 Ann, AV, LoD
Corpus-based history of ideas 6 TP
Deep-learning for Dutch text mining 3 TP DeepFrog
Exploiting UD treebanks for the extraction of word combination statistics 3 TP
Exploring Dialect2Keyword 3 Ann, Cur
Historical enrichment: Proof of concept corpus exploitation 3,6 TP
Historical research across (audio)visual and textual archives and collections 5 Ann, WF, Cur MediaSuite
Interoperability Integrated Text Annotation and Linked Open Data 3,6 Ann,LoD,TP FoLiA
Key point detection/pose analysis for Eye Jean Desmet collection through DANE 5 AV MediaSuite
Micro-frontends for manual scholarly annotations 6, (2, 3, 5) Ann, LoD, Prv
Parallel corpus mining 3 TP
Parallel historical corpus mining 3 TP
Sex, Beer and RomComs: Studying the Debate on Dutch film 5, 2, 3, 4, 6 AV, TP, Ann, LoD, WF, Cur, UI
Stories in Motion: Integrating oral histories into the Media Suite 5 AV MediaSuite
Triples-workbench: store, browse, query and visualize triples 4 LoD, UI, WF
[Neural-Tscan]: adapt the existing tscan for stylistic analysis of text with neural models and linguistic features for interpretibility 3, 6 FoLiA, NAF

In progress

Name WP IG Tech
ASR for sensitive data 3 AV
Computer vision annotations 'n' enrichments of audiovisual data 5 AV,DO,WF DANE
Curation of transcribed historical newspaper corpus (Wp6 Use case 2) 6 TP
DANS CMDI use case for CLARIAH WP3 3 Prv CMDI
Digitization workflow for historical newspapers (WP6 use case 2) 6,3 TP
Historical research on media-events across heterogenous broadcast datasets with linked and missing data 5 AV,TP,Ann,LoD MediaSuite
Linkage of Dutch Civil Records 4 LoD,WF,Cur burgerLinker
Providing Language and Speech webservices at CLST (Radboud University, Nijmegen) 3, 2 DO,TP,WF CLAM, LaMachine
Retrodigitization of Text-critical Editions 3 TP,Ann FoLiA, FLAT
Speech transcription of audiovisual data 5 AV,Do Media Suite
Store, share and search web annotations 6, 3, 5 AG, LoD
Tokenising, lemmatising, tagging and dependency parsing annotation of Frisian text using UD Pipe Frysk 3 Ann,TP UDPipe
Tracing Re-use 5 WF,AN,AVP Media Suite
Vocab Recommender 4 LoD,CuR,WF
WP6 Use Case 005: Lossless Text Data Exchange 6 TP
WP6 Use case 3: Tools to data 6 DevOps, TP
WP6 use case VOC 6 TP

Completed

Name WP IG Tech
Annotation of spelling correction for CLIN28 Shared Task 3 Ann FLAT
Automatic linguistic enrichment for Dutch texts using Frog 3 TP Frog, FoLiA
COW:Integrated CSV to RDF converter 4 LoD,Cur,WF COW
Data format for linguistically-annotated corpora 3 Ann,TP,LoD FoLiA
Extracting Information about Flood Disasters 3 Ann FLAT
Nederlab: Automatic Linguistic Enrichment of Historical Dutch 3, 6 TP,Ann Frog,FoliA
Negation Annotation in Dutch dialogue 3 Ann FLAT
PARSEME: Annotation of verbal multi-word expressions 3 Ann FoLiA
PICCL deployment at a CLARIN centre 3 DO LaMachine
Quickly building webservices with CLAM 3, 2 DO,WF,TP CLAM
Research Environment for Workshop: Cataloguing of Textual Cultural Heritage Objects 3 DO,TP LaMachine
Syntactic Movement Annotation 3 Ann FLAT
Tools to the data: Text Mining for Health Inspection 3 DO,TP LaMachine, Frog
grlc -> sparql queries as api and with metadata 4 LoD grlc

Legend

Interest groups:

The technology column refers to the most prominent CLARIAH products that feature in the use case (keep it short):

  • BurgerLinker - Command line tool for linking civil registries
  • CLAM - A framework to quickly build RESTful webservices and have a generic web-UI
  • CMDI - CLARIN's Component Metadata Infrastructure
  • COW - CSV to RDF converter (CSVW)
  • DANE - Handles compute task assignment and file storage for the automatic annotation of content.
  • DeepFrog - Deep-learning NLP tool & models for Dutch
  • FLAT - A web-based annotation tool for (linguistic) annotation of text documents
  • FoLiA - An XML-based Format for Linguistic Annotation
  • Frog - An NLP-suite for Dutch
  • grlc - converts your SPARQL queries into RESTful APIs.
  • LaMachine - A meta-distribution with various NLP/CLARIAH tools and services
  • MediaSuite - A research environment to search, analyse, and annotate media collections.