Skip to content
dirkpieper edited this page Jan 30, 2020 · 5 revisions

Minimal requirements

  • The data contains an academic institution's expenditures on a per-article basis for publishing in fee-based Wiley journals
  • The data should be made available in a machine-readable, platform independent format (CSV).
  • The data is provided under an Open Data Commons license to ensure public access and reusability.
  • A contact person is designated at the contributing institution.

Data set

In comparison to APCs in OA and hybrid journals, there are no established reporting routines for costs incurred for articles in transformation contracts. OpenAPC has already created an additional data set (visualisation) for data from transformation contracts as part of the INTACT project, but only a few data reported by the FWF contain cost information to date. In the course of the current funding phase by the BMBF, OpenAPC intends to report articles from transformation contracts with cost information per article and participating institution from 2020 onwards. This is in line with current recommendations (Expert Group to the European Commission, European University Association, Coalition S / Plan S), which stress the need for cost transparency in transformation contracts as well. In addition, cost data are relevant for the transformation of acquisition budgets and the development of cost distribution models at local and national level. With the DEAL Wiley deal, a nationwide transformation contract is available in Germany for the first time.

The data set for transformation contracts is compiled from the distributed tables of the contributing institutions, since MPDL Services GmbH does not yet want to support the institutions participating in DEAL with a central data delivery. In order to make the data comparable, the publisher and journal titles will be added using automatic enrichment procedures (CrossRef). By further enriching the records with information e.g. from the disciplinary repository Europe PubMed Central or the DOAJ, additional fields are automatically filled in.

As discussed at the Open Access Days 2019, the data from the DEAL Wiley hybrid journals will be recorded in the data set for transformative agreements, the data for DEAL Wiley Open Access journals in the original OpenAPC data set. This takes into account the fact that APCs and PAR fees are different. The connecting element, however, is that they each represent (average) costs per article.

For visibility and re-use purposes, the data set is made available via GitHub.

Data schema

Every schema field is represented by a table column and every article conforms to a single table row.

The OpenAPC data schema is described here. This contribution from Leipzig University is an example of a table which conforms to the schema.

Mandatory fields

The data schema for articles from transformation agreements corresponds to the well-known OpenAPC data schema. Each variable (mandatory fields and optional fields) forms a column, and one row is used per article.These variables must be present in every contribution:

institution — Top-level organisation which covered the fee

period — Year of APC payment

euro — The final amount that was paid in Euro. See below for details on how to calculate this number.

doi — Digital Object Identifier

is_hybrid — Should be TRUE if the article was published in a subscription-based Journal ('hybrid journal'), FALSE if the journal was fully Open Access.

Optional fields

We assume that there will be no articles without a DOI in transformation agreements, so no additional data need be provided.