Skip to content

In this repository, we are working on enriching the Book Crossing dataset with author information.

Notifications You must be signed in to change notification settings

SavvinaDaniil/EnrichBookCrossing

Repository files navigation

EnrichBookCrossing

In this repository, we are working on enriching the Book-Crossing dataset with author information.

Book-Crossing is a commonly used dataset for recommendation, created in 2004 by crawling the website of Book-Crossing for four weeks. Book-Crossing is an international book exchange community whose members can leave books anywhere in the world and indicate their location to other users. The users can then update their profile with the information that they read a new book, and also submit a rating from 1 to 10.

The dataset consists of three subsets: book ratings, users, and items. In this repository, we are using external sources to enrich it with additional author information when publicly available. The following graph roughly depicts the process. FAccTRec drawio

The process of linking the dataset to external sources in order to enrich it with author information is performed in notebook A. The following data sources are used:

  1. Google Books
  2. VIAF
  3. WikiData

In notebook B, we are analyzing the resulting dataset in terms of author characteristics.

About

In this repository, we are working on enriching the Book Crossing dataset with author information.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published