Skip to content

Programmatic access to ProteinGym datasets in R/Bioconductor

Notifications You must be signed in to change notification settings

ccb-hms/ProteinGymR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ProteinGymR

Data resources from ProteinGym

The ProteinGymR package provides analysis-ready data resources from ProteinGym, generated by Notin et al., 2023. ProteinGym comprises a collection of benchmarks for evaluating the performance of models predicting the effect of point mutations. This package provides access to two datasets: 1. Deep mutational scanning (DMS) scores from 217 assays measuring the impact of all possible amino acid substitutions across 186 proteins, and 2. AlphaMissense pathogenicity scores for ~1.6 M substitutions in the ProteinGym DMS data.


Installation

Install the package from Bioconductor or GitHub, ensuring correct Bioconductor dependencies.

if (!"BiocManager" %in% rownames(installed.packages()))
    install.packages("BiocManager", repos = "https://cloud.R-project.org")

BiocManager::install("ccb-hms/ProteinGymR")

Load the library.

library(ProteinGymR)

About

Programmatic access to ProteinGym datasets in R/Bioconductor

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages