Skip to content

Commit

Permalink
methods/data: add GTEx v8 URL
Browse files Browse the repository at this point in the history
  • Loading branch information
miltondp committed Jun 18, 2024
1 parent d8d6f38 commit 3c71849
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions content/08.05.methods.data.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
### Gene expression data and preprocessing {#sec:data_gtex}

We downloaded GTEx v8 data for all tissues, normalized using TPM (transcripts per million), and focused our primary analysis on whole blood, which has a good sample size (755).
We selected the top 5,000 genes from whole blood with the largest variance after standardizing with $log(x + 1)$ to avoid a bias towards highly-expressed genes.
We then computed Pearson, Spearman, MIC and CCC on these 5,000 genes across all 755 samples on the TPM-normalized data, generating a pairwise similarity matrix of size 5,000 x 5,000.
We downloaded gene expression data from GTEx v8 ([https://gtexportal.org/](https://gtexportal.org/)) for all tissues, normalized using TPM (transcripts per million), and focused our primary analysis on whole blood, which has a good sample size (755).
We selected the top 5,000 genes from whole blood with the largest variance after standardizing with $log(x + 1)$ to avoid a bias toward highly expressed genes.
We then computed Pearson, Spearman, MIC and CCC on these 5,000 genes across all 755 samples, generating a pairwise similarity matrix of size 5,000 x 5,000.

0 comments on commit 3c71849

Please sign in to comment.