Author: Daniel Scheuermann
Date: 2024-01-31
This repository contains a comprehensive R script and analysis to replicate a study conducted in 2010, which focused on the Pasilla gene using RNA-seq data. The Pasilla gene was depleted, and RNA was isolated to prepare single-end and paired-end libraries for treated vs untreated samples. The resulting libraries were sequenced to obtain RNA-seq data, and this script compares the gene expression in treated vs untreated samples.
Ensure you have the following R packages installed before running the script:
- DESeq2
- pheatmap
- dplyr
- RColorBrewer
- ggplot2
- ggrepel
- clusterProfiler
- goseq
Please for RNASeq-Pasilla.rmd for a breakdown of the code and the exploration of the data resulting from each section.
Provides an overview of the study and its inspiration.
Reads and sets up the count data and sample information.
Creates the DESeq2 object and specifies treatment factors.
Filters genes and performs DESeq2 analysis.
Examines specific genes, focusing on the Pasilla gene.
Filters differentially expressed genes.
Saves manipulated data and normalized counts to CSV files.
Generates dispersion and PCA plots for visualizing sample relationships.
Produces heatmaps, MA plots, and volcano plots for differential gene expression visualization.
Describes the results of Gene Ontology (GO) analysis.
Summarizes the findings of Kegg pathway analysis.
Outlines the author's reflections and future goals for the project.