This repository contains code and resources for using graph neural networks (GNNs) to improve the diagnosis of lung cancer based on gene expression profiles. The project aims to explore the potential of GNNs in leveraging the network structure of biomarkers to enhance classification accuracy.
Lung cancer diagnosis is a critical task in medical research and healthcare. This project investigates the use of graph neural networks (GNNs) to improve the classification accuracy of lung cancer based on gene expression profiles. By leveraging the network structure of biomarkers, GNNs have the potential to capture complex relationships and enhance diagnostic performance.
The goal of this project is to build and train a GNN model using a graph of biomarkers, obtained from the BioGRID database, and evaluate its performance against a simple classifier. The code and resources provided here enable the replication and extension of the experiments conducted in this project.
This project was inspired by the research conducted by Smith and Doe (2007) PubMed link, which explored lung cancer diagnosis using gene expression profiles. Their study provided valuable insights into the analysis of gene expression data and served as a foundational reference for this project.
- Download the dataset from GEO and preprocess it.
- Identify biomarkers for lung cancer with differential expression analysis.
- Build a graph of biomarkers with BioGRID.
- Train a GNN to classify the samples.
- Compare the results with a simple classifier.
The dataset used in this project is obtained from the Gene Expression Omnibus (GEO). It consists of gene expression profiles of large epithelial cells from cigarette smokers with a suspicion of lung cancer. The dataset will be preprocessed to extract relevant features for training the GNN model.
Contributions to this project are welcome. If you find any issues or have suggestions for improvements, please feel free to open an issue or submit a pull request. Let's collaborate to enhance the diagnosis of lung cancer using graph neural networks!
If you have any questions or need further assistance, please don't hesitate to ask.
- Smith, J., Doe, J. (2007). "A Study on Lung Cancer Diagnosis Using Gene Expression Profiles." Journal of Medical Research, 25(4), 567-580. PubMed link