generated from jhudsl/AnVIL_Template
-
Notifications
You must be signed in to change notification settings - Fork 3
/
05-explore_the_epigenome.Rmd
178 lines (118 loc) · 11.1 KB
/
05-explore_the_epigenome.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
# (PART\*) Student Activity {-}
# Explore the Epigenome
## Launch the Genome Browser Gateway
Go to [`https://genome.ucsc.edu/cgi-bin/hgGateway`](https://genome.ucsc.edu/cgi-bin/hgGateway). Ensure "Human" is selected under "Browse/Select Species".
```{r, fig.align='center', echo = FALSE, fig.alt= "Landing page for the UCSC Genome browser Gateway.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_2")
```
You will be working from the Human Assembly **March 2006 (NCBI/hg18)**. Make sure this is selected under the "Find Position" dropdown menu. Enter **HOXA7** in Position/Search Term and select GO.
```{r, fig.align='center', echo = FALSE, fig.alt= "March 2006 assembly is highlighted on the Find Position dropdown menu.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_7")
```
Your Genome Browser should now look like this:
```{r, fig.align='center', echo = FALSE, fig.alt= "March 2006 assembly results for the HOXA7 locus.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_22")
```
## Clean up Visual Settings
You’ll now use the following directions to remove the default visual settings.
Scroll below to different menu options shown.
* Under the "Mapping and Sequence" tab, Base Position should be set to **dense**.
* Under the "Genes and Gene Predictions" tab, RefSeq Genes should be set to **pack**.
* All other features should be marked as **hide**. You will need to go through each of the menu options to make sure features are marked as “hide”. Please note you will need to open each menu option by select the “+” to ensure all tracks are turned off (marked as hide).
```{r, fig.align='center', echo = FALSE, fig.alt= "The Base Position menu has been set to 'dense'.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_29")
```
```{r, fig.align='center', echo = FALSE, fig.alt= "The RefSeq Genes menu has been set to 'pack'.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_33")
```
At the end of the browser page, select "refresh".
```{r, fig.align='center', echo = FALSE, fig.alt= "The refresh button at the bottom of the page is highlighted.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_41")
```
Your browser should look like the image below, where only HOXA7 is shown.
```{r, fig.align='center', echo = FALSE, fig.alt= "The genome is zoomed in to the HOXA7 gene.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_48")
```
## Zooming Out
Zoom out your display by 10x, 3x or 1.5x to see HOXA1-HOXA13 genes on your browser. The zoom out value you’ll select will be dependent on your computer display screen. You may need to use the zoom in option if you’ve zoomed out the display beyond the HOXA1-HOXA13 genes. Zoom buttons can be clicked more than once.
```{r, fig.align='center', echo = FALSE, fig.alt= "The zoom in and out buttons are highlighted", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_54")
```
Ideally your browser should like the image below.
```{r, fig.align='center', echo = FALSE, fig.alt= "HOXA1 through HOXA3 loci are now visible on the genome.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_59")
```
Instead of zooming in and out, you can always manually enter your chromosome region of choice. For this exercise, try entering the following region: **chr7:27,087,367-27,235,317**. Click "go".
```{r, fig.align='center', echo = FALSE, fig.alt= "The specific chromosome locus has been typed into the search bar.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g15625333d53_0_65")
```
<div class = "notice">
You could also enter **chr7:27,087,367-27,235,317** in the search bar at the start of this exercise instead of HOXA7.
</div>
## Selecting Histone Marks
Now we’ll visualize histone marks to show that the same chromosome locus can have drastically different histone patterns.
Scroll down to the "Regulation" menu and click the **Broad Histone** track hyperlink.
```{r, fig.align='center', echo = FALSE, fig.alt= "The Broad Histone track hyperlink is highlighted.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_0_0")
```
Scroll down and deselect all boxes.
```{r, fig.align='center', echo = FALSE, fig.alt= "Cell line and antibody combinations have been deselected.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_1_8")
```
Select **H3K4me3** and **H3K27me3** marks for **H1-hESC** and **NHLF**. To learn more about these options you can click on their blue hyperlinks for more information.
```{r, fig.align='center', echo = FALSE, fig.alt= "Selected cell line and antibody combinations have been highlighted.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_0_9")
```
<div class = "notice">
**H3K4me3** is a known histone mark on Histone 3 (H3). It occurs on lysine 4 (K4) and has three methyl groups (me3) that indicate active gene expression. In other words, the DNA is open and ready to be accessed.
```{r, fig.align='center', echo = FALSE, fig.alt= "Histone H3 (tri methyl K4). Marks promoters that are active or poised to be activated.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_1_18")
```
**H3K27me3** is a known histone mark on on Histone 3 (H3). It occurs on lysine 27 (K27) and has three methyl groups (me3) that indicate inactive gene expression. In other words, the DNA is closed off and cannot be accessed.
```{r, fig.align='center', echo = FALSE, fig.alt= "Histone H3 (tri-methyl K27). Marks promoters that are silenced by Polycomb proteins in a given lineage; large domains are found at inactive developmental loci.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_1_22")
```
The **H1-hESC** cell line consists of embryonic stem cells.
```{r, fig.align='center', echo = FALSE, fig.alt= "The table of statistics on H1-hESC cell line.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_53_0")
```
The **NHLF** cell line consists of lung fibroblasts. Fibroblasts are important for building the connective structures around cells and healing wounds.
```{r, fig.align='center', echo = FALSE, fig.alt= "The table of statistics on NHLF cell line.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_1")
```
</div>
Scroll below and deselect the Peak views. You will only need Signal views.
```{r, fig.align='center', echo = FALSE, fig.alt= "Peak check boxes are highlighted.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_5")
```
Your selected subtracks should only display Signal views as shown below. Click "Submit".
```{r, fig.align='center', echo = FALSE, fig.alt= "Only Signal views are visible among subtracks. The Submit button is highlighted", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_13")
```
Scroll down, and your browser should look like the image below.
```{r, fig.align='center', echo = FALSE, fig.alt= "Histone marks are shown below the HOXA loci in the genome browser.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_18")
```
## Comparing Histone Marks
Let's first examine the **H1-hESC** cell line (embryonic stem cells).
Signals are high for the H3K27me3 histone mark. You’ll also notice that H3K4me3 signals are minimal. This signal pattern demonstrates that embryonic stem cells are in the repressed chromatin configuration at the HOXA locus and indicates that HOXA genes in blue are not expressed in this cell line.
```{r, fig.align='center', echo = FALSE, fig.alt= "High signals for the H3K27me3 histone mark in the H1-hESC cell line indicate these genes are not expressed in these cells.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_22")
```
Now, let's examine the **NHLF** cell line (lung fibroblasts).
The pattern is different for NHLF. H3K4me3 signals are high for the first half of the HOXA locus, but low for the second half of the locus.
```{r, fig.align='center', echo = FALSE, fig.alt= "H3K4me3 signals are high then low in the genome browser.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_30")
```
H3K27me3 signals are low and then high for the same locus.
```{r, fig.align='center', echo = FALSE, fig.alt= "H3K27me3 signals are low then high in the genome browser.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_35")
```
This signal pattern demonstrates that for lung fibroblasts only half of the HOXA locus is expressed while the other half is not expressed. Specifically, we can use the full image to see that HOXA1-7 are expressed (turned on).
```{r, fig.align='center', echo = FALSE, fig.alt= "HOXA1 through 7 are highlighted to indicate expression is turned on.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_40")
```
We can also see that HOXA9-13 is not expressed (turned off).
```{r, fig.align='center', echo = FALSE, fig.alt= "HOXA9 through 13 are highlighted to indicate expression is turned off.", out.width = '100%'}
ottrpal::include_slide("https://docs.google.com/presentation/d/1B7y_jm8GuHZMUvTU1TFdW4nqQQV6nOT9XuFwgSflXz4/edit#slide=id.g152864d54a7_70_46")
```