Human methylome studies SRP033252 Track Settings
 
Genome-wide profiling of the functional DNA methylation landscape at base-pair resolution in human cancer types [BS-seq] [Blood (B-cells), Brain, Brain (grey matter), Brain (white matter), Breast, Colon, Liver, Lung, Placenta, Prostate]

Track collection: Human methylome studies

+  All tracks in this collection (463)

Maximum display mode:       Reset to defaults   
Select views (Help):
CpG methylation ▾       PMD       AMR       HMR       CpG reads ▾      
Select subtracks by views and experiment:
 All views CpG methylation  PMD  AMR  HMR  CpG reads 
experiment
SRX381553 
SRX381569 
SRX381585 
SRX381601 
SRX381611 
SRX381621 
SRX381631 
SRX381646 
SRX381661 
SRX381666 
SRX381671 
SRX381678 
SRX381693 
SRX381697 
SRX381701 
SRX381706 
SRX381710 
SRX381713 
SRX381716 
SRX381719 
SRX381722 
SRX381725 
experiment
 All views CpG methylation  PMD  AMR  HMR  CpG reads 
List subtracks: only selected/visible    all    ()
  experiment↓1 views↓2   Track Name↓3  
hide
 SRX381553  HMR  Colon / SRX381553 (HMR)   Schema 
hide
 Configure
 SRX381553  CpG methylation  Colon / SRX381553 (CpG methylation)   Schema 
hide
 Configure
 SRX381569  CpG methylation  Colon / SRX381569 (CpG methylation)   Schema 
hide
 SRX381569  HMR  Colon / SRX381569 (HMR)   Schema 
hide
 SRX381631  HMR  Breast / SRX381631 (HMR)   Schema 
hide
 Configure
 SRX381585  CpG methylation  Colon / SRX381585 (CpG methylation)   Schema 
hide
 Configure
 SRX381601  CpG methylation  Prostate / SRX381601 (CpG methylation)   Schema 
hide
 SRX381646  HMR  Prostate / SRX381646 (HMR)   Schema 
hide
 Configure
 SRX381611  CpG methylation  Breast / SRX381611 (CpG methylation)   Schema 
hide
 SRX381666  HMR  Liver / SRX381666 (HMR)   Schema 
hide
 Configure
 SRX381621  CpG methylation  Breast / SRX381621 (CpG methylation)   Schema 
hide
 SRX381671  HMR  Liver / SRX381671 (HMR)   Schema 
hide
 SRX381693  HMR  Brain (grey matter) / SRX381693 (HMR)   Schema 
hide
 Configure
 SRX381631  CpG methylation  Breast / SRX381631 (CpG methylation)   Schema 
hide
 SRX381706  HMR  Brain (white matter) / SRX381706 (HMR)   Schema 
hide
 Configure
 SRX381646  CpG methylation  Prostate / SRX381646 (CpG methylation)   Schema 
hide
 Configure
 SRX381661  CpG methylation  Liver / SRX381661 (CpG methylation)   Schema 
hide
 SRX381713  HMR  Lung / SRX381713 (HMR)   Schema 
hide
 SRX381725  HMR  Blood (B-cells) / SRX381725 (HMR)   Schema 
hide
 Configure
 SRX381666  CpG methylation  Liver / SRX381666 (CpG methylation)   Schema 
hide
 Configure
 SRX381671  CpG methylation  Liver / SRX381671 (CpG methylation)   Schema 
hide
 Configure
 SRX381678  CpG methylation  Prostate / SRX381678 (CpG methylation)   Schema 
hide
 Configure
 SRX381693  CpG methylation  Brain (grey matter) / SRX381693 (CpG methylation)   Schema 
hide
 Configure
 SRX381697  CpG methylation  Brain / SRX381697 (CpG methylation)   Schema 
hide
 Configure
 SRX381701  CpG methylation  Brain / SRX381701 (CpG methylation)   Schema 
hide
 Configure
 SRX381706  CpG methylation  Brain (white matter) / SRX381706 (CpG methylation)   Schema 
hide
 Configure
 SRX381710  CpG methylation  Placenta / SRX381710 (CpG methylation)   Schema 
hide
 Configure
 SRX381713  CpG methylation  Lung / SRX381713 (CpG methylation)   Schema 
hide
 Configure
 SRX381716  CpG methylation  Lung / SRX381716 (CpG methylation)   Schema 
hide
 Configure
 SRX381719  CpG methylation  Lung / SRX381719 (CpG methylation)   Schema 
hide
 Configure
 SRX381722  CpG methylation  Lung / SRX381722 (CpG methylation)   Schema 
hide
 Configure
 SRX381725  CpG methylation  Blood (B-cells) / SRX381725 (CpG methylation)   Schema 
    

Study title: Genome-wide profiling of the functional DNA methylation landscape at base-pair resolution in human cancer types [BS-seq]
SRA: SRP033252
GEO: GSE52271
Pubmed: 26813288

Experiment Label Methylation Coverage HMRs HMR size AMRs AMR size PMDs PMD size Conversion Details
SRX381553 Colon 0.672 46.0 39500 1123.0 12293 1132.0 2227 11334.7 0.998 title: Bsseq normal colon; {"tissue": "Colon", "disease": "Normal", "label": "Colon"}
SRX381569 Colon 0.657 43.9 64108 2489.8 13113 1127.5 2137 434987.3 0.998 title: Bsseq tumor colon; {"tissue": "Colon", "disease": "Cancer", "label": "Colon_P"}
SRX381585 Colon 0.617 47.0 66552 5455.0 11292 1110.2 1912 541396.1 0.998 title: Bsseq metastasis colon; {"tissue": "Colon", "disease": "Metastasis", "label": "Colon_M"}
SRX381601 Prostate 0.541 29.1 85795 7945.6 2023 1055.5 3396 270015.6 0.999 title: Bsseq tumor prostate; {"tissue": "Prostate", "disease": "Cancer", "label": "22RV1"}
SRX381611 Breast 0.566 37.3 160561 5524.4 2044 993.8 2680 390350.1 0.999 title: Bsseq metastasis breast; {"tissue": "Breast", "disease": "Metastasis", "label": "468LN"}
SRX381621 Breast 0.438 37.4 157544 6936.3 4057 1115.5 6328 211256.8 0.999 title: Bsseq tumor breast; {"tissue": "Breast", "disease": "Cancer", "label": "468PT"}
SRX381631 Breast 0.715 34.7 60673 1115.2 7147 965.4 2866 15254.4 0.997 title: Bsseq normal breast; {"tissue": "Breast", "disease": "Normal", "label": "Breast"}
SRX381646 Prostate 0.729 37.4 64626 1125.7 4927 932.6 3254 10576.9 0.998 title: Bsseq normal prostate; {"tissue": "Prostate", "disease": "Normal", "label": "Prostate"}
SRX381661 Liver 0.496 50.7 68528 8706.7 12170 1034.2 3051 270125.8 0.999 title: Bsseq tumor liver; {"tissue": "Liver", "disease": "Carcinoma", "label": "Liver_P1"}
SRX381666 Liver 0.629 52.5 39936 1462.8 34089 1772.3 1529 817861.8 0.998 title: Bsseq tumor liver; {"tissue": "Liver", "disease": "Carcinoma", "label": "Liver_P2"}
SRX381671 Liver 0.718 55.5 55188 1203.7 6553 1028.2 3711 21288.8 0.995 title: Bsseq normal liver; {"tissue": "Liver", "disease": "Normal", "label": "Liver"}
SRX381678 Prostate 0.532 40.0 134037 5837.8 2234 1019.2 3992 205716.8 0.999 title: Bsseq tumor prostate; {"tissue": "Prostate", "disease": "Cancer", "label": "PC3"}
SRX381693 Brain (grey matter) 0.729 15.8 43145 1160.6 2226 944.7 4041 15343.4 0.981 title: Bsseq normal brain grey-matter; {"tissue": "Brain (grey matter)", "disease": "Normal", "label": "Brain_G"}
SRX381697 Brain 0.640 16.2 82379 8318.9 4369 1304.2 2302 356703.4 0.995 title: Bsseq tumor neuroblastoma; {"tissue": "Brain", "disease": "Neuroblastoma", "label": "SH5YSY"}
SRX381701 Brain 0.546 17.0 83080 8071.6 1539 1052.2 3024 254719.9 0.995 title: Bsseq tumor glioblastoma; {"tissue": "Brain", "disease": "Glioma", "label": "U87MG"}
SRX381706 Brain (white matter) 0.738 16.0 60013 1098.1 894 1031.2 3562 19610.3 0.991 title: Bsseq normal brain white-matter; {"tissue": "Brain (white matter)", "disease": "Normal", "label": "Brain_W"}
SRX381710 Placenta 0.565 29.7 53819 11001.0 5272 1016.6 2610 392655.4 0.996 title: Bsseq normal placenta; {"tissue": "Placenta", "disease": "Normal", "label": "Placenta"}
SRX381713 Lung 0.711 18.8 40823 1110.5 2102 903.3 2792 10945.9 0.996 title: Bsseq normal lung; {"tissue": "Lung", "disease": "Normal", "label": "Lung"}
SRX381716 Lung 0.475 20.6 115726 9586.3 456 936.9 3852 345753.1 0.997 title: Bsseq adenocarcinoma lung; {"tissue": "Lung", "disease": "Adenocarcinoma", "label": "H1437"}
SRX381719 Lung 0.416 21.0 107730 11488.8 1548 1000.5 3476 503769.8 0.997 title: Bsseq squamous_cell tumor lung; {"tissue": "Lung", "disease": "Squamous cell cancer", "label": "H157"}
SRX381722 Lung 0.629 21.1 91060 6082.8 1459 1025.4 2812 235689.5 0.996 title: Bsseq small_cell tumor lung; {"tissue": "Lung", "disease": "Small cell lung cancer", "label": "H1672"}
SRX381725 Blood (B-cells) 0.698 18.1 52213 987.5 1747 980.7 2783 8926.3 0.996 title: Bsseq normal CD19; {"tissue": "Blood (B-cells)", "disease": "Normal", "label": "CD19"}

Methods

All analysis was done using a bisulfite sequnecing data analysis pipeline DNMTools developed in the Smith lab at USC.

Mapping reads from bisulfite sequencing: Bisulfite treated reads are mapped to the genomes with the abismal program. Input reads are filtered by their quality, and adapter sequences in the 3' end of reads are trimmed. This is done with cutadapt. Uniquely mapped reads with mismatches/indels below given threshold are retained. For pair-end reads, if the two mates overlap, the overlapping part of the mate with lower quality is discarded. After mapping, we use the format command in dnmtools to merge mates for paired-end reads. We use the dnmtools uniq command to randomly select one from multiple reads mapped exactly to the same location. Without random oligos as UMIs, this is our best indication of PCR duplicates.

Estimating methylation levels: After reads are mapped and filtered, the dnmtools counts command is used to obtain read coverage and estimate methylation levels at individual cytosine sites. We count the number of methylated reads (those containing a C) and the number of unmethylated reads (those containing a T) at each nucleotide in a mapped read that corresponds to a cytosine in the reference genome. The methylation level of that cytosine is estimated as the ratio of methylated to total reads covering that cytosine. For cytosines in the symmetric CpG sequence context, reads from the both strands are collapsed to give a single estimate. Very rarely do the levels differ between strands (typically only if there has been a substitution, as in a somatic mutation), and this approach gives a better estimate.

Bisulfite conversion rate: The bisulfite conversion rate for an experiment is estimated with the dnmtools bsrate command, which computes the fraction of successfully converted nucleotides in reads (those read out as Ts) among all nucleotides in the reads mapped that map over cytosines in the reference genome. This is done either using a spike-in (e.g., lambda), the mitochondrial DNA, or the nuclear genome. In the latter case, only non-CpG sites are used. While this latter approach can be impacted by non-CpG cytosine methylation, in practice it never amounts to much.

Identifying hypomethylated regions (HMRs): In most mammalian cells, the majority of the genome has high methylation, and regions of low methylation are typically the interesting features. (This seems to be true for essentially all healthy differentiated cell types, but not cells of very early embryogenesis, various germ cells and precursors, and placental lineage cells.) These are valleys of low methylation are called hypomethylated regions (HMR) for historical reasons. To identify the HMRs, we use the dnmtools hmr command, which uses a statistical model that accounts for both the methylation level fluctations and the varying amounts of data available at each CpG site.

Partially methylated domains: Partially methylated domains are large genomic regions showing partial methylation observed in immortalized cell lines and cancerous cells. The pmd program is used to identify PMDs.

Allele-specific methylation: Allele-Specific methylated regions refers to regions where the parental allele is differentially methylated compared to the maternal allele. The program allelic is used to compute allele-specific methylation score can be computed for each CpG site by testing the linkage between methylation status of adjacent reads, and the program amrfinder is used to identify regions with allele-specific methylation.

For more detailed description of the methods of each step, please refer to the DNMTools documentation.