new
Note: Released Aug. 01, 2025
Description
The Macaque Genotype and Phenotype Resource (mGAP) provides reference population-level short variant
data for rhesus macaques. The current dataset was generated from ~4,000 rhesus macaques, primarily
derived from the NIH National Primate Research Center (NPRC) colonies, although other colonies are
represented. These are outbred, genetically diverse populations.
This track contains the ~130 million high-quality short variants (SNVs and INDELs), generated from
whole genome and whole exome data. Samples were jointly called to ensure each sample was genotyped
for all loci. This track omits sample-level genotypes; however, the complete dataset with genotypes
is available to browse or download from https://mgap.ohsu.edu. Allele frequency and other annotations are provided.
One of the unique features of the mGAP dataset is that most samples are derived from captive
macaques housed within the NIH NPRC system. As such, these animals have lifelong health records and
usually come from a genetically validated pedigree. If a variant of interest is identified in the
mGAP dataset, it is often possible to identify specific macaques harboring that variant and
interrogate other data that exists on these animals. The mGAP website contains additional search features for these data.
Email mgap@ohsu.edu with any questions.
Methods
Sequence reads are aligned to the Mmul10 (rheMac10) genomic assembly. Variant calling is performed
using a modified version of the Broad Institute
GATK SNP and Indel Discovery Best Practices, adapted for macaques. Raw variant
calls are subjected to fairly strict quality filtering approach, validated using Mendelian
violations and other analyses to identify regions of the macaque genome with systematic errors in
short read alignment or genotype calling.
Annotation takes place in two phases. First, variants are annotated according to their predicted
consequence on protein coding. Second, a pipeline is run that maps macaque variants to the human
genome and applies a panel of annotations.
Please see the mGAP website for more detail (requires user registration).
Credits
mGAP is supported by NIH R24OD021324. Please remember to cite this funding source in all
publications that make use of mGAP data.
References
Benjamin N Bimber, Melissa Y Yan, Samuel M Peterson, Betsy Ferguson.
mGAP: the macaque genotype and phenotype resource, a framework for accessing and interpreting macaque variant data, and identifying new models of human disease.
BMC Genomics.. 2019 Mar 6.
PMID: 30841849
|