Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
International HapMap Project
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Background == Unlike with the [[rare disease|rarer]] [[Mendelian]] diseases, combinations of different [[genes]] and the environment play a role in the development and progression of common diseases (such as [[diabetes]], [[cancer]], [[heart disease]], [[stroke]], [[clinical depression|depression]], and [[asthma]]), or in the individual response to [[pharmacological]] agents.<ref>{{cite journal |last1=Crouch |first1=Daniel J. M. |last2=Bodmer |first2=Walter F. |title=Polygenic inheritance, GWAS, polygenic risk scores, and the search for functional variants |journal=Proceedings of the National Academy of Sciences |date=11 August 2020 |volume=117 |issue=32 |pages=18924β18933 |doi=10.1073/pnas.2005634117|pmid=32753378 |pmc=7431089 |bibcode=2020PNAS..11718924C |doi-access=free }}</ref> To find the genetic factors involved in these diseases, one could in principle do a [[genome-wide association study]]: obtain the complete genetic sequence of several individuals, some with the disease and some without, and then search for differences between the two sets of genomes. At the time, this approach was not feasible because of the cost of [[full genome sequencing]]. The HapMap project proposed a shortcut. Although any two unrelated people share about 99.5% of their [[DNA]] sequence, their [[genome]]s differ at specific [[nucleotide]] locations. Such sites are known as [[single nucleotide polymorphisms]] (SNPs), and each of the possible resulting gene forms is called an [[allele]].<ref name="NHGRI_allele">{{cite web |title=Allele |url=https://www.genome.gov/genetics-glossary/Allele |website=Genome.gov |publisher=National Human Genome Research Institute |language=en}}</ref> The HapMap project focuses only on common SNPs, those where each allele occurs in at least 1% of the population. Each person has two copies of all [[chromosomes]], except the [[sex chromosomes]] in [[male]]s. For each SNP, the combination of alleles a person has is called a [[genotype]]. [[Genotyping]] refers to uncovering what genotype a person has at a particular site. The HapMap project chose a sample of 269 individuals and selected several million well-defined SNPs, genotyped the individuals for these SNPs, and published the results.<ref name="HapMapNaturePaper2003">{{cite journal |last1=The International HapMap Consortium |title=The International HapMap Project |journal=Nature |date=December 2003 |volume=426 |issue=6968 |pages=789β796 |doi=10.1038/nature02168 |pmid=14685227 |hdl=2027.42/62838 |s2cid=8151693 |doi-access=free |hdl-access=free }}</ref> The alleles of nearby SNPs on a single chromosome are correlated. Specifically, if the allele of one SNP for a given individual is known, the alleles of nearby SNPs can often be predicted, a process known as ''genotype imputation''.<ref name="Deng2022">{{cite journal |last1=Deng |first1=Tianyu |last2=Zhang |first2=Pengfei |last3=Garrick |first3=Dorian |last4=Gao |first4=Huijiang |last5=Wang |first5=Lixian |last6=Zhao |first6=Fuping |title=Comparison of Genotype Imputation for SNP Array and Low-Coverage Whole-Genome Sequencing Data |journal=Frontiers in Genetics |date=2022 |volume=12 |page=704118 |doi=10.3389/fgene.2021.704118 |pmid=35046990 |pmc=8762119 |doi-access=free }}</ref> This is because each SNP arose in evolutionary history as a single point [[mutation]], and was then passed down on the chromosome surrounded by other, earlier, point mutations. SNPs that are separated by a large distance on the chromosome are typically not very well correlated, because [[Genetic recombination|recombination]] occurs in each generation and mixes the allele sequences of the two chromosomes. A sequence of consecutive alleles on a particular chromosome is known as a [[haplotype]].<ref name="NHGRI_Haplotype">{{cite web |title=Haplotype |url=https://www.genome.gov/genetics-glossary/haplotype |website=Genome.gov |publisher=National Human Genome Research Institute |access-date=25 June 2022 |language=en}}</ref> To find the genetic factors involved in a particular disease, one can proceed as follows. First a certain region of interest in the genome is identified, possibly from earlier inheritance studies. In this region one locates a set of [[tag SNP]]s from the HapMap data; these are SNPs that are very well correlated with all the other SNPs in the region. Using these, genotype imputation can be used to determine (impute) the other SNPs and thus the entire haplotype with high confidence. Next, one determines the genotype for these tag SNPs in several individuals, some with the disease and some without. By comparing the two groups, one determines the likely locations and haplotypes that are involved in the disease.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)