Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Shotgun sequencing
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Example== For example, consider the following two rounds of shotgun reads: {| class="wikitable" |- ! Strand ! Sequence |- | Original | <code><big>AGCATGCTGCAGTCATGCTTAGGCTA</big></code> |- | First shotgun sequence | <code><big>AGCATGCTGCAGTCATGCT-------</big></code><br/><code><big>-------------------TAGGCTA</big></code> |- | Second shotgun sequence | <code><big>AGCATG--------------------</big></code><br/><code><big>------CTGCAGTCATGCTTAGGCTA</big></code> |- | Reconstruction | <code><big>AGCATGCTGCAGTCATGCTTAGGCTA</big></code> |} In this extremely simplified example, none of the reads cover the full length of the original sequence, but the four reads can be assembled into the original sequence using the overlap of their ends to align and order them. In reality, this process uses enormous amounts of information that are rife with ambiguities and sequencing errors. Assembly of complex genomes is additionally complicated by the great abundance of [[repeated sequence (DNA)|repetitive sequences]], meaning similar short reads could come from completely different parts of the sequence. Many overlapping reads for each segment of the original DNA are necessary to overcome these difficulties and accurately assemble the sequence. For example, to complete the [[Human Genome Project]], most of the human genome was sequenced at 12X or greater ''coverage''; that is, each base in the final sequence was present on average in 12 different reads. Even so, current methods have failed to isolate or assemble reliable sequence for approximately 1% of the ([[euchromatin|euchromatic]]) human genome, as of 2004.<ref name=HGS2004>{{cite journal |author=International Human Genome Sequencing Consortium |title=Finishing the euchromatic sequence of the human genome |journal=Nature |date=21 October 2004 |volume=431 |issue=7011 |pages=931β945 |doi=10.1038/nature03001 |pmid=15496913 |bibcode=2004Natur.431..931H |doi-access=free}}</ref>
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)