Template:Short description

File:Origins of DNA replication Figure 1.jpg
Models for bacterial (A) and eukaryotic (B) DNA replication initiation. A) Circular bacterial chromosomes contain a cis-acting element, the replicator, that is located at or near replication origins. i) The replicator recruits initiator proteins in a DNA sequence-specific manner, which results in melting of the DNA helix and loading of the replicative helicase onto each of the single DNA strands (ii). iii) Assembled replisomes bidirectionally replicate DNA to yield two copies of the bacterial chromosome. B) Linear eukaryotic chromosomes contain many replication origins. Initiator binding (i) facilitates replicative helicase loading (ii) onto duplex DNA to license origins. iii) A subset of loaded helicases is activated for replisome assembly. Replication proceeds bidirectionally from origins and terminates when replication forks from adjacent active origins meet (iv).

The origin of replication (also called the replication origin) is a particular sequence in a genome at which replication is initiated.<ref>Template:Cite book</ref> Propagation of the genetic material between generations requires timely and accurate duplication of DNA by semiconservative replication prior to cell division to ensure each daughter cell receives the full complement of chromosomes.<ref name="Ekundayo et al">Template:Cite journal File:CC-BY icon.svg Material was copied from this source, which is available under a Creative Commons Attribution 4.0 International License.</ref> This can either involve the replication of DNA in living organisms such as prokaryotes and eukaryotes, or that of DNA or RNA in viruses, such as double-stranded RNA viruses.<ref>Template:Cite journal</ref> Synthesis of daughter strands starts at discrete sites, termed replication origins, and proceeds in a bidirectional manner until all genomic DNA is replicated. Despite the fundamental nature of these events, organisms have evolved surprisingly divergent strategies that control replication onset.<ref name="Ekundayo et al"/> Although the specific replication origin organization structure and recognition varies from species to species, some common characteristics are shared.

FeaturesEdit

A key prerequisite for DNA replication is that it must occur with extremely high fidelity and efficiency exactly once per cell cycle to prevent the accumulation of genetic alterations with potentially deleterious consequences for cell survival and organismal viability.<ref>Template:Cite journal</ref> Incomplete, erroneous, or untimely DNA replication events can give rise to mutations, chromosomal polyploidy or aneuploidy, and gene copy number variations, each of which in turn can lead to diseases, including cancer.<ref>Template:Cite journal</ref><ref name=":1">Template:Cite journal</ref> To ensure complete and accurate duplication of the entire genome and the correct flow of genetic information to progeny cells, all DNA replication events are not only tightly regulated with cell cycle cues but are also coordinated with other cellular events such as transcription and DNA repair.<ref name="Ekundayo et al"/><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref name=":2">Template:Cite journal</ref> Additionally, origin sequences commonly have high AT-content across all kingdoms, since repeats of adenine and thymine are easier to separate because their base stacking interactions are not as strong as those of guanine and cytosine.<ref>Template:Cite journal</ref>

DNA replication is divided into different stages. During initiation, the replication machineries – termed replisomes – are assembled on DNA in a bidirectional fashion. These assembly loci constitute the start sites of DNA replication or replication origins. In the elongation phase, replisomes travel in opposite directions with the replication forks, unwinding the DNA helix and synthesizing complementary daughter DNA strands using both parental strands as templates. Once replication is complete, specific termination events lead to the disassembly of replisomes. As long as the entire genome is duplicated before cell division, one might assume that the location of replication start sites does not matter; yet, it has been shown that many organisms use preferred genomic regions as origins.<ref name=":15">Template:Cite journal</ref><ref name=":16">Template:Cite journal</ref> The necessity to regulate origin location likely arises from the need to coordinate DNA replication with other processes that act on the shared chromatin template to avoid DNA strand breaks and DNA damage.<ref name="Ekundayo et al"/><ref name=":1" /><ref name=":2" /><ref>Template:Cite journal</ref><ref name="#8638128">Template:Cite journal</ref><ref name="#27362223"/><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref>

Replicon modelEdit

More than five decades ago, Jacob, Brenner, and Cuzin proposed the replicon hypothesis to explain the regulation of chromosomal DNA synthesis in E. coli.<ref name=":0">Template:Cite journal</ref> The model postulates that a diffusible, trans-acting factor, a so-called initiator, interacts with a cis-acting DNA element, the replicator, to promote replication onset at a nearby origin. Once bound to replicators, initiators (often with the help of co-loader proteins) deposit replicative helicases onto DNA, which subsequently drive the recruitment of additional replisome components and the assembly of the entire replication machinery. The replicator thereby specifies the location of replication initiation events, and the chromosome region that is replicated from a single origin or initiation event is defined as the replicon.<ref name="Ekundayo et al"/>

A fundamental feature of the replicon hypothesis is that it relies on positive regulation to control DNA replication onset, which can explain many experimental observations in bacterial and phage systems.<ref name=":0" /> For example, it accounts for the failure of extrachromosomal DNAs without origins to replicate when introduced into host cells. It further rationalizes plasmid incompatibilities in E. coli, where certain plasmids destabilize each other's inheritance due to competition for the same molecular initiation machinery.<ref>Template:Cite journal</ref> By contrast, a model of negative regulation (analogous to the replicon-operator model for transcription) fails to explain the above findings.<ref name=":0" /> Nonetheless, research subsequent to Jacob's, Brenner's and Cuzin's proposal of the replicon model has discovered many additional layers of replication control in bacteria and eukaryotes that comprise both positive and negative regulatory elements, highlighting both the complexity and the importance of restricting DNA replication temporally and spatially.<ref name="Ekundayo et al"/><ref>Template:Cite journal</ref><ref name=":3">Template:Cite book</ref><ref name=":4">Template:Cite journal</ref>

The concept of the replicator as a genetic entity has proven very useful in the quest to identify replicator DNA sequences and initiator proteins in prokaryotes, and to some extent also in eukaryotes, although the organization and complexity of replicators differ considerably between the domains of life.<ref name="#15459665">Template:Cite journal</ref><ref>Template:Cite journal</ref> While bacterial genomes typically contain a single replicator that is specified by consensus DNA sequence elements and that controls replication of the entire chromosome, most eukaryotic replicators – with the exception of budding yeast – are not defined at the level of DNA sequence; instead, they appear to be specified combinatorially by local DNA structural and chromatin cues.<ref name=":6">Template:Cite journal</ref><ref>Template:Cite journal</ref><ref name=":7">Template:Cite journal</ref><ref name=":8">Template:Cite journal</ref><ref name=":9">Template:Cite journal</ref><ref name=":10">Template:Cite journal</ref><ref name=":11">Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref> Eukaryotic chromosomes are also much larger than their bacterial counterparts, raising the need for initiating DNA synthesis from many origins simultaneously to ensure timely replication of the entire genome. Additionally, many more replicative helicases are loaded than activated to initiate replication in a given cell cycle. The context-driven definition of replicators and selection of origins suggests a relaxed replicon model in eukaryotic systems that allows for flexibility in the DNA replication program.<ref name="#15459665"/> Although replicators and origins can be spaced physically apart on chromosomes, they often co-localize or are located in close proximity; for simplicity, we will thus refer to both elements as ‘origins’ throughout this review. Taken together, the discovery and isolation of origin sequences in various organisms represents a significant milestone towards gaining mechanistic understanding of replication initiation. In addition, these accomplishments had profound biotechnological implications for the development of shuttle vectors that can be propagated in bacterial, yeast and mammalian cells.<ref name="Ekundayo et al"/><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref>

BacterialEdit

File:Origins of DNA replication Figure 2.jpg
Origin organization and recognition in bacteria. A) Schematic of the architecture of E. coli origin oriC, Thermotoga maritima oriC, and the bipartite origin in Helicobacter pylori. The DUE is flanked on one side by several high- and weak-affinity DnaA-boxes as indicated for E. coli oriC. B) Domain organization of the E. coli initiator DnaA. Magenta circle indicates the single-strand DNA binding site. C) Models for origin recognition and melting by DnaA. In the two-state model (left panel), the DnaA protomers transition from a dsDNA binding mode (mediated by the HTH-domains recognizing DnaA-boxes) to an ssDNA binding mode (mediated by the AAA+ domains). In the loop-back model, the DNA is sharply bent backwards onto the DnaA filament (facilitated by the regulatory protein IHF)<ref>Template:Cite journal</ref> so that a single protomer binds both duplex and single-stranded regions. In either instance, the DnaA filament melts the DNA duplex and stabilizes the initiation bubble prior to loading of the replicative helicase (DnaB in E. coli). HTH – helix-turn-helix domain, DUE – DNA unwinding element, IHF – integration host factor.

Most bacterial chromosomes are circular and contain a single origin of chromosomal replication (oriC). Bacterial oriC regions are surprisingly diverse in size (ranging from 250 bp to 2 kbp), sequence, and organization;<ref name=":12">Template:Cite journal</ref><ref name=":13">Template:Cite journal</ref> nonetheless, their ability to drive replication onset typically depends on sequence-specific readout of consensus DNA elements by the bacterial initiator, a protein called DnaA.<ref name=":14">Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref> Origins in bacteria are either continuous or bipartite and contain three functional elements that control origin activity: conserved DNA repeats that are specifically recognized by DnaA (called DnaA-boxes), an AT-rich DNA unwinding element (DUE), and binding sites for proteins that help regulate replication initiation.<ref name=":15" /><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref> Interactions of DnaA both with the double-stranded (ds) DnaA-box regions and with single-stranded (ss) DNA in the DUE are important for origin activation and are mediated by different domains in the initiator protein: a Helix-turn-helix (HTH) DNA binding element and an ATPase associated with various cellular activities (AAA+) domain, respectively.<ref name=":17">Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref name=":18">Template:Cite journal</ref><ref name=":19">Template:Cite journal</ref><ref name=":20">Template:Cite journal</ref><ref>Template:Cite journal</ref> While the sequence, number, and arrangement of origin-associated DnaA-boxes vary throughout the bacterial kingdom, their specific positioning and spacing in a given species are critical for oriC function and for productive initiation complex formation.<ref name="Ekundayo et al"/><ref name=":12" /><ref name=":13" /><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref name=":21">Template:Cite journal</ref><ref>Template:Cite journal</ref>

Among bacteria, E. coli is a particularly powerful model system to study the organization, recognition, and activation mechanism of replication origins. E. coli oriC comprises an approximately ~260 bp region containing four types of initiator binding elements that differ in their affinities for DnaA and their dependencies on the co-factor ATP. DnaA-boxes R1, R2, and R4 constitute high-affinity sites that are bound by the HTH domain of DnaA irrespective of the nucleotide-binding state of the initiator.<ref name=":14" /><ref name="#2995681">Template:Cite journal</ref><ref name="#8663334">Template:Cite journal</ref><ref name="#7615570">Template:Cite journal</ref><ref name="#9351837">Template:Cite journal</ref><ref name="#2542031">Template:Cite journal</ref> By contrast, the I, τ, and C-sites, which are interspersed between the R-sites, are low-affinity DnaA-boxes and associate preferentially with ATP-bound DnaA, although ADP-DnaA can substitute for ATP-DnaA under certain conditions.<ref name="#14978287">Template:Cite journal</ref><ref name="#15901724">Template:Cite journal</ref><ref name="#10545126">Template:Cite journal</ref><ref name=":21" /> Binding of the HTH domains to the high- and low-affinity DnaA recognition elements promotes ATP-dependent higher-order oligomerization of DnaA's AAA+ modules into a right-handed filament that wraps duplex DNA around its outer surface, thereby generating superhelical torsion that facilitates melting of the adjacent AT-rich DUE.<ref name=":17" /><ref name="#19833870">Template:Cite journal</ref><ref name="#16829961">Template:Cite journal</ref><ref name="#22581769">Template:Cite journal</ref> DNA strand separation is additionally aided by direct interactions of DnaA's AAA+ ATPase domain with triplet repeats, so-called DnaA-trios, in the proximal DUE region.<ref name="#27281207">Template:Cite journal</ref> The engagement of single-stranded trinucleotide segments by the initiator filament stretches DNA and stabilizes the initiation bubble by preventing reannealing.<ref name=":19" /> The DnaA-trio origin element is conserved in many bacterial species, indicating it is a key element for origin function.<ref name="#27281207" /> After melting, the DUE provides an entry site for the E. coli replicative helicase DnaB, which is deposited onto each of the single DNA strands by its loader protein DnaC.<ref name="Ekundayo et al"/>

Although the different DNA binding activities of DnaA have been extensively studied biochemically and various apo, ssDNA-, or dsDNA-bound structures have been determined,<ref name=":18" /><ref name=":19" /><ref name=":20" /><ref name="#16829961" /> the exact architecture of the higher-order DnaA-oriC initiation assembly remains unclear. Two models have been proposed to explain the organization of essential origin elements and DnaA-mediated oriC melting. The two-state model assumes a continuous DnaA filament that switches from a dsDNA binding mode (the organizing complex) to an ssDNA binding mode in the DUE (the melting complex).<ref name="#16829961" /><ref name="#20595381">Template:Cite journal</ref> By contrast, in the loop-back model, the DNA is sharply bent in oriC and folds back onto the initiator filament so that DnaA protomers simultaneously engage double- and single-stranded DNA regions.<ref name="#22053082">Template:Cite journal</ref> Elucidating how exactly oriC DNA is organized by DnaA remains thus an important task for future studies. Insights into initiation complex architecture will help explain not only how origin DNA is melted, but also how a replicative helicase is loaded directionally onto each of the exposed single DNA strands in the unwound DUE, and how these events are aided by interactions of the helicase with the initiator and specific loader proteins.<ref name="Ekundayo et al"/>

ArchaealEdit

File:Origins of DNA replication Figure 3.jpg
Origin organization and recognition in archaea. A) The circular chromosome of Sulfolobus solfataricus contains three different origins. B) Arrangement of initiator binding sites at two S. solfataricus origins, oriC1 and oriC2. Orc1-1 association with ORB elements is shown for oriC1. Recognition elements for additional Orc1/Cdc6 paralogs are also indicated, while WhiP binding sites have been omitted. C) Domain architecture of archaeal Orc1/Cdc6 paralogs. The orientation of ORB elements at origins leads to directional binding of Orc1/Cdc6 and MCM loading in between opposing ORBs (in B). (m)ORB – (mini-)origin recognition box, DUE – DNA unwinding element, WH – winged-helix domain.

Archaeal replication origins share some but not all of the organizational features of bacterial oriC. Unlike bacteria, Archaea often initiate replication from multiple origins per chromosome (one to four have been reported);<ref name="#10864870">Template:Cite journal</ref><ref name="#17511521">Template:Cite journal</ref><ref name="Hawkins_2013">Template:Cite journal</ref><ref name="#24271389">Template:Cite journal</ref><ref name="#23991938">Template:Cite journal</ref><ref name="#22812406">Template:Cite journal</ref><ref name="#14718164">Template:Cite journal</ref><ref name="#15107501">Template:Cite journal</ref><ref name=":13" /> yet, archaeal origins also bear specialized sequence regions that control origin function.<ref name="#29357055">Template:Cite book</ref><ref name="#28146124">Template:Cite journal</ref><ref name="#24808892">Template:Cite journal</ref> These elements include both DNA sequence-specific origin recognition boxes (ORBs or miniORBs) and an AT-rich DUE that is flanked by one or several ORB regions.<ref name="#14718164" /><ref name="#11562464">Template:Cite journal</ref> ORB elements display a considerable degree of diversity in terms of their number, arrangement, and sequence, both among different archaeal species and among different origins in a single species.<ref name="#17511521" /><ref name="#14718164" /><ref name="#22978470">Template:Cite journal</ref> An additional degree of complexity is introduced by the initiator, Orc1/Cdc6 in archaea, which binds to ORB regions. Archaeal genomes typically encode multiple paralogs of Orc1/Cdc6 that vary substantially in their affinities for distinct ORB elements and that differentially contribute to origin activities.<ref name="#14718164" /><ref name="#22918580">Template:Cite book</ref><ref name="#23375370">Template:Cite journal</ref><ref name="#16978641">Template:Cite journal</ref> In Sulfolobus solfataricus, for example, three chromosomal origins have been mapped (oriC1, oriC2, and oriC3), and biochemical studies have revealed complex binding patterns of initiators at these sites.<ref name="#14718164" /><ref name="#15107501" /><ref name="#17392430">Template:Cite journal</ref><ref name="#17255945">Template:Cite journal</ref> The cognate initiator for oriC1 is Orc1-1, which associates with several ORBs at this origin.<ref name="#14718164" /><ref name="#23375370" /> OriC2 and oriC3 are bound by both Orc1-1 and Orc1-3.<ref name="#14718164" /><ref name="#23375370" /><ref name="#17255945" /> Conversely, a third paralog, Orc1-2, footprints at all three origins but has been postulated to negatively regulate replication initiation.<ref name="#14718164" /><ref name="#17255945" /> Additionally, the WhiP protein, an initiator unrelated to Orc1/Cdc6, has been shown to bind all origins as well and to drive origin activity of oriC3 in the closely related Sulfolobus islandicus.<ref name="#23375370" /><ref name="#17392430" /> Because archaeal origins often contain several adjacent ORB elements, multiple Orc1/Cdc6 paralogs can be simultaneously recruited to an origin and oligomerize in some instances;<ref name="#16978641" /><ref name="#17761879">Template:Cite journal</ref> however, in contrast to bacterial DnaA, formation of a higher-order initiator assembly does not appear to be a general prerequisite for origin function in the archaeal domain.<ref name="Ekundayo et al"/>

Structural studies have provided insights into how archaeal Orc1/Cdc6 recognizes ORB elements and remodels origin DNA.<ref name="#17761879" /><ref name="#17761880">Template:Cite journal</ref> Orc1/Cdc6 paralogs are two-domain proteins and are composed of a AAA+ ATPase module fused to a C-terminal winged-helix fold.<ref name="#15358831">Template:Cite journal</ref><ref name="#11030343">Template:Cite journal</ref><ref name="#15465044">Template:Cite journal</ref> DNA-complexed structures of Orc1/Cdc6 revealed that ORBs are bound by an Orc1/Cdc6 monomer despite the presence of inverted repeat sequences within ORB elements.<ref name="#17761879" /><ref name="#17761880" /> Both the ATPase and winged-helix regions interact with the DNA duplex but contact the palindromic ORB repeat sequence asymmetrically, which orients Orc1/Cdc6 in a specific direction on the repeat.<ref name="#17761879" /><ref name="#17761880" /> Interestingly, the DUE-flanking ORB or miniORB elements often have opposite polarities,<ref name="#17511521" /><ref name="#14718164" /><ref name="#16978641" /><ref name="#12612604">Template:Cite journal</ref><ref name="#14526006">Template:Cite journal</ref> which predicts that the AAA+ lid subdomains and the winged-helix domains of Orc1/Cdc6 are positioned on either side of the DUE in a manner where they face each other.<ref name="#17761879" /><ref name="#17761880" /> Since both regions of Orc1/Cdc6 associate with a minichromosome maintenance (MCM) replicative helicase,<ref name="#16150924">Template:Cite journal</ref><ref name="#26725007">Template:Cite journal</ref> this specific arrangement of ORB elements and Orc1/Cdc6 is likely important for loading two MCM complexes symmetrically onto the DUE.<ref name="#14718164" /> Surprisingly, while the ORB DNA sequence determines the directionality of Orc1/Cdc6 binding, the initiator makes relatively few sequence-specific contacts with DNA.<ref name="#17761879" /><ref name="#17761880" /> However, Orc1/Cdc6 severely underwinds and bends DNA, suggesting that it relies on a mix of both DNA sequence and context-dependent DNA structural features to recognize origins.<ref name="#17761879" /><ref name="#17761880" /><ref name="#21227921">Template:Cite journal</ref> Notably, base pairing is maintained in the distorted DNA duplex upon Orc1/Cdc6 binding in the crystal structures,<ref name="#17761879" /><ref name="#17761880" /> whereas biochemical studies have yielded contradictory findings as to whether archaeal initiators can melt DNA similarly to bacterial DnaA.<ref name="#23375370" /><ref name="#16978641" /><ref name="#19787415">Template:Cite journal</ref> Although the evolutionary kinship of archaeal and eukaryotic initiators and replicative helicases indicates that archaeal MCM is likely loaded onto duplex DNA (see next section), the temporal order of origin melting and helicase loading, as well as the mechanism for origin DNA melting, in archaeal systems remains therefore to be clearly established. Likewise, how exactly the MCM helicase is loaded onto DNA needs to be addressed in future studies.<ref name="Ekundayo et al"/>

EukaryoticEdit

File:Origins of DNA replication Figure 4.jpg
Origin organization and recognition in eukaryotes. Specific DNA elements and epigenetic features involved in ORC recruitment and origin function are summarized for S. cerevisiae, S. pombe, and metazoan origins. A schematic of the ORC architecture is also shown, highlighting the arrangement of the AAA+ and winged-helix domains into a pentameric ring that encircles origin DNA. Ancillary domains of several ORC subunits involved in targeting ORC to origins are included. Other regions in ORC subunits may also be involved in initiator recruitment, either by directly or indirectly associating with partner proteins. A few examples are listed. Note that the BAH domain in S. cerevisiae Orc1 binds nucleosomes<ref name="#18158899">Template:Cite journal</ref> but does not recognize H4K20me2.<ref name="#22398447" />
BAH – bromo-adjacent homology domain, WH – winged-helix domain, TFIIB – transcription factor II B-like domain in Orc6, G4 – G quadruplex, OGRE – origin G-rich repeated element. ORC gene names are indicated by a single number; e.g. 3 refers to ORC3.

Origin organization, specification, and activation in eukaryotes are more complex than in bacterial or archaeal domains and significantly deviate from the paradigm established for prokaryotic replication initiation. The large genome sizes of eukaryotic cells, which range from 12 Mbp in S. cerevisiae to more than 100 Gbp in some plants, necessitates that DNA replication starts at several hundred (in budding yeast) to tens of thousands (in humans) origins to complete DNA replication of all chromosomes during each cell cycle.<ref name=":3" /><ref name=":10" /> With the exception of S. cerevisiae and related Saccharomycotina species, eukaryotic origins do not contain consensus DNA sequence elements but their location is influenced by contextual cues such as local DNA topology, DNA structural features, and chromatin environment.<ref name="#15459665"/><ref name=":9" /><ref name=":11" />

Eukaryotic origin function relies on a conserved initiator protein complex to load replicative helicases onto DNA during the late M and G1 phases of the cell cycle, a step known as origin licensing.<ref name="#28209641">Template:Cite journal</ref> In contrast to their bacterial counterparts, replicative helicases in eukaryotes are loaded onto origin duplex DNA in an inactive, double-hexameric form and only a subset of them (10-20% in mammalian cells) is activated during any given S phase, events that are referred to as origin firing.<ref name="#21282109">Template:Cite journal</ref><ref name="#19896182">Template:Cite journal</ref><ref name="#19910535">Template:Cite journal</ref>

The location of active eukaryotic origins is therefore determined on at least two different levels, origin licensing to mark all potential origins, and origin firing to select a subset that permits assembly of the replication machinery and initiation of DNA synthesis. The extra licensed origins serve as backup and are activated only upon slowing or stalling of nearby replication forks, ensuring that DNA replication can be completed when cells encounter replication stress.<ref name="#18079179">Template:Cite journal</ref><ref name="#18579778">Template:Cite journal</ref> In the absence of stress, firing of extra origins is suppressed by a replication-associated signaling mechanism.<ref>Template:Cite journal</ref><ref>Template:Cite journal</ref> Together, the excess of licensed origins and the tight cell cycle control of origin licensing and firing embody two important strategies to prevent under- and overreplication and to maintain the integrity of eukaryotic genomes.<ref name="Ekundayo et al" />

Early studies in S. cerevisiae indicated that replication origins in eukaryotes might be recognized in a DNA-sequence-specific manner analogously to those in prokaryotes. In budding yeast, the search for genetic replicators lead to the identification of autonomously replicating sequences (ARS) that support efficient DNA replication initiation of extrachromosomal DNA.<ref name="#388229">Template:Cite journal</ref><ref name="#3311385">Template:Cite journal</ref><ref name="#2822257">Template:Cite journal</ref> These ARS regions are approximately 100-200 bp long and exhibit a multipartite organization, containing A, B1, B2, and sometimes B3 elements that together are essential for origin function.<ref name="#1536007">Template:Cite journal</ref><ref name="#7935478">Template:Cite journal</ref> The A element encompasses the conserved 11 bp ARS consensus sequence (ACS),<ref name="#6345070">Template:Cite journal</ref><ref name="#6392851">Template:Cite journal</ref> which, in conjunction with the B1 element, constitutes the primary binding site for the heterohexameric origin recognition complex (ORC), the eukaryotic replication initiator.<ref name="#7892251">Template:Cite journal</ref><ref name="#7781615">Template:Cite journal</ref><ref name="#1579162">Template:Cite journal</ref><ref name="#29973722">Template:Cite journal</ref> Within ORC, five subunits are predicated on conserved AAA+ ATPase and winged-helix folds and co-assemble into a pentameric ring that encircles DNA.<ref name="#29973722" /><ref name="#25762138">Template:Cite journal</ref><ref name="#23851460">Template:Cite journal</ref> In budding yeast ORC, DNA binding elements in the ATPase and winged-helix domains, as well as adjacent basic patch regions in some of the ORC subunits, are positioned in the central pore of the ORC ring such that they aid the DNA-sequence-specific recognition of the ACS in an ATP-dependent manner.<ref name="#29973722" /><ref name="#26456755">Template:Cite journal</ref> By contrast, the roles of the B2 and B3 elements are less clear. The B2 region is similar to the ACS in sequence and has been suggested to function as a second ORC binding site under certain conditions, or as a binding site for the replicative helicase core.<ref name="#3284655">Template:Cite journal</ref><ref name="#11756674">Template:Cite journal</ref><ref name="#28729513">Template:Cite journal</ref><ref name="#10757793">Template:Cite journal</ref><ref name="#11172708">Template:Cite journal</ref> Conversely, the B3 element recruits the transcription factor Abf1, albeit B3 is not found at all budding yeast origins and Abf1 binding does not appear to be strictly essential for origin function.<ref name="Ekundayo et al"/><ref name="#1536007" /><ref name="#1579168">Template:Cite journal</ref><ref name="#3281162">Template:Cite journal</ref>

Origin recognition in eukaryotes other than S. cerevisiae or its close relatives does not conform to the sequence-specific read-out of conserved origin DNA elements. Pursuits to isolate specific chromosomal replicator sequences more generally in eukaryotic species, either genetically or by genome-wide mapping of initiator binding or replication start sites, have failed to identify clear consensus sequences at origins.<ref name="#27436900">Template:Cite journal</ref><ref name="#19996087">Template:Cite journal</ref><ref name="#21177973">Template:Cite journal</ref><ref name="#23187890">Template:Cite journal</ref><ref name="#26560631">Template:Cite journal</ref><ref name="#21750104">Template:Cite journal</ref><ref name="#21148149">Template:Cite journal</ref><ref name="#17304213">Template:Cite journal</ref><ref name="#21813623">Template:Cite journal</ref><ref name="#28009254">Template:Cite journal</ref><ref name="#28112731">Template:Cite journal</ref><ref name="#22751019">Template:Cite journal</ref> Thus, sequence-specific DNA-initiator interactions in budding yeast signify a specialized mode for origin recognition in this system rather than an archetypal mode for origin specification across the eukaryotic domain. Nonetheless, DNA replication does initiate at discrete sites that are not randomly distributed across eukaryotic genomes, arguing that alternative means determine the chromosomal location of origins in these systems. These mechanisms involve a complex interplay between DNA accessibility, nucleotide sequence skew (both AT-richness and CpG islands have been linked to origins), Nucleosome positioning, epigenetic features, DNA topology and certain DNA structural features (e.g., G4 motifs), as well as regulatory proteins and transcriptional interference.<ref name=":15" /><ref name=":16" /><ref name=":8" /><ref name=":9" /><ref name=":11" /><ref name="#9545253">Template:Cite journal</ref><ref name="#19360092">Template:Cite journal</ref><ref name="#21750104"/><ref name="#30718387">Template:Cite journal</ref> Importantly, origin properties vary not only between different origins in an organism and among species, but some can also change during development and cell differentiation. The chorion locus in Drosophila follicle cells constitutes a well-established example for spatial and developmental control of initiation events. This region undergoes DNA-replication-dependent gene amplification at a defined stage during oogenesis and relies on the timely and specific activation of chorion origins, which in turn is regulated by origin-specific cis-elements and several protein factors, including the Myb complex, E2F1, and E2F2.<ref name="#10541550">Template:Cite journal</ref><ref name="#12490953">Template:Cite journal</ref><ref name="#15256498">Template:Cite journal</ref><ref name="#15545624">Template:Cite journal</ref><ref name="#11231579">Template:Cite journal</ref> This combinatorial specification and multifactorial regulation of metazoan origins has complicated the identification of unifying features that determine the location of replication start sites across eukaryotes more generally.<ref name="Ekundayo et al"/>

To facilitate replication initiation and origin recognition, ORC assemblies from various species have evolved specialized auxiliary domains that are thought to aid initiator targeting to chromosomal origins or chromosomes in general. For example, the Orc4 subunit in S. pombe ORC contains several AT-hooks that preferentially bind AT-rich DNA,<ref name="#10077566">Template:Cite journal</ref> while in metazoan (animal) ORC the TFIIB-like domain of Orc6 is thought to perform a similar function.<ref name="#17283052">Template:Cite journal</ref> Metazoan Orc1 proteins also harbor a bromo-adjacent homology (BAH) domain that interacts with H4K20me2-nucleosomes.<ref name="#22398447">Template:Cite journal</ref> Particularly in mammalian cells, H4K20 methylation has been reported to be required for efficient replication initiation, and the Orc1's BAH domain facilitates ORC association with chromosomes and Epstein-Barr virus origin-dependent replication.<ref name="#20953199">Template:Cite journal</ref><ref name="#23152447">Template:Cite journal</ref><ref name="#28778956">Template:Cite journal</ref><ref name="#30209253">Template:Cite journal</ref><ref name="#17066079">Template:Cite journal</ref> Therefore, it is intriguing to speculate that both observations are mechanistically linked at least in a subset of metazoa, but this possibility needs to be further explored in future studies. In addition to the recognition of certain DNA or epigenetic features, ORC also associates directly or indirectly with several partner proteins that could aid initiator recruitment, including LRWD1, PHIP (or DCAF14), HMGA1a, among others.<ref name=":7" /><ref name="#22645314">Template:Cite journal</ref><ref name="#27924004">Template:Cite journal</ref><ref name="#21029866">Template:Cite journal</ref><ref name="#20850016">Template:Cite journal</ref><ref name="#26496610">Template:Cite journal</ref><ref name="#18234858">Template:Cite journal</ref><ref name="#27272143">Template:Cite journal</ref> Interestingly, Drosophila ORC, like its budding yeast counterpart, bends DNA and negative supercoiling has been reported to enhance DNA binding of this complex, suggesting that DNA shape and malleability might influence the location of ORC binding sites across metazoan genomes.<ref name=":6" /><ref name="#29973722" /><ref name="#29899147">Template:Cite journal</ref><ref name="#18824234">Template:Cite journal</ref><ref name="#9372948">Template:Cite journal</ref> A molecular understanding for how ORC's DNA binding regions might support the read out of structural properties of the DNA duplex in metazoans rather than of specific DNA sequences as in S. cerevisiae awaits high-resolution structural information of DNA-bound metazoan initiator assemblies. Likewise, whether and how different epigenetic factors contribute to initiator recruitment in metazoan systems is poorly defined and is an important question that needs to be addressed in more detail.<ref name="Ekundayo et al"/>

Once recruited to origins, ORC and its co-factors Cdc6 and Cdt1 drive the deposition of the minichromosome maintenance 2-7 (Mcm2-7) complex onto DNA.<ref name="#28209641" /><ref name="#28717046">Template:Cite journal</ref> Like the archaeal replicative helicase core, Mcm2-7 is loaded as a head-to-head double hexamer onto DNA to license origins.<ref name="#21282109" /><ref name="#19896182" /><ref name="#19910535" /> In S-phase, Dbf4-dependent kinase (DDK) and Cyclin-dependent kinase (CDK) phosphorylate several Mcm2-7 subunits and additional initiation factors to promote the recruitment of the helicase co-activators Cdc45 and GINS, DNA melting, and ultimately bidirectional replisome assembly at a subset of the licensed origins.<ref name=":4" /><ref name="#25308420">Template:Cite journal</ref> In both yeast and metazoans, origins are free or depleted of nucleosomes, a property that is crucial for Mcm2-7 loading, indicating that chromatin state at origins regulates not only initiator recruitment but also helicase loading.<ref name="#21148149" /><ref name="#20824081">Template:Cite journal</ref><ref name="#20351051">Template:Cite journal</ref><ref name="#28322723">Template:Cite journal</ref><ref name="#20129055">Template:Cite journal</ref><ref name="#26227968">Template:Cite journal</ref> A permissive chromatin environment is further important for origin activation and has been implicated in regulating both origin efficiency and the timing of origin firing. Euchromatic origins typically contain active chromatin marks, replicate early, and are more efficient than late-replicating, heterochromatic origins, which conversely are characterized by repressive marks.<ref name=":3" /><ref name="#28322723" /><ref name="#29357061">Template:Cite book</ref> Not surprisingly, several chromatin remodelers and chromatin-modifying enzymes have been found to associate with origins and certain initiation factors,<ref name="#29357053">Template:Cite book</ref><ref name="#23751185">Template:Cite journal</ref> but how their activities impact different replication initiation events remains largely obscure. Remarkably, cis-acting “early replication control elements” (ECREs) have recently also been identified to help regulate replication timing and to influence 3D genome architecture in mammalian cells.<ref name="#30595451">Template:Cite journal</ref> Understanding the molecular and biochemical mechanisms that orchestrate this complex interplay between 3D genome organization, local and higher-order chromatin structure, and replication initiation is an exciting topic for further studies.<ref name="Ekundayo et al"/>

Why have metazoan replication origins diverged from the DNA sequence-specific recognition paradigm that determines replication start sites in prokaryotes and budding yeast? Observations that metazoan origins often co-localize with promoter regions in Drosophila and mammalian cells and that replication-transcription conflicts due to collisions of the underlying molecular machineries can lead to DNA damage suggest that proper coordination of transcription and replication is important for maintaining genome stability.<ref name="#19996087" /><ref name="#23187890" /><ref name="#21750104" /><ref name="#21813623" /><ref name="#18838675">Template:Cite journal</ref><ref name="#8638128"/><ref name="#27362223">Template:Cite journal</ref><ref name="#19560424">Template:Cite journal</ref> Recent findings also point to a more direct role of transcription in influencing the location of origins, either by inhibiting Mcm2-7 loading or by repositioning of loaded Mcm2-7 on chromosomes.<ref name="#26656162">Template:Cite journal</ref><ref name="#30718387"/> Sequence-independent (but not necessarily random) initiator binding to DNA additionally allows for flexibility in specifying helicase loading sites and, together with transcriptional interference and the variability in activation efficiencies of licensed origins, likely determines origin location and contributes to the co-regulation of DNA replication and transcriptional programs during development and cell fate transitions. Computational modeling of initiation events in S. pombe, as well as the identification of cell-type specific and developmentally-regulated origins in metazoans, are in agreement with this notion.<ref name="#21177973" /><ref name="#28112731" /><ref name="#21258320">Template:Cite journal</ref><ref name="#27168766">Template:Cite journal</ref><ref name="#22090375">Template:Cite journal</ref><ref name="#25921534">Template:Cite journal</ref><ref name="#9499407">Template:Cite journal</ref><ref name="#30718387"/> However, a large degree of flexibility in origin choice also exists among different cells within a single population,<ref name="#21750104" /><ref name="#22751019" /><ref name="#27168766" /> albeit the molecular mechanisms that lead to the heterogeneity in origin usage remain ill-defined. Mapping origins in single cells in metazoan systems and correlating these initiation events with single-cell gene expression and chromatin status will be important to elucidate whether origin choice is purely stochastic or controlled in a defined manner.<ref name="Ekundayo et al"/>

ViralEdit

File:Hhv6 genome2.png
Genome of human herpesvirus-6, a member of the Herpesviridae family. The origin of replication is labeled as "OOR."

Viruses often possess a single origin of replication.

A variety of proteins have been described as being involved in viral replication. For instance, Polyoma viruses utilize host cell DNA polymerases, which attach to a viral origin of replication if the T antigen is present.

VariationsEdit

Although DNA replication is essential for genetic inheritance, defined, site-specific replication origins are technically not a requirement for genome duplication as long as all chromosomes are copied in their entirety to maintain gene copy numbers. Certain bacteriophages and viruses, for example, can initiate DNA replication by homologous recombination independent of dedicated origins.<ref name="#9928485">Template:Cite journal</ref> Likewise, the archaeon Haloferax volcanii uses recombination-dependent initiation to duplicate its genome when its endogenous origins are deleted.<ref name="Hawkins_2013"/> Similar non-canonical initiation events through break-induced or transcription-initiated replication have been reported in E. coli and S. cerevisiae.<ref name="#28134821">Template:Cite journal</ref><ref name="#8344265">Template:Cite journal</ref><ref name="#17671506">Template:Cite journal</ref><ref name="#2446774">Template:Cite journal</ref><ref name="#25902524">Template:Cite journal</ref> Nonetheless, despite the ability of cells to sustain viability under these exceptional circumstances, origin-dependent initiation is a common strategy universally adopted across different domains of life.<ref name="Ekundayo et al"/>

In addition, detailed studies of replication initiation have focused on a limited number of model systems. The extensively studied fungi and metazoa are both members of the opisthokont supergroup and exemplify only a small fraction of the evolutionary landscape in the eukaryotic domain.<ref name="#24789819">Template:Cite journal</ref> Comparably few efforts have been directed at other eukaryotic model systems, such as kinetoplastids or tetrahymena.<ref name="#25569357">Template:Cite journal</ref><ref name="#18007594">Template:Cite journal</ref><ref name="#19153611">Template:Cite journal</ref><ref name="#29491738">Template:Cite journal</ref><ref name="#26951375">Template:Cite journal</ref><ref name="#22412905">Template:Cite journal</ref><ref name="#26481451">Template:Cite journal</ref> Surprisingly, these studies have revealed interesting differences both in origin properties and in initiator composition compared to yeast and metazoans.<ref name="Ekundayo et al"/>

See alsoEdit

ReferencesEdit

Template:Academic peer reviewed Template:Notelist Template:Reflist

Further readingEdit

Template:Refbegin

Template:Refend

External linksEdit

Template:DNA replication Template:Self-replicating organic structures