Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Protein structure prediction
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Protein structure and terminology== Proteins are chains of [[amino acid]]s joined together by [[peptide bond]]s. Many conformations of this chain are possible due to the rotation of the main chain about the two torsion angles φ and ψ at the Cα atom (see figure). This conformational flexibility is responsible for differences in the three-dimensional structure of proteins. The peptide bonds in the chain are polar, i.e. they have separated positive and negative charges (partial charges) in the [[carbonyl group]], which can act as hydrogen bond acceptor and in the NH group, which can act as hydrogen bond donor. These groups can therefore interact in the protein structure. Proteins consist mostly of 20 different types of L-α-amino acids (the [[proteinogenic amino acid]]s). These can be classified according to the chemistry of the side chain, which also plays an important structural role. [[Glycine]] takes on a special position, as it has the smallest side chain, only one hydrogen atom, and therefore can increase the local flexibility in the protein structure. [[Cysteine]] in contrast can react with another cysteine residue to form one [[cystine]] and thereby form a cross link stabilizing the whole structure. The protein structure can be considered as a sequence of secondary structure elements, such as [[alpha helix|α helices]] and [[beta sheet|β sheets]]. In these secondary structures, regular patterns of H-bonds are formed between the main chain NH and CO groups of spatially neighboring amino acids, and the amino acids have similar [[Dihedral angle#Proteins|Φ and ψ angles]].<ref>{{cite journal |title=IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. Tentative rules (1969) |journal=Biochemistry |date=1 September 1970 |volume=9 |issue=18 |pages=3471–3479 |doi=10.1021/bi00820a001|last1=Iupac-Iub Comm. On Biochem. Nomenclature |pmid=5509841 |s2cid=196933}}</ref> [[File:fipsi.png|thumb|200px|Torsion angles φ and ψ of the protein main chain]] The formation of these secondary structures efficiently satisfies the hydrogen bonding capacities of the peptide bonds. The secondary structures can be tightly packed in the protein core in a hydrophobic environment, but they can also present at the polar protein surface. Each amino acid side chain has a limited volume to occupy and a limited number of possible interactions with other nearby side chains, a situation that must be taken into account in molecular modeling and alignments.<ref name="Mount">{{cite book |author=Mount DM |title=Bioinformatics: Sequence and Genome Analysis |publisher=Cold Spring Harbor Laboratory Press |year=2004 |isbn=978-0-87969-712-9 |volume=2}}</ref><ref name=":1">Yousif, Ragheed Hussam, et al. "Exploring the Molecular Interactions between Neoculin and the Human Sweet Taste Receptors through Computational Approaches." ''Sains Malaysiana'' 49.3 (2020): 517-525.</ref> ===α-helix=== {{Main|α-helix}} [[File:Alpha helix.png|thumb|right|100px|An alpha-helix with hydrogen bonds (yellow dots)]] The α-helix is the most abundant type of secondary structure in proteins. The α-helix has 3.6 amino acids per turn with an H-bond formed between every fourth residue; the average length is 10 amino acids (3 turns) or 10 [[Angstrom|Å]] but varies from 5 to 40 (1.5 to 11 turns). The alignment of the H-bonds creates a dipole moment for the helix with a resulting partial positive charge at the amino end of the helix. Because this region has free NH<small>2</small> groups, it will interact with negatively charged groups such as phosphates. The most common location of α-helices is at the surface of protein cores, where they provide an interface with the aqueous environment. The inner-facing side of the helix tends to have hydrophobic amino acids and the outer-facing side hydrophilic amino acids. Thus, every third of four amino acids along the chain will tend to be hydrophobic, a pattern that can be quite readily detected. In the leucine zipper motif, a repeating pattern of leucines on the facing sides of two adjacent helices is highly predictive of the motif. A helical-wheel plot can be used to show this repeated pattern. Other α-helices buried in the protein core or in cellular membranes have a higher and more regular distribution of hydrophobic amino acids, and are highly predictive of such structures. Helices exposed on the surface have a lower proportion of hydrophobic amino acids. Amino acid content can be predictive of an α-helical region. Regions richer in [[alanine]] (A), [[glutamic acid]] (E), [[leucine]] (L), and [[methionine]] (M) and poorer in [[proline]] (P), [[glycine]] (G), [[tyrosine]] (Y), and [[serine]] (S) tend to form an α-helix. Proline destabilizes or breaks an α-helix but can be present in longer helices, forming a bend. ===β-sheet=== {{Main|β sheet}} β-sheets are formed by H-bonds between an average of 5–10 consecutive amino acids in one portion of the chain with another 5–10 farther down the chain. The interacting regions may be adjacent, with a short loop in between, or far apart, with other structures in between. Every chain may run in the same direction to form a parallel sheet, every other chain may run in the reverse chemical direction to form an anti parallel sheet, or the chains may be parallel and anti parallel to form a mixed sheet. The pattern of H bonding is different in the parallel and anti parallel configurations. Each amino acid in the interior strands of the sheet forms two H-bonds with neighboring amino acids, whereas each amino acid on the outside strands forms only one bond with an interior strand. Looking across the sheet at right angles to the strands, more distant strands are rotated slightly counterclockwise to form a left-handed twist. The Cα-atoms alternate above and below the sheet in a pleated structure, and the R side groups of the amino acids alternate above and below the pleats. The Φ and Ψ angles of the amino acids in sheets vary considerably in one region of the [[Ramachandran plot]]. It is more difficult to predict the location of β-sheets than of α-helices. The situation improves somewhat when the amino acid variation in multiple sequence alignments is taken into account. ===Deltas=== Some parts of the protein have fixed three-dimensional structure, but do not form any regular structures. They should not be confused with [[Intrinsically disordered proteins|disordered or unfolded segments]] of proteins or [[random coil]], an unfolded polypeptide chain lacking any fixed three-dimensional structure. These parts are frequently called "[[Coordination_complex#Optical_isomerism|deltas]]" (''Δ'') because they connect β-sheets and α-helices. Deltas are usually located at protein surface, and therefore mutations of their residues are more easily tolerated. Having more substitutions, insertions, and deletions in a certain region of a sequence alignment maybe an indication of some delta. The positions of [[introns]] in genomic DNA may correlate with the locations of loops in the encoded protein {{Citation needed|date=March 2012}}. Deltas also tend to have charged and polar amino acids and are frequently a component of active sites.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)