
Full text loading...
Category: Clinical Microbiology
Mobile Bacterial Group II Introns at the Crux of Eukaryotic Evolution, Page 1 of 2
< Previous page | Next page > /docserver/preview/fulltext/10.1128/9781555819217/9781555819200_Chap52-1.gif /docserver/preview/fulltext/10.1128/9781555819217/9781555819200_Chap52-2.gifAbstract:
Group II introns are remarkable mobile retroelements that use the combined activities of an autocatalytic RNA and an intron-encoded reverse transcriptase (RT) to propagate efficiently within genomes. But perhaps their most noteworthy feature is the pivotal role they are thought to have played in eukaryotic evolution. Mobile group II introns are ancestrally related to nuclear spliceosomal introns, retrotransposons and telomerase, which collectively comprise more than half of the human genome. Additionally, group II introns are postulated to have been a major driving force in the evolution of eukaryotes themselves, including for the emergence of the nuclear envelope to separate transcription from translation.
Full text loading...
Group II intron RNA splicing mechanism and structure. (A) Splicing and reverse splicing. Step 1. The 2′-OH of the branch-point adenosine acts as nucleophile to attack the 5′ splice site. Step 2. The 3′-OH of the upstream exon is the nucleophile that attacks the 3′ splice site to generate ligated exons and an excised intron lariat. Both reactions are reversible. (B) Group II intron secondary structure. The L. lactis Ll.LtrB group IIA intron is shown, with the six domains DI-DVI. Exons are represented by thicker lines with the EBS-IBS pairings shown by dashed black lines. The IEP ORF is looped out of DIV (not drawn to scale). (C) Group II intron crystal structure. The representation is of DI-DV of the O. iheyensis group IIC intron (PDB:4E8K) bound to ligated exons, before the spliced exon reopening reaction, provided by Marcia and Pyle ( 13 ). Colors are coded to the domain labels in panel B, although these are different introns that belong to different structural subgroups. The 5′ exon is black and the 3′ exon is dark blue. (D) Base-pairing interactions of group IIA, IIB, and IIC introns with flanking exons. Group IIA and IIB recognize 5′ exons by similar IBS1-EBS1 and IBS2-EBS2 interactions, but use different interactions to recognize 3′ exons (δ-δ’ in IIA introns and EBS3/IBS3 in IIB introns ( 214 ). Group IIC ribozymes are only ∼400 nt long, considerably smaller than IIA and IIB introns, and they are located downstream of inverted repeats, such as transcription terminators, which contribute to exon recognition along with short EBS1/IBS1 and EBS3/IBS3 interactions similar to those of IIB introns ( 215 , 216 ). Panel D is adapted from reference 4, with permission of the publisher (© Cold Spring Harbor Laboratory Press).
Group II intron and related reverse transcriptase (RTs). (A) Schematics of RTs. Two group II intron RTs, L. lactis Ll.LtrB (denoted LtrA protein; GenBank: AAB06503) and Sinorhizobium meliloti RmInt (NCBI Reference Sequence: NP_438012) are compared with two non-LTR-retrotransposon RTs, Bombyx mori R2Bm (GenBank: AAB59214) and human LINE-1 (UniProtKB/Swiss-Prot: O00370) RTs; yeast telomerase RT (GenBank: AAB64520); and retrovirus HIV-1 RT (PDB:2HMI). Conserved sequence blocks in the RT domain are numbered, and the sequence motif containing two of the conserved aspartic acid residues at the RT active site is shown below for each protein. (B) Three-dimensional model of the Ll.LtrB RT. Regions identified by unigenic evolution analysis as being required for binding the high-affinity binding site DIVa and catalytic core regions of group II intron RNAs are highlighted in red and dark blue in the left and right panels, respectively. Pink in the left panel indicates a region of the protein that may contribute to DIVa binding by stabilizing the structure of neighboring regions ( 46 ). The model was constructed by threading the amino acid sequence of the Ll.LtrB RT onto a HIV-1 RT crystal structure, with one subunit (denoted α; gray) modeled based on the catalytic p66 subunit of HIV-1 RT and the other subunit (denoted β; cyan) modeled based on the p51 subunit of HIV-1 RT ( 17 ). The N-terminal 36 amino acid residues of the Ll.LtrB RT could not be modeled based on the HIV-1 RT and are represented as spheres. APE, apurinic endonuclease domain; CTS, conserved carboxy-terminal segment found to bind RNA nonspecifically in human LINE-1 RT ( 217 ); Cys, cysteine-rich sequence conserved in LINE-1 RT; DB, DNA-binding domain in R2Bm RT; REL, restriction endonuclease-like domain; TEN, telomerase N-terminal domain; TRBD, telomerase RNA-binding domain including motifs CP and T, which contact telomerase RNA ( 139 , 218 ).
Representative retrohoming and retrotransposition pathways. (A, B) Retrohoming into cognate sites is represented by the two pathways on the left. (C, D) Retrotransposition to ectopic sites is represented by the two pathways on the right. For all panels, the intron is red (DNA solid lines, RNA hatched lines); the intron RNP is represented by a grey rectangle with a red intron lariat and an IEP with RT and maturase (X) domains and either containing or lacking an En domain. Exons of the donor (encircled D) are white, and those of the recipient (encircled R) are either white (retrohoming) or grey (retrotransposition). Each pathway ends with a product (encircled P). Green exon fragment represents primer for reverse transcription. The retrohoming pathways (A, B) differ by the presence of the En domain, and whether dsDNA (A) or ssDNA, such as at a replication fork (B), is the target. The black dot in the recipient strand represents the intron-insertion site. In pathway A, the IEP contains an En domain and after reverse splicing into the top strand, cleavage of the bottom strand occurs 9 or 10 nt downstream (step 1′), as for the Ll.LtrB and yeast aI2 introns, respectively ( 54 , 101 ). In pathway B, the IEP lacks an En domain and integrates into DNA at a replication fork (step 1) as demonstrated for RmInt1 intron ( 62 ) and En-deficient mutants of the Ll.LtrB intron ( 78 ), which preferentially use lagging or leading strand primers, respectively. Use of a leading strand primer is not shown in the figure (see reference 78 ). In both pathways, cDNA synthesis proceeds with the intron as template, using the 3′-OH of either the cleaved bottom strand (A) or an Okazaki fragment (B) to prime reverse transcription (step 2). Intron degradation and second-strand cDNA synthesis (step 3) is followed by DNA repair to generate the retrohoming products for both pathways (step 4). Host factors that participate in the process are indicated on pathway A, as established for the Ll.LtrB intron in E. coli, with those that silence the pathway shown in red, and those that promote retrohoming indicated in green ( Table 1 ) ( 72 , 73 ). Retrotransposition (C, D) occurs when the intron integrates into ectopic sites with reduced specificity. This occurs for the Ll.LtrB intron into the lagging strand template for DNA synthesis, as in pathway B, where Okazaki fragments prime cDNA synthesis (C) ( 58 ). Stimulatory and repressive host factors are again represented in green and red, respectively ( Table 1 ) ( 45 , 74 , 97 ). Alternatively, primers can be provided by nicks introduced into dsDNA by relaxase, the product of the ltrB gene that hosts the intron (pathway D) ( 93 ). Steps 1 to 4 in pathway D are as for retrohoming. EPP, error-prone polymerase.
DNA target site recognition by group IIA, IIB, and IIC introns. Target site interactions are shown for retrohoming ( 100 , 101 , 103 ) and retrotransposition ( 59 , 92 , 93 ) of the Ll.LtrB group IIA intron, and for retrohoming of group IIB introns EcI5 ( 104 ) and RmInt1 ( 77 ), and group IIC intron B.h.I1-B ( 107 ). Intron RNA regions involved in EBS1-IBS1, EBS2-IBS2, δ-δ’ or EBS3-IBS3 base-pairing interactions with the DNA target site are shown in red. A representative ectopic site is shown for the Ll.LtrB retrotransposition pathway. Base-pairs in the 5′ and 3′ exons that are recognized by the IEP are highlighted in mauve and blue, respectively. CS, bottom-strand cleavage site; IS, intron-insertion site; RF, replication fork; RH, retrohoming; RTP, retrotransposition; SL, stem-loop. CS for EcI5 is not known.
Similar RNA active sites of group II introns and the spliceosome. (A) Schematic of group II intron (left) and spliceosome (right). Group II intron elements are colored as in Fig. 1B and C , with corresponding spliceosomal elements in the same color (D1 and U5 blue; DV and U6 green; DVI and U2, purple). Conserved residues at the splice sites are in red and the branch-point adenosine (A) is circled. Elements are not drawn to scale. (B) Crystal structure of O. iheyensis group II intron in the precatalytic state (from PDB file 4FFAQ). The 5′ exon (black) is shown before 5′ splice site hydrolysis. Color-coding is as in Fig. 1B and C and Fig. 5A , with the two catalytic Mg2+ ions shown as yellow spheres bound to DV (green) ( 13 , 219 ). The putative water nucleophile is a cyan sphere. Images in panel B and D provided by Marcia and Pyle. (C) Secondary structures and Mg2+ interactions in O. iheyensis intron DV and spliceosomal U6 snRNA. DV (left, shown also in panel B) corresponds to the internal stem loop (ISL) of U6 (right). M1 and M2 are catalytic Mg2+ ions coordinated by phosphate oxygens of the nucleotides shown in red. The circled A in the intron is the adenosine nucleophile with the 2′-OH corresponding to the water molecule in B, which attacks the 5′ splice site (arrow). The introns are depicted in red hatch marks, with a small segment in green representing DV of the group II intron detailed to the right. U2 3′ to the U2-U6 pairing extends to interact with the branch-point region of the intron (arrow to circled A). The three boxed nucleotides in each case comprise the conserved catalytic triad ( 163 , 178 , 183 , 220 ). Dotted brown lines join residues involved in base triples, which are formed by two pairings between the catalytic triad and ether J2/3 or the 5′-ACAGAGA-3′ box and the third pairing to the bulge in each structure ( 163 , 180 ). (D) Base triples in DV of the O. iheyensis intron. The J2/3 nucleotides (orange ribbon) form a triple helix with the major groove at the base of DV (from reference 10 ). Two metal ions (yellow spheres) are bound near the twisted bulge loop. (E) Example of base triples in DV and U2-U6. The color-coded base triples are shown for the lower-most base-pair in the catalytic triad with a nucleotide in J2/3 in the group II intron or the 5′-ACAGAGA-3′ box in U6 of the spliceosome, as diagramed in panel C.
Comparison of Prp8 to group II intron and related RTs. Schematic comparing S. cerevisiae Prp8 (PDB:4I43) with the Ll.LtrB group II intron RT (LtrA protein; GenBank:AAB06503), Arabidopis thaliana nMat1 and nMat2 proteins (NCBI Reference Sequence: NP_174294 and NP_177575, respectively), non-LTR-retrotranspson R2Bm RT (GenBank:AAB59214), and Neurospora crassa RVT RT (GenBank:CAE76174). The plant nMat1 and nMat2 proteins are previous examples of group II intron RTs that were subsumed into nuclear genomes and retained RNA splicing function, with the nMat1 proteins acquiring a novel conserved domain (green) in place of the En domain. The Prp8 configuration of the thumb, long linker, and REL domain is similar to that in the R2Bm RT. Although group II intron RTs and the R2Bm RT lack an RNase H domain, RNase H domains are known to be acquired sporadically in different non-LTR-retrotransposon lineages ( 135 , 191 ). RVT RTs, another potential candidate for an ancestor of Prp8 ( 181 ), lack En and integrase domains and are further distinguished from group II intron and non-LTR-retrotransposon RTs by a large acidic insertion within RT-2a and by conserved N- and C-terminal domains that are not found in other proteins ( 212 ). Conserved sequence blocks in the RT domain are numbered, and the sequence motif containing two of the conserved aspartic acid residues at the RT active site is shown below for each protein. The locations of conserved RT sequence blocks in Prp8 are from structure-based sequence alignments by G Mohr (UT Austin). DB, DNA binding domain; REL, restriction endonuclease-like domain.