Lecture Notes on RNA Structural Elements and Function Laura Landweber, 9/20/95 Double-stranded helices ------------- ||||||||| ------------- at least 50% DS in solution of any random RNA molecule incl. mRNA (~55-60% in tRNA, rRNA) short-range pairings --> hairpins, loops, bulges, junctions long-range pairing intramolecular structural roles- stabilizing 3D folding eg: Group I introns, RNase P, rRNA tested by site-specific mutagenesis functional roles (stabilization and/or specificity of splicing) Gp II 5 prime splice site selection mediated by two exon-binding sites (EBSs) Gp I 6nt IGS (internal guide sequence) = 5 prime EBS also stabilized by tert. interactions b/n 2 OHs intermolecular W.-C. pairing - bimolecular recognition nuclear pre-mRNA splicing snRNAs U4-U6: regulatory, pairing disrupted during splicing U1: recognition, 5 prime splice-site selection branch point selection, etc. translation initiation: Shine-Dalgarno binds 6nt at 3 prime end of 16S rRNA anticodon-codon interactions prokaryotic antisense regulation replication, conjugation, gene expression specific loop-loop interactions, or "kissing hairpins" --> complete hybridization of 2 RNAs RNA editing in trypanosomes - guide RNAs (gRNAs) more functional roles protein recognition of ssRNA "presented" by paired regions --> correct orientation of loop, conformation (non-sequence-specific protein-RNA interactions incl. phosphate backbone and protein side chains, eg. Glu-tRNA synthase protein recognition of duplex regions DEAD (or DEAH) box proteins, incl pre-mRNA splicing factors and 2 euk translation IFs DSRAD unwinds dsRNA in Xenopus and other mammals A -> I editing protein __ __________/ \ Hairpin Loops |||||||||| | ----------\__/ non-WC pairing often shortens unpaired regions (semantic point - not really all "loops") proof by phylogenetic analysis eg. 2 tetraloops actually biloops with one non-WC bp closing the loop wobble GU or unusual GA pair base stacking also stabilizes "loops" tetraloops and some 3nt loops like UUU interchangeable in rRNA => not interacting w/ proteins or protein interactions are sequence independent loops may change conformation on binding, requiring increase in loop free energy some more flexible eg. HIV-1 TAR has many conformations all w/ sim. free energy, not dominated by one structure, but undergoes conformational change on binding protein Tat and host proteins roles in translation initiation, propagation and termination Shine-Dalgarno (SD) or AUG sequestered in a stem and hairpin loop --> low rate of translation initiation (verified by experimental elimination of hairpin by deletion of nts on one side of the hairpin) euk hairpins between 5 prime cap and first AUG inhibit translation hairpins 12-15 nt downstream of AUG enhance init. by stalling 40S mRNA stability - structures in 3 prime UT control of translation by binding hairpin upstream of SD in own mRNAs eg. T4 DNA polymerase viral coat proteins bind hairpin including AUG of replicase genes ribosomal frameshifting --> synthesis of 2 or more proteins from one init. site eg. 5 prime-X, XXY, YYZ-3 prime can slip into -1 reading frame without disrupting 2/3 bps facilitated by ribosomal pausing. Interpretation of UGA as silenocysteine codon instead of stop facilitated when hairpin immediately downstream of UGA or ribosome stalled control of iron metabolism by iron-response element (IRE), a stem-hairpin loop unwinding vs. binding proteins regulate mRNA levels of ferritin (Fe storage) and transferrin receptor (uptake) [Fe] high --> incr. ferritin and decr. ferritin receptor synthesis and v.v. IREs in 5 prime UTR of ferritin mRNA and 3 prime UTR of receptor mRNA opposite regulation by the same RNA structures! evolutionary strategy --> concerted response hairpin binding proteins probably confer stability of mRNAs w/3 prime hairpins transcription termination signal in proks. RNA hairpin followed by 6-8 Us (--> unstable dA-rU hybrid --> release) transcription attenuation in proks by terminator hairpin and a run of Us at a 5 prime leader of downstream gene inhibits txn of downstream genes ____ Mismatches and Internal Loops ------/ \----- |||||| ||||| ----------------- filled with non-WC base pairs (pairing maximized) stabilized by tightly bound water molecules water-mediated H-bonds buttress the structure effect of widening the major groove eg. Loop E in euk 5S RNA a 9 nt asymmetric internal loop actually has 4 non-WC bps (GA, Hoogsteen UA, AA and UU) + a bulged G experiment: delete a U --> 8 nt loop, less stable (less base pairing) acts like dsRNA in chemical and enzymatic probing sens. to RNase V1 and bases less reactive to chemical probes --> specific protein recognition by TFIIIA ribosomal proteins also often recognize internal loops in rRNA HIV-1 Rev protein binds a 10 nt asymmetric internal loop Gp I intron guanosine binding site = loop between two stems ___ / \ \ / Bulge Loops -------+ +------ |||||||| ||||||| ---------------- cause bends in helices HIV-1 Tat protein binds TAR element (trans activation response) incr. txn hairpin with 3 nt bulge, conformation and sequence specific binding also binds free arg excess of unpaired As in 16S rRNA often protected from chemical modification by presumed protein contacts in 30S subunit nucleophile in Gp II and pre-mRNA 5 prime splice site = a bulged A Pseudoknots L3 (spans shallow groove) ______ / \ 3 prime <------ +-----\---> 5 prime ||||| /||| | +-----/-------+ \ / L2=0 Classical Pseudoknot with coaxial stacking \__/ (one turn of the quasi-continuous helix) L1 (spans deep groove) any tertiary interaction involving W-C bps b/n 2 regions of unpaired nts not true topological knots always defined by two stems (S1 and S2) and 2 or 3 loop regions (appendix 2) potential for coaxial stacking (appendix 2) by juxtaposition of stems loop regions can be unstructured can be long-range interaction distortion in stacking at junctions of stems and loops minimum loop sizes can be predicted (and can be really small: L1 can be 1 or 2nt and span a 5-7 base paired region because of the turn of the helix) equilibrium between pseudoknot and hairpin conformations marginally more stable than possible hairpins conformation labile --> good for control of gene expression modulated by proteins offer many sites for interaction with proteins examples: rRNA, tRNA, Gp I intron core, etc. control of translational initiation of E. coli rpsO gene encoding RPS15 tRNA-like structures in translational operators of tRNA synthetase genes