IRESite record type: plasmid_with_promoter_and_putative_IRES_translationally_characterized
The shape of the nucleic acid molecule translated: linear
The quality of the mRNA/+RNA sequence: our_best_guess
The mRNA/+RNA description:
Putative in vivo transcript driven from CMV IE promoter and terminated after the SV40 late poly(A) signal
containing GLuc (Gaussia luciferase) and GFP cistrons separated by the putative eIF4G IRES.
The mRNA/+RNA sequence represented in the +DNA notation:
Credibility of mRNA sequence: end-to-end_sequence_reverse_engineered_and_should_match_experiment
The name of the promoter used to express this mRNA: CMV_IE
The in vivo produced transcripts are heterogeneous (due to any of promoter?/splicing?/cleavage?/breakage?): yes
The in vivo produced heterogeneous transcripts occur due to alternative splicing: yes
A promoter reported in cDNA corresponding to IRES sequence: not tested
Summary of possible issues when IRES cDNA is experimentally transcribed in vivo:
Summary of experiments studying integrity of the in vivo transcripts in a particular host:
Integrity (uniformity) of mRNA tested using Northern-blot: not_tested
Integrity (uniformity) of mRNA tested using RNase protection: not_tested
Integrity (uniformity) of mRNA tested using 5'-RACE: not_tested
Integrity (uniformity) of mRNA tested using primer extension : not_tested
Integrity (uniformity) of mRNA tested using RT-PCR: heterogeneous_population_of_molecules_found
Integrity (uniformity) of mRNA tested using real-time quantitative polymerase chain reaction (rtqPCR): not_tested
Integrity (uniformity) of mRNA tested using RNAi: not_tested
Integrity (uniformity) of mRNA tested using S1 nuclease mapping: not_tested
Cryptic promoter presence was confirmed by expression from a promoter-less plasmid: not_tested
Cryptic promoter presence was confirmed in an experimental setup involving inducible promoter: not_tested
Integrity (uniformity) of mRNA molecules or possible promoter presence expressed in vivo was tested using another method, please specify in Remarks: not_tested
The description of the protein encoded in this ORF: green fluorescent protein
The translational frameshift (ribosome slippage) involved: 0
The ribosome read-through involved: no
The alternative forms of this protein occur by the alternative initiation of translation: not tested
The ORF absolute position (the base range includes START and STOP codons or their equivalents): 1028-1747
Remarks:
In addition to the unspliced message 4 alternatively spliced products were reported in HeLa cells transiently
transfected by this plasmid. Although the authors intended to use a plasmid which would be depleted of
5'-splice donor sites (Supplementary Figure 6) still several 5'-splice donor sites remained in GLuc coding
region and even in the eIF4G sequence itself. Please refer to the Supplementary Figure 7 in Baranick et al.
(2008).
The sequences around the 5'-splice donor were:
TGCTGGTGGGgtagggatga (in the eIF4G sequence itself)
GGGCCCCAGGgtgtgcagcc (in the eIF4G sequence itself, intron emphasized in lowercase letters of this IRESite
record)
CCAGGGCCAAgtcgataaga (in GLuc coding region, C-terminus)
ACCTTCGCAAgcaagatcca (in GLuc coding region, C-terminus, also functional in XIAP-based plasmid IRESite_Id:563)
The 3'-splice acceptor region within eIF4G was: ttatagctttctttccccagAT
Comments from IRESite curator:
The first splice variant has intercistronic region of size 122bp (instead of 379bp as the unspliced message)
while both reported ORFs intact.
The second splice variant has intercistronic region of size 88bp (instead of 379bp as the unspliced message)
while both reported ORFs intact.
The third splice variant has the first cistron merged in another frame to the second cistron, quite
surprisingly the fusion protein extends even behind the GFP protein ORF by 17 bp. Scanning ribosome would have
to bypass AUGs of 6-10 ORFs 538-720bp to synthesize GFP protein (of these are 2 uORFs).
The fourth splice variant has modified C-terminal region of the first cistron spanning the eIF4GI 5'-UTR
insert sequence and hits a STOP codon in the very beginning of GFP ORF. However, in another ORF there could be
produced GFP protein in its original reading frame with extra 26 aminoacid residues fused to it N-terminus.
And, obviously unaltered GFP protein could be produced as well. Scanning ribosome would have to bypass AUGs of
6-9 ORFs over a distance of 538-644bp to synthesize GFP protein (of these are 3 uORFs).