The description of the protein encoded in this ORF: N-deacetylase/N-sulfotransferase (heparan glucosaminyl) 4S (shorter) isoform
The translational frameshift (ribosome slippage) involved: 0
The ribosome read-through involved: no
The alternative forms of this protein occur by the alternative initiation of translation: not tested
The ORF absolute position (the base range includes START and STOP codons or their equivalents): 419-3037
Remarks:
Hypothetical mRNA sequence of NDST4S assembled in silico by two-step multiple sequence alignment polished by
manual editing. First, EST sequences matching the cloned 5'-UTR region from brain and embryonic (and deposited
to GenBank under GI:21780283, 669bp) were aligned and resulting 5'-UTR region consensus sequence
partly overlapping to ORF region was created. The ORF overlapping sequence was used to find NDST4 protein
records, and their mRNA sequences were aligned to the consensus sequence created previously. The final
consensus sequence is this hypothetical mRNA. The NDST4S variant has shorter 5'-UTR region as it is
transcribed from a promoter P2 located in 5'-UTR of the NDST4L variant (Fig. 1) consisting of CAAT and TATA
boxes (5'-CAAT-GTGTG-TATAAAT-3'). Therefore, NDST4L sequence was trimmed from 5'-end to match the plasmid
pCAT-NDST4S-eGFP sequence provided by original author. Somehow, the resulting 5'-UTR is not 377 bp as
described in Fig. 1 (140bp+237bp for short variant) but is 418 bp instead.
For consensus sequence of 5'-UTR were used records GI# 26415164, 21780283 (the one published by Grobe et al.,
2002), 17092388, 27141716, 16482267, 32466133. For consensus sequence of ORF and 3'-UTR was used record
GI:12000418.