DPGLEAN19072 in OGS1.0

New model in OGS2.0DPOGS205919 
Genomic Positionscaffold1853:- 19091-22586
See gene structure
CDS Length1053
Paired RNAseq reads  960
Single RNAseq reads  3897
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007082 (0.0)
Best Drosophila hit  U2 small nuclear riboprotein auxiliary factor 50 (4e-157)
Best Human hitsplicing factor U2AF 65 kDa subunit isoform b (1e-136)
Best NR hit (blastp)  U2 small nuclear ribonucleoprotein auxiliary factor 2 [Bombyx mori] (0.0)
Best NR hit (blastx)  U2 small nuclear ribonucleoprotein auxiliary factor 2 [Bombyx mori] (6e-175)
GeneOntology terms











  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005686 U2 snRNP
GO:0008187 poly-pyrimidine tract binding
GO:0005681 spliceosomal complex
GO:0000245 spliceosome assembly
GO:0005634 nucleus
GO:0046982 protein heterodimerization activity
GO:0003729 mRNA binding
GO:0000166 nucleotide binding
GO:0051168 nuclear export
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0007052 mitotic spindle organization
GO:0071011 precatalytic spliceosome
InterPro families

  
IPR000504 RNA recognition motif domain
IPR006529 U2 snRNP auxilliary factor, large subunit, splicing factor
IPR012677 Nucleotide-binding, alpha-beta plait
Orthology groupMCL14185

Nucleotide sequence:

ATGCAGGCAGCAGGCCAAATCCCGGCAAACATAGTAGCAGATACACCACAGGCTGCTGTG
CCAGTAGTAGGGTCCACAATAACTAGACAAGCAAGAAGATTATATGTAGGGAACATACCA
TTTGGTGTCACAGAAGAGGAAACAATGGAGTTCTTTAATCAGCAAATGCATCTTTCTGGT
TTGGCTCAAGCTGCGGGCAACCCTGTTTTAGCGTGTCAGATAAACTTAGATAAAAACTTT
GCATTCCTTGAGTTTAGATCTATTGATGAGACTACACAGGCCATGGCATTTGATGGCATT
AATTTTAAGGGTCAAAGTTTGAAAATAAGGCGACCTCATGACTATCAACCAATGCCAGGA
ACCGAAAACCCAGCAATCAATGTACCTGCTGGTGTTATCAGTACTGTAGTTCCAGATTCA
CCCCATAAAATCTTTATTGGAGGTCTTCCTAACTATCTTAATGAAGATCAAGTGAAAGAA
CTTCTGATGTCATTTGGTCAGCTGCGAGCTTTCAACTTGGTGAAGGATTCTTCGACGGGC
CTAAGCAAGGGTTATGCCTTTGCTGAATATGTTGACATTTCTATGACTGATCAGGCTATC
GCTGGTTTGAATGGCATGCAGCTGGGTGACAAGAAACTCATTGTCCAACGGGCAAGCATT
GGAGCAAAGAACTCGACATTAGCTATGACAGGGGCTGCTCCGGTGACTCTTCAAGTGGCA
GGGCTGACATTGGCTGGTGCAGGCCCTGCCACAGAGGTACTCTGCCTCCTGAACATGGTT
ACACCGGATGAGCTTCGAGACGAAGAGGAGTATGAAGACATTTTGGAAGATATCAAAGAG
GAATGCAACAAATATGGGTGTGTGCGTAGTATAGAAATTCCAAGGCCTATTGAAGGCGTC
GAAGTGCCTGGTTGTGGAAAGGTATTTGTCGAATTTAACAGCATCGCAGATTGCCAGAAA
GCTCAACAAACATTGACAGGCAGAAAATTCAGCAACCGCGTCGTAGTTACCTCTTACTTC
GACCCCGACAAATATCACCGCAGAGAGTTTTAA

Protein sequence:

MQAAGQIPANIVADTPQAAVPVVGSTITRQARRLYVGNIPFGVTEEETMEFFNQQMHLSG
LAQAAGNPVLACQINLDKNFAFLEFRSIDETTQAMAFDGINFKGQSLKIRRPHDYQPMPG
TENPAINVPAGVISTVVPDSPHKIFIGGLPNYLNEDQVKELLMSFGQLRAFNLVKDSSTG
LSKGYAFAEYVDISMTDQAIAGLNGMQLGDKKLIVQRASIGAKNSTLAMTGAAPVTLQVA
GLTLAGAGPATEVLCLLNMVTPDELRDEEEYEDILEDIKEECNKYGCVRSIEIPRPIEGV
EVPGCGKVFVEFNSIADCQKAQQTLTGRKFSNRVVVTSYFDPDKYHRREF