DPGLEAN06112 in OGS1.0

New model in OGS2.0DPOGS212285 
Genomic Positionscaffold399:+ 25764-35008
See gene structure
CDS Length2049
Paired RNAseq reads  2127
Single RNAseq reads  6150
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011445 (1e-42)
Best Drosophila hit  CG31211, isoform B (5e-06)
Best Human hitsplicing factor, arginine/serine-rich 18 (2e-09)
Best NR hit (blastp)  PREDICTED: similar to CG31211-PA, isoform A [Apis mellifera] (6e-52)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC011125 [Tribolium castaneum] (3e-34)
GeneOntology terms


  
GO:0005634 nucleus
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families  ND
Orthology groupMCL17180

Nucleotide sequence:

ATGTTCTCTAGCAAAGATGCCGTAAATCCAGGATATCCCACGCAGTGGGCTTTAAATCCA
ACTGCATATCAAAATATAGATTCAAGCCAAGTAGATTGGGCAGCTTTAGCACAACAATGG
ATTGCTATGAAAGAAGCCGCGGTGATTGTGTCAACGCCACAGTCAAAAGCCGATGTGGAA
GAGGGCGAAGCACCTATGGAAGTAGAAAATCCAGAGGCCAGCGAACCACCTATCGGTGCA
GGCCCCGAGTGGAATGGCTCAACGAACTCATGGGGAGGTTCTTGGAACCAGTGGGGATGG
GGTTGGAGTGGCACAGGACCAATGGATCCTAAAATGGCTAGTGATCCATCAATAGCCATG
GGTCCAATGATGGATAGCTATCCCGTAGCAGATAACAATACAACCATGCCAGGTTACACG
AGCGGTGCTGTGCCGACTCCAACATTTCAACATGGTTATTGGACGGCTCAAAACTCTGAC
CAGTCGGGCAATAACCGGAATCGCGATAGGAGATCTAAGAGCAAAACTAGAGAAATTAAA
CCTACCAGAAGTAGATCACATAGAGATAAGTTACCTTTAATACCACCTGTGATGGAACCG
CTGGTTATGCCGACTCCGACATCTACTATTGACGCAGCTAAGAGACGACAGCTACCGGCT
TGGATAAGAGAAGGCTTAGAGAAAATGGAACGAGAGAAACAGAAAGCAATAGAAAGGGAA
CAGGAGAAGAAAGCGAGAGAGGAAGCGGAGAAGGAGAAGAAGAGGATTGAAGAGGAAGAG
TTGCAGAGGCTGAGGGACGAGGGACATACTGTGCTGCCGGCCAAGAGCAAATTCGATTCA
GACTCCGAGGGCGAAGCTCCCCCTCCCCCTCCCGCTGTCATTCCTCCGCCCCTGGGACGA
AAATCAAAAGAGGAGGCTCTTCAGGATGTGATGTTAGCGGTGCGTCGTTCGCTAACAGAG
ATCCTGTTGGAGGTCACTGACAGCGAGATACAAACGGTGAGCCAGGAAGAAGTGGCTCGG
TATAACGCCGCACAAGCGTCGCGACTCAACGCTATGAAGGCGAGCAAGTCCAAGGCGCTC
GCGTCCATCGCCAGCGGTCTCGGTCTTGGAGCGTACGAGAGCAGCGAGGACAGCGGCGAC
GAAGACCAACACGATATGTCCGACCAGCAGTTACAGGAGGTCATAAGACGGAAGCGTCAA
GAATTCGAACGTACCTCACGAGAGATAGAGGCGGAGGTGAGACGAGCTGAACAACGAGAG
AATGAAGAAGAGGGCTCTCAGCATCACGACACGCCGGAGAGACCGCGCAGATCACGTTCT
TCTGCTACGCCGCCGCCGCTGGACAGTGAGACGCCAGAGAAGAAACCGGAACGTCGCCCG
TCTAAGGATAAAAGAAGCAACCACAAGTCATCTGAGAAGACAGATAGAAAATTGGACGTC
ATTCAAGAAGAGAAGACACCAAAGAAAACGAACAAATACGAAACTACACCAACCATGACG
AAAGCTATCAAGTCGAGCTCAAATTCAAGTTCCAGTGACTCAGACGACGACTCGTCTAGT
ACCAGTAAATCATCGTCAGAGAGTGAACCGGAAGTGAAAGTAGAAAACAATAGGACTAAG
AAACGTAAGCGTAGAAGTACCAGCTCCAGCGACACTAACAAGAAATCAAAGAAACATAAG
AAAGACAAATCACACAAATCGAGCGAGAAGAGTTACTCCAAGAAACATCAGGAGGAATAC
GATAGGAATGATAAATCTAGATCCAAGAGAAAAGACGAATATTATGAAAAGCACAAACAT
AGAAGCCGGGACGAAAGGTCGCACAAAGACAAGTACAGGGAGGAGTCGGACGAGGACAGG
GCGAGGAAGCGGTCCAAGCGTTCAGTTAGCTACGAATCAAGATCGGGACGACGAAAAAGT
AGAGATCGATCCGAAGATAGATCTCGAAGGCGGGACAAACGATCCTACGATAGGGATAGG
TCTTACGATAGGTCCAGAGACTACGATAGGTATGATCGCCACGACAACTATTCCCGCCAT
CGCAGATGA

Protein sequence:

MFSSKDAVNPGYPTQWALNPTAYQNIDSSQVDWAALAQQWIAMKEAAVIVSTPQSKADVE
EGEAPMEVENPEASEPPIGAGPEWNGSTNSWGGSWNQWGWGWSGTGPMDPKMASDPSIAM
GPMMDSYPVADNNTTMPGYTSGAVPTPTFQHGYWTAQNSDQSGNNRNRDRRSKSKTREIK
PTRSRSHRDKLPLIPPVMEPLVMPTPTSTIDAAKRRQLPAWIREGLEKMEREKQKAIERE
QEKKAREEAEKEKKRIEEEELQRLRDEGHTVLPAKSKFDSDSEGEAPPPPPAVIPPPLGR
KSKEEALQDVMLAVRRSLTEILLEVTDSEIQTVSQEEVARYNAAQASRLNAMKASKSKAL
ASIASGLGLGAYESSEDSGDEDQHDMSDQQLQEVIRRKRQEFERTSREIEAEVRRAEQRE
NEEEGSQHHDTPERPRRSRSSATPPPLDSETPEKKPERRPSKDKRSNHKSSEKTDRKLDV
IQEEKTPKKTNKYETTPTMTKAIKSSSNSSSSDSDDDSSSTSKSSSESEPEVKVENNRTK
KRKRRSTSSSDTNKKSKKHKKDKSHKSSEKSYSKKHQEEYDRNDKSRSKRKDEYYEKHKH
RSRDERSHKDKYREESDEDRARKRSKRSVSYESRSGRRKSRDRSEDRSRRRDKRSYDRDR
SYDRSRDYDRYDRHDNYSRHRR