DPGLEAN18272 in OGS1.0

New model in OGS2.0DPOGS204526 
Genomic Positionscaffold943:- 4750-27070
See gene structure
CDS Length2964
Paired RNAseq reads  15344
Single RNAseq reads  39681
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004309 (5e-12)
Best Drosophila hit  Dek, isoform A (2e-17)
Best Human hitprotein DEK isoform 1 (2e-14)
Best NR hit (blastp)  PREDICTED: similar to LOC398543 protein [Tribolium castaneum] (2e-23)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008666 [Tribolium castaneum] (7e-21)
GeneOntology terms

  
GO:0006397 mRNA processing
GO:0005634 nucleus
GO:0003676 nucleic acid binding
InterPro families  IPR014876 DEK, C-terminal
Orthology groupMCL24234

Nucleotide sequence:

ATGTCGGGTGATACCGATAAAGCGAAAATCAGTGACAATCAGGATGAAGACAAGAAATCC
GGAACAGGCGACGGGCAGACGACGGAAGACGAATCATCCCAGGACTCGAAGGGTGAGTCG
GCGCCAATGGACGCGCAAGGTAACAACGTAGCAGACAATATAGAAGGAGACGCGAAGTCA
ACGACGGCGAACGGCAAAGAAGATGACTGTCAGGGCGAGGACTCCAAGGATGATAGGGCT
GTGAAGACGGAACACGCTAAGAAGGTAATGTACAAAAACGGTGAAACCCACGACGATGAC
GACAAAGAGGATTTGAAGGAAAATGACAAAGGGGCTAAGAAGTTAGCGCCGAAGAAGAAG
CCCAAGAAAGAGGACACCGAAGAAGAGGAAGAAGATGAAGAGGAGGAAGATGAGGAGGAG
GAGGATGAAGAAGGAGAAGAGGGCGATGAGGAGGAAGAGCAGAAGACAAAAGAGAAAGTA
AAGAAACCAGTCAAGAAACCTAAAGACGGAGAGGAGGAAGGTGACGACGAGGAAGAGGGG
GAGGAGGAGGGGGAGGAAGAAGAGGAAGAGGAGGAGGAAGAAGAGGCTCCGAAACCGAAA
CCCAAAAAGGAGCCAACTGAACCCCCTGTACCACTACCAGCTGGTAAAGGAATACCCTTG
GGGCATATCAGCAACGTGGAAGTGTCACTGTCGCGCTTCAAGACCCAAGACCAGAAGATA
TTACACCAGTATTTGTACGGACAACTCTGCCTGGATCGCAACGTCAAAAGGAACATCAAG
AAGTTCAAAGGCTACGAATGGGCGATCGGTTCCACGGAGTACAAAGCTAAACTAGAAGAA
ACAGCCAAAATGGAGCCCAAACAGTTGAGGACGATGTGCGAGATGTTGGACTTGGACAAA
AAAGGTGGCGCCAGTGAGTTGGCTGCTCGTCTGGTCGGTTTCCTTCAGCAGCCGGTCGCG
AACTCTCCCCACGCCCGCGGCGTTGCTCGCCCCCCTACCACGCAAGCGGCGACACCCGGC
GGCCGACCGAGACGATCAGCGGCCGTCAAGATACACAACAGAGATATTAATATCACGTCT
ACTGTGGCGGCTGCGATGCCCCCTTCCCGCGCTCCCTGTGTTGTCTACCCCGAGCACACC
CTTCCAACAGCCGAGGCCACTCCCCCGGCGGTTTCGCACGTGCCACTCGGACCTCAGTGT
TCGTTAAAAAGTGTGCGGCGTCGTAGGTGTCACCACCCGCACCCCTATCACGGCGCGAAA
CACGTGGTTGTAACGATGTCGTGTAGCTACTCGGACGAAGAGTATGAATCTGATCCGGAG
ACCAAGGTGAAGGGTCCCAAGCAGCCCAAGGACGGCTCGGAGGACTCTGATGGCTCCTTC
AACCCGAGCGGATCTGAGGCGGACTCGGACTTCGACCCTGAAGGTGGTGAGGGTGTGAGC
GGAGCCGCGCGCAAGAGGAAGAGCTCTGGACGACGCCGGTCCAGTAAGGGGAAGAGGGGG
CGCAAGAGCAAGGGGAGGAAGAAGGGTGGCAGCAGAGGCCGCGGCCGACGGGCACGGTCA
GACAGCGAGGACGAGAGTGAACGCTCTGATAGTGACAGCGAATTGGACTCGGCCAGCGAC
GGAGACGAATCAGATGAACCGAAGTCCAAACGTGGTAGGCCCGCGGGGTCCGTGTCTAAG
GGTCGCAAGGGAGCTGTAGCAAAGGCTAGCGCTAAAGCGACTCCCGCTAAGCGGAAAGCG
CCTACACCCACAGGGAAGAAGAAGGCCGGTGCCAAGCCAGTCGGTAGACCAGCCAAGAAG
GGCAAGCGGGCGTCCTCCGACGAATCTGGAGATGGAGAAGAAGGCAGCGAGGAAGAAGAT
GAAGAAGGCAGCGAGGAGGAGGAGAGCGGAGAAGAGGATGACGAGCCAACTGACAAGAAA
GCCAAGCGTCCACCTACAGACGAGGAGATTAAGAAGTACGTGAAGCAGATCCTGGAGGGC
GCGAACCTGGAGCAGATCACCATGAAGACGGTCTGCAAGCAGGTCTACAGCCACTATCCG
GACTTTGACCTGGCGCACAAGAAGGACTTCATTAAAGCTACTGTCAAATCGACCTGCTGT
TGCAAGGGCTCTGATGATGAATCAGATCAAGGGGATCCGGGGGCGGTGCTGAATGTGCCT
GAGTTTCGAATACCAAATGGCCTGGGTGTACCATTGGGATACTTACACAATGTAAACGAC
GCCCTCAACCGATACCATGTTGTGGACCTGAAGATGTTGCATCTCTATTTATATGGAGTA
AGCGGGACGAGGGATCAGGCTTTCAGACTCGTGGAGTTCCTCGTGAAGCCGGAGTCCAAC
TCTCCATATTTCAGATGTGTTCGTACGATCGAGAGCATTGCTAAGACTTACCATAGAGCT
GTTTCGGACAACGACTATGAATCCGATGCTGAGACGATGGTGGCGTGTTCAGAGAGGGCC
AGAGATGGGTCAGACTACTCCGAAGGTTCGTGTGTTGGAAGTTCAGAGTGTTGTCGCGCG
ATCTGTGACATCAGTGACAGCGAGAAGGAGGCTGACACTAAGATGATATCTGACAAGGAA
ATATCCTCTTCTATCGATGACACCACTGTGCTATCGCAGCCAGATGAAAGTGGCTCTTTG
GCGTCCGGGGAGATAACAAGCGTGTGTGATGAAAAAGTAGCCAAACCAGGGATTGGACAG
GAAATTGCTTTACCCGGTTATGCTAGAGCGAGACGTCTACCAACAGACGAGGATATAAGG
AAGTTTTTGCAGAAAATTTTGACGGGATTGAATCTGGAGAAGATTACTATGAACACCATC
TGCAGAGCGGTTTACAGCCAGTTCCCGGACTACGATCTGGTCAACAAGAGGGACTTCATC
ACAGCTACAGTCAAATCGGTAATTATTGAACCAAAATTAAAGGCCCCTAGTGGCGACTTT
TGTATTTGTCTATCCATATGCTAG

Protein sequence:

MSGDTDKAKISDNQDEDKKSGTGDGQTTEDESSQDSKGESAPMDAQGNNVADNIEGDAKS
TTANGKEDDCQGEDSKDDRAVKTEHAKKVMYKNGETHDDDDKEDLKENDKGAKKLAPKKK
PKKEDTEEEEEDEEEEDEEEEDEEGEEGDEEEEQKTKEKVKKPVKKPKDGEEEGDDEEEG
EEEGEEEEEEEEEEEAPKPKPKKEPTEPPVPLPAGKGIPLGHISNVEVSLSRFKTQDQKI
LHQYLYGQLCLDRNVKRNIKKFKGYEWAIGSTEYKAKLEETAKMEPKQLRTMCEMLDLDK
KGGASELAARLVGFLQQPVANSPHARGVARPPTTQAATPGGRPRRSAAVKIHNRDINITS
TVAAAMPPSRAPCVVYPEHTLPTAEATPPAVSHVPLGPQCSLKSVRRRRCHHPHPYHGAK
HVVVTMSCSYSDEEYESDPETKVKGPKQPKDGSEDSDGSFNPSGSEADSDFDPEGGEGVS
GAARKRKSSGRRRSSKGKRGRKSKGRKKGGSRGRGRRARSDSEDESERSDSDSELDSASD
GDESDEPKSKRGRPAGSVSKGRKGAVAKASAKATPAKRKAPTPTGKKKAGAKPVGRPAKK
GKRASSDESGDGEEGSEEEDEEGSEEEESGEEDDEPTDKKAKRPPTDEEIKKYVKQILEG
ANLEQITMKTVCKQVYSHYPDFDLAHKKDFIKATVKSTCCCKGSDDESDQGDPGAVLNVP
EFRIPNGLGVPLGYLHNVNDALNRYHVVDLKMLHLYLYGVSGTRDQAFRLVEFLVKPESN
SPYFRCVRTIESIAKTYHRAVSDNDYESDAETMVACSERARDGSDYSEGSCVGSSECCRA
ICDISDSEKEADTKMISDKEISSSIDDTTVLSQPDESGSLASGEITSVCDEKVAKPGIGQ
EIALPGYARARRLPTDEDIRKFLQKILTGLNLEKITMNTICRAVYSQFPDYDLVNKRDFI
TATVKSVIIEPKLKAPSGDFCICLSIC