DPGLEAN22014 in OGS1.0

New model in OGS2.0DPOGS201772 
Genomic Positionscaffold1298:- 66-4426
See gene structure
CDS Length2373
Paired RNAseq reads  240
Single RNAseq reads  674
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008588 (8e-164)
Best Drosophila hit  CG9170, isoform A (4e-27)
Best Human hitcentrosomal protein of 164 kDa (2e-12)
Best NR hit (blastp)  TATA element modulatory factor, putative [Pediculus humanus corporis] (5e-30)
Best NR hit (blastx)  TATA element modulatory factor, putative [Pediculus humanus corporis] (5e-31)
GeneOntology terms  GO:0005515 protein binding
InterPro families  IPR001202 WW/Rsp5/WWP
Orthology groupMCL40294

Nucleotide sequence:

ATGACTTCGCCATCGGCTGTGGTGTGTCGTGAAATATTTGATGAAAATAGCCAACCATCA
GCCGAAGAAGTCTCAGACTATGCCTGTCAGCTCGGCATTGATCCAGAGAGTGAGAGTCAT
CTTCTACCCCTGGCTAGAGATGGTCTCATGCAAGCTCTGCCGGCCCCATGGAAAGCCTAT
TTTGATGAAAAGCTTCAAACTCATTACTACTACAATGAAGAAACAAAGAAGACTCAATGG
GAACATCCTCTTGATCATGTTTATAAAGAATTGGTGAAGAAAGCCAGGGATGCCTCTGTG
CAAGATGACACCTGTGCATCGGTTCAGGAACTTCTCACATCTGAAGAAAACACCAAAAAC
CTTGAACGCGTCGAAACCAAAGTGGAGAGCGAAGATGAAGATTTAAGCACCGATAGCGAA
CAACATGTGTCTGATGTCAAGGACAGTGCAAATCCTGCGGCAATGTCCGGAAGTAGACGT
TTGGCTCCTTTGGGAAGGCCACCCCTGGCTCCACTGTCCAGATTGGAGAAAAAACTGAGT
GATCTGCGAATATCGCCTCTAAGACGCAGTATAGAAAGTCCTTCATCGCCTAAACCGAGT
CTAGTCAGAAATATATCCGATAGAGATATTCTTAGCCGTCCCACGTTTGAGAGGCCTAAA
ATGTTGTTCAAACAGCAATCCGAAATAATTGATTTGAAGATGCACATTTTGACTAGTCCG
GAGGAAGAAAATTTTAGTCCACTTTTATCATCCACTAAAGTAGACAGAGGGTTACCGTTG
ACAGGCAAAGGAAACATGTTCCTCAAGCTGTCTCGTTCGAATCTTCCCAGTCCAGATACA
GAGAAGTCTCTACAACTAGATTCAGTGACTAAAAGTGACCCACCGAAGGGTATTCTGCGA
GAGAAAACATATGACGCGATCCAGAAAAAGGCTGAAATTATTCTCGGGAAGGCCAAGCAA
AGTCCGAGTTTTGAAGAAGATAAGAAAAGTGTTAGGTTCAAACTGGAAAATCTCCCGGAT
CCTACAGTCAGTCCCGGTTCAAACAGCTCCTCCGAACAGAACGACGCTCAGAGTTCAATA
CGAGCTGCCTTGCCATCACCTACGTCGACCCCCATAAATCCTATGCCTTTATCACCATTG
GTATCCAGACCCCCATTACCGCCTAAATTGGATAGCCCCAGGGAAGTTAAAAGCATTTCC
GGTTCCGAAAGAGATTCAATTGAGAGCGAAAGTTTAAATCGTGGAACTTCTCCAAGACGT
CGCCTTGTGAAACCCTCGCCAGCGGATTATATTAAACCTGAATTGTTCCAGAAGAATTTC
CAAAAAATTTCAGATTTAGTCCGAAGAAGCGATATTGAACCGACGCCTATGACTTTGGAA
GAACCTTCGTCTGACAAAGAAATCAGACCGAGATCACCGTTAATACCACTAAGGAGCAAG
ATATCAATAAATCTGATGGAAAGCATCGAATCTGAAACTTCGATTGATTCCCCTGATAGA
GAGTTCGCCAACTTGGATCTAAACGACTTAGACGACTCAAACGTTAAGGTAACCGATACA
AAAGACTCTGACAAGGAAAACAGTCAAAAAACAGAGGACTCGTCAGAGAAATCACAATCT
AAAGTGACCGAGTCACTGCCGAAACCACGAGATCCGATCCCGCAGATTAAAGTTCCTAAA
TTAGAACCCATACAGAAGCAGAATAAAATTACTCATACAGATTCAGAGGATTCCAAGAAG
AGTACAAACGACGACGAAGCAAAGACTAGCGTCAGATCAAATAGTATAGAAATACCTAAG
GCTAGACCACAACTAAAACTATCTCCAACACCAAGCTTAGGGTCGGATAGGTCGCCTAGA
CTAGAATTCGCTAAGAACTGGTCCAGTCCTTTAACGACTTTCAAACCTTTCAATAAAAAT
GTCATAACGCCATTGAAATCATCAGATTCTGCGTCCAGTTTGGGCAAAGGCATAACCAGT
CCGAGGTTAGACGGCGTCATACTATCTCAGGGGAAATCGAGCACAGATAACGTCGTTGTG
GTATATCAGTTCGAAACGCAGGAAGACACACCCAAACCAATAAAATCTCCATTGATACCG
GATATGGGGGTCAGGGACATGCTTGAGAGGAAGCAAGATGAGCGTAGAAGGTTGGAGTTA
GCTCTACAGAAGGTGTTGGAGGTTATAAGAATAGAGTTCAGTGCGAAAGAGAAAAGAATG
AGGTCGGAGTTGCAGGAAGAGTTGAAGGAAGCGGAAGAGAAGTTTCTGAACGAAAAAACG
ATGAGGTTAAAGGAACAATCGGATCGACATAAGAGGGAAATGGATCAGGTGAGAATTATA
CGACGTTTAATATTAAACCGTCTTTTGCTTTGA

Protein sequence:

MTSPSAVVCREIFDENSQPSAEEVSDYACQLGIDPESESHLLPLARDGLMQALPAPWKAY
FDEKLQTHYYYNEETKKTQWEHPLDHVYKELVKKARDASVQDDTCASVQELLTSEENTKN
LERVETKVESEDEDLSTDSEQHVSDVKDSANPAAMSGSRRLAPLGRPPLAPLSRLEKKLS
DLRISPLRRSIESPSSPKPSLVRNISDRDILSRPTFERPKMLFKQQSEIIDLKMHILTSP
EEENFSPLLSSTKVDRGLPLTGKGNMFLKLSRSNLPSPDTEKSLQLDSVTKSDPPKGILR
EKTYDAIQKKAEIILGKAKQSPSFEEDKKSVRFKLENLPDPTVSPGSNSSSEQNDAQSSI
RAALPSPTSTPINPMPLSPLVSRPPLPPKLDSPREVKSISGSERDSIESESLNRGTSPRR
RLVKPSPADYIKPELFQKNFQKISDLVRRSDIEPTPMTLEEPSSDKEIRPRSPLIPLRSK
ISINLMESIESETSIDSPDREFANLDLNDLDDSNVKVTDTKDSDKENSQKTEDSSEKSQS
KVTESLPKPRDPIPQIKVPKLEPIQKQNKITHTDSEDSKKSTNDDEAKTSVRSNSIEIPK
ARPQLKLSPTPSLGSDRSPRLEFAKNWSSPLTTFKPFNKNVITPLKSSDSASSLGKGITS
PRLDGVILSQGKSSTDNVVVVYQFETQEDTPKPIKSPLIPDMGVRDMLERKQDERRRLEL
ALQKVLEVIRIEFSAKEKRMRSELQEELKEAEEKFLNEKTMRLKEQSDRHKREMDQVRII
RRLILNRLLL