DPGLEAN21587 in OGS1.0

New model in OGS2.0DPOGS202986 
Genomic Positionscaffold8:- 397620-433910
See gene structure
CDS Length4710
Paired RNAseq reads  5859
Single RNAseq reads  16446
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012331 (1e-14)
Best Drosophila hit  CG11148, isoform F (3e-30)
Best Human hitPERQ amino acid-rich with GYF domain-containing protein 2 isoform c (2e-11)
Best NR hit (blastp)  hypothetical protein Phum_PHUM498170 [Pediculus humanus corporis] (2e-33)
Best NR hit (blastx)  PREDICTED: similar to CG11148-PA, isoform A isoform 1 [Apis mellifera] (2e-34)
GeneOntology terms

  
GO:0003674 molecular_function
GO:0008150 biological_process
GO:0005575 cellular_component
InterPro families  IPR003169 GYF
Orthology groupMCL17910

Nucleotide sequence:

ATGGGCGATCGTAACAATCCTATTAAGTTCGGCCCCGAGTGGCTGCGTAATTTGGCACGC
GAGCGTACTGCTGGGAGGGCAACTACAGCACAGACCACAAGGCCCGGGGCCGCAGGCGGC
AGCAGGCCTGCCGGCTCCCAGGGGGCCGGCTCGACGTGCACAGGCTCCCCGGGGGCCGGC
CCCTCTACCGCCACGGGAGTATCGAGTTCGGGCCCCGTCGGGGCCTCGCTGACTTCAGGA
CCCGCGTCGTCCGCCGCAGCTTCCAGCACGACCGCAGTCGCCGGAGCCTCAGGCACAAAC
TCAAGAAATACAAACACTAACCATCAGAAGATTCAGCTCGCTAAGCTAAGATACGGCAGG
GAGGAGATGCTGGCATTGTACGACAGGAACGCGGAGGCTCCAGAGGAATTGAAATACTTT
GACTTGTTGTACCAGCCGCGGGGGAAACCGCCTGTTGCTCTCAACACATACGACGATGAT
ACGATGGTTTATAATGGTTTGATGACTAATTTCCCTCGTCTGGGTGATATCACGAGATGC
CACATGCCGGTCTATCTTGATATTTTCCAAGATTTTATTTATCCTCACAGGTCTTCCGCT
GGCGGGCTTATCCATGATGTCATTTTCACCGGCTCTGAATCGTCGAAACCATTCCTACCC
GGTTCGAAGGGGGCAACATCATTGCATTCCGGTATTCCCCGAATGCCAGGGTATAGCGGT
GGGCTCGATGAGGAGGGCCCCAGTAGACCTTGGAGTAGCAGCAATAACAGCGGTTCACCC
AGAACTGATCAAGGCGATTGGACGACTAATAAAATGTTTCGCAGGCGACAAGCAAATAAT
ACTAATTGGAGGCAAACGTCTCGCGACGAAGGAGACGAGTGGCGTCAGGAGAATTCCAGA
CCTCCCAATCGTTCTAGTGTCGATAAATGGGACCGTGATTGGAGCGACCGGCCGTCCGGG
GAGAGGCCGCAGTCCTGGGCTCCCAGCCGGCGACAGTGGCCCGGTGACTCCAACAACGAC
GACAACTTGCCAGAGTGGGCGGTGGACAGTGCGGAGGCTGGTGCTGGTACCTTCGACTCC
TCGGGAGCCTTCCACGGGTATAGCAATGACGATACAAACATACCTAAGTCACAGGAGTCC
ACTTACCCGTTGACTCGGTCGCATACACACGGTGGTAGTATTGCGCGCTCTAAAACTGTG
GAAGAAGGCTCTGAAGAGTGGTGGGCTTCAGAGAAAGCCAAGAAGCTATCACCGAAAAGG
TTTGAGGCTGGCGATAGTAGATATAAAAAGTCTTTGAGTACTGGAACAGACGAAGTCAGC
GGTGGAGCGGTAAGTGTCAAACGGACCGATAACACGGAGAAGACTAACGATCTAGAGTCG
TCGGAGAGTGTAGACACGCCGGAACCGGAAGCGGACGCCAGTAACGCACAGGCAACTACT
GACGAGCAGAAACAAAATGATTTGAGACAAAAGCTCTCGGATAGTAAGACATTTGACGCG
TTTATGAGATCAGATATAGAATACCCTGAACCCAACGAAGACAAAGGAAACTTCCAGTCC
GTCATGATCAATTCCAACAACGGCTTGCGGCAGAAACATCAGAATATAGTGACGGTGAGC
AACGAGACCGCCATGAGCCGGCAACAGATGAACGCCACCGGCCTGTTGCAGATGCTGCAC
GGCAGGCAAATGGGCGATCAGAACCCTGAAGAGGAAACTTCTAAAACCAACGAGGAAAAG
ATCGTTGAAGATCTTATGGACATGACTTTGGAGGACGGTCGCATGCGTCCTAATCCAGCG
CACCAACCCGGTGTCATCGCATCAGGAATGATTAATCAAAGCCAGCTGCTACGTATTGCC
AGCCCCGCGGTGCCGCAGCAAGGGATGGTTCTTAATGCGGGACAAGGGATACAGAATGTT
GGCATCCCCAACCAGGCGCTCAACTCTTCGCTAGGACTGAATATGGGCCCAGGAAATGCT
CACAGCTTGCCCATGCAAGGATTGCTACCACCAGTGATGAATACTCTGAATCCCGCTATG
GGCACGGCCATGCAAGCCCGCGTTATTGGAGCGTTCCAACAGAACGCCGGACTGCCGGTA
ATGCCAAGTCCTAACGTCGCTAATAATTCTCTATTCATGGGACAAAATAACTCTCAGCAA
CTACCAAGCGGTGATATGCAGATATCGACACACACGGCTCAGAGCAACCTGTTCCCGATG
CACGGGATGCAACATGGCAACCCCGGATTCAGCTCTATCTACGGCAACATTATGCCGCCG
ACAAATATGGGGGGCAACATGTCGACTAACATCGGTGCTAATATGCCAAACAGTCTCGGC
CCCAATATGAATACCAATATGGCTGGGAATATTGGGAGCAACATCGCCGCCAATATGGGT
AACAACATTGGCGGGAACATTGGCGCAAACATTAGCGGGAACATAGGCGGAAACATCGGC
GGAAATATTGCCGGTAATATAGGTGGAAATATTGGTGCCAACATTGCTGGAAACATTGGA
AGTAATATAATTGGCAACATTGGCGGCGCCATTGGTGGGACGATTGGCGGTAACCTCGGT
GGTAACATTGGAACCAACCTCAACGCTAGCATTGGTGGTAACATCGGTGGTAACATAGGC
ACCAACATGGCCGATCAGTGGTATTATGAAGACCCCAAAAAAGTAGTCCAGGGTCCATTC
TCGTCTAAGGAGATGTACAGCTGGTATAGGGCGGGTTTCTTTAGCCCCAGCCTGATGGTG
CGTAGGGCCTGCGAAACTCATATGCGTCCGTTAGGCTCGTACGGGCCCGTGGTACCGTTC
GCGCAAGTGGAGGTACTTCCGCCATATCCGATTACTGGATTCGAACCCCGACCCCAAAAT
CATGAAATGCTAAATCAGCAGCCGGCTCTCACTATGGAAGAGTCGCTGTGGGGTCAGCCG
GCTACCAATCAAGATTTGTTGTGGATGCAGCAGATGCCTCGCGATCGCGGCAACAATCTG
CCGATGTTCTTCTGGGATCAGCCATCCTCCGCTATATCTTCCAATGCCTTATTGCCCGAG
GAGATAGCTAAGGAGATGAAAACAGAGGATCAGATCCTCGCACAGCTCCGGGCCTCCCAG
AACCTCCCCAACCCGGCACCCTTTCTGAACGATACCCCCAGCTCAAGCTCCACAGCTTTG
AGTGAAGAGTCTTATACCACGAACGTCAGCTCGACACCGGATCTCAAACAGCTGCAAAAG
TTGATGATAAGCGAAAAACTCGCTCCTCAACCAAGGGATATCAAAGCTTCTAGCGTAGAG
CGAGAGGCTAAACCTGAGAAACCAAATAAGAAGGATCAGAACACGACTGAGACCATTGCT
GCTAAGACCCAGCCTACAAAGGCCGAATCAAAGGCTGCCAAGCAATCCAAGACCGAGAAT
GAAAAGGCCAAAAACAAAGAGACTACCACAAAGAGTAAGAAACAAAAGGCCAAAGAAGAA
AAGAAAGAGGAAGAGAACAAAGTCAAAGAGGATGACAAAGAAAAAACGACACATGAAATT
TCACCGACTAAAGGCAAGAAGGAAGACAAAATGAATAGGAAGGAATTAGAAAAAGAGAAG
AAGGAATGGATCAAGGAAGGATTCACTATTGTGAAGGGCCCTGAGAAGGAAAGCAAAAAG
GAAAATAAGAAAAAACTAGAAGAAGCCAAGGCCGCTGAAGAAGCTGAACGCAAAAAGAAA
GACGAGGAGAAGTCAGTGACCGAAGAAGATAAAAAGAAGAAAACAGTAGAATCAAAAAAG
CAGCAGGAGCATCCACAACGGAACATAGAGACAAAGAAGGCGCCCTGGTCGGCACCACAG
ATAGGACAGTTGCGTGACGGACTACCGCTGGGAGAGATTCAGCGTTTGGAGAGAGAAAAG
AAATTAGAGCAGATCAGAGAACAGCAGCACATGGTACAACTGCTCGCGCAGGAGCAGGCT
GCCGTCGCCGCCAGGGAACAGGTTATCAATGAGATGCAGGCGAATAATCCGCCGTGGACC
AAGAAGAAAATTGACCGCCCCAACAACGGAACCAGCCAGAGCTTTGCTGATATTCAGGCG
GAGACACGTCGCCAAGGAACGGCTTCCGCTCATCCTCCACCGATGCCAGTGGAGGATACT
CTGACGACCAGCAGTCAGGCGCCATGGGCCAATACCCAGAACGGAGGAGGATTCTGGGAT
ACACAGCCGAATACGTCGAAAGCTGCTGAGAAAGCGAGGGACAATAGACCCGAGACCAGC
AAGAAGAAGAAACCAGCGGTCGCCGCGTCGCCAAAGAAGGAGAGCTCTCCGTGTGCTGAA
TTTGACACTTGGTCCCAATCAGCGCTCGCTTCCTGGAGCTCCAAGATTGATGTGCCAACA
TTCGTCGGCTTTCTGAAGGACATCGAATCGCCCTACGAGGTGAAGGACTACGTTAAATGC
TACTTGGGCGAGTCCAAGGACTCCAGCGACTTTGCGAGGCAGTTCCTCGAGAAACGATCT
AAACTACTCCGTGTTGGGATGGTGACCCCCTCCGATGATCTCTGCTCACCAGCTATGGCT
GTCAATCCGCGAGCCGCACTCGACTACCAGGAGGGGAAAGGCAAAAAATCAAAGAAGAAC
AAGATGTTAAAGGTGGACGCGCGTATACTGGGCTTCTCCGTGACAGCCTCCGAGGATAGG
ATCAACGTGGGGGATATCGACACCGTTTGA

Protein sequence:

MGDRNNPIKFGPEWLRNLARERTAGRATTAQTTRPGAAGGSRPAGSQGAGSTCTGSPGAG
PSTATGVSSSGPVGASLTSGPASSAAASSTTAVAGASGTNSRNTNTNHQKIQLAKLRYGR
EEMLALYDRNAEAPEELKYFDLLYQPRGKPPVALNTYDDDTMVYNGLMTNFPRLGDITRC
HMPVYLDIFQDFIYPHRSSAGGLIHDVIFTGSESSKPFLPGSKGATSLHSGIPRMPGYSG
GLDEEGPSRPWSSSNNSGSPRTDQGDWTTNKMFRRRQANNTNWRQTSRDEGDEWRQENSR
PPNRSSVDKWDRDWSDRPSGERPQSWAPSRRQWPGDSNNDDNLPEWAVDSAEAGAGTFDS
SGAFHGYSNDDTNIPKSQESTYPLTRSHTHGGSIARSKTVEEGSEEWWASEKAKKLSPKR
FEAGDSRYKKSLSTGTDEVSGGAVSVKRTDNTEKTNDLESSESVDTPEPEADASNAQATT
DEQKQNDLRQKLSDSKTFDAFMRSDIEYPEPNEDKGNFQSVMINSNNGLRQKHQNIVTVS
NETAMSRQQMNATGLLQMLHGRQMGDQNPEEETSKTNEEKIVEDLMDMTLEDGRMRPNPA
HQPGVIASGMINQSQLLRIASPAVPQQGMVLNAGQGIQNVGIPNQALNSSLGLNMGPGNA
HSLPMQGLLPPVMNTLNPAMGTAMQARVIGAFQQNAGLPVMPSPNVANNSLFMGQNNSQQ
LPSGDMQISTHTAQSNLFPMHGMQHGNPGFSSIYGNIMPPTNMGGNMSTNIGANMPNSLG
PNMNTNMAGNIGSNIAANMGNNIGGNIGANISGNIGGNIGGNIAGNIGGNIGANIAGNIG
SNIIGNIGGAIGGTIGGNLGGNIGTNLNASIGGNIGGNIGTNMADQWYYEDPKKVVQGPF
SSKEMYSWYRAGFFSPSLMVRRACETHMRPLGSYGPVVPFAQVEVLPPYPITGFEPRPQN
HEMLNQQPALTMEESLWGQPATNQDLLWMQQMPRDRGNNLPMFFWDQPSSAISSNALLPE
EIAKEMKTEDQILAQLRASQNLPNPAPFLNDTPSSSSTALSEESYTTNVSSTPDLKQLQK
LMISEKLAPQPRDIKASSVEREAKPEKPNKKDQNTTETIAAKTQPTKAESKAAKQSKTEN
EKAKNKETTTKSKKQKAKEEKKEEENKVKEDDKEKTTHEISPTKGKKEDKMNRKELEKEK
KEWIKEGFTIVKGPEKESKKENKKKLEEAKAAEEAERKKKDEEKSVTEEDKKKKTVESKK
QQEHPQRNIETKKAPWSAPQIGQLRDGLPLGEIQRLEREKKLEQIREQQHMVQLLAQEQA
AVAAREQVINEMQANNPPWTKKKIDRPNNGTSQSFADIQAETRRQGTASAHPPPMPVEDT
LTTSSQAPWANTQNGGGFWDTQPNTSKAAEKARDNRPETSKKKKPAVAASPKKESSPCAE
FDTWSQSALASWSSKIDVPTFVGFLKDIESPYEVKDYVKCYLGESKDSSDFARQFLEKRS
KLLRVGMVTPSDDLCSPAMAVNPRAALDYQEGKGKKSKKNKMLKVDARILGFSVTASEDR
INVGDIDTV