New model in OGS2.0 | DPOGS202986  |
---|---|
Genomic Position | scaffold8:- 397620-433910 |
See gene structure | |
CDS Length | 4710 |
Paired RNAseq reads   | 5859 |
Single RNAseq reads   | 16446 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012331 (1e-14) |
Best Drosophila hit   | CG11148, isoform F (3e-30) |
Best Human hit | PERQ amino acid-rich with GYF domain-containing protein 2 isoform c (2e-11) |
Best NR hit (blastp)   | hypothetical protein Phum_PHUM498170 [Pediculus humanus corporis] (2e-33) |
Best NR hit (blastx)   | PREDICTED: similar to CG11148-PA, isoform A isoform 1 [Apis mellifera] (2e-34) |
GeneOntology terms    | GO:0003674 molecular_function GO:0008150 biological_process GO:0005575 cellular_component |
InterPro families   | IPR003169 GYF |
Orthology group | MCL17910 |
Nucleotide sequence:
ATGGGCGATCGTAACAATCCTATTAAGTTCGGCCCCGAGTGGCTGCGTAATTTGGCACGC
GAGCGTACTGCTGGGAGGGCAACTACAGCACAGACCACAAGGCCCGGGGCCGCAGGCGGC
AGCAGGCCTGCCGGCTCCCAGGGGGCCGGCTCGACGTGCACAGGCTCCCCGGGGGCCGGC
CCCTCTACCGCCACGGGAGTATCGAGTTCGGGCCCCGTCGGGGCCTCGCTGACTTCAGGA
CCCGCGTCGTCCGCCGCAGCTTCCAGCACGACCGCAGTCGCCGGAGCCTCAGGCACAAAC
TCAAGAAATACAAACACTAACCATCAGAAGATTCAGCTCGCTAAGCTAAGATACGGCAGG
GAGGAGATGCTGGCATTGTACGACAGGAACGCGGAGGCTCCAGAGGAATTGAAATACTTT
GACTTGTTGTACCAGCCGCGGGGGAAACCGCCTGTTGCTCTCAACACATACGACGATGAT
ACGATGGTTTATAATGGTTTGATGACTAATTTCCCTCGTCTGGGTGATATCACGAGATGC
CACATGCCGGTCTATCTTGATATTTTCCAAGATTTTATTTATCCTCACAGGTCTTCCGCT
GGCGGGCTTATCCATGATGTCATTTTCACCGGCTCTGAATCGTCGAAACCATTCCTACCC
GGTTCGAAGGGGGCAACATCATTGCATTCCGGTATTCCCCGAATGCCAGGGTATAGCGGT
GGGCTCGATGAGGAGGGCCCCAGTAGACCTTGGAGTAGCAGCAATAACAGCGGTTCACCC
AGAACTGATCAAGGCGATTGGACGACTAATAAAATGTTTCGCAGGCGACAAGCAAATAAT
ACTAATTGGAGGCAAACGTCTCGCGACGAAGGAGACGAGTGGCGTCAGGAGAATTCCAGA
CCTCCCAATCGTTCTAGTGTCGATAAATGGGACCGTGATTGGAGCGACCGGCCGTCCGGG
GAGAGGCCGCAGTCCTGGGCTCCCAGCCGGCGACAGTGGCCCGGTGACTCCAACAACGAC
GACAACTTGCCAGAGTGGGCGGTGGACAGTGCGGAGGCTGGTGCTGGTACCTTCGACTCC
TCGGGAGCCTTCCACGGGTATAGCAATGACGATACAAACATACCTAAGTCACAGGAGTCC
ACTTACCCGTTGACTCGGTCGCATACACACGGTGGTAGTATTGCGCGCTCTAAAACTGTG
GAAGAAGGCTCTGAAGAGTGGTGGGCTTCAGAGAAAGCCAAGAAGCTATCACCGAAAAGG
TTTGAGGCTGGCGATAGTAGATATAAAAAGTCTTTGAGTACTGGAACAGACGAAGTCAGC
GGTGGAGCGGTAAGTGTCAAACGGACCGATAACACGGAGAAGACTAACGATCTAGAGTCG
TCGGAGAGTGTAGACACGCCGGAACCGGAAGCGGACGCCAGTAACGCACAGGCAACTACT
GACGAGCAGAAACAAAATGATTTGAGACAAAAGCTCTCGGATAGTAAGACATTTGACGCG
TTTATGAGATCAGATATAGAATACCCTGAACCCAACGAAGACAAAGGAAACTTCCAGTCC
GTCATGATCAATTCCAACAACGGCTTGCGGCAGAAACATCAGAATATAGTGACGGTGAGC
AACGAGACCGCCATGAGCCGGCAACAGATGAACGCCACCGGCCTGTTGCAGATGCTGCAC
GGCAGGCAAATGGGCGATCAGAACCCTGAAGAGGAAACTTCTAAAACCAACGAGGAAAAG
ATCGTTGAAGATCTTATGGACATGACTTTGGAGGACGGTCGCATGCGTCCTAATCCAGCG
CACCAACCCGGTGTCATCGCATCAGGAATGATTAATCAAAGCCAGCTGCTACGTATTGCC
AGCCCCGCGGTGCCGCAGCAAGGGATGGTTCTTAATGCGGGACAAGGGATACAGAATGTT
GGCATCCCCAACCAGGCGCTCAACTCTTCGCTAGGACTGAATATGGGCCCAGGAAATGCT
CACAGCTTGCCCATGCAAGGATTGCTACCACCAGTGATGAATACTCTGAATCCCGCTATG
GGCACGGCCATGCAAGCCCGCGTTATTGGAGCGTTCCAACAGAACGCCGGACTGCCGGTA
ATGCCAAGTCCTAACGTCGCTAATAATTCTCTATTCATGGGACAAAATAACTCTCAGCAA
CTACCAAGCGGTGATATGCAGATATCGACACACACGGCTCAGAGCAACCTGTTCCCGATG
CACGGGATGCAACATGGCAACCCCGGATTCAGCTCTATCTACGGCAACATTATGCCGCCG
ACAAATATGGGGGGCAACATGTCGACTAACATCGGTGCTAATATGCCAAACAGTCTCGGC
CCCAATATGAATACCAATATGGCTGGGAATATTGGGAGCAACATCGCCGCCAATATGGGT
AACAACATTGGCGGGAACATTGGCGCAAACATTAGCGGGAACATAGGCGGAAACATCGGC
GGAAATATTGCCGGTAATATAGGTGGAAATATTGGTGCCAACATTGCTGGAAACATTGGA
AGTAATATAATTGGCAACATTGGCGGCGCCATTGGTGGGACGATTGGCGGTAACCTCGGT
GGTAACATTGGAACCAACCTCAACGCTAGCATTGGTGGTAACATCGGTGGTAACATAGGC
ACCAACATGGCCGATCAGTGGTATTATGAAGACCCCAAAAAAGTAGTCCAGGGTCCATTC
TCGTCTAAGGAGATGTACAGCTGGTATAGGGCGGGTTTCTTTAGCCCCAGCCTGATGGTG
CGTAGGGCCTGCGAAACTCATATGCGTCCGTTAGGCTCGTACGGGCCCGTGGTACCGTTC
GCGCAAGTGGAGGTACTTCCGCCATATCCGATTACTGGATTCGAACCCCGACCCCAAAAT
CATGAAATGCTAAATCAGCAGCCGGCTCTCACTATGGAAGAGTCGCTGTGGGGTCAGCCG
GCTACCAATCAAGATTTGTTGTGGATGCAGCAGATGCCTCGCGATCGCGGCAACAATCTG
CCGATGTTCTTCTGGGATCAGCCATCCTCCGCTATATCTTCCAATGCCTTATTGCCCGAG
GAGATAGCTAAGGAGATGAAAACAGAGGATCAGATCCTCGCACAGCTCCGGGCCTCCCAG
AACCTCCCCAACCCGGCACCCTTTCTGAACGATACCCCCAGCTCAAGCTCCACAGCTTTG
AGTGAAGAGTCTTATACCACGAACGTCAGCTCGACACCGGATCTCAAACAGCTGCAAAAG
TTGATGATAAGCGAAAAACTCGCTCCTCAACCAAGGGATATCAAAGCTTCTAGCGTAGAG
CGAGAGGCTAAACCTGAGAAACCAAATAAGAAGGATCAGAACACGACTGAGACCATTGCT
GCTAAGACCCAGCCTACAAAGGCCGAATCAAAGGCTGCCAAGCAATCCAAGACCGAGAAT
GAAAAGGCCAAAAACAAAGAGACTACCACAAAGAGTAAGAAACAAAAGGCCAAAGAAGAA
AAGAAAGAGGAAGAGAACAAAGTCAAAGAGGATGACAAAGAAAAAACGACACATGAAATT
TCACCGACTAAAGGCAAGAAGGAAGACAAAATGAATAGGAAGGAATTAGAAAAAGAGAAG
AAGGAATGGATCAAGGAAGGATTCACTATTGTGAAGGGCCCTGAGAAGGAAAGCAAAAAG
GAAAATAAGAAAAAACTAGAAGAAGCCAAGGCCGCTGAAGAAGCTGAACGCAAAAAGAAA
GACGAGGAGAAGTCAGTGACCGAAGAAGATAAAAAGAAGAAAACAGTAGAATCAAAAAAG
CAGCAGGAGCATCCACAACGGAACATAGAGACAAAGAAGGCGCCCTGGTCGGCACCACAG
ATAGGACAGTTGCGTGACGGACTACCGCTGGGAGAGATTCAGCGTTTGGAGAGAGAAAAG
AAATTAGAGCAGATCAGAGAACAGCAGCACATGGTACAACTGCTCGCGCAGGAGCAGGCT
GCCGTCGCCGCCAGGGAACAGGTTATCAATGAGATGCAGGCGAATAATCCGCCGTGGACC
AAGAAGAAAATTGACCGCCCCAACAACGGAACCAGCCAGAGCTTTGCTGATATTCAGGCG
GAGACACGTCGCCAAGGAACGGCTTCCGCTCATCCTCCACCGATGCCAGTGGAGGATACT
CTGACGACCAGCAGTCAGGCGCCATGGGCCAATACCCAGAACGGAGGAGGATTCTGGGAT
ACACAGCCGAATACGTCGAAAGCTGCTGAGAAAGCGAGGGACAATAGACCCGAGACCAGC
AAGAAGAAGAAACCAGCGGTCGCCGCGTCGCCAAAGAAGGAGAGCTCTCCGTGTGCTGAA
TTTGACACTTGGTCCCAATCAGCGCTCGCTTCCTGGAGCTCCAAGATTGATGTGCCAACA
TTCGTCGGCTTTCTGAAGGACATCGAATCGCCCTACGAGGTGAAGGACTACGTTAAATGC
TACTTGGGCGAGTCCAAGGACTCCAGCGACTTTGCGAGGCAGTTCCTCGAGAAACGATCT
AAACTACTCCGTGTTGGGATGGTGACCCCCTCCGATGATCTCTGCTCACCAGCTATGGCT
GTCAATCCGCGAGCCGCACTCGACTACCAGGAGGGGAAAGGCAAAAAATCAAAGAAGAAC
AAGATGTTAAAGGTGGACGCGCGTATACTGGGCTTCTCCGTGACAGCCTCCGAGGATAGG
ATCAACGTGGGGGATATCGACACCGTTTGA
Protein sequence:
MGDRNNPIKFGPEWLRNLARERTAGRATTAQTTRPGAAGGSRPAGSQGAGSTCTGSPGAG
PSTATGVSSSGPVGASLTSGPASSAAASSTTAVAGASGTNSRNTNTNHQKIQLAKLRYGR
EEMLALYDRNAEAPEELKYFDLLYQPRGKPPVALNTYDDDTMVYNGLMTNFPRLGDITRC
HMPVYLDIFQDFIYPHRSSAGGLIHDVIFTGSESSKPFLPGSKGATSLHSGIPRMPGYSG
GLDEEGPSRPWSSSNNSGSPRTDQGDWTTNKMFRRRQANNTNWRQTSRDEGDEWRQENSR
PPNRSSVDKWDRDWSDRPSGERPQSWAPSRRQWPGDSNNDDNLPEWAVDSAEAGAGTFDS
SGAFHGYSNDDTNIPKSQESTYPLTRSHTHGGSIARSKTVEEGSEEWWASEKAKKLSPKR
FEAGDSRYKKSLSTGTDEVSGGAVSVKRTDNTEKTNDLESSESVDTPEPEADASNAQATT
DEQKQNDLRQKLSDSKTFDAFMRSDIEYPEPNEDKGNFQSVMINSNNGLRQKHQNIVTVS
NETAMSRQQMNATGLLQMLHGRQMGDQNPEEETSKTNEEKIVEDLMDMTLEDGRMRPNPA
HQPGVIASGMINQSQLLRIASPAVPQQGMVLNAGQGIQNVGIPNQALNSSLGLNMGPGNA
HSLPMQGLLPPVMNTLNPAMGTAMQARVIGAFQQNAGLPVMPSPNVANNSLFMGQNNSQQ
LPSGDMQISTHTAQSNLFPMHGMQHGNPGFSSIYGNIMPPTNMGGNMSTNIGANMPNSLG
PNMNTNMAGNIGSNIAANMGNNIGGNIGANISGNIGGNIGGNIAGNIGGNIGANIAGNIG
SNIIGNIGGAIGGTIGGNLGGNIGTNLNASIGGNIGGNIGTNMADQWYYEDPKKVVQGPF
SSKEMYSWYRAGFFSPSLMVRRACETHMRPLGSYGPVVPFAQVEVLPPYPITGFEPRPQN
HEMLNQQPALTMEESLWGQPATNQDLLWMQQMPRDRGNNLPMFFWDQPSSAISSNALLPE
EIAKEMKTEDQILAQLRASQNLPNPAPFLNDTPSSSSTALSEESYTTNVSSTPDLKQLQK
LMISEKLAPQPRDIKASSVEREAKPEKPNKKDQNTTETIAAKTQPTKAESKAAKQSKTEN
EKAKNKETTTKSKKQKAKEEKKEEENKVKEDDKEKTTHEISPTKGKKEDKMNRKELEKEK
KEWIKEGFTIVKGPEKESKKENKKKLEEAKAAEEAERKKKDEEKSVTEEDKKKKTVESKK
QQEHPQRNIETKKAPWSAPQIGQLRDGLPLGEIQRLEREKKLEQIREQQHMVQLLAQEQA
AVAAREQVINEMQANNPPWTKKKIDRPNNGTSQSFADIQAETRRQGTASAHPPPMPVEDT
LTTSSQAPWANTQNGGGFWDTQPNTSKAAEKARDNRPETSKKKKPAVAASPKKESSPCAE
FDTWSQSALASWSSKIDVPTFVGFLKDIESPYEVKDYVKCYLGESKDSSDFARQFLEKRS
KLLRVGMVTPSDDLCSPAMAVNPRAALDYQEGKGKKSKKNKMLKVDARILGFSVTASEDR
INVGDIDTV