DPGLEAN12979 in OGS1.0

New model in OGS2.0DPOGS210192 
Genomic Positionscaffold2167:- 25424-42144
See gene structure
CDS Length3336
Paired RNAseq reads  3184
Single RNAseq reads  8677
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003263 (3e-10)
Best Drosophila hit  encore, isoform A (9e-19)
Best Human hitcAMP-regulated phosphoprotein 21 isoform 1 (3e-15)
Best NR hit (blastp)  encore protein [Aedes aegypti] (1e-59)
Best NR hit (blastx)  PREDICTED: similar to encore protein [Tribolium castaneum] (4e-46)
GeneOntology terms










  
GO:0007282 cystoblast division
GO:0048477 oogenesis
GO:0007294 germarium-derived oocyte fate determination
GO:0007293 germarium-derived egg chamber formation
GO:0048134 germ-line cyst formation
GO:0005737 cytoplasm
GO:0007317 regulation of pole plasm oskar mRNA localization
GO:0007310 oocyte dorsal/ventral axis specification
GO:0007067 mitosis
GO:0007309 oocyte axis specification
GO:0030717 karyosome formation
GO:0003676 nucleic acid binding
InterPro families  IPR001374 Single-stranded nucleic acid binding R3H
Orthology groupMCL22689

Nucleotide sequence:

ATGGAGCAACCGGAAGTCGAGAAGGGCTGCGGCAGCTCGGAGTACGGAGCACAACTTACA
AGAAATAGAAGTTTCAAATCGAAGCAGCTAGTCCGCAGTCAGGCGATACGAGAAGCAACG
TCGCCGCCACGAACAGCGTCACCGTTGGCGAGCGACAAACCAAACGTGACCGGAAACGCG
GTCGAGAAGAGGTCGAACAGCGAGACCTCCAGTCACAGCGACGGAGGTTCCGTAGTCAGC
AATGACAGCAAACAACCCGTTGAAATACAGATAACATCCAACTCCTGGGAGGACGGTAAC
AGTGACCAACGCCAGCGGACCCGCCGCTGGCTGGGTCACCACCACTCAGACTCCGCGAGG
GACCTTCTGACCCACGGGCCGCGGGTGGCCTGCGTCTGTGGGGCCTGCGAGTGTCCGCAC
TGCAGAGGGCGCAGAAGAAAACACGCGTGTCCGACGAAACAGGACTCTGGCATCGTTTGT
TCTGACGATTGTCCTGATTGCACCGACTCAGAACATGCGGGTCCTAACGGTGACGGCGCT
AGGATGTCCGGGAGTCTGGACAGCGACGATCAGGCGTATTATTGCAGATGCATCGATCGC
AAGGACAAGCCGAAGAGTATGTGCATAGGTGGCAATGAATTTGAGGATAAGACGGATCTA
TCAGGTCCGGAGCTGGTGGCCTTCATCAAAGAAACCCTGAACAAGAACCCAAGAGACCGC
GCCACTCTGTTGAGGATTGAAAAGGAACTGCATGGTCTTGTAACAGACAACAGTCGTTGC
ATCGTCCGCTTCCCCGTGATGACGTCATACGGCCGTATGCTGGTCCACCGCTGCGCAGCT
CTGTTCCAACTGGCGCACCACCTCGACCACTCGAACAAGAACTCCGTGCTGGTGTCCAAA
AGCGGCACGTGCGGTGGCCGCCTGCCCTGTACCTCCTTCCGTGAGTGGTGCACCACCGTG
TTCCCGAGGTCACCCACACACGAGGACACGCTCGCCAAGTCCATTCTAAAGCGTTGTTCG
GGACCACCTGGTACAGCTAGCTCGGCAGCTGGCAGGAGCAAGTCGTTGGAACAACGAGAG
AGGGAATACGAGAGAGTCAGGAGAAGAATTTTCAGCACGGATAACTGCACCCAGGACGAG
ACGCAATGGCCCTGGCTGACTTCCGGACCCGTTAAGCTGCTGACACCGGACACTGGCAGG
AACAAGTTATTGAAGGTGCACTCCCTGGAGGCGAAGTCCCCGGGCCGGGGGGTGGTGTCG
AAGAGTCACAGCTTCGGGGGATACACGGACCCCCAGCAGAGAGTGCTCAGCAGACAGGGT
GACCTGGCGTCATCCAGCTGGCGTCTCTCTCCGTCCAGCTCCGGGTACAAGACCCTCAGT
TTGCGGAGCACGGATTCAGTCACACCATCACCCACAGGCGGTGCGAGTCCTGAGCCGGGG
CCCCCTTCCCTGTGCGTGCCGGGGACCTCGGGGGCCCTCGTGTGGGCTGTGACCGACATG
GCCGCGGTCCCGCCCGGGGCACTAGTCATACATCCGCAGACAGGCCGGCCGCTCACCAAC
CCGGACGGCAGCCTGTACCACTTCGACCCCGAGAACCCGCCGCGCCTGTACGCGGACCGG
GGGGAGGTCGACGGCAGCGCGGAGAAGAGACGCGGGAAACTGGAAAAACAGAACTCCTTC
ATAGATAACGAATGCGACTTCGATTCCAAGCGGGACAAGCGCTGTGACTGCGCTCCAGAC
AACGACGGAGGCCAGCGGAAACCGAAGACGCCGGCAGCGGCGAGCCCGAACAACACGCGG
ACCGCGCACGACGAGCAGGCGGCGCCGGCCGCGCCCGCGCCCAACGGAGACGTGGACCAA
AGCGAAGTCGCTGAAATAAAACAGGCTCTCGAGAATATTAAAATAACACAGAAATCACCG
GTCAAAGAAAAGAAAGACGTCCAAATCGAACCCGTCAATCAGATTCAGTCTCCGAGGTAC
GATGCGGCCAATCAGGTCGCGTCTTCGCCGAGGTTCGAGTCGCCGGCCAGTCAGACGGCC
TCGAACCAGCCGCAGAGGTTCGAAACGGCCAATCAGATGCAGCAGATTCAGAGATTCGAT
TCACCGGCCAATCACGTGCAGGCTGTCCAGAGATTCGATTCCCCCGCGAATAACAGACAG
TACGACAGATATGACGTCCCCAGTAAGGCTTTGGAGAATAGGAATTTTGACAATCAGAGG
AAGTTTCTAGAAGAGGCTTATCACGAGAGCTACGTCCCGTATAAGAGTGAGGAGGCGCCG
CAGATGATGGTGAACTATCCGTCGGAGATGGCCGACATGCAGGTGGTGCAGGCGAAGATG
ACGCCAGTCCCGGTGCAGGACAACATGCGTGGTGTGCCCGTCCCCAGTCATCCCGGGTAT
TACCAGCCCGTGCAGAACTACCAGTATGTTCCCTACAGAGTGGAGCAGCCGGTACAGCAC
CTGCAGCCACAGCTGTACCAGCAGATGACCGAGGACCAGAAGCAGATGACGCCGTCCCCA
CACAGCGAGAACACGTTCAGGATCGATCCGAGCTATCCGTACTTGGCGGACTTTAATAAT
GCTTGTGGTGCCTGTGACCCGACTCAAGCTAGAGGATACAACGTCCAGTACCAGCCGGAG
TCCGTGATGTACCCCAACGTGCTGATGCAGCCGGTGCAGCAGTATCCTTACCAGGAGCAG
ATGCAGGTGCCGTGGCAGAACGTGGCCCCGGCAGCGCCCAAGGTCGTGATGCACGACGTG
TACCCTATAGTGTACCCCAGCGTGTACCCTAACGTGTACCCACCCTACAACATAGTGTAC
CCTCAGGTCCTGCCGCAATACCCCATCCAGCAGTTCCAAGATAGACGGAAGAAGAGGTAT
GATAGAGAGGCGGAGATAGCTATGAGGATACAACAGATAAAACAACAGGTGGATATGATG
GATGGAGACAGGGGACGGAACAGCGGAGGAGGTATACTGGGCAATGCTCCGGGGAGAGTC
GGGGCCGACGACAGCCAGCTGACCACAGCGGCGCGGGCCATCGTCAACTCCATAAGGAAC
ATGCAGGCCAAGAACCCATACCACGAGTCAAGACGCTCTCCCCCGCGCCCGGAACAGAGG
AGGAGAGCCGCGGGCCCGGTGTATAGACAGATGTCTCCAGGCGCCTGGTGCCGCTCTCCA
GCTCCGATGCCGCAAACCTTCAGCCAACCTCGCCGGCCACACCCGGACAACAGGAACGCT
CGTCGGGCTCCCGACTGGCGCCGGTCGTGCGGACGAGCCGTCACCTCCGCGTTGACAGCT
GTCAAACGTCCGGCTGACGTTGACAGCTGTCAGTGA

Protein sequence:

MEQPEVEKGCGSSEYGAQLTRNRSFKSKQLVRSQAIREATSPPRTASPLASDKPNVTGNA
VEKRSNSETSSHSDGGSVVSNDSKQPVEIQITSNSWEDGNSDQRQRTRRWLGHHHSDSAR
DLLTHGPRVACVCGACECPHCRGRRRKHACPTKQDSGIVCSDDCPDCTDSEHAGPNGDGA
RMSGSLDSDDQAYYCRCIDRKDKPKSMCIGGNEFEDKTDLSGPELVAFIKETLNKNPRDR
ATLLRIEKELHGLVTDNSRCIVRFPVMTSYGRMLVHRCAALFQLAHHLDHSNKNSVLVSK
SGTCGGRLPCTSFREWCTTVFPRSPTHEDTLAKSILKRCSGPPGTASSAAGRSKSLEQRE
REYERVRRRIFSTDNCTQDETQWPWLTSGPVKLLTPDTGRNKLLKVHSLEAKSPGRGVVS
KSHSFGGYTDPQQRVLSRQGDLASSSWRLSPSSSGYKTLSLRSTDSVTPSPTGGASPEPG
PPSLCVPGTSGALVWAVTDMAAVPPGALVIHPQTGRPLTNPDGSLYHFDPENPPRLYADR
GEVDGSAEKRRGKLEKQNSFIDNECDFDSKRDKRCDCAPDNDGGQRKPKTPAAASPNNTR
TAHDEQAAPAAPAPNGDVDQSEVAEIKQALENIKITQKSPVKEKKDVQIEPVNQIQSPRY
DAANQVASSPRFESPASQTASNQPQRFETANQMQQIQRFDSPANHVQAVQRFDSPANNRQ
YDRYDVPSKALENRNFDNQRKFLEEAYHESYVPYKSEEAPQMMVNYPSEMADMQVVQAKM
TPVPVQDNMRGVPVPSHPGYYQPVQNYQYVPYRVEQPVQHLQPQLYQQMTEDQKQMTPSP
HSENTFRIDPSYPYLADFNNACGACDPTQARGYNVQYQPESVMYPNVLMQPVQQYPYQEQ
MQVPWQNVAPAAPKVVMHDVYPIVYPSVYPNVYPPYNIVYPQVLPQYPIQQFQDRRKKRY
DREAEIAMRIQQIKQQVDMMDGDRGRNSGGGILGNAPGRVGADDSQLTTAARAIVNSIRN
MQAKNPYHESRRSPPRPEQRRRAAGPVYRQMSPGAWCRSPAPMPQTFSQPRRPHPDNRNA
RRAPDWRRSCGRAVTSALTAVKRPADVDSCQ