New model in OGS2.0 | DPOGS210192  |
---|---|
Genomic Position | scaffold2167:- 25424-42144 |
See gene structure | |
CDS Length | 3336 |
Paired RNAseq reads   | 3184 |
Single RNAseq reads   | 8677 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003263 (3e-10) |
Best Drosophila hit   | encore, isoform A (9e-19) |
Best Human hit | cAMP-regulated phosphoprotein 21 isoform 1 (3e-15) |
Best NR hit (blastp)   | encore protein [Aedes aegypti] (1e-59) |
Best NR hit (blastx)   | PREDICTED: similar to encore protein [Tribolium castaneum] (4e-46) |
GeneOntology terms    | GO:0007282 cystoblast division GO:0048477 oogenesis GO:0007294 germarium-derived oocyte fate determination GO:0007293 germarium-derived egg chamber formation GO:0048134 germ-line cyst formation GO:0005737 cytoplasm GO:0007317 regulation of pole plasm oskar mRNA localization GO:0007310 oocyte dorsal/ventral axis specification GO:0007067 mitosis GO:0007309 oocyte axis specification GO:0030717 karyosome formation GO:0003676 nucleic acid binding |
InterPro families   | IPR001374 Single-stranded nucleic acid binding R3H |
Orthology group | MCL22689 |
Nucleotide sequence:
ATGGAGCAACCGGAAGTCGAGAAGGGCTGCGGCAGCTCGGAGTACGGAGCACAACTTACA
AGAAATAGAAGTTTCAAATCGAAGCAGCTAGTCCGCAGTCAGGCGATACGAGAAGCAACG
TCGCCGCCACGAACAGCGTCACCGTTGGCGAGCGACAAACCAAACGTGACCGGAAACGCG
GTCGAGAAGAGGTCGAACAGCGAGACCTCCAGTCACAGCGACGGAGGTTCCGTAGTCAGC
AATGACAGCAAACAACCCGTTGAAATACAGATAACATCCAACTCCTGGGAGGACGGTAAC
AGTGACCAACGCCAGCGGACCCGCCGCTGGCTGGGTCACCACCACTCAGACTCCGCGAGG
GACCTTCTGACCCACGGGCCGCGGGTGGCCTGCGTCTGTGGGGCCTGCGAGTGTCCGCAC
TGCAGAGGGCGCAGAAGAAAACACGCGTGTCCGACGAAACAGGACTCTGGCATCGTTTGT
TCTGACGATTGTCCTGATTGCACCGACTCAGAACATGCGGGTCCTAACGGTGACGGCGCT
AGGATGTCCGGGAGTCTGGACAGCGACGATCAGGCGTATTATTGCAGATGCATCGATCGC
AAGGACAAGCCGAAGAGTATGTGCATAGGTGGCAATGAATTTGAGGATAAGACGGATCTA
TCAGGTCCGGAGCTGGTGGCCTTCATCAAAGAAACCCTGAACAAGAACCCAAGAGACCGC
GCCACTCTGTTGAGGATTGAAAAGGAACTGCATGGTCTTGTAACAGACAACAGTCGTTGC
ATCGTCCGCTTCCCCGTGATGACGTCATACGGCCGTATGCTGGTCCACCGCTGCGCAGCT
CTGTTCCAACTGGCGCACCACCTCGACCACTCGAACAAGAACTCCGTGCTGGTGTCCAAA
AGCGGCACGTGCGGTGGCCGCCTGCCCTGTACCTCCTTCCGTGAGTGGTGCACCACCGTG
TTCCCGAGGTCACCCACACACGAGGACACGCTCGCCAAGTCCATTCTAAAGCGTTGTTCG
GGACCACCTGGTACAGCTAGCTCGGCAGCTGGCAGGAGCAAGTCGTTGGAACAACGAGAG
AGGGAATACGAGAGAGTCAGGAGAAGAATTTTCAGCACGGATAACTGCACCCAGGACGAG
ACGCAATGGCCCTGGCTGACTTCCGGACCCGTTAAGCTGCTGACACCGGACACTGGCAGG
AACAAGTTATTGAAGGTGCACTCCCTGGAGGCGAAGTCCCCGGGCCGGGGGGTGGTGTCG
AAGAGTCACAGCTTCGGGGGATACACGGACCCCCAGCAGAGAGTGCTCAGCAGACAGGGT
GACCTGGCGTCATCCAGCTGGCGTCTCTCTCCGTCCAGCTCCGGGTACAAGACCCTCAGT
TTGCGGAGCACGGATTCAGTCACACCATCACCCACAGGCGGTGCGAGTCCTGAGCCGGGG
CCCCCTTCCCTGTGCGTGCCGGGGACCTCGGGGGCCCTCGTGTGGGCTGTGACCGACATG
GCCGCGGTCCCGCCCGGGGCACTAGTCATACATCCGCAGACAGGCCGGCCGCTCACCAAC
CCGGACGGCAGCCTGTACCACTTCGACCCCGAGAACCCGCCGCGCCTGTACGCGGACCGG
GGGGAGGTCGACGGCAGCGCGGAGAAGAGACGCGGGAAACTGGAAAAACAGAACTCCTTC
ATAGATAACGAATGCGACTTCGATTCCAAGCGGGACAAGCGCTGTGACTGCGCTCCAGAC
AACGACGGAGGCCAGCGGAAACCGAAGACGCCGGCAGCGGCGAGCCCGAACAACACGCGG
ACCGCGCACGACGAGCAGGCGGCGCCGGCCGCGCCCGCGCCCAACGGAGACGTGGACCAA
AGCGAAGTCGCTGAAATAAAACAGGCTCTCGAGAATATTAAAATAACACAGAAATCACCG
GTCAAAGAAAAGAAAGACGTCCAAATCGAACCCGTCAATCAGATTCAGTCTCCGAGGTAC
GATGCGGCCAATCAGGTCGCGTCTTCGCCGAGGTTCGAGTCGCCGGCCAGTCAGACGGCC
TCGAACCAGCCGCAGAGGTTCGAAACGGCCAATCAGATGCAGCAGATTCAGAGATTCGAT
TCACCGGCCAATCACGTGCAGGCTGTCCAGAGATTCGATTCCCCCGCGAATAACAGACAG
TACGACAGATATGACGTCCCCAGTAAGGCTTTGGAGAATAGGAATTTTGACAATCAGAGG
AAGTTTCTAGAAGAGGCTTATCACGAGAGCTACGTCCCGTATAAGAGTGAGGAGGCGCCG
CAGATGATGGTGAACTATCCGTCGGAGATGGCCGACATGCAGGTGGTGCAGGCGAAGATG
ACGCCAGTCCCGGTGCAGGACAACATGCGTGGTGTGCCCGTCCCCAGTCATCCCGGGTAT
TACCAGCCCGTGCAGAACTACCAGTATGTTCCCTACAGAGTGGAGCAGCCGGTACAGCAC
CTGCAGCCACAGCTGTACCAGCAGATGACCGAGGACCAGAAGCAGATGACGCCGTCCCCA
CACAGCGAGAACACGTTCAGGATCGATCCGAGCTATCCGTACTTGGCGGACTTTAATAAT
GCTTGTGGTGCCTGTGACCCGACTCAAGCTAGAGGATACAACGTCCAGTACCAGCCGGAG
TCCGTGATGTACCCCAACGTGCTGATGCAGCCGGTGCAGCAGTATCCTTACCAGGAGCAG
ATGCAGGTGCCGTGGCAGAACGTGGCCCCGGCAGCGCCCAAGGTCGTGATGCACGACGTG
TACCCTATAGTGTACCCCAGCGTGTACCCTAACGTGTACCCACCCTACAACATAGTGTAC
CCTCAGGTCCTGCCGCAATACCCCATCCAGCAGTTCCAAGATAGACGGAAGAAGAGGTAT
GATAGAGAGGCGGAGATAGCTATGAGGATACAACAGATAAAACAACAGGTGGATATGATG
GATGGAGACAGGGGACGGAACAGCGGAGGAGGTATACTGGGCAATGCTCCGGGGAGAGTC
GGGGCCGACGACAGCCAGCTGACCACAGCGGCGCGGGCCATCGTCAACTCCATAAGGAAC
ATGCAGGCCAAGAACCCATACCACGAGTCAAGACGCTCTCCCCCGCGCCCGGAACAGAGG
AGGAGAGCCGCGGGCCCGGTGTATAGACAGATGTCTCCAGGCGCCTGGTGCCGCTCTCCA
GCTCCGATGCCGCAAACCTTCAGCCAACCTCGCCGGCCACACCCGGACAACAGGAACGCT
CGTCGGGCTCCCGACTGGCGCCGGTCGTGCGGACGAGCCGTCACCTCCGCGTTGACAGCT
GTCAAACGTCCGGCTGACGTTGACAGCTGTCAGTGA
Protein sequence:
MEQPEVEKGCGSSEYGAQLTRNRSFKSKQLVRSQAIREATSPPRTASPLASDKPNVTGNA
VEKRSNSETSSHSDGGSVVSNDSKQPVEIQITSNSWEDGNSDQRQRTRRWLGHHHSDSAR
DLLTHGPRVACVCGACECPHCRGRRRKHACPTKQDSGIVCSDDCPDCTDSEHAGPNGDGA
RMSGSLDSDDQAYYCRCIDRKDKPKSMCIGGNEFEDKTDLSGPELVAFIKETLNKNPRDR
ATLLRIEKELHGLVTDNSRCIVRFPVMTSYGRMLVHRCAALFQLAHHLDHSNKNSVLVSK
SGTCGGRLPCTSFREWCTTVFPRSPTHEDTLAKSILKRCSGPPGTASSAAGRSKSLEQRE
REYERVRRRIFSTDNCTQDETQWPWLTSGPVKLLTPDTGRNKLLKVHSLEAKSPGRGVVS
KSHSFGGYTDPQQRVLSRQGDLASSSWRLSPSSSGYKTLSLRSTDSVTPSPTGGASPEPG
PPSLCVPGTSGALVWAVTDMAAVPPGALVIHPQTGRPLTNPDGSLYHFDPENPPRLYADR
GEVDGSAEKRRGKLEKQNSFIDNECDFDSKRDKRCDCAPDNDGGQRKPKTPAAASPNNTR
TAHDEQAAPAAPAPNGDVDQSEVAEIKQALENIKITQKSPVKEKKDVQIEPVNQIQSPRY
DAANQVASSPRFESPASQTASNQPQRFETANQMQQIQRFDSPANHVQAVQRFDSPANNRQ
YDRYDVPSKALENRNFDNQRKFLEEAYHESYVPYKSEEAPQMMVNYPSEMADMQVVQAKM
TPVPVQDNMRGVPVPSHPGYYQPVQNYQYVPYRVEQPVQHLQPQLYQQMTEDQKQMTPSP
HSENTFRIDPSYPYLADFNNACGACDPTQARGYNVQYQPESVMYPNVLMQPVQQYPYQEQ
MQVPWQNVAPAAPKVVMHDVYPIVYPSVYPNVYPPYNIVYPQVLPQYPIQQFQDRRKKRY
DREAEIAMRIQQIKQQVDMMDGDRGRNSGGGILGNAPGRVGADDSQLTTAARAIVNSIRN
MQAKNPYHESRRSPPRPEQRRRAAGPVYRQMSPGAWCRSPAPMPQTFSQPRRPHPDNRNA
RRAPDWRRSCGRAVTSALTAVKRPADVDSCQ