New model in OGS2.0 | DPOGS211604  |
---|---|
Genomic Position | scaffold1245:+ 10807-27845 |
See gene structure | |
CDS Length | 3351 |
Paired RNAseq reads   | 840 |
Single RNAseq reads   | 2195 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008219 (9e-125) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC007953 [Tribolium castaneum] (4e-101) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC007953 [Tribolium castaneum] (3e-50) |
GeneOntology terms   | ND |
InterPro families    | IPR021129 Sterile alpha motif, type 1 IPR001374 Single-stranded nucleic acid binding R3H |
Orthology group | MCL16011 |
Nucleotide sequence:
ATGGCCAGGACTGTAAGGAAACTAAAACGAAGTAAAGGGAAAGAGGCAAAACAGCTCCCG
GCCATAATAATAACCCCGCCCGAGTTAGACATGAAATTCAAATCTACCAACTACAAGGAA
AACTCATCCACAACCGACATGGAGTTCAGTGTTCACGATCTGATAGCTCAGCTCTCCGAG
AACTCGAAGGCATCGAAGGAGGAAATAGAGAGCATACAGAGGAAGCTGCTCCATCAGGCC
AGCGAGATCCTGAAAGTGGACAGGTTCGCCAACCAACCGTCTCCAGACCCACGTGTCGTG
AGCGCCACCCAAAAATACAACGGGCCGATCTACGGCAGACCCATATCGATCCGGAGACAA
TCGGTGATCAATCAAGACGCTCAATTAGTGCCCGAGAAATTACAAGAGAATGTCAGCGCG
GTAAACAAGCATCAGGTCAACGTGGTCGGCGCGGGAAGAGGAGATGTCTTTAACGGAACG
GGTTCTGAGGCTAGAGTCGCCAGTGCGCGCGACGCGGCCGTCCGAGGTACCTCAGAGCCG
GTGCCTCCGTACAGAATGCCGCCAGCGCCCGAGGCGGCTCTACCCGGTGCCCCCGCGCCC
CCCGTCCAAGGCATACACACGCACGCCAAGTTTCCCATAGAGCGAGATGTAATTCTGTCG
TCTGGAGATAAGAATTCAAAAGTGCCGATATTACAAGAGGCTAGAAAGAGAGGGAGGCTG
GGAGCAGTGGCTCCCGATACACCCCCCAAAGCTCTGTCAGCGTGGACTCATACTGAGAAT
CAACAAGCCGGCCCATCTGGTGACTTTAGACAATACGAACAAAACTACGGGTATGCTATG
CACCAAAACGTTAATCCGAATCAGGGTGCTGTGGCTAGAAACTTACAGAACTATGACTCA
CCAAAACCTCCTGTACCAGCCAAGAACGCTTACAAACCGGAACAAAACAATTCTCATATA
ACAGTCACAGTCGAAACCGGAAAAGATAACACAAAAGATGCTCCTAAGAGCAACAAGTCG
GTTAGCAAAACTTATCACACTTTAAAAGACATGATATCGAGTAGATTTAAAAATAAAGAC
GGGAACGATGCAGAGAAAAACAACGAGGAAGCCAGACTAAATAATAACGAAGAACGGAAA
AAGCCAGATCAAGAACCAGTAACACCACGAGAGACACCGAGGAAAGTTGAACAAGGCATT
TATGGAAGACCAATGCCACAAAATCGACCAGATATGCAATACAATCAAGGCATGCCAAAC
AATATGGCATATCACAGCCCTTCCCCTCATAGACAGTTAATTCACCAGCAACAGCAAATA
GTCCAACAGCAAATGATGTTGAACCATCAGGCTCGCTCTCAGGAGATGTTGGCTCACCGA
CCACAGGCTTTGGGTGCAGACGCTCTTTACCAATACGGGCCGCCAGGAAGACGTAGTGCT
GTTTATCAAAGGGAAGATTTGAGGTCTTTGGCAAATTTCACATCGCTGAAAACAACTCCA
CAACCACAATTCGAAAATTCACATTCGAGGCTTGATCTCAGAAGTCCACAACAATTAGAA
AGGGATATTGGACGGCAAAGAGGATTGGGTGAAGGAAGGCGAGCAGCGTCACATCCACAT
CTTCTAGAAGAGATACAACATCGACAAGAAATAGTCAGCCCCCAAATTCATAATCGATCT
AGGAGAAATTCCCAAGCAAACTTGTTAGATGGAATATCTCATGAAAATGATCCCATAAGA
AATAACGAAGAGCGGGAGTCTGATGACGGTGGTTTTAGACTGAGGCATGCTACCCAAAGT
AGGTTAAGTTATGAAGAAAGAATTCGAACCAGTGGTCGGTCCTTAGAATCACACCACGAA
AGATCTCATGAAATGTATCGAAGGACACCCGATAGCCATAAAGAATCAAGAAGAACGCCC
GATTCCTTAAGCCTAAGACAAAAAGATGAACCTTCAACATCCAGAGAAATAGAGAGGAAT
GACGAAAGTGCGAGTCAAAAATCAGCAGACAGTGTGTATAACTCCAGTGGAAAAGCCGAG
GCGTACACTCCCCAACCGTCTTCCTCAAGACAAACACCGAGTAGGATCGAAGACTTAAAG
GCTCATGGAAAGAAAGGACCCAGTGGATCAGGAGCCAGTTCGGATTATGATAAAACCGGC
GGTCAATCTTCCAACGTGGATTCAGGTCGTGGGAGCGCTGCGAACTCGAGCGGGAGACGC
GCAGAGACCACACGAGCACCTCCGCATGATGCCACAGCTGCACCAGAAAACGAATGGGCA
GATTTAGTGGAATGCGAGTTGCGTCAAATCCTGGAGCCGAAGCTCTCCAGCATGAGGTTG
GACAGCTCGGCCAGTTCGGATGGATCGGTCACGCCTCCACTACCACCGCTGTCTCCATCT
TCAGATCTTCACAAACGGAACAGTCTTCCCGGCCGTGTTGAGTATTCTGACGATCGACGT
CGCCGCGAGTCCCCTCGCTGGCCCTCTCACTCACACTCGCACTCACACAAGAAATCGTCA
AAAAGAGATCATCATTACAAGAAGCACTCCTTTGGCCCTGACACAACGGACGTCACTTCA
ACGACGACACGCAGTCTGGATCTGTCTTCCTTGTTAGATGCAAGAACAGACAGCGACGCA
TCCACAGATGCACGCGCCATACGAAGGCAGCTCCGAGGACTGGAGAACATGTACGGGGAG
GTGCTGCAGTTGTTGGGGGTCAGGAAACCAGCTGGAAAGAACTCCTGGGAGGCACGGTTA
ACTTCCAAGCGTCGTTATGGCAGCATGTCCTCGCTGCCGTCCAGCTCCGTCAGCAGTCGA
CCTGTCAGGGATAAACGAAGGTCATCCAACGAACATCGGAAGAAGAATGATTATAAGGGC
ATCAACAAGCGCTTCCAGCGGCTTGAATCCCACGTGGTGACACTGGCTCGGTCGGTGGCG
CACTTGTCGTCCGAGATGAGAACACACCACTTGGTGCTGCAAGAGATGGACACCATCCGC
GCCGAACTGGCCGCCCTCAGGCACATGTACAGATCTGGCGCCCCAAGTCGAAGACGCACT
TCAGGGTTCAGTGACCCCGAGCGTGTGAAACGTCTCACCAAATTCTTTGGAGATGAACCA
CCGCTCATGAGACTGTTCCTCAAGAAACTTGGATACGAGAAATATGCAGCTCTTCTTGAA
AAGGAGAAGGTGGGCGCGGCGGAACTGCCCTACGTCGGGGAGGACAAACTCAGAGCCCTA
GGAGTTCCATTAGGTCCTAGGATGAGAATACTCAAAGAGGCTGGGATCCATCAGGACCTA
CATTTATCTAGAGATGATCATAACACAACGACTACTTTGGCTATAGTGTAA
Protein sequence:
MARTVRKLKRSKGKEAKQLPAIIITPPELDMKFKSTNYKENSSTTDMEFSVHDLIAQLSE
NSKASKEEIESIQRKLLHQASEILKVDRFANQPSPDPRVVSATQKYNGPIYGRPISIRRQ
SVINQDAQLVPEKLQENVSAVNKHQVNVVGAGRGDVFNGTGSEARVASARDAAVRGTSEP
VPPYRMPPAPEAALPGAPAPPVQGIHTHAKFPIERDVILSSGDKNSKVPILQEARKRGRL
GAVAPDTPPKALSAWTHTENQQAGPSGDFRQYEQNYGYAMHQNVNPNQGAVARNLQNYDS
PKPPVPAKNAYKPEQNNSHITVTVETGKDNTKDAPKSNKSVSKTYHTLKDMISSRFKNKD
GNDAEKNNEEARLNNNEERKKPDQEPVTPRETPRKVEQGIYGRPMPQNRPDMQYNQGMPN
NMAYHSPSPHRQLIHQQQQIVQQQMMLNHQARSQEMLAHRPQALGADALYQYGPPGRRSA
VYQREDLRSLANFTSLKTTPQPQFENSHSRLDLRSPQQLERDIGRQRGLGEGRRAASHPH
LLEEIQHRQEIVSPQIHNRSRRNSQANLLDGISHENDPIRNNEERESDDGGFRLRHATQS
RLSYEERIRTSGRSLESHHERSHEMYRRTPDSHKESRRTPDSLSLRQKDEPSTSREIERN
DESASQKSADSVYNSSGKAEAYTPQPSSSRQTPSRIEDLKAHGKKGPSGSGASSDYDKTG
GQSSNVDSGRGSAANSSGRRAETTRAPPHDATAAPENEWADLVECELRQILEPKLSSMRL
DSSASSDGSVTPPLPPLSPSSDLHKRNSLPGRVEYSDDRRRRESPRWPSHSHSHSHKKSS
KRDHHYKKHSFGPDTTDVTSTTTRSLDLSSLLDARTDSDASTDARAIRRQLRGLENMYGE
VLQLLGVRKPAGKNSWEARLTSKRRYGSMSSLPSSSVSSRPVRDKRRSSNEHRKKNDYKG
INKRFQRLESHVVTLARSVAHLSSEMRTHHLVLQEMDTIRAELAALRHMYRSGAPSRRRT
SGFSDPERVKRLTKFFGDEPPLMRLFLKKLGYEKYAALLEKEKVGAAELPYVGEDKLRAL
GVPLGPRMRILKEAGIHQDLHLSRDDHNTTTTLAIV