New model in OGS2.0 | DPOGS211404  |
---|---|
Genomic Position | scaffold881:+ 72501-102272 |
See gene structure | |
CDS Length | 3696 |
Paired RNAseq reads   | 633 |
Single RNAseq reads   | 1708 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004677 (2e-46) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC012889 [Tribolium castaneum] (1e-92) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC012889 [Tribolium castaneum] (3e-72) |
GeneOntology terms   | ND |
InterPro families   | ND |
Orthology group | MCL18534 |
Nucleotide sequence:
ATGACTGAAATTATATCAAACGCGTTCTGTCTCGCGTTGTTCTTGGGTTTGGGGGTTTCA
ACACCTCAAGGAAGTAGGTACAACGAACCGATTCTACAAAATGAAAGATATAGTGAATTC
AATGCAGCACGAAAGTTTGATGAACCCGCGGGGGCGTACGATGTTAAATATAAGCCGAAT
GGGGTAATGGCGCAGGAAGCTCAAAGGGATCTATTATTTCAAAATGAAGGCTATAATCAA
AATACCTTTCAAGGCAATAGAGAAGAAATTTCGCTGAATACCCGGCATGTGCCTCTCAAT
CCATCTCTAACGACTCAAGAATATCAGCGAGAACCAGTTCTAAACAAATATGAAATAAGT
GCCATCAACAACGAACAAAGGATATTGCATAAATATCCAGAAGATTTCAAAAATATCAAC
ATACAAAGGAATATAATGCCTCGGGATAAAGAGGAGCTTATCAATGAGGAAAGGAATATA
AAGAAAGGAAAAGATCTCGCACCTGCAGTGCTATATAAACCTTCCTTGTCTCAAGTAAAC
AGAATTCAAAACTTTCGACCAAACTATGAAGAAAACCGTGAAATCAATGTTCCTGGCATC
GGTGCCATGATGTCAGAATCAAATAACAACAGAAATGTTATTTTATCAAAAGTAAATTCT
ATTTCTCAAATAAACCATATCAATGAGAGACTCAATTTGCCAGTTGAGCGTTTAGCCAAT
AGCATGGACCAAAATCGTGACGAAGACGGAATACAAATAGACGTACCAATAGCAACACCT
GAGTACATAAATTCAATTTTGAATAACTACAAACCCATGACACAAAGAAACTTAGTCTCC
CTTAATGGACAAGCACGCGTGGGAAATATTTTAGGCAACCCTCAAAGATTAAAAGATATT
TATATATCTTCAAAACCATCGAAGCGTTACATGGTTGTGTATCCAGATGGTAGAGTGGAG
CAAGTAGATGAGTTGGAAAAAATCCAGAAAAATTATCCAAATTATGTAATGCTTCAAGCT
AAAGAACTTTTGCAGAAAAGTCAAGTGAACCCGGTTATAACTCCTGAAATACGAGGCTCT
GATAAGCCAGTTCTCTCTACTATTTCGCCGATACTACCAATAAATCCCTTAACGGATAGC
AAAACAAATCAATTAAATAAAATTGATTCTCAAACTGTAAAAGATCAAAAAGTACCTGTC
ACTACCGACACTGTCATTGGTCCGTCTAAAGATGCTGAAGATCTGTTGTCAGATTTGAAG
CTCGAGAAGGCTGAAATAGAACCCATTAATCCAGTTCAGCAAGTCTCTTCTCATAATGAG
AACGTAATAAGTACACCTGAAATTAGTCAAGGACAAAACATTTTGGATAAAAATGAATAC
CCAAATCCTGGCCATAATGAAATAATTCCTACAACAGAAATCCCCTCAAAGGAAGAAATA
GTTACAACTTCTAAAAACACGGTATCAGTTGAACCGGACACTCCTGGGAATAATCAAGTT
TCTGATAATCAAGATCAACACAATAGATCCTCGCTAGATAACAGTCGTGAGATAGATCAA
TTGCCTACAAAGTCGGAAAAAGTAACAGATAACCATATAAAACCTAATAAAAGCACAGAA
TCAAATGAAAATTATGAGGAGGGTAAAACTACAGAAACCTCTAATCCTGAAATACGTTCC
CCTAACGAAGACACAGAATTATCTAATCAAAGTTTAGGTTTAGATATAAGCAGTTCCCCA
ACGCCACCCGAACTATCGCAAAATCCTGAAGCTAGTAAGACAAACGTATCAAATGAGACA
GTAACCGCTGATGAAGACAAGGATTCAAATGGGAAAATACCTAGCACAGAAGTTGAAAAG
AAATATGATGATGAAAACGTAGATAGCATAGCACCGGGTACGGAATCGGATATAAATTCT
ACTACCAGTGAAGCTACTTCACTATTTTCTCCAGTGAACCCTGAAACAACGGATCCCACG
ATACACAGAACAACGGTCGAAAATGAAAGTGAAAACATAACCAATTATACACCACTTTTA
TCAAATGGAGGAGCACTTATAGATGGAGAGGAAGCCTCGAATTCAACTCAACCGGAATCA
ATTAGTTCAATCTCCAGTCAGCGAGATAATAATAACACTTCCAAAGTCCAGACTTCTGAT
GTGTATGAGGATGCTTCTTATAAGGATGCGTCTCCCTCTCGTAATAATTCTGAAAGTATA
CCCCTTGAAGACGACGTTTCACGAGAGGAATCAGATTTAAATAAAAAGCCGAGCTTAAGT
CTTAAACAGAATGTATCCAATTCTGAGGAGGAATTGAACCCTATATCAAAGGAAAGTGTG
AAAGAAAATTCAAGTATAGCGATCGACGACGTCCAAGACACAGTTATAAATTCAAGTCCA
GACAGTGGACTGTCCGCTTCAAATAATGAAGAGAGTTCAGTACCAATAGCTACTTCTTAT
GATTCATACTACCAGAATTCTACCGACAAGAATGTTGAAGAGAGACGCTATGAAGATGAA
GACGATAAAGAAGAAACGTCAAATGATAAACTTCAAACAGACGTGGCCACACCACCAGCT
AGTGCTGAAAGTACAGACAGTGAACCATCGTGGTTTAACTTCAATGAGAGTCTGTTAAAA
AAACTATTGAGTAACGATGACTTACACAATTCATTCTATTTCCTGTTCCCGCGACCGAAA
CCTAATATAAAAGAGACGAAAGAGACAATTTATCCGAACGGAACCGTCGTTATTGAGACG
ACTCAGACGATTGATGCTGACGAGTGCAATGGACGGGCGGAGGGAGGTGGGTTGCCGGAG
GTGGAGCTTACTAAGGAATATAATCATATGCTGTTGCTGAGAACTGACGCCGTCAACAAG
ATGTCGGCAGCGGGCTCCGCGGTGGCGATGATCGGCGCCAACCTGCGCTTCGGTTCAGCG
GGCGCTTCCAACTCCTCTGTGGTTAACAACACGAGGCTTTGCCCTCCAGGGGGAGCAGCT
AGCGCCGCCGCTCTCCTCGCCCTCTCAGGCCTCGGTATGACCGCTAACATCGCGCTCATG
GCCGTCATACTGAGCAAGAAGCAACTGAGACGGTGGTCTCACGGCTTACTCTTCCACCAG
GCGATGGTAGACTGTGCCCGCGCCGCCATCCTTCTACCTCTGGGTATAGCGGTGTTCCGA
TGTCAACCAGTCTATAAATGTTCGCTTGTGGAAACTGCGTTTTTATTACTTGTAACCGTC
TCAACGGTCAACATGCTCACGACAGTTCTCAACGACAGTCCCATTTTCCCCGAAAACGAA
GAGGAGCAGGCTGATTTATCAGCGCCTTTATTAATGGATAGTCCACAATGCGTCCTTTTC
GGAACATTTATGATATGGTTTGCGTCAATTACCATTAACCTCGGACCCACTTTCTTATCG
GGAGCGTTGGCCGCCAGTGCGGGCTCTTACGGCTCCTATGGACCGTCACCGTCCTGCCCC
CTGGTTCGAGGACCGTTCAGACATTACGTTCTCAACGCCCTCTGGATAGGAGTTAATGCC
GTTTGCGTCGGATTGACATTATTCCATTTACGAAAGCTCCATCGGGACCTCACAAAGCCG
TTTGAGGTCGTTGTTCGCGAGGAGGCTGATGAAAGTAATTCCAACTCGTCGAAGCTCGCC
GGTGATTTGCAATGCAAGAATACTGAATATGCATAA
Protein sequence:
MTEIISNAFCLALFLGLGVSTPQGSRYNEPILQNERYSEFNAARKFDEPAGAYDVKYKPN
GVMAQEAQRDLLFQNEGYNQNTFQGNREEISLNTRHVPLNPSLTTQEYQREPVLNKYEIS
AINNEQRILHKYPEDFKNINIQRNIMPRDKEELINEERNIKKGKDLAPAVLYKPSLSQVN
RIQNFRPNYEENREINVPGIGAMMSESNNNRNVILSKVNSISQINHINERLNLPVERLAN
SMDQNRDEDGIQIDVPIATPEYINSILNNYKPMTQRNLVSLNGQARVGNILGNPQRLKDI
YISSKPSKRYMVVYPDGRVEQVDELEKIQKNYPNYVMLQAKELLQKSQVNPVITPEIRGS
DKPVLSTISPILPINPLTDSKTNQLNKIDSQTVKDQKVPVTTDTVIGPSKDAEDLLSDLK
LEKAEIEPINPVQQVSSHNENVISTPEISQGQNILDKNEYPNPGHNEIIPTTEIPSKEEI
VTTSKNTVSVEPDTPGNNQVSDNQDQHNRSSLDNSREIDQLPTKSEKVTDNHIKPNKSTE
SNENYEEGKTTETSNPEIRSPNEDTELSNQSLGLDISSSPTPPELSQNPEASKTNVSNET
VTADEDKDSNGKIPSTEVEKKYDDENVDSIAPGTESDINSTTSEATSLFSPVNPETTDPT
IHRTTVENESENITNYTPLLSNGGALIDGEEASNSTQPESISSISSQRDNNNTSKVQTSD
VYEDASYKDASPSRNNSESIPLEDDVSREESDLNKKPSLSLKQNVSNSEEELNPISKESV
KENSSIAIDDVQDTVINSSPDSGLSASNNEESSVPIATSYDSYYQNSTDKNVEERRYEDE
DDKEETSNDKLQTDVATPPASAESTDSEPSWFNFNESLLKKLLSNDDLHNSFYFLFPRPK
PNIKETKETIYPNGTVVIETTQTIDADECNGRAEGGGLPEVELTKEYNHMLLLRTDAVNK
MSAAGSAVAMIGANLRFGSAGASNSSVVNNTRLCPPGGAASAAALLALSGLGMTANIALM
AVILSKKQLRRWSHGLLFHQAMVDCARAAILLPLGIAVFRCQPVYKCSLVETAFLLLVTV
STVNMLTTVLNDSPIFPENEEEQADLSAPLLMDSPQCVLFGTFMIWFASITINLGPTFLS
GALAASAGSYGSYGPSPSCPLVRGPFRHYVLNALWIGVNAVCVGLTLFHLRKLHRDLTKP
FEVVVREEADESNSNSSKLAGDLQCKNTEYA