New model in OGS2.0 | DPOGS214770  |
---|---|
Genomic Position | scaffold1263:+ 7789-22884 |
See gene structure | |
CDS Length | 3360 |
Paired RNAseq reads   | 1678 |
Single RNAseq reads   | 4355 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000174 (0.0) |
Best Drosophila hit   | CG5741, isoform A (2e-34) |
Best Human hit | testis-expressed sequence 2 protein (1e-29) |
Best NR hit (blastp)   | GF10141 [Drosophila ananassae] (3e-74) |
Best NR hit (blastx)   | PREDICTED: similar to LOC100101310 protein [Nasonia vitripennis] (4e-55) |
GeneOntology terms    | GO:0005575 cellular_component GO:0008150 biological_process GO:0003674 molecular_function |
InterPro families   | IPR019411 Domain of unknown function DUF2404 |
Orthology group | MCL13120 |
Nucleotide sequence:
ATGGAAGTTTCAGGCGCAGGGAAATCCCCAAATACGTCACTATCATTTAGATACAATGCT
AATAATGAGGAACTGGAGGAACTTCTTCAAGCCTGTGAGGACGACCCCCCAACACCACAA
GCTGAGCCAGCTCCGGTTAGGAGTGAAAGTGCCAGTCCTAAAAGAGCCGAGAAGAATATA
TCCATCATAGATAAATATTTCAAATCTATGCGTACAGAGAAAACAAATGAAAAGACAGAG
GATATCAAACCAGACATCAAACCGGAAGAAACTCGTTCTATAATACCTCCCAAAGAAATC
ACTTCATCCCCTATGAAAGAGTATCTAAACCGGCTTGGAAAAAGAAACACCCCAGAGGTC
GCTCCGGAGGGAAAGGAGGCTAATGAAACTTGGAAAATATTTCATGATTTTAAGTTCAAA
ATAGCTCAAGCTGTTGAAGATATGAAAACGCGTTCTGTTGAAGACACTAAGGACAAAAGT
CTACCGAGAGAAAACTCGACATCAGATTCTGAAGAGAATTCAGCTGTTAAGGACTCGGAT
CAGCAGAGTATGGGTGATACGGACAATCAAGTGTCTTCATTGGACAGCAGTATGCAGAAC
TTATCTGACATGACTCAGCCATTAACGGCCAAGCCTATGGAGTTTGAGCTGATAAAGGAA
ACGAATTTAAACAAAAGTCACTCGGAATCCAGTGATGATACGCATAAGAATTTACCATAT
CAAGAATCGGACTTAGCGAGGGAATTCTTACAAGACAGTATAGAGATCGAGTCCGGTGTT
GAAGCCCTCGAGGACACGATCGACGCTTTCGGCGACGCCAATCAGATAGAAATGGAAGAC
AATAAAAGAACAACAGAAACACCCACAAACAATGCGAACACTTCAAACTTCGGACAGCCG
CCCAACAAGAACAACATACACAAAGACGAACCAAAGAAGAATTATTTCTTTAAATTCTCA
ATGTATTTCCTAACGATATTTCTATTCGTTAACTACGTGCTATTCCCTAATTCGAATATA
TGGAACGGTTTCCTATTGGGCATTTGGTTTTTCTGTTTCGCAACCAACGTCCGCGATTGG
CTGATGGATAACTACTTCAGGGATTGGGAACCGAGGAAGGGTGGTTTCTTCCAATTGAAA
CAGAGCACAACACTACCATTTATATACACGATACCAACTATAAAGGAGCACAGGCCGTTG
AAAAAGTTTGAGGGTTGGTTAAATCATTATAGGTTTCCAGACTACGATCCCTTCACGTAT
CACATCAATAAGACAACAACGGCTTTCGTCAAACTTGAAGGTTGCAACCTACGCATATCG
TACACAAGGACAAAGGTTGCTAAGCGCGCTCTTTGGGACGAAAAGATTGAAAATGTAACT
TTCTACCAGCACCGGCTATACAACCTCACTGGTGCGAGAGTCATATTGCTGCCGAAAGGT
TTAGTTAAAAGGAGGCAATGGAGCAAGAAATATCCAATTTGCATAATATTGAACGAAAAG
GAGAAGATACAGGTGTTAGAGAAAGAAAACACGGATAAGAAATCTGATACACAGGAGAAG
AAAAATAAAGATGTACCAGCGGAAGTTGAAAGCGCTGAAACAAACACGACGCCCGAAAAG
AAGAGGAAGTTCGTGTGGAGAAAAAAGGAGAAGAGACCCCCGGCCTACACGTCATGTCAC
ACTCAGCTAAACCCCTTCCTTCCTAGCCCCTACAAAAAGATCGACAACACCTCACAGTGC
ACCGAACCCGAGACGAGTCATTCAGATAAGAACACACAACTTGAATATGAAGCGGCGGAA
ACGGAAATAGAACTGACCAAGACGGCCAACAGCAAAGATGAGGACGTGGAGCTCGATGAG
AGCGAGCTAACCAAGATCAAAGAGTGTCTAGAGGAAACTGAGTTGGAGACGGGAGCAGAT
GGTGCGGCCGAGGGGGAGTGGAGCGTCCACGTCAAACACTCAAGAGATAAGCATTCTAGA
TTGTATCTCTTCGCAAGGACCGGAAGAGATAAACTCGAATGGTACCGCCGCCTCCTAGTA
GCCGTGTCAGAGGCCCGTTCCGACTCTCCCGTTGAGGACGAACGACACACGGACGACAAG
GACCCCATCGAGCTCGCTGTATACAAGCTCACTGAGAAGGATACTGTCGCCTTCGCTAAG
TCCAAGTCCAACGAGCAGGTGTCTGAGGGGGTGACTATCAGCAGTACGGCGCAGCCGCTG
CCGTCTAACTTTGACCTATACGAGAAGTCGTTTTGGCCGTACCTACTGAAGATTATACAG
AACCACGAGACGTCCTCTAAACAGACGACGGACGCGGGCGTCATGTGTCAGATAGAGCCC
ACGCCGCCCGACAGGAGTAAGAGTAAAAAGAAGAAAAAGACAGCTTCCCAGGGCGCGGAG
TGCACGTGTCGCGTGTTGCCGGCGGAGGTGTCGTGGGTGAACACCGTGCTGGCGAGGCTC
ATGTATGACGTCATGAGGGACCCCGCCATGGTGGCCCGCGTCCAGAACAGGATACAGAGG
AAGCTTAACACGCTCAAGCTGCCGTCGTTCATGTCTCCGCTGGTGGTGACGGAGCTGGTG
CTGGCGGGCTCGTGCCCAGCCGTGTGCGGGGTGGGCTCGCCCTCGCTGGACGCGCGCGGC
CTGTGGCTGCACGCCCTGCTGCGCTACGACGGCGGCGCCACCATCACCATCCTCACACAG
ATCAACCTGCTCAAGCTTAAGGAGAAGAATCTCACCTTAGAAGATCAACTACTAGCGGCA
GCCGAAAATACAGTCGAGAGCGATGCTAGCTGCTCTATACCGAGCACGCTGCTCACAGAC
AAGAAACGCAAACCGGCGATCTACGACTCGGAGGTGGAGGATTCGGCGGAGTCCAGCAGC
GACGACGAGAGCCCGCCCGTGCAGCCCGTCGACAGCACAGAGAATGTATTGGCCGCGGAC
TCTGTATCGTCAACAAACGAAGGCGGCTCGTCCAAGAAGAAGTTCCTCCGTATGGTTGAC
AAGATAGCCACCAACAAATACTTCCAGCAGGTAACCGACTACAAGTACGTGAAGCGGGCC
ATGGAAGGTCTCAGCAACACGGACATCAAGCTGCAGCTGGAGGTGAACGGCCTGGAGGGG
AGGCTCGCCATCAACCTGCCGCCGCCGCCGCACGACCGACTGTGGATAGGGTTCTCCACC
AACCCGCAGCTGGTGCTGAAGGCGCGGCCGGCGGTCGGCGCGCGGGCGCTTCGGTTCGCT
CACATCTCCAACTGGATAGAGCAGAAGCTCACCAAGGAGTTCGAGAAGGTCCTCGTGCTG
CCCAACATGGAGGACTTCATCATAGACGTCATGTCACCCACGCCTATAGAGTTCGAGTAG
Protein sequence:
MEVSGAGKSPNTSLSFRYNANNEELEELLQACEDDPPTPQAEPAPVRSESASPKRAEKNI
SIIDKYFKSMRTEKTNEKTEDIKPDIKPEETRSIIPPKEITSSPMKEYLNRLGKRNTPEV
APEGKEANETWKIFHDFKFKIAQAVEDMKTRSVEDTKDKSLPRENSTSDSEENSAVKDSD
QQSMGDTDNQVSSLDSSMQNLSDMTQPLTAKPMEFELIKETNLNKSHSESSDDTHKNLPY
QESDLAREFLQDSIEIESGVEALEDTIDAFGDANQIEMEDNKRTTETPTNNANTSNFGQP
PNKNNIHKDEPKKNYFFKFSMYFLTIFLFVNYVLFPNSNIWNGFLLGIWFFCFATNVRDW
LMDNYFRDWEPRKGGFFQLKQSTTLPFIYTIPTIKEHRPLKKFEGWLNHYRFPDYDPFTY
HINKTTTAFVKLEGCNLRISYTRTKVAKRALWDEKIENVTFYQHRLYNLTGARVILLPKG
LVKRRQWSKKYPICIILNEKEKIQVLEKENTDKKSDTQEKKNKDVPAEVESAETNTTPEK
KRKFVWRKKEKRPPAYTSCHTQLNPFLPSPYKKIDNTSQCTEPETSHSDKNTQLEYEAAE
TEIELTKTANSKDEDVELDESELTKIKECLEETELETGADGAAEGEWSVHVKHSRDKHSR
LYLFARTGRDKLEWYRRLLVAVSEARSDSPVEDERHTDDKDPIELAVYKLTEKDTVAFAK
SKSNEQVSEGVTISSTAQPLPSNFDLYEKSFWPYLLKIIQNHETSSKQTTDAGVMCQIEP
TPPDRSKSKKKKKTASQGAECTCRVLPAEVSWVNTVLARLMYDVMRDPAMVARVQNRIQR
KLNTLKLPSFMSPLVVTELVLAGSCPAVCGVGSPSLDARGLWLHALLRYDGGATITILTQ
INLLKLKEKNLTLEDQLLAAAENTVESDASCSIPSTLLTDKKRKPAIYDSEVEDSAESSS
DDESPPVQPVDSTENVLAADSVSSTNEGGSSKKKFLRMVDKIATNKYFQQVTDYKYVKRA
MEGLSNTDIKLQLEVNGLEGRLAINLPPPPHDRLWIGFSTNPQLVLKARPAVGARALRFA
HISNWIEQKLTKEFEKVLVLPNMEDFIIDVMSPTPIEFE