Monarch geneset OGS2.0

DPOGS200057
TranscriptDPOGS200057-TA1425 bp
ProteinDPOGS200057-PA474 aa
Genomic positionDPSCF300044 - 1026069-1035366
RNAseq coverage2571x (Rank: top 5%)
Annotation
HeliconiusHMEL0161623e-16775.89% 
BombyxBGIBMGA002509-TA9e-8056.94% 
Drosophilatmod-PN4e-15071.92% 
EBI UniRef50UniRef50_A8JRH33e-14871.92%Tropomodulin, isoform G n=59 Tax=Pancrustacea RepID=A8JRH3_DROME
NCBI RefSeqXP_970292.22e-16278.65%PREDICTED: similar to GA13696-PA [Tribolium castaneum]
NCBI nr blastpgi|2700010202e-16276.84%hypothetical protein TcasGA2_TC011298 [Tribolium castaneum]
NCBI nr blastxgi|2700010206e-16577.26%hypothetical protein TcasGA2_TC011298 [Tribolium castaneum]
Group
Gene OntologyGO:00058561.4e-147cytoskeleton
GO:00055231.4e-147tropomyosin binding
KEGG pathway 
InterPro domain[20-363] IPR0049341.4e-147Tropomodulin
Orthology groupMCL13180 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200057-TA
ATGAAAAAATACTGTAGTATCAAGAAGTCGTCCAGTACAAAGACGACAACAATGACGACGCCGGCCAAGCTTTATGGACGCGAGCTGTCGGCGTACGATGAGGTTGACGTTGATGAGTTGCTGTCCAAGTTATCACAAGAGGAACTCACCATGTTGGCTAAAGAAGTTGATCCAGATGACAACTTTCTCCCTCCCTCGCAGCGGAATAACTATGCATGCGAGAAGGACCCCACCGGCCCCCTCAACAGGAAGAAATTAATCGAACACATAAACAAGCAAGCCTTAGAAACACCCGATAGGCCGGAAGTCAAACCATATGTCCCCGGTGTTGTAAGGGGTAAAAAGTGGATTCCACCACCGCCCTCCGAGAAAGTCCGTGATGCTGAAGAGCAAATCACTATAGATCTGGGTGATGAATATGAACAGGCACTCAGCACCGCCTCCCAGGAGGAGATTATTGATCTTGCAGCTATCTTGGGCTTCCACTCTATGATGAACCAAGACCAGTACCACGCCTCTCTACTGAACAAAGGCCAGCCGGTTGGGCTCGGCTGGGACGGTATAACAAAAGCTTCTAAACCCAAGGTGTACCCAATGGATCCTCCCAACGACACAGACCCAGATAAGACCATCGACAGGGTCAAACAGAATGATCAGACGTTCCTCGATCTCAATTGGAATAATATTAAGAATATAAGTGACGAGAAATTCGAAAAACTATTTGAGAGTCTCAAAACTAATACGCATTTGGAGGTATTGTCACTAGTGAATGTCGGCCTCAACGACCGTACCGCCCAACTGTTGGCCGATGCTTTGGAAGTCAACAGTACGCTGCGTGTGGTTAACGTGGAGACGAACTTTATAAGCCCGGCGGGAGTGGTTCAGCTGGTGAAGGCTCTGCTCACTACCACCAGCGTTGAAGAGTTCCGCGCTTCGAACCAGCGGTCGCAAGTCCTGGGCAACAAGACGGAGATGGAGATCACGCGGCTGGTGGAGCAGAATCCCACGCTGCTGCGGCTTGGGTTGCATCTCGAGTACAGCGACGCTCGCCACCGGGTCGCTTCTCACCTGCAGAGGAACATCGACAGAAATTGCCGATTACAAAAGCGGGCTAGCGTATCGCTAAGCTTGCGCTTGCCGCGCGGGCGTCCGCCGCCGGTCGGAGTCGACTCTAGCTTCTTCAACCTAACGCCCCCTAGCGCTGAGACTCCAACGCCGACGAGCGAGCGAAGAATTAACGATAAACCTATAATGGAAACAGATAATGAGAACGGTTTCCCTAATGCGAGGACGCCAGATATAGACCCCGCTACCCACTTGTGGAAGGGTCCGGCGCGTGAGCCAGTTCACTCGCAGCCCCAGCTACGTTGTTGGCTGGATCCATTAGTCCATCAATATTCTGTCACGCTCTTTGTCACTGTGTAG

Protein sequence:

>DPOGS200057-PA
MKKYCSIKKSSSTKTTTMTTPAKLYGRELSAYDEVDVDELLSKLSQEELTMLAKEVDPDDNFLPPSQRNNYACEKDPTGPLNRKKLIEHINKQALETPDRPEVKPYVPGVVRGKKWIPPPPSEKVRDAEEQITIDLGDEYEQALSTASQEEIIDLAAILGFHSMMNQDQYHASLLNKGQPVGLGWDGITKASKPKVYPMDPPNDTDPDKTIDRVKQNDQTFLDLNWNNIKNISDEKFEKLFESLKTNTHLEVLSLVNVGLNDRTAQLLADALEVNSTLRVVNVETNFISPAGVVQLVKALLTTTSVEEFRASNQRSQVLGNKTEMEITRLVEQNPTLLRLGLHLEYSDARHRVASHLQRNIDRNCRLQKRASVSLSLRLPRGRPPPVGVDSSFFNLTPPSAETPTPTSERRINDKPIMETDNENGFPNARTPDIDPATHLWKGPAREPVHSQPQLRCWLDPLVHQYSVTLFVTV-