Monarch geneset OGS2.0

DPOGS213725
TranscriptDPOGS213725-TA3087 bp
ProteinDPOGS213725-PA1028 aa
Genomic positionDPSCF300310 + 80892-89838
RNAseq coverage1481x (Rank: top 9%)
Annotation
HeliconiusHMEL0050470.076.52% 
BombyxBGIBMGA011630-TA0.073.67% 
DrosophilaCG33232-PA0.043.76% 
EBI UniRef50UniRef50_UPI00022C9AA70.043.98%UPI00022C9AA7 related cluster n=1 Tax=unknown RepID=UPI00022C9AA7
NCBI RefSeqXP_002427103.10.042.36%Supervillin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3504104870.043.98%PREDICTED: hypothetical protein LOC100747888 [Bombus impatiens]
NCBI nr blastxgi|3504104870.043.92%PREDICTED: hypothetical protein LOC100747888 [Bombus impatiens]
Group
Gene OntologyGO:00037793.8e-248actin binding
GO:00070101.5e-21cytoskeleton organization
KEGG pathway 
InterPro domain[169-1028] IPR0071223.8e-248Gelsolin
[169-1028] IPR0156283.8e-248Supervillin
[965-1028] IPR0031281.5e-21Villin headpiece
[580-667] IPR0071231.9e-08Gelsolin domain
Orthology groupMCL14047 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213725-TA
ATGAAAGTTCTTTTAATTCTGTCGCAGGGATGGCGGAAGCGTGTTCCCCAGAACGATGCGTCGCTGTTCACAGTCGCTGGTCGCTTGGAGCGTGACAAAGTGAACACGACCACGCCACCACTGACGCCACCGCCCGTCACCACTCCGCCTGTGTCCTCACCCGCTATCACACCAGCCATACCACCACCCAATAGGTTCAGATCTCGTAAAGCTCCAACGTCCCCTACCAATGGTTTCGCATCAGGTCCGCTCCGATCGGCGTCGTGTGCCGTGATGAGCGCCGCTGAAAAACCAAAGGAAACACCAAAAATTGATAGAGAATCGTTCAAACGAAGTCACTCTGTCAGCGAGAGTATCAGTCGAGTCGATGAAAGACAGGCTAAAGACATTATAAATACAAGACAAAAGCATTACAGTAATGCAGCGATGAATCACTTTGTTTGTGTGCGGAAGAGTTTGTCTAACGAATGGGCGAGACGAGCCAAGCGTGAACGTCGCCACTTGTCCCGCAACCCGCTACGAGCTCTGGCCGCTAGGACCGACCTCAGACAGGAATATATCGCGCAACCACCGGCAACCAACAACAAACAGGCGACCAAGGAGAAAGTGACAGCTAACTGCGGGCTGGCCGCCGAAGCGTTGGCAGCGCTCGCTACTAAAGAAGATTTCTCAAACGTGGCGCTGCGGAGTGCTTCAGCCAATAATATACCGAGCCAAGGGACGAAATCTCTCATGCTGATGCACGTTAAAGGGAGGAGACGGCTCCAGACGCGTCTTGTTGAACCCGTACATACAAACGTGAACCGAGGCGATTGCTTCGTACTCGTCACCAGCGACCAGTTGTTCCTCTATATTGGACTCTACGCCAACGTCATTGAAAGAAATCGCAGCACAGACATCGCACAACACATATACAATACAAAGGATCTGGGATGCAAAAACGCGACGGGAGTTATAAAGATAGACGAACAAACAAAGAACTACTCAAACAAACATTGGAACCAGTTCTGGTCGCTCCTCGGCGTCACAGATGGTATTGAGGAGTACAGGCCCGCTGAAACTGGTTCGCCGGACGAAGACGAAATATACGAGTCGTGTGTCATACAGACGAACATGTGCTACGAAGTCATAGACGATGAACTGGTACCGATCAAGGAATACTGGGGACAGGTACCGAAAATAGCCATGCTGCACCAGTCAAAAGTCATAGTGTTCGACTTCGGCTCGGAAATGTACATATGGTACGGGAAGAATGTCCCGCTGGAGAGTAGGAGGCGGGCCGCCCAGCTAGCGCAGGAGCTGTTCGATGAAGGTTACAATTACGAGGAGTGCCACATAAATCCAATAAACGCGGCGGCTTCTCAAGGACTCAGGAATGATACGCCGTCGAACGCCAAATCCGATAAAATTCGGCCGGAATGGACGATTTTGTCGAAAGTCACACAGCACATGGAAACCATATTGTTCAAAGAGAAATTCCTCGACTGGCCGGACTACAGTCGGGTTATAAAAGTTAAAACTCAAGAGAATAAATCGAACAGTGTTGAAATAAACCCGTGCGACGCCGAGGAGATGTGGTCCAATGAGTACCAAGACCCGGATCTCATACTGGAAGGCTCCCACATCGGCCGCGGGACACATTACTACGACAAGGACACGATGCGGCATTACGACATTAAAACCAAATCGGTTTGCAAATGGGTCATACAGGAATATGACTACCAGGAGGTGAAGAACGAGTCGGATGTCGGGGAGTTCTTCTCGGGGGATAGCTATATCATAAGATGGGAATACCAGATAACAGTTACGGGACGGGAGTTGAATGGCAAACCGTCAAAGCACAATTTAACTGGTAGAGAAAGATGTGCTTACTTCTGTTGGCAAGGCAAAGACGCATCCTCTAATGAAAAAGGTGCAGCCGCTTTGTTGACGGTTGAACTAGACAGAGAGAAAGGTCCCCAAATAAGAGTGGCCCAGGGCAATGAACCGCCGGCATTCTTGAACCTGTTCCAAGGAAACTTAGTCATCCATCAAGGCAAGAAGGGGACAGATAAGAGCCGCTTCCGGTTATACGGAACGCGAGGAAATGTTTTAAACGAGGCGTATTTGTTGCAAGTTCCGTGTTCTGTCAGGCAACTCCGCAGTCGCGGTTCCCTCATATTAGTCGACACTGAAAAATCCTGCGTATACATATGGCACGGCTCCCGGAGCATGAAGCACACACAGCACATAGCCATCGAGTTAGCCAATAAGCTGGTTGCCCAAAAATGTCACATATTTAGCAGCTCCGACATAAAAGTGAGTTCAGTCAAGGAGGGAGAGGAGAGTAAGGAATTCCTTGATAGTTTAGGAGTCACCAGCAAACAATACTACAATTCAGTTCTAAGTGGGAGAGATGTCGGTTCGGATGTGACTCCTAGACTATTCCATTTCACGGATTTGGGAGGTCAGTTCGAAGCGCACGAGGTGTTGTCACCGCTTCGGCACGAGACACTAGTGACACCCTTCCCTTTTGAACAAAAGGAGCTGTATTCCGCTTCTCAACCTGCCCTGTTCCTGATGGACGACGGTGTTTGCGTGTGGCTGTGGCAGGGCTGGTGGCCGAGAGGGGAGGACGGGGAGTTCGAGCCCGAACGGAACACAGGTGTGGGAGCGTTCGCCGCTCGCTGGCAGGCGATGCGGGTGGCGGCTCTACGTACCGCTGAGCAGTACTGGCGCGTGTCGCGCGAGGGTAGGCCGGACGTGCGAGTGGTGGCCGCGGGCCTCGAACCTAAGGCCTTCATGGACCTGTTCGACACGTGGACGGATCACGACGAGGCCGCGGATGCCAATATTGCTCACGGGTACAAGGCTGGTGAGTGCGTGTCGGGGACGGTGGAGTTGTCGCGTCTGTCTAGTTGTGACGCAGTGTTGCCGCTCGCAGCACTCCAGAGACGGCCGCTGCCCGACCACGTGGACCCCCACCACCTCGAGAGACACCTGTCCTCGCCCGACTTCCTGGAGGCTTTTGGAATGACCAAAGAAGAATTTTCAGTGTTACCCGCTTGGAAACAGACCAACATGAAGAAGGACGTGGGATTGTTCTGA

Protein sequence:

>DPOGS213725-PA
MKVLLILSQGWRKRVPQNDASLFTVAGRLERDKVNTTTPPLTPPPVTTPPVSSPAITPAIPPPNRFRSRKAPTSPTNGFASGPLRSASCAVMSAAEKPKETPKIDRESFKRSHSVSESISRVDERQAKDIINTRQKHYSNAAMNHFVCVRKSLSNEWARRAKRERRHLSRNPLRALAARTDLRQEYIAQPPATNNKQATKEKVTANCGLAAEALAALATKEDFSNVALRSASANNIPSQGTKSLMLMHVKGRRRLQTRLVEPVHTNVNRGDCFVLVTSDQLFLYIGLYANVIERNRSTDIAQHIYNTKDLGCKNATGVIKIDEQTKNYSNKHWNQFWSLLGVTDGIEEYRPAETGSPDEDEIYESCVIQTNMCYEVIDDELVPIKEYWGQVPKIAMLHQSKVIVFDFGSEMYIWYGKNVPLESRRRAAQLAQELFDEGYNYEECHINPINAAASQGLRNDTPSNAKSDKIRPEWTILSKVTQHMETILFKEKFLDWPDYSRVIKVKTQENKSNSVEINPCDAEEMWSNEYQDPDLILEGSHIGRGTHYYDKDTMRHYDIKTKSVCKWVIQEYDYQEVKNESDVGEFFSGDSYIIRWEYQITVTGRELNGKPSKHNLTGRERCAYFCWQGKDASSNEKGAAALLTVELDREKGPQIRVAQGNEPPAFLNLFQGNLVIHQGKKGTDKSRFRLYGTRGNVLNEAYLLQVPCSVRQLRSRGSLILVDTEKSCVYIWHGSRSMKHTQHIAIELANKLVAQKCHIFSSSDIKVSSVKEGEESKEFLDSLGVTSKQYYNSVLSGRDVGSDVTPRLFHFTDLGGQFEAHEVLSPLRHETLVTPFPFEQKELYSASQPALFLMDDGVCVWLWQGWWPRGEDGEFEPERNTGVGAFAARWQAMRVAALRTAEQYWRVSREGRPDVRVVAAGLEPKAFMDLFDTWTDHDEAADANIAHGYKAGECVSGTVELSRLSSCDAVLPLAALQRRPLPDHVDPHHLERHLSSPDFLEAFGMTKEEFSVLPAWKQTNMKKDVGLF-