Monarch geneset OGS2.0

DPOGS209712
TranscriptDPOGS209712-TA1449 bp
ProteinDPOGS209712-PA482 aa
Genomic positionDPSCF300105 - 376350-380399
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0080185e-5830.06% 
BombyxBGIBMGA008923-TA0.060.24% 
DrosophilaCG43129-PD2e-9539.34% 
EBI UniRef50UniRef50_E2BYA12e-11142.05%Protein msta, isoform B n=9 Tax=Endopterygota RepID=E2BYA1_HARSA
NCBI RefSeqXP_396314.27e-12144.80%PREDICTED: similar to CG17086-PA [Apis mellifera]
NCBI nr blastpgi|665214641e-11944.80%PREDICTED: protein msta, isoform B-like [Apis mellifera]
NCBI nr blastxgi|665214642e-11844.87%PREDICTED: protein msta, isoform B-like [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL12869 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209712-TA
ATGGCTGACAATTGCCGTTATGATGTTAAAAGAAATGTAAAACTGGGCCGATATTTAGTTGCTAATGCTGAACTCGGATGTGGAGATTTAATTTTTACAGATTACCCCTTTGCTGTGGGCCCGAAACCAGGCAAGTATACACCCCCACTATGTTTAAGTTGTTACTGCCCGATAGAGAGCAAATATTGCTCGAGATGTAGCTGGCCAATATGCAGTGCTGAGTGTGAGCTATCTCCTAATCATCAACCAGAATGTTCGGTATTTTCAAAAGCAAAAATCAAATTTCAGCCAGTAGAGGACTGGACTGTTAGCGCACCTCAATTGGATTGTGTAACACCCTTAAGGTTACTGGTAGCTAAGGAACAAGACCCAGATCGATGGAGTAAAGAAGTACAAGCAATGGAAACGCATACAGAGCAGCGCAGGAAACGTTCCACGTGGAAAGCTGATCAAATAAATATTGTCAATTATCTCGTAGATCACTGCAAACTTAACTGCAGATTCTCTAAAGAACTGGTCGAACAAGTGTGCGCATCTCGTGGAGGATTCAGCATACGCGCTGTTTATCCTCGACTAGCCATCGCCGCCCACAGCTGTGTTCCAAACATCGTACATAGTATATTACAACCAGATTACCGTGTAGAAGTAAGAGCAGCCGTGCCGTTGCAAAAAGGACAGAGATTACACCTAAGTTACACTCACGTGTTATCTCCTACTATTCTCCGTCGTGAGCATCTCCGGGAGTCCAAGTTCTTCGACTGCGACTGTCCTCGCTGTACCGACCCGACTGAACTGGGCACACACTTGAGCACTTTTAAATGCAGTAAATGTAAAAAAGGAATCGTATTGTCTAAAAATCCTTTAGACAAAGAAGCGTCATGGAATTGCTCAGAAAATGACTGTGATTTTCGGACATCGAGCGCCGTTATGCACAAGCTCTTGTCCGACTTACAAGATGAATTGGACTCGCTGGACTCCTTGGATACGGCGTCTGAAGCTGTAGAGCAGCGAGAGGCTTTTATTAATAAGTACCAGTTGATATTACACCACCGGCACTCTTTCATGTTGTGTGTGAAGCACGCGTTGGTCCAATTGTACGGTCGTATCGAAGATCGCGGAATTGAAGAACATGTCATGTCGAAAAGGAAGGTTGAGCTCGGCAAACAGGTTCTTCAGACTCTGGATGTGATAACACCCGGGGAAAGTAGAATGAGAGGTATGTTGCTGTACGACCTACATACTCCCTTGATGAATCTGGCCAGAAGTGATTTTCGTGCTGGTGTCATTACAAAGGAGAAACTAAAAGAAAAATTGAAAGTGTCTCTGCAATGTTTGACTGATGCTGCGAGAATATTATGCAGAGAGGATGACCAAAGTCCTGAAGGAATTACAGGGAAAATAGCATTTCAATCTATGGAACAATTACAAGCCAGTATTGCAATCCTATGA

Protein sequence:

>DPOGS209712-PA
MADNCRYDVKRNVKLGRYLVANAELGCGDLIFTDYPFAVGPKPGKYTPPLCLSCYCPIESKYCSRCSWPICSAECELSPNHQPECSVFSKAKIKFQPVEDWTVSAPQLDCVTPLRLLVAKEQDPDRWSKEVQAMETHTEQRRKRSTWKADQINIVNYLVDHCKLNCRFSKELVEQVCASRGGFSIRAVYPRLAIAAHSCVPNIVHSILQPDYRVEVRAAVPLQKGQRLHLSYTHVLSPTILRREHLRESKFFDCDCPRCTDPTELGTHLSTFKCSKCKKGIVLSKNPLDKEASWNCSENDCDFRTSSAVMHKLLSDLQDELDSLDSLDTASEAVEQREAFINKYQLILHHRHSFMLCVKHALVQLYGRIEDRGIEEHVMSKRKVELGKQVLQTLDVITPGESRMRGMLLYDLHTPLMNLARSDFRAGVITKEKLKEKLKVSLQCLTDAARILCREDDQSPEGITGKIAFQSMEQLQASIAIL-