Monarch geneset OGS2.0

DPOGS203201
TranscriptDPOGS203201-TA1326 bp
ProteinDPOGS203201-PA441 aa
Genomic positionDPSCF300035 + 615102-616668
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0057880.077.85% 
BombyxBGIBMGA011087-TA0.078.36% 
Drosophilaste24a-PA6e-11846.91% 
EBI UniRef50UniRef50_Q7K1729e-11646.91%LD04933p n=26 Tax=Coelomata RepID=Q7K172_DROME
NCBI RefSeqXP_001659506.11e-13050.45%caax prenyl protease ste24 [Aedes aegypti]
NCBI nr blastpgi|944692921e-13050.57%prenyl-dependent CAAX metalloprotease [Aedes aegypti]
NCBI nr blastxgi|944692925e-12950.34%prenyl-dependent CAAX metalloprotease [Aedes aegypti]
Group
Gene OntologyGO:00160201.7e-47membrane
GO:00065081.7e-47proteolysis
GO:00042221.7e-47metalloendopeptidase activity
KEGG pathway 
InterPro domain[190-428] IPR0019151.7e-47Peptidase M48
Orthology groupMCL14008 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203201-TA
ATGGACGAGAATATTCTCTTATTTCTTATATTAAGCTTCTCATGGATTGAATATTTATGGGAACTATATCTCTCCTTACGGCAGCGTAAAATATATAAAACTAACAAAAACATCCCAGAGGACCTTAAAACAATGCTAAATGAAGAGCAATTTGAAAAAGCTCGTATATATGGAATTGATAAGACAAACCTCAAAATAGCTAAGGAATTTTACAGTATGACAATAACATCAATAATTCTTTATAAGAGATGGATATCTGTTGCATGGCATAAGTCAGAAGGCATTGCTGAGATTTTCAATGTTAGTCCCAAACAAGAAATTTTAATAAGTTGTACTTTTATGACATTTGTAACATTATTCAACTTTGTTACTAATATGCCTTTCTCAATCTATGGCACTTTTGTACTCGAACAGAAACATGGTTTCAACAAGCAAACTGTTGGCTTTTTCATAAAAGACCAATTGAAATCTTTAGTACTCAGTCTTGTTATAACATTACCAGTGGTTTCAATGGCAATATACATAATAATGCTTGGTGGTAAAATGTTTGTAGTTTGGTTATGGCTATTTACCACTGTGACTACTTTGTTATTGCTAATGCTTTACCCTTCAGTAATAGCACCTTTGTTTGATAAATTTGTTCCTCTATCTGATGGTTCTCTGAGAACCGCGATAGAGAATCTAGCATCAAAACTTAAATTTCCTCTTACTCAGATATACATTGTGGAAGGTTCCAAGAGATCGGCTCATAGTAATGCATATTTCAGTGGTCTGTTTGGTGCTAAAAGAATTGTTTTATTTGATACTTTACTAGAGAAAGTAGATGAAGATACAAAAGTTACAACGGGCTGTACGGAAAGTGAAATTTTGGGTGTTTTAGCACATGAACTTGGTCACTGGAGTTGTAGCCACATTTATAAATCCATAGCTCTAACTGAAGTTAACTTACTTCTTCTATTTACAGCATTTGGGGCTCTTTTCAGATATTCCATGTTATATATGGCTTTAGGATTCCCTCAAGGCCAGGAACCAATTATTATTGGACTGATTGTTGTCTTACAACTTATTCTTGCACCGTACAATTCACTTTTGTCATTCTTTGCAACAGCTTTATCTAGAAAATTTGAGTTTGAAGCTGATAACTTTGCAGTTTCCCTTAATTATTCTAAGGAACTGAGATCTGCTCTCATTAAGTTAGGGAAAGATAATTTGGATTTCCCGATCTACGACAAACTTTACTCAGCCTGGTACCATTCACATCCAACTTTGTTGCATAGAATTGAAAATATTCAGAATTTAATCAAAGAGCAAAAGAAAGAATTGTAA

Protein sequence:

>DPOGS203201-PA
MDENILLFLILSFSWIEYLWELYLSLRQRKIYKTNKNIPEDLKTMLNEEQFEKARIYGIDKTNLKIAKEFYSMTITSIILYKRWISVAWHKSEGIAEIFNVSPKQEILISCTFMTFVTLFNFVTNMPFSIYGTFVLEQKHGFNKQTVGFFIKDQLKSLVLSLVITLPVVSMAIYIIMLGGKMFVVWLWLFTTVTTLLLLMLYPSVIAPLFDKFVPLSDGSLRTAIENLASKLKFPLTQIYIVEGSKRSAHSNAYFSGLFGAKRIVLFDTLLEKVDEDTKVTTGCTESEILGVLAHELGHWSCSHIYKSIALTEVNLLLLFTAFGALFRYSMLYMALGFPQGQEPIIIGLIVVLQLILAPYNSLLSFFATALSRKFEFEADNFAVSLNYSKELRSALIKLGKDNLDFPIYDKLYSAWYHSHPTLLHRIENIQNLIKEQKKEL-