Monarch geneset OGS2.0

DPOGS211020
TranscriptDPOGS211020-TA1677 bp
ProteinDPOGS211020-PA558 aa
Genomic positionDPSCF300004 + 1303424-1313125
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0060518e-9047.84% 
BombyxBGIBMGA010348-TA3e-6130.42% 
DrosophilaCG31823-PA4e-6332.39% 
EBI UniRef50UniRef50_E2BXB05e-6232.57%Retinoid-inducible serine carboxypeptidase n=8 Tax=Endopterygota RepID=E2BXB0_HARSA
NCBI RefSeqXP_001605442.12e-6634.24%PREDICTED: similar to CG3344-PA [Nasonia vitripennis]
NCBI nr blastpgi|1565553964e-6534.24%PREDICTED: retinoid-inducible serine carboxypeptidase-like [Nasonia vitripennis]
NCBI nr blastxgi|1565553965e-6534.24%PREDICTED: retinoid-inducible serine carboxypeptidase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00065084.9e-101proteolysis
GO:00041854.9e-101serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[174-556] IPR0015634.9e-101Peptidase S10, serine carboxypeptidase
Orthology groupMCL25250 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211020-TA
ATGTCAGGCCTATGGATGTGTGCAACAGGTTTTGTTTTAGTTCTAAGTATTGCGATACAGATAGAAGCAGTCACAGTGAACCCGTTGTGTCCTACTAATAAAATTGCGCCTCATCGTAGGGCGAGGCGATTCAGAAATTCTCAGCCCAGAACATTTTACGCATATTACGATTTATTGAGACCAGGTGGGTCTTATAATGCACCCAGCAGTTTGCAAGAAAAGCCCCAGGAAGACTCCGAAGATTGCGGAGAAATAGAGGATTATGATGAAAATGACGTCAGTTCAGCTACAACAACTAGGGCAGGTAATCCTACATCAAATAGAAAATATGGTTTTTATAATCACTATGGTTTCTATCGCCCAATGCAAACAACAACTGCACCTAAAAGACAAACTCGACCCCCTGCAGGAGGATATTTTGGAAATAACGCATACAATCCACCCAAGCCTTTGAATCCTACAGAACTTAATCCATACTTAGAGGCTTTAAGTCATAAGCCATGGAAAAGTATTCAAAGCTATATCGAGGTTCGCCCCGGAGCCTTTCTCTTCTACTGGTTCTATTACGCTGATGGTTCTGTGAATGGAGCTAGAGAGAAGCCTCTTATAATATGGATACAAGGCGGTCCAGGATTAGCAGCTAGTGGAATAGCAAATTTTGCAGAAATCGGACCGTTGAATATGAATATGCAACCTAGAAATCATACTTGGGTTAAGGGTAACAATTTGTTACTAATAGATCATCCAGTAGGGACTGGCTTTAGTTATGCTTCAAATAAATCTCTATACGTTCGGACCGATAAAGGTGCCGCCAGAGATCTTCTGAGAGCAATAAAAGAATTTTTTAAACGTCACAAGGAGTTTCGCAAGACACCAACTTATTTAATCGGACAAAGTTATGGAGGTAAATTGTGTCCGAGGTTGGGATATTATTTGTATACGGCTATGAAGAATAAACGTTTAAAGATGAATTTTAAAGGCATCGGAATTGGCAGCGGATGGGTTGATCCAAAACAAAGCTCTTTAGTGCAACCGGAATTTTTATATAATATGGGTGTAATTGACTTATCAACTTTCGTGAAATCAAAGAAGATTGTAAAGCAAATGTGTGAATTAATCGAGGCCAAAGAATACGTGACCGCCGGAAGATTTTCTACTATACTATTTAACATGTTTAATGTGGAAGCAGCAATGGATATAAACTTTAATAATATCAACCAAGAGAGTCCTTATCCGGCTCTATACCGATTGGCGTTAAAAGTTAACAAATACGTTAAACCAACTTTAAAAGAAGTGGATCAGAATTTAGATTGGAGTTTTATTTCCGATGATGTCTTTGAAAGTTTAAGCGAAAGCTTCCTTGTACCTTCTAGCAAATACTTAGAAACACTGCTAAATCATACGAATCTTCGGATCGTTGTTTATAATGGAAATCTTGATGTGGTTACACCTCTTGCTGGAGCAACGAATTGGGTTCACGCACTAAAATGGCGCGGCTCAAGGGAACTAATGAACGCAACGAGAATTCCTATTAAAGGACATCGGAATGGTTTCTATAAAACTGCGAGACAACTAAGTCTTTGGTCTGTCTTTGGTTCCGGACACTGGGTACCAGAAGAGAATCCTGTGGCAATGGAAGAAATACTCAAATATTTGATGTCCGCTGAATACAGATAA

Protein sequence:

>DPOGS211020-PA
MSGLWMCATGFVLVLSIAIQIEAVTVNPLCPTNKIAPHRRARRFRNSQPRTFYAYYDLLRPGGSYNAPSSLQEKPQEDSEDCGEIEDYDENDVSSATTTRAGNPTSNRKYGFYNHYGFYRPMQTTTAPKRQTRPPAGGYFGNNAYNPPKPLNPTELNPYLEALSHKPWKSIQSYIEVRPGAFLFYWFYYADGSVNGAREKPLIIWIQGGPGLAASGIANFAEIGPLNMNMQPRNHTWVKGNNLLLIDHPVGTGFSYASNKSLYVRTDKGAARDLLRAIKEFFKRHKEFRKTPTYLIGQSYGGKLCPRLGYYLYTAMKNKRLKMNFKGIGIGSGWVDPKQSSLVQPEFLYNMGVIDLSTFVKSKKIVKQMCELIEAKEYVTAGRFSTILFNMFNVEAAMDINFNNINQESPYPALYRLALKVNKYVKPTLKEVDQNLDWSFISDDVFESLSESFLVPSSKYLETLLNHTNLRIVVYNGNLDVVTPLAGATNWVHALKWRGSRELMNATRIPIKGHRNGFYKTARQLSLWSVFGSGHWVPEENPVAMEEILKYLMSAEYR-