Monarch geneset OGS2.0

DPOGS204518
TranscriptDPOGS204518-TA1578 bp
ProteinDPOGS204518-PA525 aa
Genomic positionDPSCF300205 + 65706-73624
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0089756e-14458.47% 
BombyxBGIBMGA012452-TA1e-13359.43% 
DrosophilaCG3734-PA4e-7536.19% 
EBI UniRef50UniRef50_C9W8J65e-15860.91%Carboxypeptidase 3 n=2 Tax=Obtectomera RepID=C9W8J6_9NEOP
NCBI RefSeqXP_001608051.11e-8239.50%PREDICTED: similar to ENSANGP00000013861 [Nasonia vitripennis]
NCBI nr blastpgi|2377008552e-15760.91%carboxypeptidase 3 [Mamestra configurata]
NCBI nr blastxgi|2377008552e-15759.69%carboxypeptidase 3 [Mamestra configurata]
Group
Gene OntologyGO:00065082.6e-114proteolysis
GO:00082362.6e-114serine-type peptidase activity
KEGG pathway 
InterPro domain[51-445] IPR0087582.6e-114Peptidase S28
Orthology groupMCL20556 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204518-TA
ATGAAATTGCTGTGTATATTGTTGACCGTATCGAAATTGGCATGGGGGGAGTCGGACATTTTGGAACCTCCACTAAGAGTGGACCTTAAACCACCTGTAGAGTACAACGGGTTTGAATCCTCTCACTTTTGGCGCACCTACGATATGCCCATTGACCACTTTGATCCTCAAAATCGTGAAACATATCAAATGCGATATATGTACAATGAGGAATTTTTTGGTGGTAATAATTATCCTATTTTTATTATGGTTGGCGGCGAGTGGAACATTCAACCCGGATGGTTGCTAGCAGGGAATATGTACTTAATGGCTCAGGAAAATAGAGGATATCTTTTCTATACAGAACATAGATATTATGGGGAAAGTTTACCCTACACAACTTTTACTACCGAAAACCTACGTTTCCTAAATGTCGATCAGGCGCTCGCTGACCTTGCATATTTCATATCGGAAATTAAAAAAATACCGAGTTTTGTGAACAGCAAGGTTGTGTTATACGGAGGTTCATACGCAGGCAATATGGTGTTATGGCTAAAACAAAGATACCCGCATTTGGTAGTGGGTGTTGTTGCTTCAAGCGGACCTATAAAAGCACAAGTTGATATTCCTGGTTATTTGGAGGTAGTACACAACGCATTCCTATCAGAGGGAGGACAAGAATGCGTAGATACTATTAAACAAGGCATCGCTGATACTATAGCTGCAATGGAAACCGAAGACGGAAGAAGGTCTATACAGAGAATTTACAGACTATGCGTTCCCCTAGACTATAGCTCCCGTCTATCGATGGGTTATTTCTCCGGCTACATTACTTGGACGTTTTCTACTTCTGTACAGACTGCGAGACCTGGTTCACTGACAGCTATATGTCAGAACTTCACTAATAACGTCTACGGTTCCACGCCAATGGAGCAAATTGGGGGCTATATTGCAGACAGCCGGTCAATTTCAAATTGTCTGAATGTAACCTACGATAATTACGTAGCCTCCTATAATAAGACTGTCCCTTCGAATGGCAAAGCGTGGTACTATCAGACATGCACGGAATATGGCTACTATCAAACAGCACCAAAATCTGGTACCGCCTTCGACCAACTAACTTGGCTCGATGTGCCCTTCTATGTTGACTTTTGCAAAAGAGTATTCAGTGAAAAATTTACCGAGTCGTTTGTGATGAACGCGATTGATCGGGTGAATCTTATGTTCGGTGGATTGTATCCAAATGTCAATAACACGATCAATATCCACGGTGATATAGATCCGTGGCACGTGCTCGGAGTTTACGACCGGGACCTTAAGGAAACATCGCCAACCATACTTGTACCCAGAGCTCTTAGAGTGGTGGAAAATGTATTCGGCATATTGTCTGCAGTGTTTAGTTTTGCGGAAACCTATGTTGTTAGATCTGAAAACCGTGGCATCTGTGGTAGCTTTGACGAAGAATCACACGGCCAGGTAAGATTAGGAAACTGGAGAGAAGATAACAATAACATGACTAGTATGTTACCCTTGCAAATAAGACTGCGCAAGGCCACAAGATCTTCTCACAATGATCGTAATGAAGTTGCTGAATATTGA

Protein sequence:

>DPOGS204518-PA
MKLLCILLTVSKLAWGESDILEPPLRVDLKPPVEYNGFESSHFWRTYDMPIDHFDPQNRETYQMRYMYNEEFFGGNNYPIFIMVGGEWNIQPGWLLAGNMYLMAQENRGYLFYTEHRYYGESLPYTTFTTENLRFLNVDQALADLAYFISEIKKIPSFVNSKVVLYGGSYAGNMVLWLKQRYPHLVVGVVASSGPIKAQVDIPGYLEVVHNAFLSEGGQECVDTIKQGIADTIAAMETEDGRRSIQRIYRLCVPLDYSSRLSMGYFSGYITWTFSTSVQTARPGSLTAICQNFTNNVYGSTPMEQIGGYIADSRSISNCLNVTYDNYVASYNKTVPSNGKAWYYQTCTEYGYYQTAPKSGTAFDQLTWLDVPFYVDFCKRVFSEKFTESFVMNAIDRVNLMFGGLYPNVNNTINIHGDIDPWHVLGVYDRDLKETSPTILVPRALRVVENVFGILSAVFSFAETYVVRSENRGICGSFDEESHGQVRLGNWREDNNNMTSMLPLQIRLRKATRSSHNDRNEVAEY-