Monarch geneset OGS2.0

DPOGS212083
TranscriptDPOGS212083-TA1299 bp
ProteinDPOGS212083-PA432 aa
Genomic positionDPSCF300038 - 1198396-1207328
RNAseq coverage89x (Rank: top 63%)
Annotation
HeliconiusHMEL0125761e-6860.09% 
BombyxBGIBMGA006715-TA7e-4957.67% 
DrosophilaCG17633-PA9e-3426.42% 
EBI UniRef50UniRef50_A0FDQ43e-3685.23%Carboxypeptidase n=1 Tax=Bombyx mori RepID=A0FDQ4_BOMMO
NCBI RefSeqXP_001608062.17e-3927.79%PREDICTED: similar to carboxypeptidase A [Nasonia vitripennis]
NCBI nr blastpgi|3454823551e-3727.79%PREDICTED: zinc carboxypeptidase A 1-like [Nasonia vitripennis]
NCBI nr blastxgi|1482987803e-5744.23%carboxypeptidase [Bombyx mori]
Group
Gene OntologyGO:00065084.2e-31proteolysis
GO:00082704.2e-31zinc ion binding
GO:00041814.2e-31metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[126-224] IPR0008344.2e-31Peptidase M14, carboxypeptidase A
[24-113] IPR0090201.9e-06Proteinase inhibitor, propeptide
Orthology groupMCL25268 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212083-TA
ATGTGGGTGACACGTATTTTTGTATTTATTGCCTTTGTACGATTAACAAGGACATTTACTGTCGAGAAAGTTAAGGAGTACAAAAACTACAAATACTACGATGTAAAAGGCAGTAGCAGCACCTTGGAACGCTTAAAAACTTTACTAGAAGAGAAAGATGATTCTCTAATTTATTTAGACAATAAAAGAACAACCCAATTATTGATTGCTCCGGAATTGGAAGCAGCTTTTAAAGAAATTATTGAAAAAGTTAAATTAAACGCTACGCTCCTTCATAATGACATTTCTGAGGTTCTCCGAGAAGAGAAGCCACCAACAACACAAAAAATATATAGATATTCTTGGAATGCTTACTACGATGTTAATCAAGCAAACGTAAGTAGGTCGCATCCAGAATGGGCAGAGGTGATAGTGGGCGGAAAGAGTTACGAGGGACGTGAAATACGAGGGCTCAGAATAAATACGCCGGTCGACGGTGACGATAATCCGAATAAGCCTGTATTTTTTATCGAATCAGGAATACATGCACGAGAATGGATAGCTCCAGCCACGACTACGTACTTCATCAATCAACTGCTTACCAGCAAAGATCCTAATGTAACAAGATTGAGGGATCAATTTGACTGGCGGATCTTCCCCACTGTTAATCCCGATGGATATCATTACAGCTATATGTTTGTACAAATCGGTAACGTTTCACTTGAATACGGATACAAGGTGAATCAAAAAAGATATGATGGACCAGGCACCGCTGCGGAGACCCTGTATAAAGCGTCAGGTGGAAGTATGGATTGGGTTCGTAACAAATTGAATACACCGCTCGTTTTCACATACGAGCTACGCGGCAGTTCTTTCCACTGGCCACCGTCAAAGATCCCAGAACAAGGAGAGGAAGTCACTCAGATGATGCTCGGCTTATCAAAAGAAGCTCGCAACCTAGATAAAGCGTCAGGTGGAAGTATGGATTGGGTTCGTAACAAATTGAATACACCGCTCGTTTTCACATACGAGCTACGCGGCAGTTCTTTCCACTGGCCACCGTCAAAGATCCCAGAACAAGGAGAGGAAGTCACTCAGATGATGCTCGGCTTATCAAAAGAAGCTCGCAACCTAGATAAAGCGTCAGGTGGAAGCATGGATTGGGTTCGTAACAAATTGAATACACCACTCGTTTTCACATACGAGTTACGCGGCAATTCTTTCCACTGGCCACCGTCAAAGATCCCAGAACAAGGAGAGGAAGTCACTCAGATGATGCTCGGCTTATCAAAAGAAGCTCGCAACCTAGGTTACTATTAA

Protein sequence:

>DPOGS212083-PA
MWVTRIFVFIAFVRLTRTFTVEKVKEYKNYKYYDVKGSSSTLERLKTLLEEKDDSLIYLDNKRTTQLLIAPELEAAFKEIIEKVKLNATLLHNDISEVLREEKPPTTQKIYRYSWNAYYDVNQANVSRSHPEWAEVIVGGKSYEGREIRGLRINTPVDGDDNPNKPVFFIESGIHAREWIAPATTTYFINQLLTSKDPNVTRLRDQFDWRIFPTVNPDGYHYSYMFVQIGNVSLEYGYKVNQKRYDGPGTAAETLYKASGGSMDWVRNKLNTPLVFTYELRGSSFHWPPSKIPEQGEEVTQMMLGLSKEARNLDKASGGSMDWVRNKLNTPLVFTYELRGSSFHWPPSKIPEQGEEVTQMMLGLSKEARNLDKASGGSMDWVRNKLNTPLVFTYELRGNSFHWPPSKIPEQGEEVTQMMLGLSKEARNLGYY-