Monarch geneset OGS2.0

DPOGS204785
TranscriptDPOGS204785-TA1272 bp
ProteinDPOGS204785-PA423 aa
Genomic positionDPSCF300217 - 124199-125835
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0176813e-14759.76% 
BombyxBGIBMGA009487-TA5e-11646.84% 
DrosophilaCG18417-PA1e-5533.41% 
EBI UniRef50UniRef50_C9W8K41e-11648.95%Caboxypeptidase 4 n=2 Tax=Obtectomera RepID=C9W8K4_9NEOP
NCBI RefSeqNP_648119.13e-5433.41%CG18417 [Drosophila melanogaster]
NCBI nr blastpgi|2249245365e-11648.95%caboxypeptidase 4 [Mamestra configurata]
NCBI nr blastxgi|2249245361e-11448.95%caboxypeptidase 4 [Mamestra configurata]
Group
Gene OntologyGO:00065081.7e-71proteolysis
GO:00082701.7e-71zinc ion binding
GO:00041811.7e-71metallocarboxypeptidase activity
GO:00041805.1e-10carboxypeptidase activity
KEGG pathway 
InterPro domain[122-406] IPR0008341.7e-71Peptidase M14, carboxypeptidase A
[17-108] IPR0090201.1e-14Proteinase inhibitor, propeptide
[26-97] IPR0031465.1e-10Proteinase inhibitor, carboxypeptidase propeptide
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204785-TA
ATGAAGGTCTGGGTATTCTCATGCTTTATTTTTGCTGTTTCAGCCAAACACGAAGAGTTCGCAGGATGGAAGTCGTATTATGTAGAACCGTCAGATTTAGGTCAACTTAAATCTTTCAATAGTCTGATACAACATATAAAAGTTGATGTTTTTGCTCATCCAATTGTTGGACGACCTGGACTCATTTTAGTGGATCCATCGTTTCAAAACGAACTTATTGGTGGTTTGGAAAGATTGGGCATCAAATATAAAATCCATGCTGAAGATGTTAAAAGTCAATTTGATTTGGAAGATGACTTGTTCAAAAATGTTTATAAAAAAAGTCCTCAGAGCAATTTTGGAGGGAAGTTGCCATATGATTCGTATCAATCATATGAAACGATTAACAAATACTTAGATGATATTGCAAAAGAATATCCAGAGAAAGCTAAAGTAGTAACTCATACTTCTTTTAATGGGTTGCCTATTAAGTACTTGAGGGTGTCTACAACAAATTTTGAAGATCCAACGAAACCTATTATATACCTAGATGGAGGTATCCACGGTAGAGAATGGTTATCCATACCGCCTGTGACTTTTGCCATTAATAGACTTTTGGAAAATGGTACTGATTCTAGTCTATTAGAAAAGTTTGACTTTATTCTATTCCCTATCGTTAACACAGATGGATACCAATATAGCCGTGACAGAAGCGCGGCATGGAGAAAAACTCGTTCCTGGTACCAAGATCCCTGGAGCATTATGTGTCCTGGGGTAGACATTGATCGCAATTTTGATTTCCATTGGAACACTACAGGAGCTGGTAGTAGTAAATGCTCATACGTGTATCCTGGCGTTTCTGCATTCTCAGAAGCCGAAACAAGAGTTGTAAGAGAAATATTATTAGAAAATGATCGCATTCTTATGTACATATCCTTTAGTAGCGGGGGTAGTCTCATCATGTATCCATGGGCAGTTGACGGTTCCTTGTCCAGTGAAGTCTTTAGACTTCACAATGTCGGCGTGGCTATGGCTGATACTATCAATGCTTTACAAATGCCCGAGTTTTTCGATTATAAAGTAGGTAATGCAGCATTGGTTCGCCAACTACCGATGTCAGGAACGTCTATCGACTATGCTCATCATTTAGGGGTCCCATTGACCTTTACGATCGAATTGCCTGGACTTTTTGGAGGTTATTTTATGAATCCAATTTACATGGCAAGAATTTGTTTTGAAACTTGGGAAGGAGTAAAGGCTGGTATCAGAAAAGCAGAACAATTATATATGTAG

Protein sequence:

>DPOGS204785-PA
MKVWVFSCFIFAVSAKHEEFAGWKSYYVEPSDLGQLKSFNSLIQHIKVDVFAHPIVGRPGLILVDPSFQNELIGGLERLGIKYKIHAEDVKSQFDLEDDLFKNVYKKSPQSNFGGKLPYDSYQSYETINKYLDDIAKEYPEKAKVVTHTSFNGLPIKYLRVSTTNFEDPTKPIIYLDGGIHGREWLSIPPVTFAINRLLENGTDSSLLEKFDFILFPIVNTDGYQYSRDRSAAWRKTRSWYQDPWSIMCPGVDIDRNFDFHWNTTGAGSSKCSYVYPGVSAFSEAETRVVREILLENDRILMYISFSSGGSLIMYPWAVDGSLSSEVFRLHNVGVAMADTINALQMPEFFDYKVGNAALVRQLPMSGTSIDYAHHLGVPLTFTIELPGLFGGYFMNPIYMARICFETWEGVKAGIRKAEQLYM-