Monarch geneset OGS2.0

DPOGS207526
TranscriptDPOGS207526-TA1281 bp
ProteinDPOGS207526-PA426 aa
Genomic positionDPSCF300177 + 203632-209805
RNAseq coverage57x (Rank: top 69%)
Annotation
Heliconius% 
BombyxBGIBMGA001890-TA4e-8946.94% 
DrosophilaCG8564-PA1e-0727.17% 
EBI UniRef50UniRef50_Q3T9056e-0826.64%Carboxypeptidase B n=5 Tax=Noctuidae RepID=Q3T905_HELZE
NCBI RefSeqXP_002046669.12e-0826.69%GJ12357 [Drosophila virilis]
NCBI nr blastpgi|748317192e-0726.64%carboxypeptidase B precursor [Helicoverpa zea]
NCBI nr blastxgi|824082034e-0826.28%Chain B, Structural Basis Of The Resistance Of An Insect Carboxypeptidase To Plant Protease Inhibitors
Group
Gene OntologyGO:00065081.7e-16proteolysis
GO:00082701.7e-16zinc ion binding
GO:00041811.7e-16metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[138-382] IPR0008341.7e-16Peptidase M14, carboxypeptidase A
Orthology groupMCL44336 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207526-TA
ATGGCCGCTGCAATCAAAGCGAGTGCATGCATCAGTCCGGCCATTGGTATTGCCTTGGATAACAATATGGCGGAAAGTCGTCATGCTATAAAATCGATATCCATAAAAAAACAAAATCGATTTGATTCGAAGAAACAAGAAGAATTGCGTGCGAACAGCGAAAAGATGAAAAAATTAATTATAAAGAATGACAAGATGAGTGCTCGTCAAAATTTAATAAAACTGAAAGATAAGAAACAACAGGGCACTACCCAATATCCGAATGCACTGCGACAAGGAAATTTAAAACATATACCACGAGTGACCAAAATGGATATGTTTCAAGATTATAGCGACCCAACAAGTCCTCCCTCAGCGTTCGCAGACAATTATGTTTCTCGAACATTTAGCTCATACAATTTGAAGCAACAACTGACAAGTATAGCTCAAGAAATACCTAATGCAAATATAACGATCGAAGTCATAGGTAGGACTCTGGAATACCAAGACATATTGATGCTCAGGATATCAGAAAATACTGGCCACAGATACTTTAGGGCAGATGATCAAAAATACGCTGAAGAACTGCCGGAGAAGAAGATAATATTTATAGTCCACGGTCTCAAAGTCATGGGAATGAGAGATATACCTTGCTTATCAACGACAGGATCGTTCCGGGTACTACTGTCATACTACACATCGCATTTAGATAAATTTAACATATTTTTAATACCTATGGCGAACCCGGACGGCTATACCTATATACGGTCAACATACAATGAGGAACTGGTTTGGCTTAAACGTGGAATTAATTTTATCGAGGTGCCCTTCTCAGAGGCAGAGACGAGGGCTGTGAGAGACATCTTCCACAAATATGGCCATAAAATTGTAGCTTTCTTTAACGTGCATGCTGGCTCATATCATAGTTCGGTTTTCAGGGGCGATGCTGTACTCTATCCGAAGGGTTACAGCGACACGGCGATAGACGATGATAAATATATTGATTTGAAGGGAGAGATTGACGAAGTCATTAGGAATGCAAGCTTCCAAGTATATTCGGTTACTGTTGATACTTTGTACAACTGGTACGGGCTTGTCAAAGGTTCCAGCGTGGACTACGCCTCAACGGTTTATGGGATACCTTATTCCATGGAATTAGTGATGCAGGGATATGATGAAGAGGGGAATTACTATGATGAAGACTACTCGTATCTTGCTCTAAATGAAGTTTGGTCAAAGGTCATTGACGTCATTATGTCATACATATGGAAATCTATACATGGAAACGATTCAAAACGATGA

Protein sequence:

>DPOGS207526-PA
MAAAIKASACISPAIGIALDNNMAESRHAIKSISIKKQNRFDSKKQEELRANSEKMKKLIIKNDKMSARQNLIKLKDKKQQGTTQYPNALRQGNLKHIPRVTKMDMFQDYSDPTSPPSAFADNYVSRTFSSYNLKQQLTSIAQEIPNANITIEVIGRTLEYQDILMLRISENTGHRYFRADDQKYAEELPEKKIIFIVHGLKVMGMRDIPCLSTTGSFRVLLSYYTSHLDKFNIFLIPMANPDGYTYIRSTYNEELVWLKRGINFIEVPFSEAETRAVRDIFHKYGHKIVAFFNVHAGSYHSSVFRGDAVLYPKGYSDTAIDDDKYIDLKGEIDEVIRNASFQVYSVTVDTLYNWYGLVKGSSVDYASTVYGIPYSMELVMQGYDEEGNYYDEDYSYLALNEVWSKVIDVIMSYIWKSIHGNDSKR-