Monarch geneset OGS2.0

DPOGS204786
TranscriptDPOGS204786-TA1287 bp
ProteinDPOGS204786-PA428 aa
Genomic positionDPSCF300217 - 119205-121997
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0176816e-15962.59% 
BombyxBGIBMGA009487-TA8e-14957.35% 
DrosophilaCG18417-PA2e-7237.90% 
EBI UniRef50UniRef50_Q3T9054e-15061.41%Carboxypeptidase B n=5 Tax=Noctuidae RepID=Q3T905_HELZE
NCBI RefSeqNP_648119.15e-7137.90%CG18417 [Drosophila melanogaster]
NCBI nr blastpgi|748317192e-14961.41%carboxypeptidase B precursor [Helicoverpa zea]
NCBI nr blastxgi|748317192e-14861.41%carboxypeptidase B precursor [Helicoverpa zea]
Group
Gene OntologyGO:00065087.1e-93proteolysis
GO:00082707.1e-93zinc ion binding
GO:00041817.1e-93metallocarboxypeptidase activity
GO:00041803.6e-13carboxypeptidase activity
KEGG pathway 
InterPro domain[121-409] IPR0008347.1e-93Peptidase M14, carboxypeptidase A
[17-108] IPR0090202.9e-17Proteinase inhibitor, propeptide
[26-95] IPR0031463.6e-13Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL17548 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204786-TA
ATGAAGGTGCTGGCTGTATTGTTATTAGCTTCCTTGGTCTCGGCCAAACATGAAGTTTATTCAGGATGGAAGTCTTACTACGTCAATCCGTCCACTCAAGAGCAACTCGCATCCCTAGGACAGCTTATACCATATTTAGAATTGGATTTCATCAGCTATGCATCCGTGAACAGACCAGGAGTAGTTCTTGTTAAACCATACCATCAGGAAAAATTTATAAAGTTCTTAGAAGAAGAAAACATTGATCATTGGGTGCACAGTGAAGATGTTAAAGAATCTCTTGATATTGACGATGCTATCATAGAGGAAATAAATCAGAAGGAATCAAAATTTAATGGCGTCAGGATACCGTATAATAATTACCAACCTTTGGAAGTTATCTACCAATACATTGACATGATTGCGGAGAAATATCCTGAAGTTGCTACTTTAGTCACTCCGGCTAATTCATTCGGAGGAATCCCAATCAAATATTTGAAGATTTCAAAGAACAAATTTCAAGGAAACAAACCCGTTATAATCATTGACGCTGCGATGCATGCAAGAGAATGGATAACGCCACCAACAGTGACTTATGCTATCCACAAGCTGGTAGAAAATGTCACTGAACCGGACTTGCTCGAAAATTTCGACTGGATCTTGTTACCAGTTGCTAATCCCGATGGTTATAAATATTCTTTTGAGAAGGAACGTTTTTGGCGTAAGACTCGTTCAACTGACCAACATCCAACGAGTAGATTATGTCCAGGCGTAGATGGAAACAGGAATTTTAATTTTGTTTGGAATACGATTGGTACCAGCAATAACCCTTGTTCTGATATATACGCTGGCGCAAGGCCCCATTCAGAAATAGAAGTGAGGGTAGTCGAAAATATTATAACAGAACACTTAGACAATGCTCTTATGTACTTAACCATGCACAGTTTTGGAAGTTATATTCTTTACCCTTGGGGTCACGACGGTTCACTGCCACCCAACGCCTTTGCCTTACACCTTGTTGGAGTAGAAATGGCTGATGCGATAACCAATGTACAGTTACCTAATTTCCCAAAATACCGGGTCGGTAATGCTGTAACTACTCTTGGATATCCGGCTTCTGGTGCTGCAGAAGACTATGCTCATATGAGGGGCGTTCCTTTGTCGTATACCTACGAGTTGCCTGGATTGAGAAGTGGTTTCCAAGGTTTTCATTTAGATCCAAGGTATATAAGACAGGTCAGCGAGGAGACGTGGATCGGTATTGTTGCAGGTGTCAGAAGATCTCTTCAATTTGCTAGTAATAAATAA

Protein sequence:

>DPOGS204786-PA
MKVLAVLLLASLVSAKHEVYSGWKSYYVNPSTQEQLASLGQLIPYLELDFISYASVNRPGVVLVKPYHQEKFIKFLEEENIDHWVHSEDVKESLDIDDAIIEEINQKESKFNGVRIPYNNYQPLEVIYQYIDMIAEKYPEVATLVTPANSFGGIPIKYLKISKNKFQGNKPVIIIDAAMHAREWITPPTVTYAIHKLVENVTEPDLLENFDWILLPVANPDGYKYSFEKERFWRKTRSTDQHPTSRLCPGVDGNRNFNFVWNTIGTSNNPCSDIYAGARPHSEIEVRVVENIITEHLDNALMYLTMHSFGSYILYPWGHDGSLPPNAFALHLVGVEMADAITNVQLPNFPKYRVGNAVTTLGYPASGAAEDYAHMRGVPLSYTYELPGLRSGFQGFHLDPRYIRQVSEETWIGIVAGVRRSLQFASNK-