Monarch geneset OGS2.0

DPOGS205075
TranscriptDPOGS205075-TA2217 bp
ProteinDPOGS205075-PA738 aa
Genomic positionDPSCF300074 + 50186-58320
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0121202e-17250.88% 
BombyxBGIBMGA006871-TA0.070.52% 
DrosophilaCG8945-PC1e-7937.69% 
EBI UniRef50UniRef50_E2C9W22e-10445.41%Carboxypeptidase B n=2 Tax=Formicidae RepID=E2C9W2_HARSA
NCBI RefSeqXP_001602586.14e-9844.95%PREDICTED: similar to molting fluid carboxypeptidase A [Nasonia vitripennis]
NCBI nr blastpgi|3071686771e-11045.88%Carboxypeptidase B [Camponotus floridanus]
NCBI nr blastxgi|3071686773e-11045.65%Carboxypeptidase B [Camponotus floridanus]
Group
Gene OntologyGO:00065088.3e-113proteolysis
GO:00082708.3e-113zinc ion binding
GO:00041818.3e-113metallocarboxypeptidase activity
GO:00041801.6e-13carboxypeptidase activity
KEGG pathway 
InterPro domain[410-726] IPR0008348.3e-113Peptidase M14, carboxypeptidase A
[302-395] IPR0090207.3e-15Proteinase inhibitor, propeptide
[303-393] IPR0031461.6e-13Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL17372 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205075-TA
ATGGCGCGTCGCTTGGCGCTGCTGGCGTTCCTGGCGTGCGCGGTCACAAGCCGCCCGCCTTCCAACTTCCATCAATTAACCAGTTCAAACTCTGACACCTTAGACACTATCGCTCTGTGGGCGGATCTCGATGATCTGGACTCAGATGAGATCCCTCCGCGATTTGATAGAGCTAAACACAATGCCAAAGTGATCATAAGGAATAACTTACCAGAACTAAACATTACCAGGAAAACTTCTAAGAGTGACAGTAAAAATAAAGTTATAAAAACGACGATTAATATCGAAAGTCATACAAACTTTATAGTGACTAAACGACCGGTAGTTGTGACTAGCTCAGTGACTTTATTGCCCGAAGAATCTAATATGAATGATGAGGTATTTTGGGTAGATGGAGTTGCTCATACTCCGAGACCCATCACAAGGAAAACAGCCAAGCCGAAATCATCAACAACACGGCGGCCGTCTACAACGAGACGATCTCAGACTTCGACGAAAAAACAAAAAGCAACCACACGTCGTTCAACAACGAGAACAACTCGGTCATCTCTAACAACTCGTGCAACCAAAGCAACGAAATCACCAAAGACTAAAGTGACTTGTGCAACCAAAAAGCCTGGAAGGGTACCATCACTCCATCAAGATAACAAAGTCAAAGAGAAACCAGTTAGTTCAGGTTGGTTTGTAGGTATCTTCTCCAGCTTATACAATTGGTTCGTAGAATCAAAAACGTCTTGGTTCAGCAGCAACTGGTTCATAGACGACTCAAAGATTACCACAACAACAACACCTTCCCCCCCTTCAACTTCTACTGGAAATTGGTTTGAAACGTTTTTAGATTCAGAAGATGTGGAAACAACTCCAGCAACAGAAACCATAAAAAAATCAGCTTCAAAATCACTTAAGAAAAATTACAAAGGATATCAGCTGATACGTGCTTATCCAGATGTTCTATGGAAAGTTCATTCTTTGTTAGACCTTAAGGAGGAAGCTGAAGGTTCGGGCTTGATGTGGTGGACATCTCCCTCCTTAAATGGGTCCACAGATCTTTTAGTTCCACCAGATCTGCTAGTTGATATTAAAGATAGTCTGAAATCTTCAAAAATAGAGTTCGATGTTGTAATTTGGGATTTGCAGAAAGCTATTTCATATGAGAACCCAAAACTGTCAAAAAAGCAGCGAATAGAACTTGAACAATTGAGAGGTCATCCAATGACTTGGAGGCGGTATCATCGTTATTCTGATATAATGCGTTACTTAGATTATTTACAACATTCATATTCTGATATAGTAGAACTGATACCCTTAGGGCTGTCTTCAGAGGGCCTACCTCTAGTAGCTGTTAAGGTGTCACTTCCTCGCAATGAAACTATAAAAAACAATAAGGTGAAAAGAAAATATAAATTAAAATCTCAACTGAAGCCTGCCGTGTGGTTGGAGGGCGGTGCCCACGCCCGTGAGTGGATAGCCCCAGCTGTCGCTCTATGGATGTTACACAACCTAGTGGAAGGAGAAAAAGGCTTCGGAACCGATCGGCGTATGCTTAAAATGGCAGATTTTTACATTATGCCAGTGTTGAATCCTGACGGGTACGAGCACTCGCACACGCACGACAGGCTTTGGAGGAAAACACGCTCACGAAGCTCAGAGCATTCAGACGATTACTATGTTGGCTGGTTTCCGTGGAATTGGGGACGGACGGAGTGCATCGGTGTTGACGCCGACCGCAATTGGGATTACCACTGGGGTGAAAAAGACTCTTCGCAAGATCCGTGCGCCGAAAACTACTCTGGTCCACATCCCTTCAGTGAGCCAGAAACACGGGCTGTTTCCCAGTTCTTAGCTGAAAATAGAGGCCATATCCAGGTATACATATCACTGCATGCGTACTCGCAAGCTTGGCTTTTGCCAAGCAGTCATTCGCATGCAACTTTTGCCGACGATGGAGTTTTAATGGAAATGGGCAAACTGGCGACGGCTGCTCTGGCAGACATGTACGGAACCAAGTATCAGGTGGGGACAGCTGCGGAAATACGTCAACCAGCAACTGGGATGTCCCACGATTGGGCAAAAGTTCGAGCTGGTATCAAGTATGCCTACCACGTCGACCTCCGAGATTCCTACGGTCCCTACGGTTTTTTGTTACCAGGGTCGCAGATAGTGCCAACAGCGCGTGAAACTTATCAGGCTCTCAAAGCGATTGTTGAAAATTTATAA

Protein sequence:

>DPOGS205075-PA
MARRLALLAFLACAVTSRPPSNFHQLTSSNSDTLDTIALWADLDDLDSDEIPPRFDRAKHNAKVIIRNNLPELNITRKTSKSDSKNKVIKTTINIESHTNFIVTKRPVVVTSSVTLLPEESNMNDEVFWVDGVAHTPRPITRKTAKPKSSTTRRPSTTRRSQTSTKKQKATTRRSTTRTTRSSLTTRATKATKSPKTKVTCATKKPGRVPSLHQDNKVKEKPVSSGWFVGIFSSLYNWFVESKTSWFSSNWFIDDSKITTTTTPSPPSTSTGNWFETFLDSEDVETTPATETIKKSASKSLKKNYKGYQLIRAYPDVLWKVHSLLDLKEEAEGSGLMWWTSPSLNGSTDLLVPPDLLVDIKDSLKSSKIEFDVVIWDLQKAISYENPKLSKKQRIELEQLRGHPMTWRRYHRYSDIMRYLDYLQHSYSDIVELIPLGLSSEGLPLVAVKVSLPRNETIKNNKVKRKYKLKSQLKPAVWLEGGAHAREWIAPAVALWMLHNLVEGEKGFGTDRRMLKMADFYIMPVLNPDGYEHSHTHDRLWRKTRSRSSEHSDDYYVGWFPWNWGRTECIGVDADRNWDYHWGEKDSSQDPCAENYSGPHPFSEPETRAVSQFLAENRGHIQVYISLHAYSQAWLLPSSHSHATFADDGVLMEMGKLATAALADMYGTKYQVGTAAEIRQPATGMSHDWAKVRAGIKYAYHVDLRDSYGPYGFLLPGSQIVPTARETYQALKAIVENL-