Monarch geneset OGS2.0

DPOGS201319
TranscriptDPOGS201319-TA1497 bp
ProteinDPOGS201319-PA498 aa
Genomic positionDPSCF300176 + 347670-350798
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0172460.075.56% 
BombyxBGIBMGA003112-TA0.077.29% 
DrosophilaCG4572-PA5e-8340.98% 
EBI UniRef50UniRef50_C9WMM51e-9844.57%Venom serine carboxypeptidase n=16 Tax=Endopterygota RepID=VCP_APIME
NCBI RefSeqNP_001152775.12e-9944.57%venom serine carboxypeptidase [Apis mellifera]
NCBI nr blastpgi|3838576443e-10244.55%PREDICTED: venom serine carboxypeptidase-like [Megachile rotundata]
NCBI nr blastxgi|3838576441e-10144.42%PREDICTED: venom serine carboxypeptidase-like [Megachile rotundata]
Group
Gene OntologyGO:00065085.7e-114proteolysis
GO:00041855.7e-114serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[54-470] IPR0015635.7e-114Peptidase S10, serine carboxypeptidase
Orthology groupMCL12601 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201319-TA
ATGATATTGGAACTACTCATCCTTTTTACATTCACCCACCATGCTTTGGGTTCGGATCGATATGATTTGCCTGAACTGCCGCCTGGTGAGGCGCCGGAGCCGATCTTATCGTTTATGGAGAACACCTATAGACTGGGCCGCGGCAGAGCAACGCATGACGGTGATGCCGGAGAGCCTCTGCTCTTGACCCCATTGATAGAAGACAACAAGTTAGAAGAAGCTAGAGCTGCATCATATGTGAACCCGGATTATCTGCTCCCTGGCATGGATAGCTACGCGGGTTACTTAACCGTGAACAAGGAATACAATGCAAACCTTTGGTTCTGGTACTTTCCAGTATCTGACCAGCCAGTTGAAGAGACTCCTTGGATAATTTGGTTACAAGGAGGTCCGGGCGCTTCATCGCTATACGGACTCTTTACAGAAATTGGTCCATTGGTTGTCACGGATGAGAATCAATTGAAAGAACTTCAATACTCGTGGGGAAAAAATCATTCGCTTCTGTTCATTGACAATCCTGTAGGAACAGGTTTCAGTTTTACTTACGATGACAGAGGCTTTGCTACCAATCAAACAACAATCGGTGAAAACCTGTACACAGCCCTGCAGCAGTTCCTGACACTGTTCCCGGAGCTGCGGAAGGCTCCGCTGACTATAGCGGGAGAATCGTATGCCGGGAAACACATTCCTTCACTTGGGGTCCAAATACTTTGGAATAAGTATCAAGATAAGCCGATCAACTTACAGGGTTTAGCCATCGGTAACGGGTTCATTGACCCGATGTCGTTGCAAAGATATAGCTATTTCGTACGCGAAGTTGGTCTGGTTGACGATAAAGTAGCTAATGTTATGAACCAGTTGGAAACAGCTATCGTCCAGTTCATTAAAGCTGACCAAATGCTGAAAGCGTATGCCTACTACAATTATCTCTTGCAACTGTTTCTATCAGAATCAAAATTAAACAATCTGTATAACTATCTAGAAGACGATATTAGTTTGGACGGAGCGTATCTCGATTACATACAAAGGACAGACGTCAGAAAGGCTTTGCACGTAGGTAATACGAACTTCACATCAATAGGGGTTGTGTATAGAAAATTAGTGCCAGATTTCATGGCAAGTGCTAAATCCATGCTTGAAGAGCTGTTGGAAAACTACCGAGTGATGTTATTCAATGGTCATCTAGATATAATAGTGGCTTATCATCCGTCAGTTAACACCTACGAGTCTCTGTCCTTCTCTGGGACCATGGAATACAAAATGGCCAAACGTCGTTCCTGGTATCATGACGGGCAATTAGCTGGGTATTACAAAACAGCTGGTAATTTAACAGAGGTAATGATCCGTGGCGCTGGTCACATGGTCCCCGCAAATAAACCGGCCGCAGCGCTCGGACTCATCTCAGCTTTCGCCCGCGGTATAACTTTAGAAAAAGACACAGGTCCATTGGTGAATATAGAAAAAAACATCTCGAGAACTTATCCTCAGCCAGTCTGA

Protein sequence:

>DPOGS201319-PA
MILELLILFTFTHHALGSDRYDLPELPPGEAPEPILSFMENTYRLGRGRATHDGDAGEPLLLTPLIEDNKLEEARAASYVNPDYLLPGMDSYAGYLTVNKEYNANLWFWYFPVSDQPVEETPWIIWLQGGPGASSLYGLFTEIGPLVVTDENQLKELQYSWGKNHSLLFIDNPVGTGFSFTYDDRGFATNQTTIGENLYTALQQFLTLFPELRKAPLTIAGESYAGKHIPSLGVQILWNKYQDKPINLQGLAIGNGFIDPMSLQRYSYFVREVGLVDDKVANVMNQLETAIVQFIKADQMLKAYAYYNYLLQLFLSESKLNNLYNYLEDDISLDGAYLDYIQRTDVRKALHVGNTNFTSIGVVYRKLVPDFMASAKSMLEELLENYRVMLFNGHLDIIVAYHPSVNTYESLSFSGTMEYKMAKRRSWYHDGQLAGYYKTAGNLTEVMIRGAGHMVPANKPAAALGLISAFARGITLEKDTGPLVNIEKNISRTYPQPV-