Monarch geneset OGS2.0

DPOGS201318
TranscriptDPOGS201318-TA3303 bp
ProteinDPOGS201318-PA1100 aa
Genomic positionDPSCF300176 + 331157-344985
RNAseq coverage436x (Rank: top 28%)
Annotation
HeliconiusHMEL0172471e-12165.26% 
BombyxBGIBMGA003110-TA8e-10453.77% 
DrosophilaCG4572-PA2e-7136.51% 
EBI UniRef50UniRef50_E2AY017e-9137.03%Probable serine carboxypeptidase CPVL n=10 Tax=Formicidae RepID=E2AY01_CAMFO
NCBI RefSeqNP_001152775.13e-8138.24%venom serine carboxypeptidase [Apis mellifera]
NCBI nr blastpgi|3071686683e-9037.03%Probable serine carboxypeptidase CPVL [Camponotus floridanus]
NCBI nr blastxgi|3071686684e-8937.05%Probable serine carboxypeptidase CPVL [Camponotus floridanus]
Group
Gene OntologyGO:00065089.1e-103proteolysis
GO:00041859.1e-103serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[603-1024] IPR0015639.1e-103Peptidase S10, serine carboxypeptidase
Orthology groupMCL26488 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201318-TA
ATGGTGAGAACTGTCGTAATTATTGCATTATTTATATGCAGTGGCCGCGCTAAAGTTATCATTCATGAAGATTTGCCAGACATTACGAAGAATCTAGTGCAGAATGATGTAAAGAACGTCCCCTCAATTGTACTAGATCTGCCGGTGCACAAGGAACTGATAAGTATAATAGCTCCAAAAGATACTTCGAAGAGTGTACCAAAATTGAAAACATCTACGAACACAAATGAAAACGGAAACCAGAAAAGTTGTGCGGGCATAAATGATACTATATCAGATAATTTAATCGATTCAGATGAAAATGTACCTTGTGAGATTCCGTTTAAGATCGACAATGGCTCAGTTCTAATTCTTACGCCTTATATTAAAGATGGCCAAGTCGCAGAAGCGCGTAATGCGAGCTCCGTCAACCCTGATCTGTTTCTGGGGTATGAAAGCTATTCAGGTTTTATAACAGTGAATAAAACTTACGATTCGAACATATTCTTCTGGTATTTTCCTGTTCTCAAAAAACCGTATGGAGCCAATTTGGCTGAAACCGTACACCAGTTCCTTGAAATATTCCCTGAACTCAGGTCAGCCCCGTTATACGTAGCTGGAGAATCATATGCTGGCAAATATGTCCCGGCTCTCTCTATGGAACTCCACAAGCAAAAAGATAACCCGGAATTCAAAGTCAATTTAACGGGTATGATGTTGGGCAACGCATATATAGATCCCAGTATGATAGCGCAAGTGTCGTATCCCTTCTATTATTTTGGACTGCTCAGCAAAGAGCAAATTGATATAGTTGACCCGCTCTTAAAGTCCTTTCAACAGGACATAGCATCGAATAACAGCATCGCTGCTAAAAATAAGTGGAACAGCTTGATCGCCGTGTTGTTGTTTCTAACTCATCAAAAGCAGGCTTACAATTTCCTTAAGGACGATATATCTGTAGGCCACTACGTAAATTTTCTCAAAACATCAGAAGTAAAGAGGGCTCTACACGTAGGAGACATAAGATTCTCTTTTGTAAATCAGACTGTGAATTCAAAGATGGCGCCGGATTTCTTGAGCAGTTCAAAGCCCTTGTTTGAAGAACTTTTGGAACATTATAGAGTGCTGATATACTGTGGACATTTGGACCAGATGTTGCCATGTGTGTTCACGTCAGACAATTTCAGGACATGGACATGGAGTGGATCCAAGGAATTTCAAGAAGCAGCCAGATATCCTTATATTTACAAGGCTAAATTGTCTGGCTACCACAAGACAGGAGGTCAGCTGACGGAGGTCGTGGTACGAGGGGCAGGTCACATGGTACCGGTCGACCAGCCCGGACCTATACAGAACCTAGTGGCTCGCTTCACCCACAACAAACCACTCAGCCAGCGCTTTGGACTCCTCGAGGGATCGTTCATACAGGAGTTCATTAAGAACCAGACGGTTGTATATTTTGGACATTTGGACCAGATGTTGCCATGTGTGTTCACGTCAGACAATTTCAGGACATGGACATGGAGTGGATCCAAGGAATTTCAAGAAGCAGCCAGATATCCTTATATTTACAAGGCTAAATTGTCTGGCTACCACAAGACAGGAGGTCAGCTGACGGAGGTCGTGGTACGAGGGGCAGGTCACATGGTACCGGTCGACCAGCCCGGACCTATACAGAACCTAGTGGCTCGCTTCACCCACAACAAACCACTCAGCCAGCGCTTTGGACTCCTCGAGGGATCGTTCATACAGGAGTTCATTAAGAACCAGACGGTTGTATATTTTATAACACTCGCGGCTGTAGCAGATGCCGTACAAATAGATACACCTCTTTTTCTCACCGCTTTCATTAAAGAGAATAAAACTGCGGAGGCGAGAAACGCGTCTCTCGTAAATGCGGACGAATTTCTAAACGTCACAAGTTATTCAGGTTTTTTAACTGTTGACGATAACTATGATTCTAATTTATTCTTCTGGTACTTTCCCGTTGCTAATAAAGATGTAAAGAGAACTCCATGGATAATTTGGCTCCAAGGAGGTCCGGGAGCTACAAGCTTAGCCGGCCTTTTCGACGAAATGGGTCCATTCGAATTGGATAGCAATTTAAATTTAAAAAAACGCAAGTACACGTGGACGGATGACTTCTCTATGGTATACATAGATAATCCCGTGGGAGCGGGTTTCAGTTTCACGAAACATGATGAGGGTTATCCGAACAATATGGATATGTACACCGAAAGCCTATATAGAGCAGTGAATCAGCTGATCGTATTATATCCAGAGTTAAGTGAGGCGCCTCTGTATGTAGCTGGTGAGTCCTATGCTGGGCGGTACGTGCCAGCTTTAGCCGAGAGAATCATGAAAGATAAGGAGAAAGACGGCCACATTAATTTACAGGGTATCATGCTGGGTAATCCTTTACTAGACCGCGAGAGTGTAATTGATTATACTCGAGCGTTCTACTCTTGGGGACTCATAGACGAGCAGGGCGCTCTAGCAGCAGAACCTCTTCAGAAGCAGTTCCAAAAGGAAATCGATGAAGGGAATGCCCAAGAGGCATATAAGCTGCGTGACGAGCTTCTCGATAAGCTCCAAGGTATAGCGGAGCAGTCGTCTCTATACAACGTCATCACACCTATAGAAGGTTTGGAACACTTCATCAATTTCATCACCAGTTCGAAAATCAGGAACTTGATCCACGCCGGGAATGTGACCTTTCACTTTTCAAACGACAAGGTCCATAAACATCTCGTAGCTGATTTCTTGGCCCCCGTTTCCAGTAAAGTCCTAACTGTTCTCGAACACTACAGGGTTCTTATATACTGCGGCCAGTTGGACCTCACGACTCCCTGTGTTCTGAACAGCGAGGCTCGCAGGAAGAGGTGGATGTGGTCTGGGAGGGAAGAGTTTCTTAGATCACCGCGGACACCATGGTGGTTCAATAATACCGTGGCTGGCTTCGTGAAATCAGGCGGAGGCTTCACGGAGGTTCTCGTAAAGGGGGCCGGACATCTAGTACCCAAGGAAAAACCAGCTGAAGCCAAGGCACTAATATCATACTTCATCAATGGAACAGGTCTACCAACACCACCTTCATACAAAATACATCCGGAAGACACTCCATACTACGAGGAGTACTTTGACCTAAAAACATCAGGAGCTGTCCCGGCGGTGGGGCTAAGGGCTGGCTTAATCGCCAGTGTCGTAGTGAACGTTCTGCTGTTAGCTGGTATCGCTTTAGGAGTCTACAAGTTTCTGAAATGGAAGAGAGAATCCGATTATTTCTATTCGCCCTTAAACGACGGCATTTTAACTATGTCGTAG

Protein sequence:

>DPOGS201318-PA
MVRTVVIIALFICSGRAKVIIHEDLPDITKNLVQNDVKNVPSIVLDLPVHKELISIIAPKDTSKSVPKLKTSTNTNENGNQKSCAGINDTISDNLIDSDENVPCEIPFKIDNGSVLILTPYIKDGQVAEARNASSVNPDLFLGYESYSGFITVNKTYDSNIFFWYFPVLKKPYGANLAETVHQFLEIFPELRSAPLYVAGESYAGKYVPALSMELHKQKDNPEFKVNLTGMMLGNAYIDPSMIAQVSYPFYYFGLLSKEQIDIVDPLLKSFQQDIASNNSIAAKNKWNSLIAVLLFLTHQKQAYNFLKDDISVGHYVNFLKTSEVKRALHVGDIRFSFVNQTVNSKMAPDFLSSSKPLFEELLEHYRVLIYCGHLDQMLPCVFTSDNFRTWTWSGSKEFQEAARYPYIYKAKLSGYHKTGGQLTEVVVRGAGHMVPVDQPGPIQNLVARFTHNKPLSQRFGLLEGSFIQEFIKNQTVVYFGHLDQMLPCVFTSDNFRTWTWSGSKEFQEAARYPYIYKAKLSGYHKTGGQLTEVVVRGAGHMVPVDQPGPIQNLVARFTHNKPLSQRFGLLEGSFIQEFIKNQTVVYFITLAAVADAVQIDTPLFLTAFIKENKTAEARNASLVNADEFLNVTSYSGFLTVDDNYDSNLFFWYFPVANKDVKRTPWIIWLQGGPGATSLAGLFDEMGPFELDSNLNLKKRKYTWTDDFSMVYIDNPVGAGFSFTKHDEGYPNNMDMYTESLYRAVNQLIVLYPELSEAPLYVAGESYAGRYVPALAERIMKDKEKDGHINLQGIMLGNPLLDRESVIDYTRAFYSWGLIDEQGALAAEPLQKQFQKEIDEGNAQEAYKLRDELLDKLQGIAEQSSLYNVITPIEGLEHFINFITSSKIRNLIHAGNVTFHFSNDKVHKHLVADFLAPVSSKVLTVLEHYRVLIYCGQLDLTTPCVLNSEARRKRWMWSGREEFLRSPRTPWWFNNTVAGFVKSGGGFTEVLVKGAGHLVPKEKPAEAKALISYFINGTGLPTPPSYKIHPEDTPYYEEYFDLKTSGAVPAVGLRAGLIASVVVNVLLLAGIALGVYKFLKWKRESDYFYSPLNDGILTMS-