Monarch geneset OGS2.0

DPOGS204790
TranscriptDPOGS204790-TA2568 bp
ProteinDPOGS204790-PA855 aa
Genomic positionDPSCF300217 + 127391-140909
RNAseq coverage289x (Rank: top 38%)
Annotation
HeliconiusHMEL0176790.050.06% 
BombyxBGIBMGA009477-TA5e-14561.22% 
DrosophilaCG8560-PA6e-6836.87% 
EBI UniRef50UniRef50_O973897e-13056.87%Carboxypeptidase A n=9 Tax=Obtectomera RepID=O97389_HELAM
NCBI RefSeqNP_648120.32e-6636.87%CG8560 [Drosophila melanogaster]
NCBI nr blastpgi|222188712e-12956.87%carboxypeptidase A [Helicoverpa armigera]
NCBI nr blastxgi|491683921e-13356.77%carboxypeptidase [Helicoverpa armigera]
Group
Gene OntologyGO:00065083.7e-92proteolysis
GO:00082703.7e-92zinc ion binding
GO:00041813.7e-92metallocarboxypeptidase activity
GO:00041803.6e-13carboxypeptidase activity
KEGG pathway 
InterPro domain[124-407] IPR0008343.7e-92Peptidase M14, carboxypeptidase A
[441-530] IPR0090209.5e-16Proteinase inhibitor, propeptide
[445-530] IPR0031463.6e-13Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL17548 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204790-TA
ATGTTCAAGCCACTTTTATTTTTTTTGGCATCAATATATGTTGTCTATGGAAAGCATGAGGAATACGATGGGCACTCCCTGTACCAAGTCAGGGTAGAAAATGCTTACCAAGTGGATCAATTGATGAATGTAGTGGAACGCCTTCTGTTGGACGTGTGGACATACGCCCAACCAGAACAAGATGGACTCATCTTAGTTTCGAAGGAACTGAAACCTCTGTTCGAAGAGGAACTGAAGAGCATTGGTGTAGAGTTCGCAATTGATACAGAAAATATAAAAGATAAGTTAGATCTGGAAGATCAGCTTTTGGCTAACGCTAGTATCGCCGATGCTGGAAGGACACATCGAGGTCTTTCATTTGATGTGATCCACCGTTACGCTGTTGTTGACAGATATCTCTCCGATCTCGCACGGCAATACTCCAACGTGCGAGTTGCATCTGGTGGGAAAAGTGTTGAAGGTCGCGATATCAAATATTTAAGGATCTCAAGTAACAATTTCCAGTCAGGCAACAAGCCTGTTGTGGTTATCCAGTCGTTACTTCACGCCCGTGAATGGGTCACTCTCCCCGTGACTCTATACGCCATCCATAAACTTGTCATCGATGTGACTGAACAAGACCTGATAAGAGACATTGATTGGGTCATCATACCGATTGCTAACCCTGACGGTTACGAGTTCACTCATACTAGGACACGTATGTGGAGAAAGAACCGTAGAACTGGATTTGGAAGGTGTATTGGTGTAGATCTAAACAGAAACTTTGACTTCGCGTGGGGAATCGCATCCAGCAACCAAGCTTGTGCGGACACTTTCCATGGCCCCAAAGCTTTTTCGGAACCAGAGACCCAAATGACTAGAGATATCATCAATAGATACAGATCTCGCATAGAGCTATTCATTGATGTCCATTCATTCGGAAGCTTGATCCTGTACGGTTACGGCAACAGACAGCTGCCTCCTCATTCCAACACTCTCCAGCAGGTGGCTGTTGAAATGGCCCGAAGAATTGATGCCGTTAAATGGCCCGCAAATAGAAATTACAGAGTTGGTAACATCGCCCAAGTTCTATATCAAGCATCAGGTGGATGCAGTGACTATGCTCAGGCTCGGATTGGAAATAAACTGTCATATACATATGAATTACCTGCCTATAGAAACCTTAATAATATGAATGGTTTTCTAGTAGACCCGGCCTTCATCCGTCAAGCCGGATACGAAACATGGGAAGGCATCAACGGCTACAGCGACTGCTCTCCATACCCGTTCAGCGATTGGTCTTATTTCAGCCTCAGCTTCGTTTTGACTTTTTGTTTCGTACACGCAAAACATGAAATGTATGATGGACACTCGCTTTTCAAAATAAAAATTCAAGATGAGAAACAAATGAAATATTTGTACTCCTTAGCAGAAGTTTTGGATTTGGACGTGTGGGTTTTACCATTCCCAGGCAATGATGCTGCAGTGTTGGTCTCTAAAGAAAGCAATGAACCGTTTAGGAGTGAACTCTCTGAATACGGGTTTGAGTTCCATATAGAAACAGATAATATTAAAAGGGCTTTAGATTTAGAAGATACATTAAATGAGAAAGCGCTTCGCAAAAATCACTTCAAATCAGGATATAGAGGGAAAATTGGGTTTGACAGAATTTATAAACTATCTGAAGTCGACTCATACCTCGAAACTCTGGCGGAATCATATCCAAATACTGTCACACTTGTAAATGCTGGAAAATCTTTTGAAGGCAGAGACATAAAGTATCTCAAAATTTCATCAACTAATTTCCAGGATGTAAGAAAACCAATCGTTTTTGTTGAATCACTTCTGCATGCTCGCGAATGGGTGACACTCCCACCAACACTCTGGGCCATTGAGAAACTGGTTGTAAATGTCACAGAATCGGATCTGATAGACACCATAGATTGGATTATATTACCTGTTGCTAACCCAGATGGATATGAATTGACACACAATGAGGACCGGTTCTGGCGAAAGAACCGTGCTACTGGTTTTGTTCCTGGGGACTTTTGTGTCGGTGTAGACCTGAACAGAAACTTCGATATAAAGTGGGGAAGTGGCTCAAGCAGTAGTGTATGCTCCGAAATCTTCCATGGATCCCGTCCATCTTCTGAACCCGAGACAGTCATTATAAGCAGAATCATGGAAGAATATAAAAATCGTATCGATCTCTTCATAGATCTCCACAGCTATGGAAGTATGATTCTTTATGGTTGGGGAAGTGGAGATTTGCTGCCTAATGCATTCTCTTTGCAACAGATAGGTTTAAATATGGCTAAAGCTATTGACGACATGAAATTAATATTTAATGAAAACTATACAGTTGGGAACGCTGGACTTGTTTTATACCCAGCTTCAGGAACAGCCATGGACTATGCGAACCGGTTTAACATTCCATTCTCTTATGTTTATGAGCTACCTGGTTCTATATTTGGAACAGGTCTTTCAGGATTTCTGATTGAACCCGAACTCGTGGAACATTACGCAAAAGAGACCTGGGAAGGAATAAAGACTGGCGCCAGAAATGTACGCGATCTTCTCGGTAGATAG

Protein sequence:

>DPOGS204790-PA
MFKPLLFFLASIYVVYGKHEEYDGHSLYQVRVENAYQVDQLMNVVERLLLDVWTYAQPEQDGLILVSKELKPLFEEELKSIGVEFAIDTENIKDKLDLEDQLLANASIADAGRTHRGLSFDVIHRYAVVDRYLSDLARQYSNVRVASGGKSVEGRDIKYLRISSNNFQSGNKPVVVIQSLLHAREWVTLPVTLYAIHKLVIDVTEQDLIRDIDWVIIPIANPDGYEFTHTRTRMWRKNRRTGFGRCIGVDLNRNFDFAWGIASSNQACADTFHGPKAFSEPETQMTRDIINRYRSRIELFIDVHSFGSLILYGYGNRQLPPHSNTLQQVAVEMARRIDAVKWPANRNYRVGNIAQVLYQASGGCSDYAQARIGNKLSYTYELPAYRNLNNMNGFLVDPAFIRQAGYETWEGINGYSDCSPYPFSDWSYFSLSFVLTFCFVHAKHEMYDGHSLFKIKIQDEKQMKYLYSLAEVLDLDVWVLPFPGNDAAVLVSKESNEPFRSELSEYGFEFHIETDNIKRALDLEDTLNEKALRKNHFKSGYRGKIGFDRIYKLSEVDSYLETLAESYPNTVTLVNAGKSFEGRDIKYLKISSTNFQDVRKPIVFVESLLHAREWVTLPPTLWAIEKLVVNVTESDLIDTIDWIILPVANPDGYELTHNEDRFWRKNRATGFVPGDFCVGVDLNRNFDIKWGSGSSSSVCSEIFHGSRPSSEPETVIISRIMEEYKNRIDLFIDLHSYGSMILYGWGSGDLLPNAFSLQQIGLNMAKAIDDMKLIFNENYTVGNAGLVLYPASGTAMDYANRFNIPFSYVYELPGSIFGTGLSGFLIEPELVEHYAKETWEGIKTGARNVRDLLGR-