Monarch geneset OGS2.0

DPOGS201501
TranscriptDPOGS201501-TA2121 bp
ProteinDPOGS201501-PA706 aa
Genomic positionDPSCF300006 + 904448-911609
RNAseq coverage1045x (Rank: top 12%)
Annotation
HeliconiusHMEL0155050.075.33% 
BombyxBGIBMGA002593-TA0.080.81% 
DrosophilaCG5355-PA0.061.47% 
EBI UniRef50UniRef50_Q9VKW50.061.47%CG5355 n=29 Tax=Neoptera RepID=Q9VKW5_DROME
NCBI RefSeqXP_395364.20.064.81%PREDICTED: similar to prolyl endopeptidase isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838570120.065.62%PREDICTED: prolyl endopeptidase-like [Megachile rotundata]
NCBI nr blastxgi|3838570120.065.81%PREDICTED: prolyl endopeptidase-like [Megachile rotundata]
Group
Gene OntologyGO:00042526.5e-118serine-type endopeptidase activity
GO:00700086.5e-118serine-type exopeptidase activity
GO:00065086.5e-109proteolysis
GO:00082363e-60serine-type peptidase activity
KEGG pathway 
InterPro domain[1-703] IPR0024700Peptidase S9A, prolyl oligopeptidase
[5-411] IPR0041066e-146Peptidase S9A/B/C, oligopeptidase, N-terminal beta-propeller
[74-422] IPR0233026.5e-118Peptidase S9A, oligopeptidase, N-terminal
[478-701] IPR0013753e-60Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL11829 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201501-TA
ATGACCTTCGACTATCCAGAGGCGAGGAGGGATGAGACGGTGGTCGACAACTATCATGGTACCAACGTATCAGACCCATACCGCTGGTTGGAGGATCCAGATTCAGACGAGACCAAAGCGTTTATAGAAGCCGAAAACAATATAACTCGTCCGTTCTTAGACTCGTGTCCCGTTAAAACCGATATAAACAGTAGGCTCACAGAGCTCTGGAACTATCCCAAATACTCCTGTCCATTCAGAAGAGGAGATAGATACTTCTTCTTCAAGAACACCGGACTGCAGAATCAGAATGTGCTATACGTCCAAGATAGTTTGGACGGCGAGCCGCGCGTGTTCCTCGATCCGAATACGTTGTCTGAGGACGGTACGATCGCTTTGTCCGGGAGTCGCTTCACCGAGGACGGTTCAACCTTCGCGTACGGTCTGTCAGCCAGCGGATCGGATTGGATAACGATCCATTTGAAGGATGTTGCTACTGGCGTAGACTATCCCGAGGTTTTAGAGAAAGTTAAGTTCGCCTCAATGTCGTGGACAAAGGACAATAAGGGACTCTTTTATTCTCGGTATCCAGAGCAGACCGGCAAGACGGATGGTTCGGAGACGGACGTGAACAGAGATCAGAAGCTGTGCTACCACAGGCTGAACACGCCGCAGGAGGATGACGTCATCGTGGTAGAATTCCCCCAGGAACCTCTGTGGAGGATCGGTGCGGAGGTGTCGGACTGCGGCAGGTATCTCCTCGTGAGTCCGGTGAGAGACTGTCGCGACAACCTGCTGTTCTTCGCCGACCTGTCCTCCGCCTCGCTCACAGGACACCTCCAACTAACACAGATCGTGCACAAGTTCGAAGCCGACTATGAGTACATAACGAACGAGGGTTCCGTATGCATATTCCGGACAAACAAGAACGCACCCAACTACAGACTCATAAAAATCGACCTGAATAACCCAGCTGAGGAAAATTGGGAAACTTTAATAGCGGAACATCCCACTGATGTCCTGGACTGGGCTTCTGCGGTCGACAAAGATAAGTTAGTCATACACTACATAAGGGACGTTAAGAGCGTACTGCAGTTACACAGTATGAAGACGGGTGATTTGATGCAAAACTTCGATTTAGGTGTTGGCTCCATAGTGGGGTTCTCGGGGAAGAAAGAACAGAGCGAAATATTCTATCACTTCATGTCATTCCTTACACCCGGCGTCATCTATCACGTGGACTTCAAGAAACAACCGTACGCACCAACCATATTCAGAGAAGTTAAAGTGAAAGGCTTCGACGCTTCGCAGTATGAAGCCAAACAAGTTTTCTATAGCAGCAAAGATGGCACGAGAGTTCCTATGTTCATAGTATCTAAGAAAGGTTTACCGCGTGATGGGTCCCGCCCGGCGCTGCTCTACGGCTACGGCGGGTTCAACATCAACGTCCAGCCGAGCTTCAGCGTGACGCGGATCGTGTTCATGCAGCACTTCGAAGGTTCCGTAGCGGTTCCGAACATCAGAGGCGGCGGTGAATACGGCGAGCGGTGGCACAACGCCGGCAGACTGCTGAACAAGCAGAATGTCTTCGATGATTTCATATCCGCCGGCGAGTATTTGGTGCGGGAAGGGTACACCAGACCCGGCCTGCTCGCGGTCCAGGGCGGCTCAAACGGCGGGCTGCTGGTTGCAGCGGTCGCAAATCAGCGGCCCGACCTGCTGGGCGCAGCGATCGTTCAAGTCGGAGTGCTGGACATGCTGCGCTTCCAGAAGTTCACCATCGGACACGCCTGGATATCGGACTACGGCAGCTCAGATAATAAGACACATTTCGAAAACCTGCTTAAGTACTCGCCGCTGCACAACATCCAGTCGCCAGATAACGTAAGCCGTGCCGAGTACCCGGCGACGTTGGTGCTAACTGCGGATCACGATGACCGCGTAGTGCCGCTTCATTCCCTCAAGTATATAGCGACATTACAGCACGCTGTTAGAGGCACGCCGCAAAGACGACCGCTGTTAGCACGGATCGACACGAAGGCTGGTCACGGAGGAGGAAAACCGACCGCGAAAATAATCGATGAACACACAGACATCCTGTGCTTCCTCGCTCAAACCCTGGGACTTAAGTTCCTGAAGTGA

Protein sequence:

>DPOGS201501-PA
MTFDYPEARRDETVVDNYHGTNVSDPYRWLEDPDSDETKAFIEAENNITRPFLDSCPVKTDINSRLTELWNYPKYSCPFRRGDRYFFFKNTGLQNQNVLYVQDSLDGEPRVFLDPNTLSEDGTIALSGSRFTEDGSTFAYGLSASGSDWITIHLKDVATGVDYPEVLEKVKFASMSWTKDNKGLFYSRYPEQTGKTDGSETDVNRDQKLCYHRLNTPQEDDVIVVEFPQEPLWRIGAEVSDCGRYLLVSPVRDCRDNLLFFADLSSASLTGHLQLTQIVHKFEADYEYITNEGSVCIFRTNKNAPNYRLIKIDLNNPAEENWETLIAEHPTDVLDWASAVDKDKLVIHYIRDVKSVLQLHSMKTGDLMQNFDLGVGSIVGFSGKKEQSEIFYHFMSFLTPGVIYHVDFKKQPYAPTIFREVKVKGFDASQYEAKQVFYSSKDGTRVPMFIVSKKGLPRDGSRPALLYGYGGFNINVQPSFSVTRIVFMQHFEGSVAVPNIRGGGEYGERWHNAGRLLNKQNVFDDFISAGEYLVREGYTRPGLLAVQGGSNGGLLVAAVANQRPDLLGAAIVQVGVLDMLRFQKFTIGHAWISDYGSSDNKTHFENLLKYSPLHNIQSPDNVSRAEYPATLVLTADHDDRVVPLHSLKYIATLQHAVRGTPQRRPLLARIDTKAGHGGGKPTAKIIDEHTDILCFLAQTLGLKFLK-