Monarch geneset OGS2.0

DPOGS206562
TranscriptDPOGS206562-TA2784 bp
ProteinDPOGS206562-PA927 aa
Genomic positionDPSCF300108 - 643626-658220
RNAseq coverage654x (Rank: top 20%)
Annotation
HeliconiusHMEL0043697e-13857.08% 
BombyxBGIBMGA013746-TA2e-7848.97% 
Drosophilaea-PA6e-5838.63% 
EBI UniRef50UniRef50_Q49QW03e-12553.46%Prophenol oxidase activating enzyme 3 n=5 Tax=Obtectomera RepID=Q49QW0_SPOLT
NCBI RefSeqXP_312744.31e-10036.35%CLIP-domain serine protease subfamily B (AGAP003058-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|567183901e-12453.46%prophenol oxidase activating enzyme 3 [Spodoptera litura]
NCBI nr blastxgi|567183903e-12853.46%prophenol oxidase activating enzyme 3 [Spodoptera litura]
Group
Gene OntologyGO:00038247e-81catalytic activity
GO:00042527.2e-77serine-type endopeptidase activity
GO:00065087.2e-77proteolysis
KEGG pathway 
InterPro domain[656-926] IPR0090037e-81Peptidase cysteine/serine, trypsin-like
[665-921] IPR0012547.2e-77Peptidase S1/S6, chymotrypsin/Hap
[518-570] IPR0227002.6e-15Proteinase, regulatory CLIP domain
[518-571] IPR0066047e-14Disulphide knot CLIP
[299-314] IPR0013141.8e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL21029 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206562-TA
ATGGGAGGAGAGCTTGACTGTGCTGATGAAGTGCTCGTGATACCAATAGAGAAGATCATGGCGCATGAGGAGCATTGCGATTTATATGGCGCCAACTGCACATCAATCCACGATTGTGATGTACTGAAAAACTTGATTCATAAATCACCAAAACTGAGAGCTAGGAGCGTCGAATACATTTGCGGTTTTGACGGCGATGTAGCAAGGGTTTGCTGTCCGAGTACCCCGAGTGACGAATTCTTTATGGATTCACTTACAACTACAGACGGTGATTACTACGGTGAGGAGTACGATGAGAGCATGCAAAGCAATGAGCTGTCAGAAGCTAAATGTGGTATGGGAGTTAAGTGTGTATCAATTGAAGACTGCGATGTATTAAAGGAACTTATACATAAATCATCGCGAGATAGACTCACTGTTACCAAATTTCATTGCGGCTACGATGGAGATGTGCCCAGGGTTTGTTGCCCGCCGTCTAATATTAAAAGATGTCCGACACTAGAAGGTACTATAGGGACTTGCGTTAGTCTAGAAAACTGTCCTTATCTCGTGAAACGAAAGAGTACTGAGAAAGCCCAGGATATAGAATACGTTCAACGATCAAGGTGCCCCGGACCTAAAGTGTTGAGTGTTTGTTGCGACCTTTCCCTAAACATATCTAATATCGACACACACGTCCTTCCGGACCCCATGATAGGTGGTGTAGACAACACAGACAACAGATTCTTAACCGATTGCGCCGTCCTGAACGAAACCTTAGGTGTTATGCCATCAACGAGCGAGTGTTGTGGCATTGAAGCTAATAGTGGAAACAGAATTATTGGTGGCACAGAGACAGCTATAGACGAATATCCATGGCTGGCGATGCTTGAGTATGAACTACCATACTTGTGCGGTGGAACTTTGATAAGTAGACGTCACGTATTAACAGCGGCACATTGTCTTGTTGGTGACACCATGGCACCCAGAGGAGTGAAATCTGTACGACTTGGTGAATACGATACAGAGAATGATGGTCCAGACTGCGTGGAAGTCGAAGGTGGCGGTGATGATTGCACTGAGGGGGCCATTACATTTAAAATTAACAAAATCATCGTACATCCTGGGTATAACAGTGAGCTTGAGGTACTGAAAGCTAACGACATAGGTCTATTGAAGCTAGATGGAACTGTGCCATACAACGATTTCATTCGTCCCATCTGTTTGCCGAAGGCTGATCTGTATGAGATGCCCATATCTCCAACTCTTCGTTTCCATGTAGCCGGTTGGGGGGCTGTTAGTGAGACTAAAGAGAGGAGCAGAGTTAAGCTTCAAGTCGATCTACCTGTTATCAAGCAGGAGGAATGTAAAAAATTATATAACTTGAACGAAAAACCACTAGTGTGGAATAAGCAATTCTGCGCTGGTGGTGAACCAGATAAGGATACGTGTAGAGGAGATTCAGGTGGTCCACTCACATACGTGGACCCTGTCAAAAAAATCAACGAAATCATCGGCATCACCAGCTTCGGCATATTGAAATGTGGAACTCAAAGACGACCCAAAACCAATTGCACGAGTCCAGATGGTAAACGTGGTGTGTGTATTCCGTTAAATGACTGCAAATCAATCTTAGATATACTCGAAAAGAAGGAAATGACGGCCGGAGAAAAAAATTTCTTGAGATATTCAAGATGTGGACCCGTAGATAGTAAAATTTCGGTTTGTTGCGTAAAGGATACGTTGGATAATGCGTGTTTCAACGCTGACGCGATGCAAGGCGTGTGCATCGATGTTCGGTCATGTCCGTCCATCATTAAGCTACTTCAGCCTCCGGTGCCACAAGGCAGTCTGGATTTCATTAAAAACTCCAGGTGTTTAGGCAAAACCTCTCACAGCATATGCTGTGGACCAGATCATGTCGAGAAGGTCCCTATACGTATTTGTAATCAGTCAGCAGCCCCACCCGACGTGAGAACGGAATGCTGTGGTGTGGATCCTTCAACAGGGAATAAGATTTTTGGTGGGAACGCAACAGCGATAGATCAGTACCCATGGTTGGCTCTCATAGAGTACAGAGACAAAAACAATAAAATAAAACTACTCTGTGGAGCTGCTCTTATAAGTTCCAAGTACGTGTTGACAGCTGGACATTGCGTCATCGGACCAGTTCTGAACTCTGGAAAGCCAGAAAATATTAGGCTCGGAGAGTATGACACATCAAATTCTGAATCGGATTGCGTTGAAGGAGAGGGAGGAGGGGTTGACTGTGCTGATGAAGTGCTCGTGATACCAATTGAGAAGATTATAGCGCATGAGGGTTACGACCCAATGTCTCCCTTAAGAAGAAATGATATTGCTCTTATAAGGATGGCTACATCTGCTCCTTACACGGGTTTCATTCAACCAATCTGTCTTCCAACAACCGACGTAACATTATCCAAAGATAATTTAGTCTTCACAGCCGCTGGCTGGGGAGCTGTGTCAACCGAACAAAGTACAAGCAATGTTAAAATGCACGTCGATTTACCATTAAAGGGAGACGAGGAATGTCAAAAGGCCTACAACGTTTCCACCAGAAAATTGCAACTATGGAATCGTCAGTTGTGTGCTGGAGGTGTGAAGGGGAAGGACACCTGCAGAGGAGACTCCGGTGGGCCTCTGATGTATGACAACGGCAGGACTTACTCCGTCATAGGAGTCGTCAGCTTTGGCCCCTCGCCGTGCGGCTTAGAGAACGTGCCGGGAGTTTATACCAAGGTCTACGAATATCTGCCTTGGATAAGGACCAACATTAAACCCTGA

Protein sequence:

>DPOGS206562-PA
MGGELDCADEVLVIPIEKIMAHEEHCDLYGANCTSIHDCDVLKNLIHKSPKLRARSVEYICGFDGDVARVCCPSTPSDEFFMDSLTTTDGDYYGEEYDESMQSNELSEAKCGMGVKCVSIEDCDVLKELIHKSSRDRLTVTKFHCGYDGDVPRVCCPPSNIKRCPTLEGTIGTCVSLENCPYLVKRKSTEKAQDIEYVQRSRCPGPKVLSVCCDLSLNISNIDTHVLPDPMIGGVDNTDNRFLTDCAVLNETLGVMPSTSECCGIEANSGNRIIGGTETAIDEYPWLAMLEYELPYLCGGTLISRRHVLTAAHCLVGDTMAPRGVKSVRLGEYDTENDGPDCVEVEGGGDDCTEGAITFKINKIIVHPGYNSELEVLKANDIGLLKLDGTVPYNDFIRPICLPKADLYEMPISPTLRFHVAGWGAVSETKERSRVKLQVDLPVIKQEECKKLYNLNEKPLVWNKQFCAGGEPDKDTCRGDSGGPLTYVDPVKKINEIIGITSFGILKCGTQRRPKTNCTSPDGKRGVCIPLNDCKSILDILEKKEMTAGEKNFLRYSRCGPVDSKISVCCVKDTLDNACFNADAMQGVCIDVRSCPSIIKLLQPPVPQGSLDFIKNSRCLGKTSHSICCGPDHVEKVPIRICNQSAAPPDVRTECCGVDPSTGNKIFGGNATAIDQYPWLALIEYRDKNNKIKLLCGAALISSKYVLTAGHCVIGPVLNSGKPENIRLGEYDTSNSESDCVEGEGGGVDCADEVLVIPIEKIIAHEGYDPMSPLRRNDIALIRMATSAPYTGFIQPICLPTTDVTLSKDNLVFTAAGWGAVSTEQSTSNVKMHVDLPLKGDEECQKAYNVSTRKLQLWNRQLCAGGVKGKDTCRGDSGGPLMYDNGRTYSVIGVVSFGPSPCGLENVPGVYTKVYEYLPWIRTNIKP-