Monarch geneset OGS2.0

DPOGS206238
TranscriptDPOGS206238-TA2478 bp
ProteinDPOGS206238-PA825 aa
Genomic positionDPSCF300334 + 232927-263036
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0112220.078.76% 
BombyxBGIBMGA009737-TA1e-14795.31% 
DrosophilaCG8170-PA6e-15368.11% 
EBI UniRef50UniRef50_A1Z7M79e-15168.11%CG8170, isoform A n=36 Tax=Eukaryota RepID=A1Z7M7_DROME
NCBI RefSeqXP_001870897.17e-15781.21%serine protease [Culex quinquefasciatus]
NCBI nr blastpgi|1700495191e-15581.21%serine protease [Culex quinquefasciatus]
NCBI nr blastxgi|1571093427e-15681.21%serine protease [Aedes aegypti]
Group
Gene OntologyGO:00038243.9e-88catalytic activity
GO:00042524.9e-80serine-type endopeptidase activity
GO:00065084.9e-80proteolysis
KEGG pathway 
InterPro domain[572-823] IPR0090033.9e-88Peptidase cysteine/serine, trypsin-like
[584-818] IPR0012544.9e-80Peptidase S1/S6, chymotrypsin/Hap
[610-625] IPR0013141.7e-10Peptidase S1A, chymotrypsin-type
Orthology groupMCL15927 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206238-TA
ATGTTGAGCGAAAGTCAATGGAGGTTCGTACTGATGATGCTGGCTGTGGGCACGGGCGTCTCCGGGGAGACGAGTCGCGCGCTCAGCATATACCAGAGACAGAGGAGTGTACCCATGTTCGTGCCCAGGAGTTACAGGAGGAGGTTCAGGAGAGTGCAGTTCCTGGACGAAGAAGAGAACAGCCAGCCTCTCGGGCGCGGAGAGATCGACCTGACGGAGAACAAACACTCCAGAATCAAGTTCGATGACACCACGGACTATGACTTCACCTCCGCGGACGTCACGGAGGAGCCGCGGAAGGTAGAAGCCTTGGGCACGGAGAGATTCTACATGCCGCGGATGTATCCCAAGTTCTATCAGGGCAGGTTCGGTGGACCGTACGCCCCGAACTACGCGTTCGAACCCCTGATTTCCTCACCACCGAGAGCGGGGTACTATCCTAACAGTGGCTGGAAGGCGCGCAGTCCTCGCGTAGTGTTCCCTTATTCCCCAGACGGTAACTCAGTGCTGACCAACTCTCACGCGGGACCTAATTTCAATGACAATGTCGTGTTCCGTGAACAAAACTTCGGCACCAGCGACATCGGCGCCGAAGACCAGACGCTCCAGGATATCTCGTCATCCCCAGTCAACAACGACGCCTTCTCTGAGAGAGATGCCAACGATCCACATCGAGCTCAGAGCAACGAAATAGAAAGATTGCGATACATTATAAGATCATACGAACCACGGCGCTATACACTAAAAAAACCAATGTTTGTTCAGACAGAGTTGACCGAGAGAATATTTCAGGAGCACGCGAACGTGAGTTCCGTGCTGCCAGAGTTCAAACCGATATCGGACGAGCAGATACAGAAGATAAACGACATTATAAAAGAAATCACGAGCACTAATAGAGATAGTGGGAGAAGAGGGAGAGTTTTGGACAAGCCTAACCCTGCGCTGATAGAGAGTAAGAAATTAGAGAGACAATATGAGAAAAACTGTCCCCAGGGCGGGACGTGCGAGTTCTTTTTCTATTGCTGGATGGTCGGAGGACTACTGGACGGTTCCTGCGGGAGCTTGCTCAAAGGCTGTTGTCACAGAGTAGCCAAGTCCGGCATCCTCGGGGTACAAGACTCCAACAGCTTAGAATTCACACCCAACGAAGGACTTAGCTATGGTCCGGTTATAAACGATGAGAATGCCAACGATCCACATCGAGCTCAGAGCAACGAAATAGAAAGATTGCGATACATTATAAGATCATACGAACCACGGCGCTATACACTAAAAAAACCAATGTTTGTTCAGACAGAGTTGACCGAGAGAATATTTCAGGAGCACGCGAACGTGAGTTCCGTGCTGCCAGAGTTCAAACCGATATCGGACGAGCAGATACAGAAGATAAACGACATTATAAAAGAAATCACGAGCACTAATAGAGATAGTGGGAGAAGAGGGAGAGTTTTGGACAAGCCTAACCCTGCGCTGATAGAGAGTAAGAAATTAGAGAGACAATATGAGAAAAACTGTCCCCAGGGCGGGACGTGCGAGTTCTTTTTCTATTGCTGGATGGTCGGAGGACTACTGGACGGTTCCTGCGGGAGCTTGCTCAAAGGCTGTTGTCACAGAGTAGCCAAGTCCGGCATCCTCGGGGTACAAGACTCCAACAGCTTAGAATTCACACCCAACGAAGGACTTAGCTATGGTCCGGTTATAAACGATGAGAGCTGTGGCGTGGCGGGGAATAAGCAGACGGCACAACGACGCATCGTCGGAGGTGATGACGCTGGCTTCGGCAGCTTCCCCTGGCAGGCCTACATCAGAATAGGATCCTCAAGGTGTGGGGGGTCCCTGATATCTCGTCGCCACGTTGTCACAGCTGGTCACTGCGTAGCGAGGGCTCAGCCCCGACATGTCCGCGTCACCCTCGGCGACTACGTCATCAACTCCGCGGCGGAACCCTTCCCAGCTTATACCTTCGGCGTCAGATCTATCAAGGTTCACCCGCTCTTCAAGTTCACACCTCAAGCTGATCGTTTTGACGTGGCCGTCCTCACTCTAGACAGAAACGTACAATACATGCCGCATATAGCTCCGATCTGTCTTCCGGAGCGAGGGTCCGACTTCCTGGGTCAGTACGGATGGGCGGCAGGCTGGGGCGCCCTCAGTCCCGGCTCGAGACTTCGCCCTCGGACCCTACAGGCCGTCGACGTGCCCGTCATAGACAACAGAGTGTGCGAACGATGGCATCGAGCTAATGGTATCAACGTGGTGATATATCCCGAGATGCTGTGTGCGGGATACCGCGGAGGCGGCAAGGACAGTTGTCAGGGAGACAGCGGCGGACCGCTCATGCTGGAGCGAGGTGGTCGCTGGACCCTGGTAGGAGTGGTCTCCGCCGGCTATTCCTGCGCCTCCCGCGGCCAGCCGGGCATCTACCACCGAGTGGCGCACACGGTGGACTGGATCTCACATGCCACCACTCTCACGTAG

Protein sequence:

>DPOGS206238-PA
MLSESQWRFVLMMLAVGTGVSGETSRALSIYQRQRSVPMFVPRSYRRRFRRVQFLDEEENSQPLGRGEIDLTENKHSRIKFDDTTDYDFTSADVTEEPRKVEALGTERFYMPRMYPKFYQGRFGGPYAPNYAFEPLISSPPRAGYYPNSGWKARSPRVVFPYSPDGNSVLTNSHAGPNFNDNVVFREQNFGTSDIGAEDQTLQDISSSPVNNDAFSERDANDPHRAQSNEIERLRYIIRSYEPRRYTLKKPMFVQTELTERIFQEHANVSSVLPEFKPISDEQIQKINDIIKEITSTNRDSGRRGRVLDKPNPALIESKKLERQYEKNCPQGGTCEFFFYCWMVGGLLDGSCGSLLKGCCHRVAKSGILGVQDSNSLEFTPNEGLSYGPVINDENANDPHRAQSNEIERLRYIIRSYEPRRYTLKKPMFVQTELTERIFQEHANVSSVLPEFKPISDEQIQKINDIIKEITSTNRDSGRRGRVLDKPNPALIESKKLERQYEKNCPQGGTCEFFFYCWMVGGLLDGSCGSLLKGCCHRVAKSGILGVQDSNSLEFTPNEGLSYGPVINDESCGVAGNKQTAQRRIVGGDDAGFGSFPWQAYIRIGSSRCGGSLISRRHVVTAGHCVARAQPRHVRVTLGDYVINSAAEPFPAYTFGVRSIKVHPLFKFTPQADRFDVAVLTLDRNVQYMPHIAPICLPERGSDFLGQYGWAAGWGALSPGSRLRPRTLQAVDVPVIDNRVCERWHRANGINVVIYPEMLCAGYRGGGKDSCQGDSGGPLMLERGGRWTLVGVVSAGYSCASRGQPGIYHRVAHTVDWISHATTLT-