Monarch geneset OGS2.0

DPOGS208138
TranscriptDPOGS208138-TA5430 bp
ProteinDPOGS208138-PA1809 aa
Genomic positionDPSCF300347 - 129562-141605
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0165140.055.21% 
BombyxBGIBMGA014089-TA0.042.99% 
Drosophilandl-PA4e-12637.25% 
EBI UniRef50UniRef50_Q8WSJ20.050.36%Ovarian serine protease n=3 Tax=Bombyx mori RepID=Q8WSJ2_BOMMO
NCBI RefSeqNP_001037168.10.050.36%ovarian serine protease [Bombyx mori]
NCBI nr blastpgi|1129844380.050.36%ovarian serine protease [Bombyx mori]
NCBI nr blastxgi|1129844380.049.94%ovarian serine protease [Bombyx mori]
Group
Gene OntologyGO:00042529.8e-82serine-type endopeptidase activity
GO:00065089.8e-82proteolysis
GO:00038245e-80catalytic activity
GO:00055151e-09protein binding
KEGG pathway 
InterPro domain[427-659] IPR0012549.8e-82Peptidase S1/S6, chymotrypsin/Hap
[423-664] IPR0090035e-80Peptidase cysteine/serine, trypsin-like
[1314-1571] IPR0154209.9e-60Peptidase S1A, nudel
[1118-1164] IPR0021721e-09Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL16591 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208138-TA
ATGGAAACATTGGAAATTGGTAGAAACATGGCTACAAGTACAAATGAAAAGTTAGCTGTAGGGCTCCCGATGGACTTGGCGGAGACAAATAAAGTTCGCAATGGTTTCAAGTGCATACCTGCTGTTCAAAAAGCTCTAGTTATCATCACACTAATTATATTTTCTATCGGTATCTATGTTTTGTTTCTGAGACTTTTAAAAGTCGACGGTGGTTATGGAGATATGTATTCAGATCATCAACTGCAATATCCGCACCACTCAAGGCCCATATCAAATTATAGGTCGCATTATGCCCAAACTCATAATTCAGAAGAACTTTTGACAAATCCTCACCCACAAGACATCGGAATCGCTATGAAATTCTTATCACCAAAATATGACATGACATCCAAAGACTCTGTTGGTTCAAATTCGACATTTAATGAAATAGAATGTCCAGTTGGAACTGTTTCATGTAATAACGGGGCCTTGTGTATAGATGAACACAAATGGTGCGATGGTAATGTTGAATGTGATGACGTCAGCGATGAATCAAAATGTGACTGTAGATCAAGGGTTGATGATTCTAGAATTTGCGATGGCTATTTTGATTGTCCTTTCGGTGAAGATGAAATGGGATGTAATGGTTGTAATGACAACACATTCAGTTGTGAGGATCTAAATGTTAACTCAAAGAACACATGTTTTTCCAAGGAGCAACGTTGTGACAACTTTGCGGATTGTCCAAATCAGAAAGACGAAATTGACTGTAGCTTGTTAGCACCGAGCTTACACAAAAAACCCTTGTTTGCCATTTCAAATACGGAAGGCTTTCTGCACAGAAATTTTAAAGGAAACTGGTATGCTGTCTGTAGTAATCCTTACATGTGGGCACATGATGTGTGTCGTCGGGAAACAGGGCTTATAATAAGGCCTCCTTATATTCAAGTTGTGCAAATAGATCCCTTAATAAAGGTTAAGTATATAAACACGGCGCTGGGAGGATCAATACACACTACAAATACTTGCGGGAACAATTCCGCAGTGTACGTCACGTGTCCTGATTTATTGTGTGGAACCCGAGTGCTATCTTCCTCAGAATTTTTAAGACAAAACGCTAATATGGAAGACAACCTGTTTGGCCGGAATAAAAGATTCCTTTTCCAAGAATCATACCCTGGGATATATTACGGTGACCGAAAAAAGAGATATACAATAAACAACGCCTGGCAATCCCAACCATTCCATTATTTGAGAAAAGATTTGGTTAATGATGTAAGGAACAAAAGATCAGACAGTAGAGTAGTGGGAGGAAAACCTAGTCAACCAGCTGCCTGGCCGTGGGTAGTAGCACTTTATAGAGATGGAATGTTTCATTGTGGAGGCGTTATTGTTAACCAAAATTGGATAATGTCTGCGGCACACTGCGTTAACAAATTTTGGGAACATTACTATGAAGTACAAGTCGGTATGCTCCGTCGGTTTTCATTCTCACCTCAAGAGCAGAACCACCGTGTTACTCACGTAATAGTGAATCAAAATTACAATCAGGAAGATATGAAGAATGACTTATCTTTATTGAGAGTTAAACCTGGCATTCAGTTTAGTCGCTGGGTACGACCTATTTGCTTACCTGGACCTGAAGTGGCTGGTGCTGACTGGATGTGGGGACCTCCTGCTGGTACGACTTGTACAGCTGTAGGCTGGGGAGCAACTGTAGAACGTGGCCCTGATCCGGACCATATGCGTGAGGTAGAAGTTCCTGTATGGGAGCACTGCAAACATGAGGAAGATCAAAGTGGCAGTGAAATGTGCGCAGGTCTTGCTGAGGGTGGTAGAGATTCTTGTCAAGGAGATAGCGGAGGACCACTTCTATGCACTAATCCTGCCAATCCGCAGCAATGGTATGTAGCAGGTATTGTGAGTCATGGCGATGGTTGTGCACGAAAAGGTGAACCAGGGGTTTATACAAGAGTCAGCGTTTTTGTTTCTTGGATACGATACCACATTGCATCAAAAGCGTTACCGATAATTCAGCCTAAACAAGAATGTCCGGGATTTAGATGTGATTCTGGGATTTCAAAGTGCTTGCCAAAAAAGAGGATGTGTGATAAAATAATAGATTGTTTAGATGGCGAAGATGAACTAAATTGTGAAATAGTGAGATCAGCAGATATTTTTCCAAATAATTTATTTCTCAATCCATTGGCTAAAGTCGCAAATATAACAAACAACCAAGAAACTATCAACATAAGTGATAATGAAAAAAACAATAATCTTCCAAGTAATAATATTATCTTAGTCAATGATACCAATATGAATACGAAATTAAACTCATCTAATTTAAATGAATTTGAAACTACAACATTTTTATCGAATAAAGATATAATAACTGATACAAAAGTATATGACCAATCTACTTTGTTTGTAACCACGGAGAAATCTATCATCAAAAATGATGATAAACTTACAAATATTATACCACTCCCAACTGAAGTAAGTTTAGAACAAAGTTCCCTTTACGATGATATAAAACACGCTTCAACAATAGAAATGCCATCTCCTGATTATTCAGGAGAATCCATAGATGATCTGGATCCAAATGAATCAATTACATTAGATTCAATTATTACAACGACTACAAGTAAATATTTTGATGATATAAATATAAACCTAAACTCTAGATACGCAATAGAGTCTATGTCCTCGATTTTAGATAACCCTCAAAAAGAAGAGAAAATATTCACAAATATTAAGGTGACAGAGAAAACTTTGGTAACATCACCACTTGGCATTTCTAGATTAGATTCGGATATTGATTCTACAACAGTAACTATAAAATCAAGTGATTTTAATACATCCGACGAAAATAATTTAAATAACAATATATTCAATAATACTATTGATCAAAAGGATAACTTAAACTTAGCAGATAATAAATATAATAAGACTATTCCAACGACAAATGATTATGTGCCAACTTATAATGGTAGCAATAATTTTAATGAAAGTACTACTGAACCAAATATAGACGACCAAATAAATTCGGAAACGAATCATGAACAAAACTATTTAACACCCATTAGGCCGATGGATACTACTTACACTAATAGCGATAAAAATGAAGACACGTTTCTGACTGAACTCCAATCAGCAAAAAAGAAAAAATATATACCAACGCCTACTGAGTTTCAATGCAGGCGCATTTATCAAATCGTCCCCCACACGACTCGTTGTGATCATAAAGCAGACTGTGAAGATGGTTCAGATGAACAGGATTGTACGTGTGTTGACTACCTAACAACTTTTGATAATAGACTGTTATGTGATGGACACTTCGATTGTGCCGATGGACAGGACGAAGTGAATTGTTATACATGTGAAGAGGATAAGTTTCTATGTAAACTAAGTGAAATGTGTCTCGATTCAAAGTACGTTTGTGATGGTATACCACAATGTCCCTCAGGCGAAGACGAAATGGACTGCTTTGCTCTTACAAACGGCAATCATATTGAACGTGATATACACGGCAGACCAGAGGCAAAATTGGAGGGTTACTTGACTAAAAAGTATCAAAACAGCTGGCATGTTGTGTGTGAAGACAACATGTCGGTTTCAGAACAAGAAGAAGCTGCTACACATATATGCCGCTATTTGGGATTTAGCTCAGCAAATAAATATGTTATCAAATATATCAATGTGAAACAAAAACTTCATCATATGAAAGATAAAAGGTCGATACGAAATATCGATTTAAGGATGCCTGTTCACTTCAGCTATAGAACAGCTAGTGACAACAATGATTCCACGCATGTAGTCATAAATGAACCTCAAATAATTAAAGAGGAATGTGTTCCTAATATAACGAAAACCTGCATGTCGCTTTATGTTTTTTGCGATCATTCCTTGTACACTCATTTTGATAGCATTGATGAAGTGAACATCAAGAACGAAATAAAGAAGATGTCTGATCAAATGTGGCCATGGATTGCGAAATTATATGTGGACGGAAAATATAAATGCACTGGAGTTTTAGTTGATTTGTCTTGGGTTCTAATAAATCACGTATGCCTACCGAGTTCTGAGCTAAGTTATCACTATGTAACAGTTATACTTGGTTCTCACAAAACTCTTAAATCAACTGTTGGACCTTATGAGCAAGTGTATCGAGTTGATGCAAAGAAACATTTATATCAAAGGAAAGTTATGCTTCTGCATCTCAACGAACCCGCTGTATACACATCTATGGTGAAGCCGATGGTAGTGACGTCTCTATATTCCGATGATGCTGATAATACGATATGCGTAGCAGTTGGCCAGGATAGGAATAATAAAATGTCAAGCGTTTTTCTAAAAGAAACTGATAAATGCAATTCCCACAATCGATGTTTCGATCTTTTAGTCAATTCTAGCTATTGTAACTTTGAAGATGCAAAATGGGCCGGTATAATAAGTTGTCACAACAAACGTGGATGGTATCCCGCAGCGTCGTTTGTTAAAGACATGGGAATATGTAAAAATACTGATGGCATAAATGGAACAGACATTGGAAATTTAAAAATTGATATAAAATATTTCGAAGATAAACCATTACCTCTTTCCGATGGGCATTTGTTTACAAATTGCGAAGGAGTCCGGTGTCAAAGAGGGCATTGTGTGGGGTTACAAGATGTATGTAATGGGGTCACGAATTGTGAAGATTCTTCGGATGAATCTAAAGAATCATGTCGGAAAAAACATGATGTTTGTACACAAAATCCATTTTATCGTGGATGTGAATGTCCGGTTGGTCAGTTAAAATGTCATAATGGTCAATGTATACCCAAAGAATTGTTCAAGGACGGCCGCAATGATTGCGGAGATGGCACCGACGAGCCTGGTCAAACTTTATGTTCAGATTACTTGAGGAGGGTTATGCCTTCAAGACTTTGTGACGGAATTCTTCACTGTCACGACAGGAGTGACGAAGATCCCACATTCTGTAAATGTTTCGCAAAAAAGGCGTACAAGTGCACAGGAATGTCGATTGACGAGGACTACTGCGTAGCAACTGACATGGTTTGTGACGGTGTACTTGATTGTCCAAATGGAGATGATGAGCGAACCTGTATAGGTTTGAGCTCGGCTCAGGGAACACCGCACGGCATTGGCGAAGTAATAATACGCTCCCACGGCGTGTGGTATTCGAAATGCTATACCAAACAAAACCATACGAAATCAGAACTAGAAGCTATTTGTAGAGAGTTAGGTTTCATTGGCGGACACGCAAAACAACTGCCAGATCCTAAAGGAATACCAAATCCCTACAACAATATTGTTATCGACATGTTTTCTGATGTAATGCTAAATAATAACACAATAATAAAATTGAGAAACACACCGAATCCTATCGCCCGCGCTGTGACTCAAGATATAAAAGAGTGTTATCCAGTTTTCATAGAATGTCTCTAG

Protein sequence:

>DPOGS208138-PA
METLEIGRNMATSTNEKLAVGLPMDLAETNKVRNGFKCIPAVQKALVIITLIIFSIGIYVLFLRLLKVDGGYGDMYSDHQLQYPHHSRPISNYRSHYAQTHNSEELLTNPHPQDIGIAMKFLSPKYDMTSKDSVGSNSTFNEIECPVGTVSCNNGALCIDEHKWCDGNVECDDVSDESKCDCRSRVDDSRICDGYFDCPFGEDEMGCNGCNDNTFSCEDLNVNSKNTCFSKEQRCDNFADCPNQKDEIDCSLLAPSLHKKPLFAISNTEGFLHRNFKGNWYAVCSNPYMWAHDVCRRETGLIIRPPYIQVVQIDPLIKVKYINTALGGSIHTTNTCGNNSAVYVTCPDLLCGTRVLSSSEFLRQNANMEDNLFGRNKRFLFQESYPGIYYGDRKKRYTINNAWQSQPFHYLRKDLVNDVRNKRSDSRVVGGKPSQPAAWPWVVALYRDGMFHCGGVIVNQNWIMSAAHCVNKFWEHYYEVQVGMLRRFSFSPQEQNHRVTHVIVNQNYNQEDMKNDLSLLRVKPGIQFSRWVRPICLPGPEVAGADWMWGPPAGTTCTAVGWGATVERGPDPDHMREVEVPVWEHCKHEEDQSGSEMCAGLAEGGRDSCQGDSGGPLLCTNPANPQQWYVAGIVSHGDGCARKGEPGVYTRVSVFVSWIRYHIASKALPIIQPKQECPGFRCDSGISKCLPKKRMCDKIIDCLDGEDELNCEIVRSADIFPNNLFLNPLAKVANITNNQETINISDNEKNNNLPSNNIILVNDTNMNTKLNSSNLNEFETTTFLSNKDIITDTKVYDQSTLFVTTEKSIIKNDDKLTNIIPLPTEVSLEQSSLYDDIKHASTIEMPSPDYSGESIDDLDPNESITLDSIITTTTSKYFDDININLNSRYAIESMSSILDNPQKEEKIFTNIKVTEKTLVTSPLGISRLDSDIDSTTVTIKSSDFNTSDENNLNNNIFNNTIDQKDNLNLADNKYNKTIPTTNDYVPTYNGSNNFNESTTEPNIDDQINSETNHEQNYLTPIRPMDTTYTNSDKNEDTFLTELQSAKKKKYIPTPTEFQCRRIYQIVPHTTRCDHKADCEDGSDEQDCTCVDYLTTFDNRLLCDGHFDCADGQDEVNCYTCEEDKFLCKLSEMCLDSKYVCDGIPQCPSGEDEMDCFALTNGNHIERDIHGRPEAKLEGYLTKKYQNSWHVVCEDNMSVSEQEEAATHICRYLGFSSANKYVIKYINVKQKLHHMKDKRSIRNIDLRMPVHFSYRTASDNNDSTHVVINEPQIIKEECVPNITKTCMSLYVFCDHSLYTHFDSIDEVNIKNEIKKMSDQMWPWIAKLYVDGKYKCTGVLVDLSWVLINHVCLPSSELSYHYVTVILGSHKTLKSTVGPYEQVYRVDAKKHLYQRKVMLLHLNEPAVYTSMVKPMVVTSLYSDDADNTICVAVGQDRNNKMSSVFLKETDKCNSHNRCFDLLVNSSYCNFEDAKWAGIISCHNKRGWYPAASFVKDMGICKNTDGINGTDIGNLKIDIKYFEDKPLPLSDGHLFTNCEGVRCQRGHCVGLQDVCNGVTNCEDSSDESKESCRKKHDVCTQNPFYRGCECPVGQLKCHNGQCIPKELFKDGRNDCGDGTDEPGQTLCSDYLRRVMPSRLCDGILHCHDRSDEDPTFCKCFAKKAYKCTGMSIDEDYCVATDMVCDGVLDCPNGDDERTCIGLSSAQGTPHGIGEVIIRSHGVWYSKCYTKQNHTKSELEAICRELGFIGGHAKQLPDPKGIPNPYNNIVIDMFSDVMLNNNTIIKLRNTPNPIARAVTQDIKECYPVFIECL-