Monarch geneset OGS2.0

DPOGS214606
TranscriptDPOGS214606-TA3732 bp
ProteinDPOGS214606-PA1243 aa
Genomic positionDPSCF300050 - 135211-140721
RNAseq coverage607x (Rank: top 21%)
Annotation
HeliconiusHMEL0225030.070.54% 
Bombyx% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1164918774e-2119.79%subtilisin-like serine protease [Pediococcus pentosaceus ATCC 25745]
Group
KEGG pathway 
Orthology groupMCL35026 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214606-TA
ATGTTTCAGCGCACATCAAAGTCCCTGTGCGTTTTGAACGTCGGCCGTAACCCCCTGGGCTCGGAGGCCGTGAGGTCTCTAGTGGGGCGAGGGTTGGTGTCCCTGGGCCTACAAGCCGCCAGGCTCGGACCGGACGCGGCCAGGGGACTGGCTGATATTATACGAGGGGGGGAGAGACTACAGGAGTCACAGACGGGCGTCCAGAGCACGGAGGCTGCGACAGTCGCGCGACTGTTGCGCGAAATTCGTGTCGTGTGTCGCGGGAACGAACCCGCGGCCCCCGACAGGTTAATGAGGAAGATCAGCCTCACTTGTCATACAGTCCCTATGATTAAGACTCCTGCCGCTGATGATGATCGTCGCGTCCGCCTCCGTTCCCCAGCGCCGTCTCCGGCCCCCTCGCCGGCTGGTAGCCCCGTACCAACACCTACTGGATCACGATTTTCGGTGACTCGAGTGACCCCCGAGCGGGAGATGTCAGATTCAACCCCCACAACCCCCACGACCCCCACCAGATGTACTTCATCAAGGTTTAAAGTTGTACAGGTGGTGGAACCGCAAGTAGTGATGCCCAGGAAGTCTGTCTCGAGATTCTCAGTTACGAGGAACTATGACAGTACTTACAATCCCACGTTACCACCGACCACGCCATCGCCATCACCATCGCCGTCTCCGACACCGTCACCAGTACCGGACAGGAGCGAGAAGATTGGCCAAACTCCTTTGAAAAAGGTTGGCCCAACCATCGACGGTAGTGGCGCAAAGGTTGGCCATATCGACAAGCAAGCAGCAAATGAACGATCGAATATTGAACACACACAGAGTGTTGACTTTAAAAAGACTAACACAGATAGTAAAGAAACCACGAATAAGAAAGAGACTGAGATGCTAGCGGACTTCAGTTACGACGAGGTCCGCATAAAAGATGTTATATTAAAAGATAAAGATAAAGATAAAGATGTAAAAGATATAGAAGGTAGTTTGATAATTATTGACGATGTCAGAGACGAGGACGAAGAACCGTCGAGTGAAGCGGCTAAGGATTTAGATGTGTGTGATGTGCTTGTGACAAGTGACTTCGGTGTCAAACGCCAAGTTAGTGACGATAGTGTACACGACTCGGACAACGACGTGTTTAGTGATAGCGGGGACTTTAAAAATTTAGATTTAGTGTATAGTGATACCTATAACGGTGATAGGAACGAACAAATGTCTCATAGTGAGACGGTTGGCGGCGGGATAGAGACGGGTGTAGCGAGGTGTATAGGTGTGAGCGAGACGGATAGAGATATGACAATTGAAACAGATAATAGTCTAACCAAAGTACATCGCGACCAAAATGTAAATAATATATATAAGGATGATAAAGTAGCCATAGAGGATGATGGGAAAGGTGTAGATGTTGTAGATTTAAACGTAACCGTTGTACCTGCTCGTGATAGTGTAGTGTTAAAGAAGAATAAGAGTGAATCGAGCCTCGACAGCCCGGACCTGGAGGTGTCGAGGCTGATGCGGAGACCTGTGAGTGCCTTCTGTGATAGCAGCTCGTCGCTGGAGATATCCGGCAGCTCGATGGAGAGTCTGAACACGGACAGACCCAGGCTGATAATCGACAAGCATTTGTCCAAAGACAACAGCGTCGAGTCCACCAGTGAGGTGACACCAGTGAATCTTAACGTATCGATAAGTTCCAACGAGAGCGTCTCGCCTATTATATTCGCTAAGAAGATCCACGGATCGCTATCCAGCCTGGAGGCGAGCGTCAGTTCGGTTGAATCCGCTAAAGAAAAGATAATGGTGACGTCGGCGGATTCAGGGATAGAATATTCGTTACAAAACCCATCCGAAATGAAAGACGACAGTTCGTCCAATGAAGGCACTCTGACGAACTGCAGTTCGAGTTTGAAGGAAACTATGAGGAAGGATTCGCAGGATACCGTGACGCCGAAGCGAACGTCCAGCTTGTTGGATGTACCGGCTCTGAAGTCCAAGGGCTTAGAACGGATGAGGAAGATATCCTGGGTAGCACCGTCAGCAAGCTTCCATCTCCCCAAAGCTGAGGAGAAAGTGGAATACAAGCTGCCGGGGAATCTGGAGAAATTGCTCAGCCTCTTCCAACATCCGAGCAGTCTGTTCTCCAGGAGTAGCAGTGACGATGAGAGAAAATCTAACTCCGGGACACCCCCGAGGAAGGATTCGTCTTTAACCAGCTCGTTCTGGTCCTGGGGGAGTGTCGCCGAGAAGAACGACGATGACAGTATATCGGATGCAACAGACTCGACGCTGTCCGAGCGCGTGCAGGTGTCCTTCGTCGACGAATCGTTCTCAAAGAAACTCGACAGCAAAACGCCTTCCACGGACACTGATAACACTCTAAGTGAATTTCAGTTTCCTAATACGGAGAAAGTTACCGTAACCACCGATAAATTAGTACAAAGTTTAGATGTAAGCGACCCGTGCTCGGCCAACCCGAGCGATGATCTCATTGTGCCCAACGATTTTGTATACGATGATAACTTAAAATCTGAAGATAAAGTTGATGTTAAACGAACCTTCGCCTCTGTATTGAAGTCGTCTGGTTCGGAGAATTCATTGGAGAGGCCGAATCCTGACGTGGGACAGACGGTCGAGAAGCTTCCCAGCAAGGTGATCAAAGGCATCAAGGAAAATATAAGCCCGGAGAATACTTTGACGTCCAGTATAGCGACGAAAGCCATGGCTATGGAAGTAGCGGAGAGACAGGCCAAAAATAAACAAATCGTCAACACAGTATGGGAGGTCACGAATCCGTTGACGGAGAAAAGTGATACTAAAGTGACTGAGAAGAAGACTCAAAGTGACTTGGCGCCGATAGCGAATATCGATGAAACTTGTGACGTATCAGCTGATGACGTCATCCAGCTGGCATACATCGATGATAAAGAAGATGGAGCCGATAAGGTTGAGAAGGTCCTGGAGAATATCGACCTGGGGAAGGACGCCTTATCGTATCTGATATATGAAAACCAAGATTACGAGGCGGATACAGAAACTGTGTTGGCGAATAGATCTCAGGAAGGATCTCTGGCTCAAGAACTGAGGGATGCTGAGATAAAGGAGATGCTAGATCTATCACCTGAGTTGGTGTTAGACGAGGCCCTGGAAATACCGGAGATATTCACTGTTGAAATCAAGGGACGAAAAAGCTCTCCAGTCATACCGGAGAGGGCGAAGATGAAGAAGTCCAACTCGCTGGAGGATTTGACGAAGAGACAGAATCTAGAAGAGAAAGAGAGTCCTAAGATGAAGACGATAGCGTTCAAAGTCCCCGAGAGCACCACTCCCAGAGACATACCAGAGAGACGAACGAAATTAAGATCTAGGAGCGGATCCAGTCCTAAATCATTACCGGAGAGCCTGAACAAACCTTGTCCCTTGACGAAGATGGATTCCATATTGAGCAAGAAGAAGAAAAAAGTGTCCTCGCTGGGGAAAATGGCGAAAGACTCGCTGCTAGCGTTGAACATGAGCGAGGAGGAAATCGCCGAGTTCAGACGCTCCTATAAACTGACGTCGGTTGAGAGTCTAAGGTCTTTGGAGTCCGTGTCCGAAGATGCGAACTCACACAGCGGGACCTCATACGATTCGAGATGCCGAGCCTGTCTCCGGACTTCACAAGAGAGTCTCATGTCGCTGGACTCCATCAACGAGGACTGCAGGTGTGCCGATGACGAGAAACGTCACCATAGATAA

Protein sequence:

>DPOGS214606-PA
MFQRTSKSLCVLNVGRNPLGSEAVRSLVGRGLVSLGLQAARLGPDAARGLADIIRGGERLQESQTGVQSTEAATVARLLREIRVVCRGNEPAAPDRLMRKISLTCHTVPMIKTPAADDDRRVRLRSPAPSPAPSPAGSPVPTPTGSRFSVTRVTPEREMSDSTPTTPTTPTRCTSSRFKVVQVVEPQVVMPRKSVSRFSVTRNYDSTYNPTLPPTTPSPSPSPSPTPSPVPDRSEKIGQTPLKKVGPTIDGSGAKVGHIDKQAANERSNIEHTQSVDFKKTNTDSKETTNKKETEMLADFSYDEVRIKDVILKDKDKDKDVKDIEGSLIIIDDVRDEDEEPSSEAAKDLDVCDVLVTSDFGVKRQVSDDSVHDSDNDVFSDSGDFKNLDLVYSDTYNGDRNEQMSHSETVGGGIETGVARCIGVSETDRDMTIETDNSLTKVHRDQNVNNIYKDDKVAIEDDGKGVDVVDLNVTVVPARDSVVLKKNKSESSLDSPDLEVSRLMRRPVSAFCDSSSSLEISGSSMESLNTDRPRLIIDKHLSKDNSVESTSEVTPVNLNVSISSNESVSPIIFAKKIHGSLSSLEASVSSVESAKEKIMVTSADSGIEYSLQNPSEMKDDSSSNEGTLTNCSSSLKETMRKDSQDTVTPKRTSSLLDVPALKSKGLERMRKISWVAPSASFHLPKAEEKVEYKLPGNLEKLLSLFQHPSSLFSRSSSDDERKSNSGTPPRKDSSLTSSFWSWGSVAEKNDDDSISDATDSTLSERVQVSFVDESFSKKLDSKTPSTDTDNTLSEFQFPNTEKVTVTTDKLVQSLDVSDPCSANPSDDLIVPNDFVYDDNLKSEDKVDVKRTFASVLKSSGSENSLERPNPDVGQTVEKLPSKVIKGIKENISPENTLTSSIATKAMAMEVAERQAKNKQIVNTVWEVTNPLTEKSDTKVTEKKTQSDLAPIANIDETCDVSADDVIQLAYIDDKEDGADKVEKVLENIDLGKDALSYLIYENQDYEADTETVLANRSQEGSLAQELRDAEIKEMLDLSPELVLDEALEIPEIFTVEIKGRKSSPVIPERAKMKKSNSLEDLTKRQNLEEKESPKMKTIAFKVPESTTPRDIPERRTKLRSRSGSSPKSLPESLNKPCPLTKMDSILSKKKKKVSSLGKMAKDSLLALNMSEEEIAEFRRSYKLTSVESLRSLESVSEDANSHSGTSYDSRCRACLRTSQESLMSLDSINEDCRCADDEKRHHR-