Monarch geneset OGS2.0

DPOGS213208
TranscriptDPOGS213208-TA3528 bp
ProteinDPOGS213208-PA1175 aa
Genomic positionDPSCF300114 + 226694-230308
RNAseq coverage166x (Rank: top 51%)
Annotation
HeliconiusHMEL0103020.064.36% 
BombyxBGIBMGA007401-TA3e-9567.73% 
DrosophilaCG12499-PA1e-3022.96% 
EBI UniRef50UniRef50_E2BQV32e-6527.42%Nucleolar pre-ribosomal-associated protein 1 n=3 Tax=Formicidae RepID=E2BQV3_HARSA
NCBI RefSeqXP_971852.29e-6029.21%PREDICTED: similar to GA11665-PA [Tribolium castaneum]
NCBI nr blastpgi|3838589161e-7124.21%PREDICTED: LOW QUALITY PROTEIN: nucleolar pre-ribosomal-associated protein 1-like [Megachile rotundata]
NCBI nr blastxgi|3838589162e-8723.44%PREDICTED: LOW QUALITY PROTEIN: nucleolar pre-ribosomal-associated protein 1-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[109-398] IPR0217143.1e-34Ribosome 60S biogenesis N-terminal
Orthology groupMCL13720 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213208-TA
ATGGGCAAAAGAAAATACGAAGATAATTCAACTAGCGACAATAAGAAAACAAAGACCGAAAATAATTCAGAAAATATAAATGACACTGAAAATGATGTTGAGCAACCTCAAAAAGAACAAAATGTTAAGAGCCAGAATAGAAATATTAGTAAAAATGCATTATTTGATATTAAGCATTTCCGTAAAGAGTTAACAGCAAAGCAGGGCCAGACCATGGCATTAACTCAGTTCCTGCAAGTCTGCCTCAATCCAGACAGTGATGCAGACTATATGCTCGAATATTTAAAAGTCGGTGGTAACTCCCATGAGCTTCTCCGACAGATATCTCAAGACAACAAGAAAAACCTGTCACTAGCGACACCTGTATTTCATCTGTTCCACCTGAACATACTTAAGGTCCAGTCCTCTCTTCCCCACATGACATCCATCACAGAGGAAGCTTGCAGATACTTCTTAAATACATTTACACCAACAGTGGAAATTATGATCAGTGAAAGTTCAGGTCCACGGCATAGAAAAATTATCCTCAATCTTTTGACTTCCATGGTGACGTTGAGCTCCGACTTAGGTGTCGAAATATTAAACCAAATACCACTGACTCCAAAGAATTTACAATATATTCTAGAAAAGCCAAATTATAAAGAGAAGGATAATGTCAGAACATGTTTTGTGCGTTTCATGACATCTTTTCTTGTCGAAGGTCATCTGCCATTGATAAAAGCTTTATTGGAAAAACCAGGATTGCTATCTCTGGTGATCCCCGGACTAGTTCAAGACGAAGCAGATGCCGTCTTAATGTTTTTGAACATCCTGAAGAACAATGTGATTGATAATACTTTAATATCAAAAACTTTAAAATTGAAAACATTCAGTCACCAAGTGTTGATCAATATGTTTAAAGTCTACATGTGGAAGGGACCTCCCGATTCAGATTCAGACAAAAGTGACATTAAAGAAGAAATAATGAGGCTGTTGTCTGACATTATACTTACATTATTTACGTCTCACAAACTTGGACTTTATTTTATGGACCCTCAAAACTTGTACAAAGCTCTGCAGCTGTTAAAACGGCCTTGGGAGAATGATGACCAGAGCCAAGTGGTCCTGGAAATTATTTACCGATGTCCGGATCTACACAGAGCTATGGTAAATGTAATCGAACAAAGTTTCCAACCCCAACATTCGCCCATGTGGGGAAAGGCAGTGAATTTTGTTATATCTCTTCTTGATAAATTAAACCCTGATAAAATGGCGACTCGCTTACATAACCTGAGTCCCATCCAAACAGCAAACTTCATAAGATTCATCACCCTGCCCGTGCCATTACTTAAATTAATGAATGCTGCAATCGGAACAGATCTCACCATATCCACACATTGTATTAAAGTCATAGTGAAAATGTTACAATCACTCAGGAGATTTATGCACATACTGGAATTAGAAGATTCAAATGAAAGAATATCAGAACTGAAGAATAAGTTGGAGAATTTTCTTCCTAAACATATGCCATCGTCTAACATTATTGTGTCACTCATAAAGAAAGTATTAGAGGGTTCTGCAACTCCTGATAACGCTGATGCTCAGGATTACAGGCTGCCGAAGCCGGAGGCGGCAGATGCTCTATTAACATTCATTGACACCCTGCTATTGTACAACCACATTTACCCAGCCTCCTTTGAGTCTCTGGAAGGTAATATTGATATGAAGCGTCTGCTAGACTTTTCCATGACACTAACTGAAGGCAATATATCTTTGTTAAAGTTCAAGGTCGTTTCTCTTTGGCTAATATTAGACGGCTCAGCTCTGACCATTAAAAACACAATGTTCAAACAACTGTTCCTGATTATGTTGGATGTGTTCACGAGCGACGAAAACGAAACGTGGTTGGAAGCTAAAGAAACACTGTATATATTTTTTAAGAATACAGAAGTTTTTGAGGCCGATGAAGATGAAATAAATCTCATGTTGTATACTCTACGGAACTCGAAAGTAAATCCGATATCATTGATAGCGGACATCATTGAACACGTTCTGGCCAATGGCCAGGAATTGTCTGAATACGTCAGGAATCAAGTCGTCAACTTCGAGATAACAGACGAATGTAGCGAGGGCAATCTGGATAAACTGTTCAAAGACTTGATGGGCGGGAAGCAGCCCACGGAGAGTGTTTTTCTGGAAAACAAAATACCTTCGCCTTTCATCGTAGGCTGTATGCAGTTTATCCAAAGCAACAGAGACGCTAAAAAGAATTTAAAACAATTTCTCTGTTTGTACGTCGCCAATTTGCTTCATTGTAATAATTCTCCGGAACTCACCGAGGTCTTGATCGGTGACTCGAAGTTAGATATAAGGAGTTACGTTGCCGACTGGACAGTACGACCCGTAGTTATACCCGATAGCACCAGCAAGGATGATACGTTGAAGAAATTATCATACTCTATAATTGAAGCTACTGAAATACCGATAAATGCTATCTTTCCATTCTTATTGGAGACGGACGACGAACACGACTTGAAAGTCCTGGATGTTCCGTATAGAATAGATACAAGGAAAGCTATCAATGGCTCGGATTTGTTTGTCTGGGCGAAATACTTGATGTTCTGTATTATACGACTCTCTAATATGAAAGAGCTTTATGAAGAACAACAGAAGAAAATCGACGGCTACTTCCAAGTGATTATAATGACAGGCAAAAAGCATCTGACGTTGAACATGTGCAGGAATATAATCCTGAATTTGTTTAAGAACGCCCACTCCCTTAAAGTATTCCAGCCTGTCGATTTCCACAAAAATCCATCGAACACTCTGGCTACGAAGTTCATGTTACAAATTCTGGAATCTAATAAAGATGTCATTAATTATCTCAATCAAAAACATAACATCCTAAAGTCGTATCAACAAAAGACATTCAACGAACTAGTGAAGGCATTTGTTAAAGTGAACAAAAGAAAAACTATTGACAGCGAAGTGACTGTGAGAGTGTTGGAGACGATAGGATTGTCTAAGGAAAACGATTTACATCTATTCGATAATATATTTTCAGCGAATACTTTCGCTTGCTTCGGAGAAGACAAAGAACCAACTTTGGTTTTGCAACTGTTAAACATTTTGATAACGAAGTATTCGAAATCCATAGCACAAGAACTACCTCCGGATACGGTAACCAATTGCTTCGTAATGTACACGAAACTCCTGAACCTTAAAGATGTGACTCCTAATTTGACAGATCTCGAGGAGTCGCTCATACAGTTGTTTGAACATAAACCCCACTATACTGCGCACGTCACTCAGGAGGAATTTAGAATTTTTTTCAATGCCAATGCTATCAGAAAGTCAACCTCGACTCTAGCTGCACTGATACTTAAACATCAAATTAAGATGTGCGATGTTTTCGTTGAAGAATTAAACAGACCAGAAGTACTCAGTCAAAGGGAGATCACGTTACCTTTAGGTAACGCTATGATTGATCACGAGCAGTTTCTCTTACAGAACAAAAACGTTCTGGCCAAAATATTCGAAGAGTACTAA

Protein sequence:

>DPOGS213208-PA
MGKRKYEDNSTSDNKKTKTENNSENINDTENDVEQPQKEQNVKSQNRNISKNALFDIKHFRKELTAKQGQTMALTQFLQVCLNPDSDADYMLEYLKVGGNSHELLRQISQDNKKNLSLATPVFHLFHLNILKVQSSLPHMTSITEEACRYFLNTFTPTVEIMISESSGPRHRKIILNLLTSMVTLSSDLGVEILNQIPLTPKNLQYILEKPNYKEKDNVRTCFVRFMTSFLVEGHLPLIKALLEKPGLLSLVIPGLVQDEADAVLMFLNILKNNVIDNTLISKTLKLKTFSHQVLINMFKVYMWKGPPDSDSDKSDIKEEIMRLLSDIILTLFTSHKLGLYFMDPQNLYKALQLLKRPWENDDQSQVVLEIIYRCPDLHRAMVNVIEQSFQPQHSPMWGKAVNFVISLLDKLNPDKMATRLHNLSPIQTANFIRFITLPVPLLKLMNAAIGTDLTISTHCIKVIVKMLQSLRRFMHILELEDSNERISELKNKLENFLPKHMPSSNIIVSLIKKVLEGSATPDNADAQDYRLPKPEAADALLTFIDTLLLYNHIYPASFESLEGNIDMKRLLDFSMTLTEGNISLLKFKVVSLWLILDGSALTIKNTMFKQLFLIMLDVFTSDENETWLEAKETLYIFFKNTEVFEADEDEINLMLYTLRNSKVNPISLIADIIEHVLANGQELSEYVRNQVVNFEITDECSEGNLDKLFKDLMGGKQPTESVFLENKIPSPFIVGCMQFIQSNRDAKKNLKQFLCLYVANLLHCNNSPELTEVLIGDSKLDIRSYVADWTVRPVVIPDSTSKDDTLKKLSYSIIEATEIPINAIFPFLLETDDEHDLKVLDVPYRIDTRKAINGSDLFVWAKYLMFCIIRLSNMKELYEEQQKKIDGYFQVIIMTGKKHLTLNMCRNIILNLFKNAHSLKVFQPVDFHKNPSNTLATKFMLQILESNKDVINYLNQKHNILKSYQQKTFNELVKAFVKVNKRKTIDSEVTVRVLETIGLSKENDLHLFDNIFSANTFACFGEDKEPTLVLQLLNILITKYSKSIAQELPPDTVTNCFVMYTKLLNLKDVTPNLTDLEESLIQLFEHKPHYTAHVTQEEFRIFFNANAIRKSTSTLAALILKHQIKMCDVFVEELNRPEVLSQREITLPLGNAMIDHEQFLLQNKNVLAKIFEEY-