Monarch geneset OGS2.0

DPOGS203454
TranscriptDPOGS203454-TA1992 bp
ProteinDPOGS203454-PA663 aa
Genomic positionDPSCF300242 + 195675-201208
RNAseq coverage1428x (Rank: top 9%)
Annotation
HeliconiusHMEL0150257e-16050.91% 
BombyxBGIBMGA011103-TA8e-5172.44% 
DrosophilaNnp-1-PA6e-5132.75% 
EBI UniRef50UniRef50_UPI00022C9E298e-7029.26%UPI00022C9E29 related cluster n=2 Tax=unknown RepID=UPI00022C9E29
NCBI RefSeqXP_973402.13e-5937.30%PREDICTED: similar to AGAP002684-PA [Tribolium castaneum]
NCBI nr blastpgi|3504144963e-6929.26%PREDICTED: ribosomal RNA processing protein 1 homolog [Bombus impatiens]
NCBI nr blastxgi|3504144961e-7430.59%PREDICTED: ribosomal RNA processing protein 1 homolog [Bombus impatiens]
Group
Gene OntologyGO:00306883.5e-37preribosome, small subunit precursor
GO:00063643.5e-37rRNA processing
KEGG pathway 
InterPro domain[33-220] IPR0103013.5e-37Nucleolar, Nop52
Orthology groupMCL15178 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203454-TA
ATGAAACGACTGGAAAAGAAAAAGGTTAAAAATCCAAAACATAAAAAGAAATCTATAATAAAACCTAAAAAAGAGAAAGTACTCGTTGTGGCGCAGGAATTTAAATTTGCGCGTCTGCTTTCTGGAAACGAGAAAAAAACTAGAGATCGTGTTTTGAAAACATTGAAAAAATGGCTTTTGAACTGTTTTGAGAAAGGATATGAGTTCAAGGAAGATGACTTCATCCGAGTATGGAAAGGTTTATTTTATGCCGTGTGGATGTCTGACAAGCCTCTAGTACAGATCCTCGACCTGTTCCCTGTGGAACAGATACACCATGCTATCCTGATGATGAAGGCGGGATTTAGAGTGTTAGCCACCGAGTGGTACGGACTGGATCAACACAGGATGGATAAGTTCCTTATGCTGGTCCGACGCTACCTCCGCGGCGGCTGGCGCTGTCTGAGGAGAGCCGACTGGAGTCTGGACTCGTGTCACAGATTTGTTGACATGCTCACGGGACAGGACGGCCTGTTTGCTGTGAAGACTCCATCCTTTGCCAGGAACTCCCTGTCCCTGATGCTGCATGTCATTGACTGCTACCTGGAGGAGTTGTCCAAGGTATCCGAAGGTTCCATCCCCCTGGCCAGCATGGCGTGCCTCCTCCAGCCGTTCACGCTCCACGTGAGGTGTGACACTCCGCTCAGTGTGCCGGCCAGGAGACTCCTCACGCAGCTCATCACACAGACCGACCTGGGCTTGGAGTACGAACAGAAGGCTCGGGCCTGGAAAAAGATGGGTTGTCCCCCCGGAGGACCGGATGCCTTGGAGCTGGTGGAAGACGAGGATGATGAGGAACATGAGGATGACGAGCAGGAGGAGGTCCTGGAGGACGGGCCTGGGCCGCTGGACCCTCGCGCAGGTCGTGTGAGCGTGCTGCTGACGCCGCTGCCAGTGCCGGGAGCCGAGCTAGCGGACCAGCTGAGAGGAGCCGCGACCTCGCACCGTGCGAAGAGACGAGCAGAGATATGTGCTGAGAGGTTCTTAGCTCTATCATCATCAGAATATCCCCTGAAGGTTCCCGAGGCGGACATCCCGGAGGAGGTGCCTAAGGGCGTGTCCCCTGGCAAGGCAGCCAAGAGTCTGGTGAGGCTGGAGAGGAGCCTCGGCGCCACCAAGGACGAACTAGCTCTCAAAGCAATGAGTAAAAAACATCGCAAGAAGTTACTGGCGAAGGCGCGAGCTGGTCTCAGTATAGTGGAGGAAGTGACGACAGGATCCAGGGACGGGGAGTGGCAAGTTGAAGAAGCGGAAGATACAGAACACACCAAGGACGAGGACAAAGAAAACCTGAAGAAGAAAAAGAATAGAAAAAGAAAGATGAAGAACGACCAAGAAGACGGAAACAAGAAACGGAGAGTGACCGGAGATGACGTCATAGAGACAAGTGAAAGCAACCATAAGAATAGTATCACAGAAATTAAGAAAAAGAAACGAGATAAAGCAAAGATAAAGACAAAAAAAAATAATGAAAGAAAAGTCAGCGGAGGTGACGGAAGCAAGGCGGTGATCAGTATTAAGCCAATAAATAAAGAAAATAATACAAAAACACAAAAAGATAGAATAGAAAACAAAGAGCGGGCTGAGAGTGTAGCGAGTGATGAGAGGAAGAATGAAAAGATGAGTAAAGATAAGGAAAAGAAACAGGAAAACAAGAGAAACGAGAAGGACAGTGAGAGAGTTACACCGAGTAAGGTCAAGAACTATCAGAAGGCGACTCAAGCGAAGAAAGACAGTCCCAAGAAGGTGGTGACCTTCGACACGCCCAAGAAGGTGAAGTTCGTGTTAAAGAAGAACAGTATGCAACTGCCCGGGGAGTACTACAGGAGCGTGCAGAGAAGCCCCGACATCCCTTACGACGGGACCAGGCAGCCCGCCAAGACCAACCTCAAGCCCTCCACGCCCTCACCCATCAACCCGTTCTTTAAGAAGTACAAGCTGAAGATCAAGTGA

Protein sequence:

>DPOGS203454-PA
MKRLEKKKVKNPKHKKKSIIKPKKEKVLVVAQEFKFARLLSGNEKKTRDRVLKTLKKWLLNCFEKGYEFKEDDFIRVWKGLFYAVWMSDKPLVQILDLFPVEQIHHAILMMKAGFRVLATEWYGLDQHRMDKFLMLVRRYLRGGWRCLRRADWSLDSCHRFVDMLTGQDGLFAVKTPSFARNSLSLMLHVIDCYLEELSKVSEGSIPLASMACLLQPFTLHVRCDTPLSVPARRLLTQLITQTDLGLEYEQKARAWKKMGCPPGGPDALELVEDEDDEEHEDDEQEEVLEDGPGPLDPRAGRVSVLLTPLPVPGAELADQLRGAATSHRAKRRAEICAERFLALSSSEYPLKVPEADIPEEVPKGVSPGKAAKSLVRLERSLGATKDELALKAMSKKHRKKLLAKARAGLSIVEEVTTGSRDGEWQVEEAEDTEHTKDEDKENLKKKKNRKRKMKNDQEDGNKKRRVTGDDVIETSESNHKNSITEIKKKKRDKAKIKTKKNNERKVSGGDGSKAVISIKPINKENNTKTQKDRIENKERAESVASDERKNEKMSKDKEKKQENKRNEKDSERVTPSKVKNYQKATQAKKDSPKKVVTFDTPKKVKFVLKKNSMQLPGEYYRSVQRSPDIPYDGTRQPAKTNLKPSTPSPINPFFKKYKLKIK-