Monarch geneset OGS2.0

DPOGS201905
TranscriptDPOGS201905-TA1644 bp
ProteinDPOGS201905-PA547 aa
Genomic positionDPSCF300191 + 602701-607218
RNAseq coverage1136x (Rank: top 11%)
Annotation
HeliconiusHMEL0140250.092.27% 
BombyxBGIBMGA006085-TA0.087.37% 
Drosophilanop5-PA0.077.70% 
EBI UniRef50UniRef50_E2B2155e-17665.54%Nucleolar protein 5 n=10 Tax=Eukaryota RepID=E2B215_CAMFO
NCBI RefSeqXP_395309.30.077.21%PREDICTED: similar to nop5 CG10206-PA [Apis mellifera]
NCBI nr blastpgi|1954716330.078.02%GE18393 [Drosophila yakuba]
NCBI nr blastxgi|910789000.070.14%PREDICTED: similar to nop5 CG10206-PA [Tribolium castaneum]
Group
KEGG pathwaynve:NEMVE_v1g1931358e-29 
 K12844 (PRPF31)maps-> Spliceosome
InterPro domain[254-401] IPR0026871.2e-57Pre-mRNA processing ribonucleoprotein, snoRNA-binding domain
[162-214] IPR0129761.1e-29NOSIC
[1-66] IPR0129743.6e-21NOP5, N-terminal
Orthology groupMCL13607 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201905-TA
ATGTTGGTGCTATTCGAAACGCCGGCGGGATACGCTATATTTAAGTTACTTGATGAGTCAAAATTATCACAAATAGATGATCTGTACCAGGAGTTCAACACGCCTGAAGGGGCTTCATCAGTAGTGAAACTGAAGAATTTTATTAAATTTGAAGACACTACGGAAGCTTTAGCAGCAACTACCGCCGCTATTGAAGGCAAGCTTTCAAAGACATTGAAGAAAGGTTTGAAGAAGCATCTGCTCAAAGATGTTCAGGATCAGCTGTTAGTCGGGGACGCCAAGCTAGGCAGTGCCATCAAAGAGAAGTTTGACTTACAATGTGTATCAAATTCAAATGTACAAGAATTGTTGAGATGTATCCGTTCCCAAATGGACAGTTTGCTGACGGGCCTGCCCAAGAAAGAAATGACAGCTATGGCTTTAGGTCTTGCCCATTCACTGTCCAGATATAAACTGAAATTCTCTCCAGACAAAATTGATACTATGATAGTACAAGCTCAATGTCTATTGGATGATTTGGATAAGGAATTGAACAACTACGTCATGAGATGCCGGGAATGGTACGGCTGGCACTTCCCGGAGCTCGGGAAGATTATAACAGACAATACTTCATTCGTGAAGATCGTTAAGCTCATGGGTACCCGAGATCACGCGGCCACGACTGATATGTCGGACATTCTTCCGGAAGATCTGGAAGAGAAAGTCAAAGAAGCAGCCGAGATATCCATGGGAACGGAGATCTCTGATGACGATATTATTAATATACAGAACTTATGTGATGAGATTGTATCTATCACGGACTATAGAGCACATCTGACGGACTATTTAAAGGCGAGGATGATGGCCATGGCACCGAACCTGACAGTTCTGATCGGGGAGCATATAGGAGCTAGACTAATAGCCCACGCTGGTTCATTAATGAATCTAGCTAAACATCCGGCTTCCACTTTGCAAATATTCGGTGCTGAGAAAGCTTTATTCAGAGCTTTGAAGACTAAGAAGGATACTCCAAAGTACGGTCTCATATACCACGCTCAGCTGATTGGACAATGTAGCACCAAAAACAAGGGCAAAATGTCGAGAATGTTGGCGGCCAAGGCGGCGCTGGCGACACGTGTCGACGCCTTCGGTGATGATGTGACATTCGAGTTGGGAGCGAAACACAAAGTGAATCTTGAGAATAAGCTGCGGTTACTAGAAGAAGGTAACTTGAGGAGAATCAGCGGCACGGGCAAGGCGAAGGCCAAGTTCGAGAAATATCACAGTAAATCTGAAGTGTTCTCGTACCCGACGGCAGCGGACAGCACCTTGAAGGCAGTGAAGAGGGAACACGAGCCGGAAGAAGAAGCCGCTCCGGCCAAGAAGATGAAGCTGGAGAACGATGTCAAAGTAAAGAAAGTGAAATCAGAACCGTCAGATGAAGTGGACGGACAAGAGAACGGTGATTCAGAGCTGACTGAGAAGAAGAAGAAGAAAAAGAAGAAGTCCATGGAACCAGAACTGGCACAGGCTGGAGAACAATCTCCGGTCAGTGAGAAGAAAAAGAAAAAGAGCATGGAACCAGAACTAGCACAGAGCGAAGAAGCACCCGCGAGCGAAAAGAAGAAAAAGAAGAAAAGACAATCTCAGCCCCAAGAAGAATGA

Protein sequence:

>DPOGS201905-PA
MLVLFETPAGYAIFKLLDESKLSQIDDLYQEFNTPEGASSVVKLKNFIKFEDTTEALAATTAAIEGKLSKTLKKGLKKHLLKDVQDQLLVGDAKLGSAIKEKFDLQCVSNSNVQELLRCIRSQMDSLLTGLPKKEMTAMALGLAHSLSRYKLKFSPDKIDTMIVQAQCLLDDLDKELNNYVMRCREWYGWHFPELGKIITDNTSFVKIVKLMGTRDHAATTDMSDILPEDLEEKVKEAAEISMGTEISDDDIINIQNLCDEIVSITDYRAHLTDYLKARMMAMAPNLTVLIGEHIGARLIAHAGSLMNLAKHPASTLQIFGAEKALFRALKTKKDTPKYGLIYHAQLIGQCSTKNKGKMSRMLAAKAALATRVDAFGDDVTFELGAKHKVNLENKLRLLEEGNLRRISGTGKAKAKFEKYHSKSEVFSYPTAADSTLKAVKREHEPEEEAAPAKKMKLENDVKVKKVKSEPSDEVDGQENGDSELTEKKKKKKKKSMEPELAQAGEQSPVSEKKKKKSMEPELAQSEEAPASEKKKKKKRQSQPQEE-