Monarch geneset OGS2.0

DPOGS209485
TranscriptDPOGS209485-TA1296 bp
ProteinDPOGS209485-PA431 aa
Genomic positionDPSCF300127 - 487535-490635
RNAseq coverage905x (Rank: top 14%)
Annotation
HeliconiusHMEL0162642e-14971.33% 
BombyxBGIBMGA007440-TA3e-11759.42% 
DrosophilaCG10341-PA4e-4550.00% 
EBI UniRef50UniRef50_UPI0002060A583e-4347.16%UPI0002060A58 related cluster n=1 Tax=unknown RepID=UPI0002060A58
NCBI RefSeqXP_001961640.17e-4949.15%GF15068 [Drosophila ananassae]
NCBI nr blastpgi|1947587871e-4749.15%GF15068 [Drosophila ananassae]
NCBI nr blastxgi|1953879869e-5633.97%GJ17682 [Drosophila virilis]
Group
Gene OntologyGO:00311203.2e-33snRNA pseudouridine synthesis
GO:00305153.2e-33snoRNA binding
GO:00422543.2e-33ribosome biogenesis
KEGG pathwaybmy:Bm1_308302e-18 
 K12398 (AP3M)maps-> Lysosome
InterPro domain[146-297] IPR0075043.2e-33H/ACA ribonucleoprotein complex, subunit Gar1/Naf1
[164-243] IPR0090001.8e-12Translation elongation/initiation factor/Ribosomal, beta-barrel
Orthology groupMCL17817 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209485-TA
ATGGAAAACCTTGATCAAAGTAACAATAAAAACGTGTCGCTCAGTCTAATAGCTGAATATGGCTCGGAATCGGATTCGGAGGATACCCGATCGAATGAGCACACCGCACAAGATGACTCGGATGCTGTCGAATTAAGTGAAATGGTTTTGAAAAATAACATTATCAATGCTTGTGCCCACGGACCCCTTGCTGAAGGTAGTAGTGAGAATGTGGTCACCATGAGACACATGATCATCGACAGTTCAGAGAGCGGTGTTATGGAGTATGATGTGACTGAGTACCGCGAGCTGGACAGCGACTCCGACTCGAGTTCGGACGACTCGAGTGATATTGATTCTGTGAAGGATATTGAGGAACTTTCTAGCGGCGATGAACAAGAAGGTAGTAACCGGCCCGGTAAGTTAGAAACTCCCAAAGTGCACGGCGAACTAGGTTTGGATGACTTGCCGCCGATCGAGGATCTTGCCATTAGTCTGCCGGCCCAGGAGACTATAAAAATTGGAAAAATTGCAAGCATAGTTGATAGATTAGTTATAGTTCGTGCTTTCGAAGCAACCCCAGCCGTTGATCTGGATAGTGTACTATTTTTGGATAACGGTGCCAAAGCATTGGGCAAAGTATTTGATGTTTTTGGACCGGTAACAGAACCTCATTACTGCGTCCGTTTCAATTCGTTGGAGCACGTCCGAGAGCGGGGCGTTGTTACGGGTGCGGACGTGTACATAGCGCCTCGCAGCGCGCACACCAGCTACGTGTTTCTAGCTGAGCTCATGAAAGTCAAAGGCTCAGATGCGTCGTGGCTGAACGATATAGAGCCGCCTCCGAGCCACGTGGATTATTCTGATGATGAGGAGGAGAGACGAGCCAACAGAACTAGGAAGGAACAGCGACAGAACAAACAAGAGGACTCCGGGGACGGGGGCACGTCCGACAACCAGCCGAGGAGAGTACTCGAGGCCAAACGACACCAGCGACCCTCCGAGTCGAGTAGTCGCTTTGGCGGAAACTACCGAGGACCGCCAGGCTTCAGGAGGAACCCTTCGAGCTTCATAAGAAACACCAGACCCTGGGACGACCAGACCAGGACGCCGCCCCACCCCGTCGATCCCAACCAGCCGTTCTTTCCTACATTCAATCCGTTCATTCCGTTCATGAACGGCAACCCGTTCGGCAGGTCCCGGATGCCAGCCATGCCGCCGTACCGCCAACACGTGCCGATGTTCGGCGCCAATTACAACATGTCCGGGCCGAAGCCCCAGTGGAGTCCCGGCCCTCCGCCCCCGCCCGGGACCTAA

Protein sequence:

>DPOGS209485-PA
MENLDQSNNKNVSLSLIAEYGSESDSEDTRSNEHTAQDDSDAVELSEMVLKNNIINACAHGPLAEGSSENVVTMRHMIIDSSESGVMEYDVTEYRELDSDSDSSSDDSSDIDSVKDIEELSSGDEQEGSNRPGKLETPKVHGELGLDDLPPIEDLAISLPAQETIKIGKIASIVDRLVIVRAFEATPAVDLDSVLFLDNGAKALGKVFDVFGPVTEPHYCVRFNSLEHVRERGVVTGADVYIAPRSAHTSYVFLAELMKVKGSDASWLNDIEPPPSHVDYSDDEEERRANRTRKEQRQNKQEDSGDGGTSDNQPRRVLEAKRHQRPSESSSRFGGNYRGPPGFRRNPSSFIRNTRPWDDQTRTPPHPVDPNQPFFPTFNPFIPFMNGNPFGRSRMPAMPPYRQHVPMFGANYNMSGPKPQWSPGPPPPPGT-