Monarch geneset OGS2.0

DPOGS208807
TranscriptDPOGS208807-TA1440 bp
ProteinDPOGS208807-PA479 aa
Genomic positionDPSCF300036 - 201814-203253
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0150950.073.11% 
BombyxBGIBMGA007961-TA1e-10642.17% 
DrosophilaCG1316-PA7e-7334.46% 
EBI UniRef50UniRef50_E2AVF62e-10241.96%RNA-binding protein 45 n=9 Tax=Endopterygota RepID=E2AVF6_CAMFO
NCBI RefSeqXP_395582.21e-10242.57%PREDICTED: similar to CG1316-PA [Apis mellifera]
NCBI nr blastpgi|3072020223e-10442.65%RNA-binding protein 45 [Harpegnathos saltator]
NCBI nr blastxgi|3072020222e-10042.45%RNA-binding protein 45 [Harpegnathos saltator]
Group
Gene OntologyGO:00001667.9e-16nucleotide binding
GO:00036762.4e-11nucleic acid binding
KEGG pathway 
InterPro domain[29-104] IPR0126777.9e-16Nucleotide-binding, alpha-beta plait
[31-85] IPR0005042.4e-11RNA recognition motif domain
Orthology groupMCL11668 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208807-TA
ATGGATAACCGGCGTAAATTCCCCCGAAATGAAGAGAGAGATCGCGAAGATGTGCCGGTATATTCAAGACTTTTTATAGTTTGTGATCGCAATTTAAAAGAAGATAATTTTAGAGAAGTTTTCTCTAAGTTCGGTTATATAGAAGATGTACGTATTCCCAGAGATCATAAATCAGGAGAACCTAAAGGCGTGGTATTCATAAAATTTTCTAAAACGTCCGAAGCTGCTCTGGCATTGGAAGAAATGAATCTTAAAGTAATGCCATATTCTAGCAGACCACTTAAAGTGATGGTGGCGGCTAATAAGTCAGATATACAATCTGAAGACCACAGTAATGAGAAATATCGTAGATTATTCTTACATATTCCCAAAGACATGAATGAAGATATGCTCGAGGAAAATTTCAAAAAATACGGACATATTGATGATGTTCTCATACAGAGAGACAGGAATACGAGAGAACCGAAAGGTTTCGCTTATATAAAATTTCGAAAATTTTCAGAAGCGGCATTTGCATTTGAACAATGTGAGAAAAAATACAGAGCTATCTTCGCTCAACCTAAAGGTGCTAACAGACGTCCGGAAACAAGTTACGAGACTAATATTAATCATTTAGCTATGTCTTCTTCAAATCAGCGGAACTCTATCATGACAATGATGAATGTACACCCCAGGGGGTACACCCGTGTTAATTTCATGTGCAGTCCATACCTCACACAAATGCATGTAGAATCGTTATTTGATATTGTGCCTGGGCTTGTAGATTTTCGTTACTTTGTCGACTTGGTAAGGAATTTTAGTAAAGGTTCAGCCCAGTATTCAAATCCGGTGTCAGCTGCGTACGCTGTTGAGAAGTTAAATGAATTTGAATACCCTCCTGGTCAAAAAATATTCGTTAAGCCAGCTGATACAAACTTTGACAGCCATCCACGTAACCAGGAACAATCATTCTCCGATATACCGAACGCGGTCAGTAATTTGAGAAACGCTATCGCTTCAACTGCAAATTCCACCTCACCTGATCTGGTGCAACTCGCTGAGGCTATAGCTGAAGCGTCAAAATTAATAAAAATGGCCACAGCCGGAGTCTCAGATGACAATATTCCCGACAGCAATGATTTGAATTATTGTAGCGTTAAATTACCACCCACCCAACCGTTAGCTGATATCGACAGTCCAGTAGCTAAGAGGTGCTTTCTGGTCTGCAAACCTCAGCCACCTCCGCTCACGGTGCTGCGGGATATATTCTGTCGTTTCGGTAATCTCATCAACGTATACACACTACCGCAGAAAACTGTAGGATATGCTCGTTACTCGAAAGCCGAGTCAGCCGATAACGCAATACAAACGTTACACGGCGCCGAAATCTGCGGCATCCGTATGAAAGTTCTAGAAGCGGAGGACGAAGCGCCCTCCAAAAGAATGAGATATGATCATTAA

Protein sequence:

>DPOGS208807-PA
MDNRRKFPRNEERDREDVPVYSRLFIVCDRNLKEDNFREVFSKFGYIEDVRIPRDHKSGEPKGVVFIKFSKTSEAALALEEMNLKVMPYSSRPLKVMVAANKSDIQSEDHSNEKYRRLFLHIPKDMNEDMLEENFKKYGHIDDVLIQRDRNTREPKGFAYIKFRKFSEAAFAFEQCEKKYRAIFAQPKGANRRPETSYETNINHLAMSSSNQRNSIMTMMNVHPRGYTRVNFMCSPYLTQMHVESLFDIVPGLVDFRYFVDLVRNFSKGSAQYSNPVSAAYAVEKLNEFEYPPGQKIFVKPADTNFDSHPRNQEQSFSDIPNAVSNLRNAIASTANSTSPDLVQLAEAIAEASKLIKMATAGVSDDNIPDSNDLNYCSVKLPPTQPLADIDSPVAKRCFLVCKPQPPPLTVLRDIFCRFGNLINVYTLPQKTVGYARYSKAESADNAIQTLHGAEICGIRMKVLEAEDEAPSKRMRYDH-