Monarch geneset OGS2.0

DPOGS215323
TranscriptDPOGS215323-TA1272 bp
ProteinDPOGS215323-PA423 aa
Genomic positionDPSCF300120 + 196865-198136
RNAseq coverage341x (Rank: top 34%)
Annotation
HeliconiusHMEL0100276e-15364.24% 
BombyxBGIBMGA007961-TA3e-11848.94% 
DrosophilaCG1316-PA4e-4429.64% 
EBI UniRef50UniRef50_E2AVF64e-6133.79%RNA-binding protein 45 n=9 Tax=Endopterygota RepID=E2AVF6_CAMFO
NCBI RefSeqXP_395582.21e-5935.05%PREDICTED: similar to CG1316-PA [Apis mellifera]
NCBI nr blastpgi|3071702041e-6033.79%RNA-binding protein 45 [Camponotus floridanus]
NCBI nr blastxgi|3071702042e-5633.87%RNA-binding protein 45 [Camponotus floridanus]
Group
Gene OntologyGO:00001664.7e-15nucleotide binding
GO:00036761.7e-13nucleic acid binding
KEGG pathway 
InterPro domain[24-98] IPR0126774.7e-15Nucleotide-binding, alpha-beta plait
[40-96] IPR0005041.7e-13RNA recognition motif domain
Orthology groupMCL11668 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215323-TA
ATGAAAGTTTTAAATAATGACAATAAACCAACAAAAGTTATGGTGGCTGTTAATAAAAATGAGAACCCTGCACAAAACGAAAATGTAGATAGATATAGGCGACTCTTCATCAAAGTCCAGAAAGATGCTAGTGAATCTGAACTCAGGCACCACTTCTCAACATTTGGACAAATAGAGTCGATACACCTACAAAGAGACAAAGTGACTGATACATGTAAAGGTTTCGCATACGTCCAATATAAAACTTTCTATGATGCAGCTAAAGCTTTTGAAGAATGTGATAAGAAATATAGACCAGTATTTGCTACACCGAGAGACGATTTGAAAAGAAGCAGAAATTGCCTTGATATTGAGCATCATAATATTAACGGATATAGTAATACAAAGAATAATCATAATCATTACACTGACAGACACGGCCAGCTGAAACCAGAATACAATAACGAAAATATGAATGGGACAATAACATGTAGCTCTTACGACTATAACACTATCTCAGTTAAATGTAGCCCACAAGTAGCCAAGAAGTACATAGAACAATTATTTAATGTTATACCTGGCATGGTTCAATTTCAATATTATTTGGACACATTTAATGGAATTTCCAAAGCCGTTATAACATACGAGGAAAGTAGATCTGCAGCACATGCAGTTGACAGGTTGAATAAATTTGAATTTCCGTCTGGGGAGATACTAACTGTCAAGCCGGATAAGAATCCTTTGGTCAAAGCTGCTAATGATCTCACAGATATTGTCAATAATTTTAGAAATGCTGTTGATTATGGTGCACCCGATATAAAACAACTGGCCGAAGCGATTGCTAAAGCGTCCACTTTAATAAAAGCTTCGACGACAGGTCAGATTTATTCACCGAGAGATCAGCATGACTACAATTATTGTGATGTTATTTTACCACCTCACAAACCGATGGCGGATAACAATAGCAGAGTTGCCCAAAGGTTATTCATTATCTGCAAACCCCAGCCTCCGCCGATGTCGACTTTACAAGATGTGTTCTGTCGTTTCGGAGATCTCATTAATGTTTCCACTATACCAAACAAGACATTCGGGTTTGTTAAATACGCCTCTGTGAATGCCGCACAGGAAGCAATGAGAGTTCTTAACGGAGCTACTGTCACTGGTGTGAAGTTAAAAGTTTTAGAGGCTGATGAGAAGCCCAACAAGGAAGATAAAGTAGGTCAAACGGAACAAGCAGAGAACACAGACTATGATATGGACAGCAAGAGAATGAGACTAGACGACAAAGACTAG

Protein sequence:

>DPOGS215323-PA
MKVLNNDNKPTKVMVAVNKNENPAQNENVDRYRRLFIKVQKDASESELRHHFSTFGQIESIHLQRDKVTDTCKGFAYVQYKTFYDAAKAFEECDKKYRPVFATPRDDLKRSRNCLDIEHHNINGYSNTKNNHNHYTDRHGQLKPEYNNENMNGTITCSSYDYNTISVKCSPQVAKKYIEQLFNVIPGMVQFQYYLDTFNGISKAVITYEESRSAAHAVDRLNKFEFPSGEILTVKPDKNPLVKAANDLTDIVNNFRNAVDYGAPDIKQLAEAIAKASTLIKASTTGQIYSPRDQHDYNYCDVILPPHKPMADNNSRVAQRLFIICKPQPPPMSTLQDVFCRFGDLINVSTIPNKTFGFVKYASVNAAQEAMRVLNGATVTGVKLKVLEADEKPNKEDKVGQTEQAENTDYDMDSKRMRLDDKD-