Monarch geneset OGS2.0

DPOGS210855
TranscriptDPOGS210855-TA2022 bp
ProteinDPOGS210855-PA673 aa
Genomic positionDPSCF300027 + 622925-628635
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0079882e-14774.12% 
BombyxBGIBMGA006982-TA3e-16353.01% 
DrosophilaCG17838-PB3e-3136.41% 
EBI UniRef50UniRef50_UPI0001CBAAE52e-4347.62%UPI0001CBAAE5 related cluster n=3 Tax=unknown RepID=UPI0001CBAAE5
NCBI RefSeqXP_001196183.15e-4748.15%PREDICTED: similar to RIKEN cDNA 1810073H04 gene [Strongylocentrotus purpuratus]
NCBI nr blastpgi|720080101e-4548.15%PREDICTED: similar to RIKEN cDNA 1810073H04 gene [Strongylocentrotus purpuratus]
NCBI nr blastxgi|2976745466e-4448.29%PREDICTED: probable RNA-binding protein 46-like isoform 1 [Pongo abelii]
Group
Gene OntologyGO:00001665.1e-12nucleotide binding
GO:00036767.4e-10nucleic acid binding
KEGG pathway 
InterPro domain[374-470] IPR0126775.1e-12Nucleotide-binding, alpha-beta plait
[407-470] IPR0005047.4e-10RNA recognition motif domain
Orthology groupMCL18322 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210855-TA
ATGTCAACATGTCAACTGAAAAATGACATTCGTCTGACAGTAGATAAGAGTACAAGAGCTGGTGAAAGACAAACAAAATCCTTAGAAGAATTGGAACTAGATTTCGAAGATGAAAAGAATTCAATGTTAGCTACACCAGTATTCGAAGAAACGACTTATGAAGATTTTTATGACGAAATTGATAACAGAATTGAATTAGATGATTCCATGAATATGATATTTTTTGGTCCTCCTAGTTCTTTCTTAAGACTTGATGCAGAATCTATTTTACACATACTGACAAATACCAAGAAAGACTTGATACCGATTTCTCGGACATTGTTAGCTTGGGATGTAGTGACAGATGGAAAAACTTCTGCTTTTTTACAAGGTCGGGAGTTTCTTGCCTTTGGAAGCCTTTTAAACGATATACCCGAAGAAGATTTATATTATGTAAATTTTGGAGATCCATCTGTTTATAAATATTTCGCGAAAAATTATGTCAATTTAAATAATAGAAAGTTCGGAGTTTTGGCTGCAGCGTATAGACGATATTATGGCAACAACTGGTATAAAAACGGTTTCCTTGTGAACGATCTCGGATACATGTTGTGTGGATTTCCGTCACACGATTTGAAACGCATAACGCCATCTACTTTTAAGGAGCTTACATTTGACGTTCTCAGTAAACTTGGTAGATGTAGTACTCATCAAAAGCTGACTTTATATAATATAGCCACACATCCTGATGCTTACGGAGAGCCGCACAAATGGTCTTCGCATGAAGTTGGCAGGCTAAATTCTTTATTTAATTGTATACCAAGAGAGGAAATAAGTTCAATTCAATTAGAAGCTATAGCAGCTATTAGTCCAAATGTAATGAAACTTATAAAACAAGAAAAGCTATCGTTCTTTACTAAACCGCAAATACTAAGAATGAACCCAAAGACGCGACGTATTTATATTTTAAGAGTACAATTAAAAAACAGCCTCGATATAAGACAAATTACTAAAAAAAGTGGTAGCTCCAGAACATGTGTGGATATGACTAAATATGTCCAATATGTTCATTTTACTCATTACTCTTCAAGTGAAGTGAATCAGTATGAATTATCTTCTAGATTATTATCATTAACCGAAAACAATACTTCTTATACCCTTACACAAATAAATGGCCAAAGGATTTACAGAAAAGCACCCCATGCTTGGAGTGGTCCGGAACCATCAAGGAATTGCGAGGTATTCATTGGTCGCATACCACACGATTGCTTCGAGGACACTTTGGTGCCTCTGTTCCGCCAAGCGGGGGAACTGTTCGAGTTTCGTCTCATGATTAATTTTTCCGGTTGGAACAGAGGATATGCATTTGCTATGTATACCACCGAAGAAGAGGCGAGCAACTCCATACGAATGTTCAACAATTATATGATCCGACCCTCCTGGCAACTCGGTGTCTGCCCATCGATCAACAACTGCCGCATCTTCATATCTCGTATCCCGGCCACGACGCCGACGTCCGAGATAGTACGGTTGGTGTACGCTCTGACGGAGGAGGTGCAGGAGGTGCGCGTGCGGCGCTCGGCGGCGGCCTGCGCGGCCATCGTGGAGTACCGCTCGCACCGCGGGGCGGCCATGGCGAGGAAGGCGCTGGTGGCGGCGGCCGCGGCGGCCTGGGTGGGTCGCTGGAGCCCAGAGCGTGGCGTGGAGCTGACTAGGGCAGGTGACGATCCTCCGGCGGCGGGCATGGTGTCAGCGGGCGGCGCGGGCGGCGGCTCCTACACGCTGGAGCGCTGGTCGCAGGCTCGACGGCTAAGGATGATACACGAGGCTCAGCGCAACGCACACGCCGCCGCCGCCACTCAGGCCGCACCCTTACCAACCGCTCACGATTGGTCCCCCCAGGGCATAGATTCTCTGGTGTCGTCTCTGTCTTCACTGGGTGTGTCTCGAGTGTGGGCGGCGCCGCCCCCGCCCACTCGTGACTTCAACCCCTGGACCGCTCGTCACCCGCTGGAGGAGTTCATACCGCACCGCTCACAGTGA

Protein sequence:

>DPOGS210855-PA
MSTCQLKNDIRLTVDKSTRAGERQTKSLEELELDFEDEKNSMLATPVFEETTYEDFYDEIDNRIELDDSMNMIFFGPPSSFLRLDAESILHILTNTKKDLIPISRTLLAWDVVTDGKTSAFLQGREFLAFGSLLNDIPEEDLYYVNFGDPSVYKYFAKNYVNLNNRKFGVLAAAYRRYYGNNWYKNGFLVNDLGYMLCGFPSHDLKRITPSTFKELTFDVLSKLGRCSTHQKLTLYNIATHPDAYGEPHKWSSHEVGRLNSLFNCIPREEISSIQLEAIAAISPNVMKLIKQEKLSFFTKPQILRMNPKTRRIYILRVQLKNSLDIRQITKKSGSSRTCVDMTKYVQYVHFTHYSSSEVNQYELSSRLLSLTENNTSYTLTQINGQRIYRKAPHAWSGPEPSRNCEVFIGRIPHDCFEDTLVPLFRQAGELFEFRLMINFSGWNRGYAFAMYTTEEEASNSIRMFNNYMIRPSWQLGVCPSINNCRIFISRIPATTPTSEIVRLVYALTEEVQEVRVRRSAAACAAIVEYRSHRGAAMARKALVAAAAAAWVGRWSPERGVELTRAGDDPPAAGMVSAGGAGGGSYTLERWSQARRLRMIHEAQRNAHAAAATQAAPLPTAHDWSPQGIDSLVSSLSSLGVSRVWAAPPPPTRDFNPWTARHPLEEFIPHRSQ-