Monarch geneset OGS2.0

DPOGS206425
TranscriptDPOGS206425-TA1194 bp
ProteinDPOGS206425-PA397 aa
Genomic positionDPSCF300181 + 161254-162956
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0032830.097.15% 
BombyxBGIBMGA013823-TA0.096.87% 
Drosophilafne-PB2e-12768.63% 
EBI UniRef50UniRef50_F4W7V75e-14473.71%ELAV-like protein 2 n=5 Tax=Acromyrmex echinatior RepID=F4W7V7_ACREC
NCBI RefSeqXP_971256.27e-14075.07%PREDICTED: similar to RNA-binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|3800270758e-14876.31%PREDICTED: ELAV-like protein 2-like [Apis florea]
NCBI nr blastxgi|3800270757e-14277.97%PREDICTED: ELAV-like protein 2-like [Apis florea]
Group
Gene OntologyGO:00037237e-104RNA binding
GO:00001662e-27nucleotide binding
GO:00036762.8e-21nucleic acid binding
KEGG pathway 
InterPro domain[15-256] IPR0065487e-104Splicing factor ELAV/HuD
[18-33] IPR0023434.2e-44Paraneoplastic encephalomyelitis antigen
[14-113] IPR0126772e-27Nucleotide-binding, alpha-beta plait
[271-344] IPR0005042.8e-21RNA recognition motif domain
Orthology groupMCL10538 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206425-TA
ATGTCGAAGGGCGACAGTGAACAGCAAAATGGCTCCGGCGAAGAGTCCAAGACCAACCTCATCATAAACTACCTGCCACAGAGCATGACCCAAGAGGAAATACGAAGCCTGTTCTCGAGTATCGGCGAAGTAGAGTCCTGCAAGCTGATCCGGAACAAAGGAGCGGCCTTCCCGGACGCCTTGAACCACGCGCTACACGGCGGCGGACAGAGTCTTGGGTATGCCTTTGTTAACTATCACCGACCCGAAGACGCCGAGAAGGCTATCGCCACCCTCAATGGACTGCGGCTCCAGAACAAAACCATTAAGGTCTCGTACGCCAGGCCAAGCAGCGAAGCCATCAAAGGCGCCAATCTCTACGTTTCCGGCCTCCCGAAGACAATGACCCAAATAGAGTTGGAGAGATTATTCAGTCCATACGGTCGTATTATCACATCTCGTATACTCTGTGAGAATTCAGGCGGACGTCCATTCACCGGCGGCGAGCAGGGCCTGTCTAAGGGAGTCGGCTTCATCAGGTTCGATCAACGCGTCGAGGCCGAGAGAGCTATCCAGGAGTTAAACGGAACGGTCCCGAAGGGAGCCACTGAACCTATAACAGTGAAATTCGCAAATAATCCAAGCAACAACGGAAAAGCGTTAGCACCGCTCGCAGCGTACCTTCCCGCAGCGCTTCGCTTCCCGGCGCCACTGGGAAGATTCAGCTCAGGCAAGTCCCTACTCGCTATAAATAAAGGTCTACAGCGCTACAGCCCTCTTGCCGGCGAACTGCTCGGCGGGGTGCTACCAGGGGCGGTCGGCTCCGAGTGGTGTATCTTTGTTTACAACCTAGCCCCGGAGACCGAGGAAAACGTCCTCTGGCAGCTGTTCGGGCCATTCGGCGCCGTCCAGAGCGTGAAAGTGATCCGCGACTTGCAGACCAACAAATGCAAAGGTTACGGCTTCATAACAATGACGAATTACGACGAAGCCGTGGTAGCCATACAGTCCCTGAACGGCTATACGCTCGGGAACAGAGTGCTCCAGGAGATCATATCTTACGAAGTTCAGCGGCCGGCTCGGCTCGATTCGATAATGTTTGCCAAATATATTGTCACTATAATCGCGTCCGTCGCTCGCGGGTCGCGGCTAAGTCTTAGCCCAAGACTATCGGCTATACAGCCATCCGCGGTGAACGCGTCGATCCAACTATAA

Protein sequence:

>DPOGS206425-PA
MSKGDSEQQNGSGEESKTNLIINYLPQSMTQEEIRSLFSSIGEVESCKLIRNKGAAFPDALNHALHGGGQSLGYAFVNYHRPEDAEKAIATLNGLRLQNKTIKVSYARPSSEAIKGANLYVSGLPKTMTQIELERLFSPYGRIITSRILCENSGGRPFTGGEQGLSKGVGFIRFDQRVEAERAIQELNGTVPKGATEPITVKFANNPSNNGKALAPLAAYLPAALRFPAPLGRFSSGKSLLAINKGLQRYSPLAGELLGGVLPGAVGSEWCIFVYNLAPETEENVLWQLFGPFGAVQSVKVIRDLQTNKCKGYGFITMTNYDEAVVAIQSLNGYTLGNRVLQEIISYEVQRPARLDSIMFAKYIVTIIASVARGSRLSLSPRLSAIQPSAVNASIQL-