Monarch geneset OGS2.0

DPOGS204365
TranscriptDPOGS204365-TA963 bp
ProteinDPOGS204365-PA320 aa
Genomic positionDPSCF300040 + 484220-485182
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0052950.096.25% 
BombyxBGIBMGA005888-TA4e-17390.00% 
Drosophilafne-PB2e-11163.24% 
EBI UniRef50UniRef50_F4W7V74e-12767.93%ELAV-like protein 2 n=5 Tax=Acromyrmex echinatior RepID=F4W7V7_ACREC
NCBI RefSeqXP_971256.24e-12669.79%PREDICTED: similar to RNA-binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|3454936193e-13068.18%PREDICTED: ELAV-like protein 3-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454936192e-12668.47%PREDICTED: ELAV-like protein 3-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00037234.1e-87RNA binding
GO:00001661.3e-26nucleotide binding
GO:00036761.3e-24nucleic acid binding
KEGG pathway 
InterPro domain[1-223] IPR0065484.1e-87Splicing factor ELAV/HuD
[55-70] IPR0023432.4e-29Paraneoplastic encephalomyelitis antigen
[208-319] IPR0126771.3e-26Nucleotide-binding, alpha-beta plait
[239-312] IPR0005041.3e-24RNA recognition motif domain
Orthology groupMCL25182 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204365-TA
ATGTCCCAAGAAGAAATCAGATCACTTTTTTCAAGTGTCGGCGAAGTGGAAAGTTGCAAGTTAATAAGAGATAAAGTTACTGTTTTTCCTGATCACATTCTTAATGGCCAAAGTCTCGGGTATGCATTCGTCAATTACCATAAGCCTGAAGATGCCGAAAAAGCTGTGAACACACTGAATGGTCTACGATTACAGAACAAAATTATAAAAGTGTCGTACGCTCGTCCCAGCTCAGATGCTATAAAAGGCGCTAATCTTTACGTTTCTGGTCTCCCTAAGCACATGACGCAGCAGGATTTGGAGAAGTTATTCAGTCCGTTCGGTACAATAATAAGTTCGCGCATCTTACACGAAAACATGAACGTCGGGCATTTATTGCAAGGGGGCATGGAAGAACAAGGGATTCAAGGACCTTCCAGAGGGGTTGCATTTATTCGTTACGATCAAAGAATTGAAGCTGAGAATGCTATAAGAGAGTTAAATGGCACTATACCGCCTGGAGGTACGGGTCCCATTACAGTGAAATGTGCAAACAATCCCAGTAACCAAAACAAGGCCCTTGCACCGCTGGCTACGTACTTAGCACCACCCACCGTCCGTCGCTTCTTAGGGCCCGCTGGTAAGGCTCTGCTTGCCATTAATAAAGGACTTCAACGATACTCGCCGCTTGCGGATCCTCTCGTCCAAGGTAACGCATTAGGCGGTTCAGGTTGGTGCATTTTTGTTTACAATATCGGTGCCGATACAGAAGAAAGTGTGCTATGGCAGCTGTTTGGCCCCTTCGGAGCCGTTCAGAGTGTTAAAATTATAAGAGATCCCACCACCAACAAGTGCAAAGGCTACGGTTTCGTTACTATGACCAACTACGACGAGGCTGTAGTTGCCATCCAGTCATTGAATGGCTATTCCCTCAATGGACAAGTGCTGCAAGTCAGCTTCAAAACAAACAAGAGTAAATCCTAA

Protein sequence:

>DPOGS204365-PA
MSQEEIRSLFSSVGEVESCKLIRDKVTVFPDHILNGQSLGYAFVNYHKPEDAEKAVNTLNGLRLQNKIIKVSYARPSSDAIKGANLYVSGLPKHMTQQDLEKLFSPFGTIISSRILHENMNVGHLLQGGMEEQGIQGPSRGVAFIRYDQRIEAENAIRELNGTIPPGGTGPITVKCANNPSNQNKALAPLATYLAPPTVRRFLGPAGKALLAINKGLQRYSPLADPLVQGNALGGSGWCIFVYNIGADTEESVLWQLFGPFGAVQSVKIIRDPTTNKCKGYGFVTMTNYDEAVVAIQSLNGYSLNGQVLQVSFKTNKSKS-