Monarch geneset OGS2.0

DPOGS210364
TranscriptDPOGS210364-TA1290 bp
ProteinDPOGS210364-PA429 aa
Genomic positionDPSCF300025 + 467687-468976
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0138331e-15462.36% 
BombyxBGIBMGA011921-TA2e-14160.83% 
DrosophilaSnp-PI2e-3833.64% 
EBI UniRef50UniRef50_UPI00017930EC1e-6940.22%UPI00017930EC related cluster n=1 Tax=unknown RepID=UPI00017930EC
NCBI RefSeqXP_001948838.12e-7040.22%PREDICTED: similar to RNA-binding protein 40 (RNA-binding motif protein 40) (RNA-binding region-containing protein 3) [Acyrthosiphon pisum]
NCBI nr blastpgi|1936838305e-6940.22%PREDICTED: RNA-binding protein 40-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1892358001e-7140.74%PREDICTED: similar to RNA-binding region (RNP1, RRM) containing 3 [Tribolium castaneum]
Group
Gene OntologyGO:00001663e-20nucleotide binding
GO:00036766.2e-12nucleic acid binding
KEGG pathway 
InterPro domain[332-423] IPR0126773e-20Nucleotide-binding, alpha-beta plait
[340-418] IPR0005046.2e-12RNA recognition motif domain
Orthology groupMCL14078 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210364-TA
ATGTCTGCGGTCTTAATTATAAAACACCTGCCACCAAACTTATCTTTCCAAGATAAGGAAAAGTTATTGCATCATTTCGGAGCTGAGAAGGTATGGGGACCCGAGAAAAGAGATTATGTGTTTGCTTCGTTTTCTACTAAAGAAAAAGCAAAAATATCTTTGCAGCGTCTCCATCAACTGGAAATAGCTAATAGACGTTTAGTCGTTGAATATTCGTTCGAAAAAGAACCGACGCTACAGACAAAACAGAATGAAGACTCAGTTTCAGACACCACAAAACATATAAGAGAGTTCTTGCGGACTCTTAACGCTTGGAATCCTTCTGTGGATTTTTACCAACCACCACCAATACATCTTAAATATAAATATCCTACAGCAAATTCTGTAGTATGTATAAATATTTTGTATTCACTCCTTCTACATAAACCGTTTTACATTCAAACATTACATCTCATGAACAAAATGAGCTTAGAGCCACCCTTTGAAGAAAATGAAAAAGCTTTAAATTATTTTAAGGAGACTTTTAGGGAATATTTTCTGGATGAGATTTATATTACTGCCCCTAATGAACCCAGCGAGTCAGAAATATCCAGTGATGATAATACAGTGCAGCCAAAGGAACATTTACCTTCAATGGTGAAAAGGAAGCATTTGCTGCCTAAGACAAGGAAACGGGCTGCAGCAGTGTTGTCGACAGCCAATCTACCGATGTCTAAAAGAGAAAAACCTACAAACCAGGAAGATGTGTTTGAAGTCGTCGCTCCTGTTGCTGAAGCCAAAAAAATATCATTGGTTGTGTCACATGATGCTCTACAGAAGCAGACTGAAATCCCTGAAGTAGTGGGTGAGTTGGGAAAGTTTCAGAAGGAAGAACAAAGCTCAGTTCAAGAAGAGAAAGTTGAGGAACCCGATAAACCATCAATAACAAAAAAGGAAATTTTGAAGAATAGACTATCTTACAGGGAAATGAAGGGGTTACCAGTTTTCAAAAACTATCATCCTGGGGAACCATCTATGAGACTATACATAAAAAATCTGGCAAAAAACGTAACAGAACAAGATGTACAAAGAATCTATAAAAGATACATGGAAGACATCCCTGAAGAGGAGCGGGTTGGATTTGATGTAAGGGTCATGCAAGAGGGACGAATGAAGGGACAGGCTTTTGTTACGTTTCCATCAATAAAGTTGGCAGAACAGGCTCTCAGTGAAACAAATGGGTTCATATTGAATGACAAACCGATGGTTGTACAATTTGCTAGGGCTGCTATTAAAAAAACTGTTGAATAA

Protein sequence:

>DPOGS210364-PA
MSAVLIIKHLPPNLSFQDKEKLLHHFGAEKVWGPEKRDYVFASFSTKEKAKISLQRLHQLEIANRRLVVEYSFEKEPTLQTKQNEDSVSDTTKHIREFLRTLNAWNPSVDFYQPPPIHLKYKYPTANSVVCINILYSLLLHKPFYIQTLHLMNKMSLEPPFEENEKALNYFKETFREYFLDEIYITAPNEPSESEISSDDNTVQPKEHLPSMVKRKHLLPKTRKRAAAVLSTANLPMSKREKPTNQEDVFEVVAPVAEAKKISLVVSHDALQKQTEIPEVVGELGKFQKEEQSSVQEEKVEEPDKPSITKKEILKNRLSYREMKGLPVFKNYHPGEPSMRLYIKNLAKNVTEQDVQRIYKRYMEDIPEEERVGFDVRVMQEGRMKGQAFVTFPSIKLAEQALSETNGFILNDKPMVVQFARAAIKKTVE-