Monarch geneset OGS2.0

DPOGS211059
TranscriptDPOGS211059-TA1047 bp
ProteinDPOGS211059-PA348 aa
Genomic positionDPSCF300446 + 166927-169462
RNAseq coverage904x (Rank: top 14%)
Annotation
HeliconiusHMEL0044325e-14381.77% 
BombyxBGIBMGA009577-TA1e-12275.57% 
Drosophilamsi-PC3e-8875.13% 
EBI UniRef50UniRef50_Q8MS044e-8675.13%RH49436p n=31 Tax=Coelomata RepID=Q8MS04_DROME
NCBI RefSeqXP_001607438.11e-9255.17%PREDICTED: similar to RH49436p [Nasonia vitripennis]
NCBI nr blastpgi|1565385652e-9155.17%PREDICTED: hypothetical protein LOC100123736 [Nasonia vitripennis]
NCBI nr blastxgi|1565385652e-9953.63%PREDICTED: hypothetical protein LOC100123736 [Nasonia vitripennis]
Group
Gene OntologyGO:00001664.7e-23nucleotide binding
GO:00036762.5e-21nucleic acid binding
KEGG pathway 
InterPro domain[153-248] IPR0126774.7e-23Nucleotide-binding, alpha-beta plait
[169-241] IPR0005042.5e-21RNA recognition motif domain
Orthology groupMCL13889 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211059-TA
ATGATGGTATATTCCAGCGGCGCGCTGGGCGGGCTGGAGGTGGTGGAGGGCGGTCTGGTAGGTGGCGAGCTGGCGGCTCACGTGCTGAGCGCGCAGGCCGCCGCCCACGCCGCGGCCACTGCGGCCGCGCAACAACAACAGATGGCAGTGCAGCAAATCATGTGTCCGTCAGAGAACTCACCATCTTCTGGTCGTTCCACTCCTGTGACGACCGCCACTGGGAACACTTCACCATCACCCAGCAAGCTGTTTGTGGGCGGCCTCAGCTGGCAGACGAGCTCGGAGAAGCTGAGAGAGTATTTTGCCATGTTTGGAGCTGTAACCGACGTTTTGATTATGAAGGACCCCGTGACACAGCGCTCGCGCGGCTTCGGCTTCATCACGTTCCAGGAGGCGGCGTCCGTGGACAAGGTGCTGGCGGTGCCCGTCCACACGCTGGACGGCAAGAGGATCGACCCCAAGCACGCCACGCCCAAGTCGGCGCCCAAGCCGGCCAAGACCAAGAAGATCTTCGTGGGCGGCGTCGGCCAGGACACGTCGGCGGACGAGGTGCGCGCCTACTTCGCGCAGTTCGGAGCCGTGGAGGACGCCGTCATGCTCATGGACCAGCAGACCAAGAGACACCGCGGCTTCGGCTTCGTCACCTTCCACTCCGAGGAGGCCGTGGAGCGCGTGTGCGACATCCACTTCCACACCATCAAGAACAAGAAGGTGGAGTGCAAGCGAGCTCAGCCCAAGGAGGCGGTGGCGGCCGCCCCGCTGGCGCTCGGCAAGCGGCTGGTGCTGCGGCCGGGACGCGGGCTGGTTTACGCGGGAGGCGTGGGTGGAGTGGGAGCCGTCGGCGGCGTTGGCGGGGTGCCGGCCGTGGGCGCGCACGCCTACCGCTACGCGCCGTACGCCTTGCCGGGGTCGCTGGTGGCCCCGCAGCCCGCCCCAGCCCCCGCCCTGCCCCAGTTCGGCGCGGCGTACTCCCTGGCCGGCGTGGACATGTCTTCGTTCCAGGGCGTGGACTGGAGCGCCATGTACGGCGTGCCGATGTACATCTGA

Protein sequence:

>DPOGS211059-PA
MMVYSSGALGGLEVVEGGLVGGELAAHVLSAQAAAHAAATAAAQQQQMAVQQIMCPSENSPSSGRSTPVTTATGNTSPSPSKLFVGGLSWQTSSEKLREYFAMFGAVTDVLIMKDPVTQRSRGFGFITFQEAASVDKVLAVPVHTLDGKRIDPKHATPKSAPKPAKTKKIFVGGVGQDTSADEVRAYFAQFGAVEDAVMLMDQQTKRHRGFGFVTFHSEEAVERVCDIHFHTIKNKKVECKRAQPKEAVAAAPLALGKRLVLRPGRGLVYAGGVGGVGAVGGVGGVPAVGAHAYRYAPYALPGSLVAPQPAPAPALPQFGAAYSLAGVDMSSFQGVDWSAMYGVPMYI-