Monarch geneset OGS2.0

DPOGS210553
TranscriptDPOGS210553-TA1392 bp
ProteinDPOGS210553-PA463 aa
Genomic positionDPSCF300304 + 119938-122530
RNAseq coverage877x (Rank: top 14%)
Annotation
HeliconiusHMEL0064255e-7495.49% 
BombyxBGIBMGA013447-TA3e-8672.58% 
Drosophilarin-PB2e-6470.76% 
EBI UniRef50UniRef50_D6W9492e-9849.09%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W949_TRICA
NCBI RefSeqXP_975463.13e-9949.09%PREDICTED: similar to rasputin CG9412-PB [Tribolium castaneum]
NCBI nr blastpgi|910769846e-9849.09%PREDICTED: similar to rasputin CG9412-PB [Tribolium castaneum]
NCBI nr blastxgi|910769844e-10748.28%PREDICTED: similar to rasputin CG9412-PB [Tribolium castaneum]
Group
Gene OntologyGO:00068103.7e-32transport
GO:00056223.7e-32intracellular
GO:00001663.7e-13nucleotide binding
GO:00036762.7e-12nucleic acid binding
KEGG pathway 
InterPro domain[12-132] IPR0020753.7e-32Nuclear transport factor 2
[326-434] IPR0126773.7e-13Nucleotide-binding, alpha-beta plait
[346-408] IPR0005042.7e-12RNA recognition motif domain
Orthology groupMCL13089 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210553-TA
ATGGAGGCGTCCCCGTCCCCGTCCCCGCAGAGCGTCGGCCGCGAGTTCGTCCGCCAATACTACACTCTGCTCAACAAAGCGCCCGCACACCTGCATAGATTCTACAATAACTACTCGTCGTTCGTGCATGGCGGGCTGGACGCGCCCAACCGCGAGACGCTGCCTGTCGTCGGACAGAAGCAGATCCACAACCGTATCCAGCAGCTCAACTTCCGCGACTGTCACGCCAAGATCAGTCAGGTGGACGCGCAGGCCACGCTGGGCAACGGAGTCGTGGTGCAGGTCACCGGCGAGCTGTCCAACGCCGGCGCCCCCATGAGGCGCTTCACGCAGACCTTCGTGTTGGCGGCGCAGTCGCCCAAGAAGTACTACGTGCACAACGACATCTTCCGCTACCAGGACGTGGTGTTCTCCGACGAAGAAGGCGAAGGCTCCGGCCGCTCGGACGCCGAGGAGGAGGACGCGGCCGCCGGCGGGTACTTCCCGCCCGCTTTTCCGCCCGCGCCCTTCCCCGCGCCGCCGCACGCCCAGCTCGTGTCGCTGCCCGCGTCGCCGCATCTCAACGGACACCCTCACGACGACCCCGCCAGGCACCTCGCGGCCGCGCTGCAGGCTGACCCCTCCGCCATGTGTCCCGCGACACCGGCCGGAACCACCTCGACTCTGCGCCCCGTCTCCTCGGCCGGCACGATCGCACCCGCCGCAGCCGCAGTCCCCGAGCGGGAAGAGGAACCGGCCCCGGAGCCCGAGCGTGAGCCGACGCCTCCGCCCCAGCAGCCGACGCCTCAACCGATGCCTCAACCCGCGGCGGCCGCCCCTCCGGAACCCAAAACGTACGCCAACCTGCTGAAGTCTGGATCGAGTGCTAGTACAACGCGTGGCTCACCCCCTGCGCCCCCCGCACCCGTTCCCGCTCCCTCATCTGCGCCGGCGCCCGCCTCGCACGAGCCCCGCGCACGCCCGCCCAGGAGCGCTCAGCCACAGCAGCAGGCCGGCAACCAGGACGGCGGTCGTCGGTACTCGGACGCCCAGCAACTGTTCCTGGGCAACCTGCCGCACTCCGCCACGGAGGAGAGTCTGCGTGCTATCTTTTCTCGGTTCGGTCCCGTGGCCGAGCTGCGCGTGCACAGCAAGCCCGCCGCGCCCGGAGCGCCGCGCCACCCTAATTACGGCTTCATCACCTACGAGACGGCGCAGGCCGCCGCCGACTGTCTGCTGGCCGCCGCCAACGAGCCGCTGTACTTCCCTGGCGAGGGCGGTGAGGGCGGCGAGGGCGCCGGTGTCAAGCTCAACGTGGAGGAGAAGAAGACCCGCGGACGCGAGCCCCCGCGCCGCCGCCCGCTGTCGTCTCACCGCGCCTCCTTCCAGCCGCGCCAGAACTTCCGCCGCTAA

Protein sequence:

>DPOGS210553-PA
MEASPSPSPQSVGREFVRQYYTLLNKAPAHLHRFYNNYSSFVHGGLDAPNRETLPVVGQKQIHNRIQQLNFRDCHAKISQVDAQATLGNGVVVQVTGELSNAGAPMRRFTQTFVLAAQSPKKYYVHNDIFRYQDVVFSDEEGEGSGRSDAEEEDAAAGGYFPPAFPPAPFPAPPHAQLVSLPASPHLNGHPHDDPARHLAAALQADPSAMCPATPAGTTSTLRPVSSAGTIAPAAAAVPEREEEPAPEPEREPTPPPQQPTPQPMPQPAAAAPPEPKTYANLLKSGSSASTTRGSPPAPPAPVPAPSSAPAPASHEPRARPPRSAQPQQQAGNQDGGRRYSDAQQLFLGNLPHSATEESLRAIFSRFGPVAELRVHSKPAAPGAPRHPNYGFITYETAQAAADCLLAAANEPLYFPGEGGEGGEGAGVKLNVEEKKTRGREPPRRRPLSSHRASFQPRQNFRR-