Monarch geneset OGS2.0

DPOGS205084
TranscriptDPOGS205084-TA1344 bp
ProteinDPOGS205084-PA447 aa
Genomic positionDPSCF300074 + 198804-248735
RNAseq coverage636x (Rank: top 20%)
Annotation
Heliconius% 
BombyxBGIBMGA006880-TA2e-6849.55% 
DrosophilaA2bp1-PH1e-3284.21% 
EBI UniRef50UniRef50_B7QEL32e-4955.36%RNA-binding protein, putative n=11 Tax=Arthropoda RepID=B7QEL3_IXOSC
NCBI RefSeqXP_967336.11e-5541.23%PREDICTED: similar to AGAP006089-PA [Tribolium castaneum]
NCBI nr blastpgi|3504118508e-5542.60%PREDICTED: LOW QUALITY PROTEIN: RNA binding protein fox-1 homolog 2-like [Bombus impatiens]
NCBI nr blastxgi|3287802354e-7246.79%PREDICTED: RNA binding protein fox-1 homolog 2-like [Apis mellifera]
Group
Gene OntologyGO:00001663.3e-13nucleotide binding
GO:00036767.1e-11nucleic acid binding
KEGG pathwayspu:5922019e-07 
 K12897 (TRA2)maps-> Spliceosome
InterPro domain[229-289] IPR0126773.3e-13Nucleotide-binding, alpha-beta plait
[229-277] IPR0005047.1e-11RNA recognition motif domain
Orthology groupMCL22142 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205084-TA
ATGAACACATTAGCGGTATACAAAGGCTCCGCAGGTGGGTCGGCGATTAATCACAGTGAGGAATTCAGCGGGGGCGGAGGCGGCGCGACTGCCCTAGTGCCGCCTCGGGAGCCCGACGTGGCGCACGCCCGCCCCTCGTTCGGAACACATGCCCCGCAGCTCACCACCCCTACATATCTTTCAAACCCGAACATAAAGCACATGGTGGGCACGGGGATGGCGGGCCCGTTCCCTGCGGCGGCGGCGGCTGCGGCTGCGGCGGCTGTCAAGCGGCGGTTGCCTCTCCTCGCCGCCGTCAAGGCCGATAACGGAGCCGCTGCCTCGCAACAGTCCTCCCTCGTCAAGAGCGAAGGACCTCCGGCACAAGCACCTCTACCGCCGACCAACTTTTCACCCCAACCGACACAGCAACACACCCAGAACTCGATAGAAAACCAAAATCAGAATTCACTATCGGTCCCTCAGCAAAGCGAAGCGGACTCGAACGACGAGAGCACGCCGGTGAGTGTGGCGGCGGCGGCTGCGGCTGCACACGCGGCGGCGGCTGCGGTCACGGCGGCGGTGGCGGCCGGCGCTGCACGCGACCGCCGACAAGAGCCTCGTGCCCGTCACGGCCTCGCAGCCCAAGCGACTGCACGTCTCCAACATCCCCTTCAGATTCAGAGACCCGGACCTCAGGAACATCAATATGGAACGATACTTGATGTAGAAATTATCTTCAACGAACGCGGCTCCAAGGGATTCGGTTTTGTAACATTCGCAAATAGTGGTGATGCGGAGCGAGCACGAGAGCGTCTTCACGGCACCGTGGTTGAGGGCAGAAAGATAGAGGTTAATAACGCCACAGCCAGGGTGCAGACAAAGAAACCACCCGCGGTGCCAAACGCTCCTGCTCTGCGCGGCGCCGCCGTCCTCCGCGGCCGCTCCCCGCGCCCCCCCTCCGCGCCGGCCGCTCACCCTCACCCCCTCGCACCCGCGGTGATGCCACGCGCCAACCCATTCGCCTCGCCGTTGCACGCATACGCACATGTGTATTACGACCCGTTCTTAGCTGCAGCGGCGACTGCTGACTCCAACTACAGGCTACAGGCGGCGGCGGCGGCGGCAGCGGCGGCGGCTCCATTGCTGAAGTCCCCGCTGACGACGGCGCAGCATGCGGCAGCAAATAACCAAAGTCAAAATATCAGCTTTCTACTAGTTAGTATAGTACGACATATCGAGTCGAATAAGAGATACTACGGTCGAGAGTATGCGGACCCGTATCTAGGACACAGCATCGGTCCGGTTACAGGATACGGGACCGCGGTCTACAGAAGCGGATACAACAGATTCGCTCCGTACTAA

Protein sequence:

>DPOGS205084-PA
MNTLAVYKGSAGGSAINHSEEFSGGGGGATALVPPREPDVAHARPSFGTHAPQLTTPTYLSNPNIKHMVGTGMAGPFPAAAAAAAAAAVKRRLPLLAAVKADNGAAASQQSSLVKSEGPPAQAPLPPTNFSPQPTQQHTQNSIENQNQNSLSVPQQSEADSNDESTPVSVAAAAAAAHAAAAAVTAAVAAGAARDRRQEPRARHGLAAQATARLQHPLQIQRPGPQEHQYGTILDVEIIFNERGSKGFGFVTFANSGDAERARERLHGTVVEGRKIEVNNATARVQTKKPPAVPNAPALRGAAVLRGRSPRPPSAPAAHPHPLAPAVMPRANPFASPLHAYAHVYYDPFLAAAATADSNYRLQAAAAAAAAAAPLLKSPLTTAQHAAANNQSQNISFLLVSIVRHIESNKRYYGREYADPYLGHSIGPVTGYGTAVYRSGYNRFAPY-