Monarch geneset OGS2.0

DPOGS215326
TranscriptDPOGS215326-TA1083 bp
ProteinDPOGS215326-PA360 aa
Genomic positionDPSCF300120 + 216570-220268
RNAseq coverage1440x (Rank: top 9%)
Annotation
HeliconiusHMEL0100212e-13169.75% 
BombyxBGIBMGA007965-TA1e-5661.42% 
Drosophila% 
EBI UniRef50UniRef50_G0S9S96e-0827.76%Splicing factor (Prp24)-like protein n=5 Tax=Sordariales RepID=G0S9S9_CHATD
NCBI RefSeqNP_502291.16e-0628.57%hypothetical protein F11A10.7 [Caenorhabditis elegans]
NCBI nr blastpgi|3787323601e-0730.29%nucleolin, variant [Exophiala dermatitidis NIH/UT8656]
NCBI nr blastxgi|1255604294e-1124.32%hypothetical protein OsI_28114 [Oryza sativa Indica Group]
Group
Gene OntologyGO:00001663.8e-11nucleotide binding
GO:00036764.1e-09nucleic acid binding
KEGG pathway 
InterPro domain[69-166] IPR0126773.8e-11Nucleotide-binding, alpha-beta plait
[90-164] IPR0005044.1e-09RNA recognition motif domain
Orthology groupMCL25410 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215326-TA
ATGGTCGCTCAGAAGAGAGGTCGTGGTGCCAAAAATCAAGCGAACAAAGCTAAAGCCAATCAGCCAACTGAAGAAGTCGTTGCGACGGAAGTGACCGAACAAGAACCTGAAGCTGCCGAAGAAAACAATGGAGATTCACAGCCCGAGAATAATGAAGAACAAGCGACAGGGGAGGAAGCTGCGGAGGAACAGTCAGAAACAGAACAGACCCAAGAAGATGAACCCAAAGCTGAAGAACTCAAAGAAGATAAAGTAGAATCTGGGAAACTACTGGTGGAGAATCTGCCTTCAAGCTATCTATTCGACTATCAAGACAAACTCAAGGAGCTGTTCAGTAAGCATGGAGAAGTCGTCAGTGTGAAACGTGGTCCAATCATTGTAACCGAACTGACCACAACTCCAACATTATCGGCTATAGTTGAATTCAAAAACAAAGATTCCCTGGAAAAGGCGCTGTCAGAGAACGGCACGTCCTTGGATGGTAGCGTAATCTCTGTGGTGGTGGAATCCCGAGCGGACACCGGCATCCTGGTGGGAGTCCCTTACGAGGCCAGCTGCGACTACGTGAAGGCGCTGTTCGAACAGTGCGGCCCGGTCGCGCACGTACATGAGTTCAGCAAGACCAAGTACAAGATATTACGAGTAACCTTCAATGAGAAGGAGTCAGTGGAGCGTGCTCTGAAGTTGGACAGAGACCTCCGCATCAACGGGTTCCTGGTTACAGTATCCAAGTACAGGGACGACGACGAGTTCAAGGCACACTCCGCTAACTCAAACAAGAACAAGAGACAGCATGGCCAACAGAATCGGAACTCCGGTAACACGAACGTGAACAACTCGGGAGGAACCAGCCCCGCATCACGCACCAACAGTTACCGCGCGAGAGGAGGCGCCTTCAACGCCAGAGGAGGAGGCAACTATAGAGGCGGCCGAGGTCGCGGGTTCTCGGGGCGCGGCTCGTTCCCTCGCGGCGGCTTCACTCCGCTCAACACGTACATGCCGCCACAGCCGTACATGGGCCGACCGCAGGGAGGCTTCATGAACGACGGTCAGCGGCCGGTCAAGCGACTCCGACAGATGTAG

Protein sequence:

>DPOGS215326-PA
MVAQKRGRGAKNQANKAKANQPTEEVVATEVTEQEPEAAEENNGDSQPENNEEQATGEEAAEEQSETEQTQEDEPKAEELKEDKVESGKLLVENLPSSYLFDYQDKLKELFSKHGEVVSVKRGPIIVTELTTTPTLSAIVEFKNKDSLEKALSENGTSLDGSVISVVVESRADTGILVGVPYEASCDYVKALFEQCGPVAHVHEFSKTKYKILRVTFNEKESVERALKLDRDLRINGFLVTVSKYRDDDEFKAHSANSNKNKRQHGQQNRNSGNTNVNNSGGTSPASRTNSYRARGGAFNARGGGNYRGGRGRGFSGRGSFPRGGFTPLNTYMPPQPYMGRPQGGFMNDGQRPVKRLRQM-