Monarch geneset OGS2.0

DPOGS208500
TranscriptDPOGS208500-TA1926 bp
ProteinDPOGS208500-PA641 aa
Genomic positionDPSCF300064 - 863192-870795
RNAseq coverage3258x (Rank: top 4%)
Annotation
HeliconiusHMEL0087530.078.35% 
BombyxBGIBMGA005177-TA0.068.72% 
DrosophilanonA-PB2e-11957.83% 
EBI UniRef50UniRef50_Q0ZAL30.070.49%Splicing factor proline-and glutamine-rich n=5 Tax=Endopterygota RepID=Q0ZAL3_BOMMO
NCBI RefSeqNP_001037408.20.070.34%PTB-associated splicing factor [Bombyx mori]
NCBI nr blastpgi|1097068290.070.49%splicing factor proline- and glutamine-rich [Bombyx mori]
NCBI nr blastxgi|1097068290.072.06%splicing factor proline- and glutamine-rich [Bombyx mori]
Group
Gene OntologyGO:00001661.3e-15nucleotide binding
GO:00036762.4e-15nucleic acid binding
KEGG pathway 
InterPro domain[421-473] IPR0129751.3e-24NOPS
[271-346] IPR0126771.3e-15Nucleotide-binding, alpha-beta plait
[275-342] IPR0005042.4e-15RNA recognition motif domain
Orthology groupMCL11218 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208500-TA
ATGCACGCCATGAACCAATATCGAGGATTTGGAGGCCCACCTCAGCCCCAAAGACAAGGCGGTCGTCGGGGCAATAATCGAGGCGGATTCCGCAATTCCCGCTTTGAGCATAATCAGAATCAGCGAGGGCATAACTTCCATGGGGGTCAACGTCCCCATATTGAGCCTGTAAAACAAGGTCCCCAAAATGAACCGCCACCACCGCAACCACAGCCTCAACCGCAGGCTCAGCAGGAGAAGAAAGAATCCTCTCCTCAACAGACTCCACAGACTCCACAACAGCCACCTCCAAAGCCTCAACCACCTCAACAAAAGCCATTGCAACAGCAACCAGCTCAAAAACCACCTCAAAATCCAACCCAACCTCAACAGACACCATCTCAACCACCGGTTAAACCGGAATCCCAAGACCATGTTGATAAAGCTGCACCTGAACAGAGACCTCAAAACCAAAATCAAAACCAGGGTGCTTATCAAAAGGGTCCAATAGCTAGGCTGATGGACATAAAGCAGGAGCAAGGTCCCAATAATGTCAATGGTGGCAACTTTGGTGCTAACAAACAGGGTGGTTGGAGTCAGGGGGGGCCGGGTAATAAGCCAGGTGGCAATTTTGGTCCAAAAGGTTTTGGTCCAAAACAACAACAAGGACAGGGTCCAGCTCGTAACCAACAGAGGAATAACATGGGGCGTGGACCTCAAGATGGCGGGAAGCCACAACAAAGAGAAGAACAAATGCTTGCTAATAAATTGAAAGATCTCATGGGCCCATTGAATGACATACCTCCAATTGAACAACCGGAGGTCAAATTTAATGGCAGAAGTCGTTTATACATTGGAAATCTTACAAATGATGTTACCGACGAGGAAATACTAACCATGTTTGGACAGTTTGGTGAAACTGCGGAACTGTTTCTGAACAAAGAGAAGAACTTTGGATTTATAAAAATGGACTATAGAGTGAATGCAGAAAAAGCTAAACGTGAAATTGATGGCAGAATGAGAAATGGCAGAAACATCAGAGTAAGATTTGCTCCACATAACAGTGCGGTGCGCGTTAAGAACCTGCCACCGTTTGTGTCAAATGAGTTACTGTACAAGGCATTTGAAATATTTGGCAAGATTGAGAGGGCCTATGTCAAAGTTGATGACAGAGGAAAAACTCTCGGTGAGGGCATTGTTGAGTTTGCCAGAAAGCCCAGTGCATTAGGTGCCATCCGTAACTGCACTGAAAGATGCTTCTTCTTAACTTCATCTTTGCGTCCTGTCATTGTGGAGAGTTTGGAGGAACCTGACGAATTTGATGGTTATCCTGAGAAGAATTTACCAAAGAAACATCCAGAATTCTTACGTGCCAGAGAAATGGGCCCACGTTTCTCCGAGCCAGGCAGTTTCGAGCACGAGTACGGTACCAGGTGGAAGCAACTCCATGAACTGCACAGACAGAAGGAGGAGGCTCTGAAGAAGGAACTGGCTGCCGAGGAGGAGAAACTTGAAGCACAGATGGAGTATGCTAAATACGAACATGAAACGGAGCTGCTCCGAGAACAGCTGCGGCAGAGGGAACAGGACCGAGAGAGACAGAAGAGAAGCTGGGAGATGGCGGAGAGGGCGGCGGACGAGAGGAGGGAGGCGGAGCGCGCGCAGCTGCTGCGGCAGGAGGAGGAGCTCAGCAACCGGATGCGGCTGCAGGACGACGAGCTGCGGCGGCGGCAGCAGGAGAACACGCTGTTCGTACAGGCTCAGAGGTTGAACTCGATGTTGGATCGTCAAGAGCAAGGCATGTTCGACCACCAGCAGCCTATGGATGGAGGCTTCAGAGAGCAGTTCGACATGCCACGTGGAGGTTATGACGAGATGGCACAGAATCGCGGTTGGGACGGACCGCGCCAAATGGATGATTTTCCTAATAAAAGACGTCGATTTTAA

Protein sequence:

>DPOGS208500-PA
MHAMNQYRGFGGPPQPQRQGGRRGNNRGGFRNSRFEHNQNQRGHNFHGGQRPHIEPVKQGPQNEPPPPQPQPQPQAQQEKKESSPQQTPQTPQQPPPKPQPPQQKPLQQQPAQKPPQNPTQPQQTPSQPPVKPESQDHVDKAAPEQRPQNQNQNQGAYQKGPIARLMDIKQEQGPNNVNGGNFGANKQGGWSQGGPGNKPGGNFGPKGFGPKQQQGQGPARNQQRNNMGRGPQDGGKPQQREEQMLANKLKDLMGPLNDIPPIEQPEVKFNGRSRLYIGNLTNDVTDEEILTMFGQFGETAELFLNKEKNFGFIKMDYRVNAEKAKREIDGRMRNGRNIRVRFAPHNSAVRVKNLPPFVSNELLYKAFEIFGKIERAYVKVDDRGKTLGEGIVEFARKPSALGAIRNCTERCFFLTSSLRPVIVESLEEPDEFDGYPEKNLPKKHPEFLRAREMGPRFSEPGSFEHEYGTRWKQLHELHRQKEEALKKELAAEEEKLEAQMEYAKYEHETELLREQLRQREQDRERQKRSWEMAERAADERREAERAQLLRQEEELSNRMRLQDDELRRRQQENTLFVQAQRLNSMLDRQEQGMFDHQQPMDGGFREQFDMPRGGYDEMAQNRGWDGPRQMDDFPNKRRRF-