Monarch geneset OGS2.0

DPOGS212023
TranscriptDPOGS212023-TA1233 bp
ProteinDPOGS212023-PA410 aa
Genomic positionDPSCF300054 - 671314-677504
RNAseq coverage2014x (Rank: top 6%)
Annotation
HeliconiusHMEL0179551e-1141.89% 
BombyxBGIBMGA010178-TA3e-9994.21% 
DrosophilaB52-PA1e-7879.26% 
EBI UniRef50UniRef50_E1ZYC09e-7466.82%Serine-arginine protein 55 n=8 Tax=Formicidae RepID=E1ZYC0_CAMFO
NCBI RefSeqNP_001037676.12e-9391.58%splicing factor arginine/serine-rich 6 [Bombyx mori]
NCBI nr blastpgi|1129829563e-9291.58%splicing factor arginine/serine-rich 6 [Bombyx mori]
NCBI nr blastxgi|1129829566e-12277.36%splicing factor arginine/serine-rich 6 [Bombyx mori]
Group
Gene OntologyGO:00036769.6e-21nucleic acid binding
GO:00001662.4e-19nucleotide binding
KEGG pathwaydgr:Dgri_GH185283e-78 
 K12893 (SFRS4_5_6)maps-> Spliceosome
InterPro domain[5-70] IPR0005049.6e-21RNA recognition motif domain
[4-79] IPR0126772.4e-19Nucleotide-binding, alpha-beta plait
Orthology groupMCL11585 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212023-TA
ATGGTTGGATCCCGGGTGTACGTTGGCGGGCTGCCTTTTGGTGTTAGAGAAAGAGACTTAGAAAAGTTTTTTAAAGGATTTGGAAGAATAAGAGACATCCTTATTAAGAATGGATATGGTTTTGTGGAATTTGAAGACTACAGAGATGCTGATGATGCAGTCTATGAATTAAATGGAAAAGAATTGCTTGGTGAAAGGGTGGTGGTGGAGCCGGCGCGGGGCATCGACCGCAGCGCGGACCGCTACCGTCGCGACCGCTACTACGAGCGCGACCGCGGCCGATCGAGATACGAATTTTCCGCCCGCAGTGACTACAATTATAGATACGGGCCGCCGACGCGTACGGAGTATCGACTTATAGTTGAAAACCTATCCAGCCGCATTAGCTGGCAGGATTTGAAGGATTACATGCGTCAGGCTGGCGAAGTTACTTACGCGGATGCTCACAAGCAACATAGAAACGAAGGGGTTGTGGAGTTCGCAACTCATTCAGACATGCGAGCTGCTATCGAGAAATTGGACGGTACTGAGCTGAACGGCCGCCGCGTCCGCCTGGTGGAGGACCGACGTTCGTCCAGACGACGCAGCCGCTCCTCTTCCTCAAGGAGCCGCTCACGGTCACGAGACAGGCGCCGCTCACGATCCAGGTCTCGTTCTCGTGGCTCCCGCAGCCGCTCCAAATCCAAATCTCGTCCAAAGAGCAAGAGCCCAGCTGCCAAATCTCATTCGAGATCTCGCTCCAAAGACCGCAGCCGTTCCAGATCCGCTTCCCGCAAGTCTGAGCGCGGGTCGGCATCACGTCCGTCCCGTGAGCGTTCCGCGGGACGGAAGTCCGCAGAGCGGAACGGAAGGTCCGCATCGCGCTCCAAGTCTCGCTCACCTATGGATGATAAGTTAGTATATGACCACCCAAGCACACATACGAGGAGCGATCTCGCTCACGCAGCAAGGAGGCGGGATCACCCAAACGAGAGGAAGAGCGTCGTGAGAGCAAGTCTCGTTCAAGGTCTCGCTCCCGGTCTGGATCGCGCGAGCGGTCAGTCTCACGCGAGCGGTCGCGCTCCGCCTCGCCCAGGCAAAATGGAGACGAACGGGCCGCCGACGAGCGCTCCCCGCGCAGCGGAGACTGAGGGGGCACTAGGGAGTGATGCGGGGGCGGGAGGGGGGAAGCTCACAGTAGACTGGACGGACGGTTACATGGCAAAGAAAAATCGTACATCTGTTACTATCTGA

Protein sequence:

>DPOGS212023-PA
MVGSRVYVGGLPFGVRERDLEKFFKGFGRIRDILIKNGYGFVEFEDYRDADDAVYELNGKELLGERVVVEPARGIDRSADRYRRDRYYERDRGRSRYEFSARSDYNYRYGPPTRTEYRLIVENLSSRISWQDLKDYMRQAGEVTYADAHKQHRNEGVVEFATHSDMRAAIEKLDGTELNGRRVRLVEDRRSSRRRSRSSSSRSRSRSRDRRRSRSRSRSRGSRSRSKSKSRPKSKSPAAKSHSRSRSKDRSRSRSASRKSERGSASRPSRERSAGRKSAERNGRSASRSKSRSPMDDKLVYDHPSTHTRSDLAHAARRRDHPNERKSVVRASLVQGLAPGLDRASGQSHASGRAPPRPGKMETNGPPTSAPRAAETEGALGSDAGAGGGKLTVDWTDGYMAKKNRTSVTI-