Monarch geneset OGS2.0

DPOGS203665
TranscriptDPOGS203665-TA1185 bp
ProteinDPOGS203665-PA394 aa
Genomic positionDPSCF300010 - 2487969-2490661
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0133530.085.61% 
BombyxBGIBMGA003462-TA2e-16083.00% 
DrosophilaSpf45-PA6e-9046.00% 
EBI UniRef50UniRef50_E2B7V42e-11359.70%Splicing factor 45 n=8 Tax=Arthropoda RepID=E2B7V4_HARSA
NCBI RefSeqNP_001040441.10.082.66%RNA binding motif protein 17 [Bombyx mori]
NCBI nr blastpgi|1097068230.083.92%splicing factor 45 [Bombyx mori]
NCBI nr blastxgi|1097068230.083.92%splicing factor 45 [Bombyx mori]
Group
Gene OntologyGO:00036763.3e-21nucleic acid binding
GO:00001663.5e-18nucleotide binding
GO:00056222.5e-12intracellular
KEGG pathwayame:4096962e-114 
 K12840 (RBM17, SPF45)maps-> Spliceosome
InterPro domain[1-395] IPR0169672.1e-130Splicing factor, SPF45
[300-380] IPR0039543.3e-21RNA recognition motif domain, eukaryote
[299-382] IPR0126773.5e-18Nucleotide-binding, alpha-beta plait
[210-249] IPR0004672.5e-12D111/G-patch
Orthology groupMCL13409 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203665-TA
ATGTCGTTATATGATGATCTTGATACTATAAAAGCCCGTACGTCTGATAAAGTCGCAGGATGGTCTTCGGGAATAAAACTTTTGCAATCTCAATTACAATTAAAAAAGGCTGCTGTAACACAACCAAAGAGAGAAGCTTTGCGAAGATCTAATCAGGTACTAACTCCTGTAATAGACTTGAAAAGTAAGTCTAAGGATGATGACGAGCCTAACACGAATAGCCCAAAAATTCAACCAAAAACAATTACAGCTACTTTGAATGTTCGTGATTTTGATTGGAATGTTGCAAATGAATATGATCCTATGTGGCCTAATGATTATGAAAAAGTAGCCAAAGAAATGCAAGCTAAAAGGCTCAACCTAGATGGAAGCAATGAAAGAACAGAAAGGAGTGAGAGATCAGAAAGGAAACGTAAAACAAGATTTAATGATGAAGAAGATGTTATACCAGAAAAAACATTAGTGCCAATGCAACCTGAAGAGGAAGAAGTGGAAGAGAAAACCAAACGGACAGCAGGAGTTGCTATTGCACCGCCACCTTCTTTGACTATAGAGAGTCTCTCACCCCCACCTGTCATACCAACTCCAACTGCAAGCCAAGGTTTCTCACTTGGTGGCTATGGTGCTAGTTCGGTTGCAGCAAAAATAATGGCAAAATATGGTTTTAAAGAGGGTCAAGGTTTAGGCAAGAAAGAACAAGGAATGTCAGTAGCATTGCAGGTTGAGAAAACCTCCAAGCGTGGCGGTCGCATTATTCACGAAAAGGACAGCACAAACATGATGCCACCTAGTTTTGCCATGACATCATATTCAGGACCGGACTCACCCAATGCTTCAAATTCACCACATTCAAGACAGGAACCGTCTATTACAGAAATAATGAAAACACCAAGTAAAGTGGTTTTATTGAGGAATATGGTTGGACCAGGTGATGTTGATGAAGAACTTGAGCCGGAGGTCAAAGATGAGTGCAACACCAAGTATGGTGAAGTAGTAAAAGTCCTGATCTTTGAAATGCCCAATGCACCAAGTGATGAAGCTGTCAGAATATTTGTGGAATTCAAGAGGATTGAAAGTGCTATTAAAGCAGTTGTTGATTTGAATGGGAGATTTTTTGGTGGAAGACAGGTCAAGGCTGGCTTTTATGATGTAGAAAAGTTTGCATCTTTGCAATTAAATGAATGA

Protein sequence:

>DPOGS203665-PA
MSLYDDLDTIKARTSDKVAGWSSGIKLLQSQLQLKKAAVTQPKREALRRSNQVLTPVIDLKSKSKDDDEPNTNSPKIQPKTITATLNVRDFDWNVANEYDPMWPNDYEKVAKEMQAKRLNLDGSNERTERSERSERKRKTRFNDEEDVIPEKTLVPMQPEEEEVEEKTKRTAGVAIAPPPSLTIESLSPPPVIPTPTASQGFSLGGYGASSVAAKIMAKYGFKEGQGLGKKEQGMSVALQVEKTSKRGGRIIHEKDSTNMMPPSFAMTSYSGPDSPNASNSPHSRQEPSITEIMKTPSKVVLLRNMVGPGDVDEELEPEVKDECNTKYGEVVKVLIFEMPNAPSDEAVRIFVEFKRIESAIKAVVDLNGRFFGGRQVKAGFYDVEKFASLQLNE-