Monarch geneset OGS2.0

DPOGS200700
TranscriptDPOGS200700-TA1005 bp
ProteinDPOGS200700-PA334 aa
Genomic positionDPSCF300274 - 178493-181135
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0118963e-13099.55% 
BombyxBGIBMGA005282-TA2e-9298.17% 
DrosophilaSpx-PA4e-11890.23% 
EBI UniRef50UniRef50_UPI000194DCF32e-11287.68%UPI000194DCF3 related cluster n=1 Tax=unknown RepID=UPI000194DCF3
NCBI RefSeqNP_001037646.18e-13098.22%spliceosomal protein on the X [Bombyx mori]
NCBI nr blastpgi|1129833281e-12898.22%spliceosomal protein on the X [Bombyx mori]
NCBI nr blastxgi|1129833280.094.44%spliceosomal protein on the X [Bombyx mori]
Group
Gene OntologyGO:00001664.6e-29nucleotide binding
GO:00036766.2e-24nucleic acid binding
KEGG pathwaycqu:CpipJ_CPIJ0141492e-120 
 K12831 (SF3B4, SAP49)maps-> Spliceosome
InterPro domain[8-89] IPR0126774.6e-29Nucleotide-binding, alpha-beta plait
[14-87] IPR0005046.2e-24RNA recognition motif domain
Orthology groupMCL13531 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200700-TA
ATGGCTGCAGGACCAATTGCAGAAAGAAATCAAGATGCAACGATTTACGTTGGTGGTTTGGACGACAGAGTCACAGAAAGCCTGCTTTGGGAACTATTTGTTCAAGCTGGACCTGTCGTGAATGTTCATATGCCAAAGGACCGAGTGACACAAACTCATCAAGGATATGGATTTGTTGAATTTATGGGAGAAGAAGATGCCGATTATGCAATAAAGGTCATGAATATGATAAAACTATATGGAAAGCCCGTAAGGGTAAATAAAGCATCAGCACACCAGAAGAACCTTGATGTGGGAGCTAATGTGTTCATTGGTAATCTTGATCCCGAGGTAGATGAAAAACTATTGTATGATACATTTTCTGCATTCGGTGTCATATTGCAGACGCCTAAGGTTATGAGAGACCCCGAGACTGGTAACTCGAAAGCATTTGCATTTATAAACTTTGCATCCTTCGAAGCCTCCGATGCGGCCATAGAGGCTATGAATAACCAGTACTTGTGCAACCGTCCTATATCTGTCTCTTATGCATTTAAGAAAGATGTTAAGGGAGAAAGGCATGGTTCCGCAGCTGAAAGGTTACTGGCAGCACAAAATCCTTTATCGCATGCAGACAGACCTCACCAGCTGTTTGCAGATGCGCCACCTATAATGGGTCCACTGCTGATGGCTCCTCCTCCACCACCGACGCCAAGTCCGATGCCGCCTGGCCCTCCGTTGACATCACGACCTCCATTGCCAATGGCTGCTCCCCCACCACCACCCTCAACAATGCCTCCACCTGGGCCACCTCCACCCCCAGGGCCCCCACCCCCTCCGCCGCCATTTCATCATTTCCCCCCTCCTCCATTCGGGCCACCTGGCTTTGGCCCGCCACCCCCTCCTGGAGCGAGGCCCCCACCATGGAGGCCACCCCCACCGTCTTTCAGGCCCCAATACCGACCGCCACCGTTTGGTCATCCCCCCTTCCCGCACCACCCGCCTGAACCAAACTATAACTATTAA

Protein sequence:

>DPOGS200700-PA
MAAGPIAERNQDATIYVGGLDDRVTESLLWELFVQAGPVVNVHMPKDRVTQTHQGYGFVEFMGEEDADYAIKVMNMIKLYGKPVRVNKASAHQKNLDVGANVFIGNLDPEVDEKLLYDTFSAFGVILQTPKVMRDPETGNSKAFAFINFASFEASDAAIEAMNNQYLCNRPISVSYAFKKDVKGERHGSAAERLLAAQNPLSHADRPHQLFADAPPIMGPLLMAPPPPPTPSPMPPGPPLTSRPPLPMAAPPPPPSTMPPPGPPPPPGPPPPPPPFHHFPPPPFGPPGFGPPPPPGARPPPWRPPPPSFRPQYRPPPFGHPPFPHHPPEPNYNY-