Monarch geneset OGS2.0

DPOGS200021
TranscriptDPOGS200021-TA1218 bp
ProteinDPOGS200021-PA405 aa
Genomic positionDPSCF300337 - 219818-222842
RNAseq coverage574x (Rank: top 22%)
Annotation
HeliconiusHMEL0080350.087.16% 
BombyxBGIBMGA002363-TA0.086.59% 
DrosophilaCG14641-PA1e-14958.35% 
EBI UniRef50UniRef50_Q9NW641e-15162.74%Pre-mRNA-splicing factor RBM22 n=98 Tax=Eukaryota RepID=RBM22_HUMAN
NCBI RefSeqXP_395009.26e-16269.71%PREDICTED: similar to RNA binding motif protein 22 [Apis mellifera]
NCBI nr blastpgi|3800203691e-16069.71%PREDICTED: pre-mRNA-splicing factor RBM22-like [Apis florea]
NCBI nr blastxgi|3838572853e-17572.06%PREDICTED: pre-mRNA-splicing factor RBM22-like [Megachile rotundata]
Group
Gene OntologyGO:00001661.6e-18nucleotide binding
GO:00036762.8e-18nucleic acid binding
GO:00082701.1e-05zinc ion binding
KEGG pathwayame:4115382e-161 
 K12872 (RBM22, SLT11)maps-> Spliceosome
InterPro domain[218-307] IPR0126771.6e-18Nucleotide-binding, alpha-beta plait
[233-301] IPR0005042.8e-18RNA recognition motif domain
Orthology groupMCL14340 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200021-TA
ATGGCTGTGTCGAAGTCTACGAATACATACAATCGTCAGAATTGGGAAGATTCCGATTTCCCGATTTTATGTCAAACATGCCTTGGTGATAACCCCTACATTCGAATGACTAAAGAGAAATATGGCAAGGAGTGTAAGATCTGTGTACGCCCTTTCACCGTATTTAGATGGTGTCCGGGAGCAAGAATGCGTTTCAAGAAAACAGAAATCTGTCAAACATGTTCAAAATTGAAAAATGTTTGCCAAACATGCCTCTTAGATCTTGAATATGGATTACCAATACAAGTCAGAGATGCTGCTCTTAAAATACAAGACGATCTACCTCGTAATGAGGTCAATAAAGAATATTATATTCAGAACTTAGATAGTCAAATGTCTAAATTTGATCCAAGCCAGCCGAGCAATTCCGCTCTGAAATCAAAGGGTGCATCAGATCTACTGATGCGGCTAGCTAGAACGGCTCCTTACTACAAGCGTAATAGACCTCATGTGTGTTCGTTTTGGGTTAAAGGAGAGTGTAGGAGAGGTGAGGAGTGTCCTTACCGACACGAGAAACCAACAGATCCCGACGACCCCTTGGCAGATCAAAATATTAAGGATAGATACTATGGTGTAAATGATCCCGTGGCGGAGAAGTTGATGCGTCGTGCTGCGGCTATGCCAGCACTGCCACCTCCCGAAGACAGAACAGTCACTACATTATACATCGGCAATCTGCCGGAAAATATTACTGAAGAGGAATTAAGAGGCCACTTCTATCAATACGGTGAAATAAGGTCATTGACTCTAGTCCCGAGGGCTCAATGTGCATTTGTTCAATATACTACTCGGAGTGCGGCCGAGCATGCAGCCGAGAAGACCTTCAATAGGTTGGTGATAGCTGGCAGGAGACTTGCCATTAAGTGGGGAAAATCACAAGGACGACAAGGTCCATCGGAGGCTATCGAGGCAGGTGTTCCATTGGAGCCAGTTCCTGGTCTGCCGGCTGCTCTGCCACCACCACCTGCCTTCCTCCATCCCTTCCCGCCACAGATGCCGCCGCCGCCGAATCGTCCGAACGACTTCTTCAATCTGCATCACTACGGCGGTCCTCGGGCGTGGGCCTGGCCTCCTCCCGGGCCTCCGCCTTCCCACCCCGCGCCCCCCGCGCCCTTGCACTACCCCAGCCAGGATCCGTCAAGACTCGGACACGCCCACTCAGCGACGCCACAGACATAA

Protein sequence:

>DPOGS200021-PA
MAVSKSTNTYNRQNWEDSDFPILCQTCLGDNPYIRMTKEKYGKECKICVRPFTVFRWCPGARMRFKKTEICQTCSKLKNVCQTCLLDLEYGLPIQVRDAALKIQDDLPRNEVNKEYYIQNLDSQMSKFDPSQPSNSALKSKGASDLLMRLARTAPYYKRNRPHVCSFWVKGECRRGEECPYRHEKPTDPDDPLADQNIKDRYYGVNDPVAEKLMRRAAAMPALPPPEDRTVTTLYIGNLPENITEEELRGHFYQYGEIRSLTLVPRAQCAFVQYTTRSAAEHAAEKTFNRLVIAGRRLAIKWGKSQGRQGPSEAIEAGVPLEPVPGLPAALPPPPAFLHPFPPQMPPPPNRPNDFFNLHHYGGPRAWAWPPPGPPPSHPAPPAPLHYPSQDPSRLGHAHSATPQT-