Monarch geneset OGS2.0

DPOGS204992
TranscriptDPOGS204992-TA2457 bp
ProteinDPOGS204992-PA818 aa
Genomic positionDPSCF300123 + 133962-142430
RNAseq coverage927x (Rank: top 14%)
Annotation
HeliconiusHMEL0094943e-16481.96% 
BombyxBGIBMGA010228-TA2e-9969.08% 
DrosophilaSaf-B-PB2e-3839.74% 
EBI UniRef50UniRef50_UPI0002246E5D1e-5047.77%UPI0002246E5D related cluster n=1 Tax=unknown RepID=UPI0002246E5D
NCBI RefSeqXP_001606003.12e-5147.77%PREDICTED: similar to scaffold attachment factor B2 [Nasonia vitripennis]
NCBI nr blastpgi|3454883324e-5047.77%PREDICTED: hypothetical protein LOC100122397 [Nasonia vitripennis]
NCBI nr blastxgi|3838613845e-9633.81%PREDICTED: SAFB-like transcription modulator-like [Megachile rotundata]
Group
Gene OntologyGO:00001662e-21nucleotide binding
GO:00036768.2e-21nucleic acid binding
KEGG pathwayosa:43454551e-13 
 K12897 (TRA2)maps-> Spliceosome
InterPro domain[320-421] IPR0126772e-21Nucleotide-binding, alpha-beta plait
[340-413] IPR0005048.2e-21RNA recognition motif domain
[8-49] IPR0030341.2e-11DNA-binding SAP
Orthology groupMCL25720 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204992-TA
ATGTCGGACCGAAAACGTTTAATATCTGATCTTCGTGTAATAGATCTACGCGCTGAGTTAGAAAAGCGCAATCTAGATAAAAGCGGTGTACGCGGTGTGCTCGTACAACGTTTGTCTAAGTATTTAGAAGAGAACGGGGAAGATCCGGCTACTTTTACTTTCGAATTAGCTACAAATGAAGTAAAGACTCCATCAAAGAAGACCAGACGCACTGAAAGTGCTGTAGAAACTGTTGAAGAAGAAACCCCAATTATGGAAGATATGATAGTCCAAGATACAGCTGGTGAGGAAGAGGATACGGAACAATCGGAAAATAATCAGAAAAAAAAGGCGGCGAAAGAATCTCCGAAAAAAACAGATGAAATGGAGGTTGACGATGATGAGAAGGTTAACCGCAAACGTGAAAGTAAAGACGAGGCTGAAGCTAGTCAAGCAAAGAAACCCTGCTTAGAAAAGGATGTTAGAAATGAGGAAGAACATAAGGCAGAAAATAATACAGACGCTGAAGACAGTATCAACTTAGATTTGGGTGAAGACGAACTTTTGAATGAAGAGACTGACATCTCAGCTAAAAACAAGAAAGATGACGCCGCGCACGTGGATCCTGCAGCGGCCGCGAGCGTCGAGCCGTCGCAGACTTCGGAGCCAGCTGACTCCTCAGAATGTCAAGTAATCACCGAAGCGGCTGCCACTAGGTTGGTGATCACCGAACGTCGCGACGCCTCGGATGCAACGGGCATCGTTGGAGCTCCGGATCACCATGGACATCTTGAACTGATAGACTTCACGAAGAATATCACATGTTCTGATCGGCGCTGCTCACGTGAGATATCTGAACGTGAAGAAGATATGTATATTAAAGAAGAGCTTTGGACTTCAGCCGACAGTCATCGCACAAGCAACACACAACGCAATGAGTCTGATAAGGTTGATAGAGAAGACAAGGATGACAAAGAGAAAGATGGGGATGTTGACAAAAAGGAAAAGAGAGACGAAAACAAAGAGGGTGGCGCGAGGAATTTATGGGTTAGTGGATTGTCTGGGGACACGAGAGCCAAGGATCTGAAGCAGCTATGTAGCAAGCACGGCAAGGTGATTGGAGCCAAAGTGGTAACCAACGCCCGTACCCCAGGATCTCGTTGCTATGGTTACGTTACTATGGCCAGCTCTCAGGATGCCGAAAACTGTATTAAAAATCTACATAGAACTGAATTACACGGTCGCATGATATCTGTGGAGAAGGCTAAATCTGAATCCGAGTCCGCAGCTCGCCGCCAGCCGTCCCTCAGGCCCGACAGGCGGAACAGCAAGGAACGGAAAGACGACGCCAAGGAAAACCAGGATACAAAAGACGGTAATGAGAAATCTGAATCTAAACCTAAAAAGGAAGGCGAGATCTCCGGCGCTGAGAGCACTAGATCTACGTCACGTACTCGTGAAAAGTCGCACCGTTCCGATAAGGATAGAGTAAGGCGTAGCTCAAGAAGCCGCGAACGGAGGCGGTCACCACGGGAGGTGCTCTCGTTTTCAAAGATATGGAAGGAGCGTGATGTAGCCCGCGCTCGTGAGCGTTCGAGGGCGGCTCGTGAAGAAGAACGGAGAAGGCGGGCGGCGGAAGACGCGCTCAGAGAAAGAGAGAGGAGACAAAGACAAGAGAAACATAGACTCACCATAGAGAGGGAGAAGTTACGAGCTGAGAGGGAAAAGATAGAACGAGAGAAGAATGAACTTTTGAGGTTAGAGAGGGAGAGACATAGATTAGAAAGAGAAAAACTGGAATTGGAGAGATTGGAATTGAAAAGGGCTCAACTAAGGTTAGAAGAGGAGCGTCGCAAGCGGGGCTATGAGAGTGCGGCCTATCGCAAGGCGGCCAGTCCACCCGAACCTTCCTACGACAGGGACGCAAGACACAAAAGGCCGCCGCCGCCTACTTTGCAGGCGTCGAGCCGCGGTCAGTTCGAAGCGCCGCCGCCGCCGAGGTTCGAATTGGCCGGAGGATACGACCGCACCGACAAACGGGATAGGGATTACAAGAGAGACTATCCCCGACACGTAGCATCTAACAATATGAAATATCCACCAAACGGCAGCGCTAGCGAGGACACGAGACAGCAGCTGCCCCCCGGCACTAGACCCAAAGAGCCAAGTCGATCGTACGACTCTAGAGAAGGTCGTTCGTACCGTCCCTCTCCTCCGGGGAAACCTGAACCGCGCAGTTGGAACGCCTCCAGTCGATACCCAGAAGCCGCAGCGCCCACTAAAGGTTGTATTCCAATCATAAAAAAAGCCATGGCGGGCAGCGGTAGCGGCAGCAGTAGCGAACCGTGGTCGGGCGAGGCTCGGTATGGCGGGTCATACGAGACGCGCTATTCTCCCGCTTACCAACCTCAGCCCCCCGGCGCCGCTTACCCGGACCGGTACGTCCCCGCCGCAAGGGACTACGCCAGGAAATACTGA

Protein sequence:

>DPOGS204992-PA
MSDRKRLISDLRVIDLRAELEKRNLDKSGVRGVLVQRLSKYLEENGEDPATFTFELATNEVKTPSKKTRRTESAVETVEEETPIMEDMIVQDTAGEEEDTEQSENNQKKKAAKESPKKTDEMEVDDDEKVNRKRESKDEAEASQAKKPCLEKDVRNEEEHKAENNTDAEDSINLDLGEDELLNEETDISAKNKKDDAAHVDPAAAASVEPSQTSEPADSSECQVITEAAATRLVITERRDASDATGIVGAPDHHGHLELIDFTKNITCSDRRCSREISEREEDMYIKEELWTSADSHRTSNTQRNESDKVDREDKDDKEKDGDVDKKEKRDENKEGGARNLWVSGLSGDTRAKDLKQLCSKHGKVIGAKVVTNARTPGSRCYGYVTMASSQDAENCIKNLHRTELHGRMISVEKAKSESESAARRQPSLRPDRRNSKERKDDAKENQDTKDGNEKSESKPKKEGEISGAESTRSTSRTREKSHRSDKDRVRRSSRSRERRRSPREVLSFSKIWKERDVARARERSRAAREEERRRRAAEDALRERERRQRQEKHRLTIEREKLRAEREKIEREKNELLRLERERHRLEREKLELERLELKRAQLRLEEERRKRGYESAAYRKAASPPEPSYDRDARHKRPPPPTLQASSRGQFEAPPPPRFELAGGYDRTDKRDRDYKRDYPRHVASNNMKYPPNGSASEDTRQQLPPGTRPKEPSRSYDSREGRSYRPSPPGKPEPRSWNASSRYPEAAAPTKGCIPIIKKAMAGSGSGSSSEPWSGEARYGGSYETRYSPAYQPQPPGAAYPDRYVPAARDYARKY-