Monarch geneset OGS2.0

DPOGS214942
TranscriptDPOGS214942-TA2583 bp
ProteinDPOGS214942-PA860 aa
Genomic positionDPSCF300280 - 113346-120568
RNAseq coverage759x (Rank: top 17%)
Annotation
HeliconiusHMEL0155930.071.84% 
BombyxBGIBMGA004823-TA0.080.73% 
DrosophilaCG3605-PA0.056.72% 
EBI UniRef50UniRef50_F4WDL60.056.07%Splicing factor 3B subunit 2 n=5 Tax=Formicidae RepID=F4WDL6_ACREC
NCBI RefSeqXP_975513.20.060.53%PREDICTED: similar to CG3605 CG3605-PA [Tribolium castaneum]
NCBI nr blastpgi|2700095550.060.53%hypothetical protein TcasGA2_TC008829 [Tribolium castaneum]
NCBI nr blastxgi|2420113710.057.05%Splicing factor 3B subunit, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00063971.2e-59mRNA processing
GO:00056341.2e-59nucleus
KEGG pathwaytca:6644130.0 
 K12829 (SF3B2, SAP145, CUS1)maps-> Spliceosome
InterPro domain[428-555] IPR0071801.2e-59Domain of unknown function DUF382
[560-618] IPR0065681.3e-29PSP, proline-rich
Orthology groupMCL12203 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214942-TA
ATGGACGGTCCACCGGGAACTACGTCTGGAGGTAGTACCAGTTCTATGGGCCCACCGCCAGGAATGCCGAGTTTTCCTCCCATGCCTCCATCGTCAGGCCCCATGGGTCCTGGAAGTATGCCGCCTCCTCCAGTGGGTCCACCTGGTACAATGCCTGCAGTCACAACATCTGGTGGTCCACCAAACATGCCGCCACCAGGAATGGGTCCACCTCCTAACATGATGGGTATGGGTCCTCCTGGTATCGGTCCTCCACCCCCGCCCGGTTTGGGACCCCCAGGAATCAACATGGGACCTCCTCCGATGGGACCGCCAGGCCTTCCGTCACGAATGCCTCCTAACATGATGCGGGGAACATCTAATATGAAGAGTAACTACAATCAAACTATAGATATGGGACCGCCTGGTATGGTGCCACCCTCTAGTATGAATCCTTGGGACAATCAATGCCCTCCTGGTTGGGGGCGACAAGGGAGAGGGGATGGCCCTCCAGGATGGGACGATCAGGACGATGATGAAGATGATAATGATGATGAAAGTGATCCTTCAGGACCTCCACTACCATCCTTGTTGACCATGAAAATAGATACACCCGAGGAGTTCAGAAATAAACCCCCTTCTGCTGTGGGTGGTGTTGTGCTACCAAAAGCCTTGGAGGAGGCACTCGCTTACAAAGATCAAAGACAAGCTGCCTTAGGAGATGAAGCAGATAAAGTAACAGAGCAAACAAAGAAACCTGAACCTCCACCGGCACCTGTGATCAGTACAGAGTATGATGGTGAAGAAGAAGGAGACTCGGATGAAGATAACATACCAGAAGCTCCCTTACCACCAATAATATCTAAGCAAGAGAATCAAACCAAAGCGAGTAAAACTAAACGGAAAAAGAAGAAGAAGAAGGCGGCGAAACAGAAGAGAAAAGAAGCAAAGTCGGCCGACGAAAGTAGCAAAGAAGCCCAGAAGACCAGCGACAAAGAAAACGAAAAGGAAGCTGAAATCGAATACGTCCAAGAGAACATACAGTTCCACGAACTGGAGCCCATGTACCGTCAGTTCCACCGCATCCTGGAATCGTTTAAGATAACGGAGAGGAAGGAGGAGATCAAGGATGAACCCGGGAAAGATGCACCGAAACCGAGCAAGCCGCTGGAGAAAGTTACCGACCAATTTGCAGCTGACGAAGAGGCTGTTGAGAAACATGCAGCCGATGAGAAGGAGCGGCTCTCAAAACGCAAGTTAAAGAAGCTGTCTCGTCTGTCCGTGGCGGAGCTGAAGCAACTGGTGGCCCGGCCGGATGTAGTGGAGATGTACGACGTCACCGCCAGGGACCCCAAACTGCTGGTACAGCTGAAGGCTCACAGGAACACTGTCCAAGTGCCGCGCCACTGGTGTTACAAACGGAAGTATCTGCAAGGCAAGCGCGGTATCGAGAAGCCGCCGTTCGACCTGCCGGACTTCATCAAGAAGACCGGCATCATGGAGATGAGAGCCTCGCTCCAGGACAAGGAGGAAACTAAGACATTGAAGGCGAAGATGAGGGAGAGGACGCGACCCAAGCTCGGGAAGATTGACATCGACTACCAGAAGCTGCACGACGCGTTCTTCAAGTGGCAGACGAAACCTCGCATGACCATCCACGGTGACCTCTACTACGAGGGTAAGGAGTATGAAACTCGACTGAGAGAAAAGAAACCGGGAGATCTCTCAGAGGAACTGAGAACCGCACTGGGCATGCCGGTGGGACCTGGCTCTCATAAGGTGCCGCCGCCGTGGCTGATCGCCCAGCAGCGTTACGGACCGCCTCCGTCTTACCCAAACCTCAAGATCCCGGGCCTGAACGCTCCTATACCCGAGGGTTGCGCCTTCGGGTACCACGCGGGCGGCTGGGGTAAGCCTCCCGTCGATGAAGCCGGCAAACCTCTCTACGGAGACGTGTTCGGACATCAGAGCAGCGGCCAAGATGATGCCGAGGATCAAGATATAGACAGGACCATGTGGGGTGAACTGGAGTCGGAGTCAGAGGAGGAATCGGAAGAAGAGGAATCAGATGAGGGCGAGAAGGCCGGTGAGGGTGAGGCCGTGGCAGCGGGCGTGGCGACTCCTGGTGAGGGACTCGTCACACCGCTGGGCACCAGCTCTGTACCGCCCGGACTGGAGACACCTGACACCATCGAGCTCAGGAAGAAGAAGATGGAGGATCTAGAAGGCGGTGAGACACCGGCCTTGTATCAAGTGGTCCCCGAGAGACGAGTTGGTCTCACGTCTGGTATGATGGCGTCCACACATGTGTATGACATCAATGCCGCAAATCCTGGTAAACGAGCTCCGACCGGTGCAACCAGTGAGGTTGGTCCCAGCGCTGCAGCTGGTGTAGAAGTGGCGCTGGACCCCTCGGAGCTGGAGCTGGAGCCCGAGGCTGTGGCGGCCAGGTACGAGAGACACCTGCGGGAACACAGGCCCAAGGGACGCGAGGACCTCTCAGATATGTTGGCCGACCACGTCGCCAGACAGAAGAATAAACGAAAGCGTCAACAAAACACAGATTCCAAGCAAGCGAAGAAATACAAAGAATTCAAGTTCTAA

Protein sequence:

>DPOGS214942-PA
MDGPPGTTSGGSTSSMGPPPGMPSFPPMPPSSGPMGPGSMPPPPVGPPGTMPAVTTSGGPPNMPPPGMGPPPNMMGMGPPGIGPPPPPGLGPPGINMGPPPMGPPGLPSRMPPNMMRGTSNMKSNYNQTIDMGPPGMVPPSSMNPWDNQCPPGWGRQGRGDGPPGWDDQDDDEDDNDDESDPSGPPLPSLLTMKIDTPEEFRNKPPSAVGGVVLPKALEEALAYKDQRQAALGDEADKVTEQTKKPEPPPAPVISTEYDGEEEGDSDEDNIPEAPLPPIISKQENQTKASKTKRKKKKKKAAKQKRKEAKSADESSKEAQKTSDKENEKEAEIEYVQENIQFHELEPMYRQFHRILESFKITERKEEIKDEPGKDAPKPSKPLEKVTDQFAADEEAVEKHAADEKERLSKRKLKKLSRLSVAELKQLVARPDVVEMYDVTARDPKLLVQLKAHRNTVQVPRHWCYKRKYLQGKRGIEKPPFDLPDFIKKTGIMEMRASLQDKEETKTLKAKMRERTRPKLGKIDIDYQKLHDAFFKWQTKPRMTIHGDLYYEGKEYETRLREKKPGDLSEELRTALGMPVGPGSHKVPPPWLIAQQRYGPPPSYPNLKIPGLNAPIPEGCAFGYHAGGWGKPPVDEAGKPLYGDVFGHQSSGQDDAEDQDIDRTMWGELESESEEESEEEESDEGEKAGEGEAVAAGVATPGEGLVTPLGTSSVPPGLETPDTIELRKKKMEDLEGGETPALYQVVPERRVGLTSGMMASTHVYDINAANPGKRAPTGATSEVGPSAAAGVEVALDPSELELEPEAVAARYERHLREHRPKGREDLSDMLADHVARQKNKRKRQQNTDSKQAKKYKEFKF-