Monarch geneset OGS2.0

DPOGS207257
TranscriptDPOGS207257-TA3651 bp
ProteinDPOGS207257-PA1216 aa
Genomic positionDPSCF300008 - 735600-742804
RNAseq coverage1646x (Rank: top 8%)
Annotation
HeliconiusHMEL0021830.092.60% 
BombyxBGIBMGA000829-TA0.093.51% 
DrosophilaCG13900-PA0.074.39% 
EBI UniRef50UniRef50_Q153930.077.14%Splicing factor 3B subunit 3 n=139 Tax=Eukaryota RepID=SF3B3_HUMAN
NCBI RefSeqXP_002429717.10.079.33%Splicing factor 3B subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3072059560.079.27%Splicing factor 3B subunit 3 [Harpegnathos saltator]
NCBI nr blastxgi|3072059560.079.27%Splicing factor 3B subunit 3 [Harpegnathos saltator]
Group
Gene OntologyGO:00056341.6e-91nucleus
GO:00036761.6e-91nucleic acid binding
KEGG pathwayphu:Phum_PHUM4490600.0 
 K12830 (SF3B3, SAP130, RSE1)maps-> Spliceosome
InterPro domain[859-1182] IPR0048711.6e-91Cleavage/polyadenylation specificity factor, A subunit, C-terminal
Orthology groupMCL12724 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207257-TA
ATGTATCTTTACAATTTAACGCTGCAAGGCTCAACGGCCATTTCGCACGCCGTGCATGGGAATTTTTCTGGCACGAAACAGCAAGAAATAATTATATCTCGAGGCAAAACTTTAGAATTGCTAAGACCTGATCCAAATACTGGAAAAGTGCACACTTTAATGAAAGTGGAGATTTTCGGCGTCATTCGTTCTATGATGTCATTTCGACTCACCGGCGGCACTAAAGATTATATAGTTGTTGGATCTGACTCGGGGCGCATTGTTATACTTGAATATATACCTGCTAAGAACATTCTCGAAAAAGTACATCAAGAGACATTTGGGAAATCGGGATGTAGAAGAATAGTACCAGGGCAGTACCTAGCAATTGATCCGAAAGGCAGAGCTGTTATGATAGGTGCAATTGAAAAGCAAAAATTGGTATACATTTTGAACAGAGATGCTGAAGCTAGATTAACAATCTCGTCACCGCTTGAAGCTCACAAATCTAACACATTAGTATACCACATGGTGGGAGTTGATGTTGGCTTTGAAAACCCCATGTTTGCTTGCTTGGAGATAGACTATGAAGAGGCAGACTCTGATCCCACAGGGGAAGCAGCTCAAAAGACACAGCAGACATTAACATTTTATGAGCTGGATTTGGGTTTAAATCATGTAGTAAGAAAATATTCAGAACCTCTTGAAGAACATGCCAATTTCCTTATAACGGTACCTGGTGGTAATGATGGCCCGTCAGGTGTACTTATCTGTTCAGAAAATTATCTAACCTACAAAAATTTGGGAGACCAGCATGATATTAGATGTCCTATTCCTAGAAGGAGAAATGATTTAGACGACCCAGAAAGGGGTATGATCTTTGTCTGCTCGGCCACACACAAGACAAAATCGATGTTCTTTTTCCTGGCACAAACTGAACAGGGTGATATATTTAAAATCACTATAGAAACCGATGAAGATATGGTGACGGAAATTAAACTGAAATACTTTGATACTGTACCAGTTGCAACTGCTATGTGCGTTCTGAAGACTGGCTTTCTTTTTGTTGCTTGTGAATTTGGCAACCACTATTTATACCAAATTGCTCACTTGGGTGATGAAGATGATGAACCAGAATTCAGTTCTGCAATGCCATTAGAGGAAGGAGACACATTTTTCTTTGCTCCCCGACCTCTCAGGAACTTGGTGCTGGTTGATGAATTGGATTCCCTCTCACCCATACTCGCTTGCCATGTGGCAGACTTAACTGGTGAAGATACACCTCAAGTGTATTTAGCATGTGGCAGAGGACCAAGATCTTCACTGAGAGCCTTAAGACATGGTTTAGAAGTAGCAGAGATGGCTGTATCAGAACTACCCGGTTCACCAAATGCAGTATGGACTGTGCGACGGCACAAAGATGATGACTATGATTCGTACATCATAGTGAGTTTCGTAAACGCTACGTTGGTGCTATCTATCGGTGAGACTGTGGAAGAGGTGACGGACTCTGGTTTTCTCGGAACCACACCGACATTGAGCTGCCACGCACTTGGAAGTGATGCATTGGTTCAAGTATATCCTGATGGTATAAGACATATCAGGGCTGACAAACGAGTTAACGAGTGGAAGGCACCCGGCAAGAAGTCTATTGTGAAATGTGCCGTCAATCAAAGACAAGTTGTCATAGCACTGACTGGAGGTGAACTGGTGTACTTTGAAATGGACCCGACTGGCCAATTGAATGAGTACACTGAACGAAAGAAGTTGTCATCTGATGTATCCTGTATGGCACTGGGATCAGTAGCTACTGGAGAACAGAGAGCTTGGTTCCTAGCTGTTGGTTTAGTTGACAATACTGTCAGAATTATTTCACTGGATCCTGCTGATTGTCTAGCACCTCGTTCAATGCAAGCCCTGCCTGCCAGCCCCGAGTCCTTGTGTATTGTTGATCAACCCTTTGAGTCTGGTGCCAAATCTGCTTTACACCTTAACATTGGCTTAAGTAATGGAGTATTACTACGTACAACTCTGGACTCTGTTAGTGGTGATTTAGCTGATACAAGAACAAGATACCTGGGATCTCGCCCTGTGAAACTTTTCAAAGTTAGAGTGCAGTCAGCGGAAGCAGTGCTGGCTGTGTCTTCGAGGACATGGCTCGGTTATCAATATCAGAACAGATTCCATCTAACGCCATTGTCATATGAATGTCTAGAGTATGCTGCGGGATTTAGCTCTGAACAATGTACCGAGGGTATAGTGGCCATTTCATCAAATACACTAAGAATTTTAGCCCTAGAAAAATTGGGTGCCGTATTCAATCAAACATTCCAACAATTAGATTACACACCAAGAAAGTTTGTTATAAATAGTGATAACAATCACATCATAGTTTTGGAGACTGACCACAATGCTTACACTGAAGAAATGAAGAAGCAAAGAAGAGTGCAAATGGCACAAGAAATGAGAGAAGCTGCTGCTGGGGGAACTCCCGAGGAACAACAACTAGCAAATGAAATGGCCGACGCGTTCCTTTCAGATGTGTTGCCAGAAAATATATTTTCTTCCCCGAAAGCTGGTGCCGGCATGTGGGCGTCTCAGATCCGTATACTGGACATGAGTGGCGGCGTGGGCGGGTGTAGCACTGTGTGTCTACTACCGCTGGAACAGAACGAGGCGGCCGTGTCTTTGTGTGTAGTACGATGGGCCGCTCTCACTGACAACACACCACATCTAGTAGTAGGGGTTGCCAAGGACGCTCTGCTGTCACCACGTAGCTGCTCTGAGGGCAGTCTACATGTTTATAAGATTTATAATACTGGAAAATTGGAATTGGTACATAAAACACCAATAGATGAATACCCTGGAGCGTTGGCAGCATTCAATGGCAAGCTGCTGGCAGGAGTGGGGCGGATGTTGAGGTTGTACGACATTGGTAGAAGGAAACTATTACGGAAGTGCGAAAACAGACACATTCCAAACCTCATAGCGGATATCAAAACTATAAGGCAGAGAATATTCGTATCGGACGTCCAAGAATCCGTGTTCTGTGTTAAATACAAGAAGAGGGAAAACCAGCTGATTATTTTCGCCGACGACACCAATCCCAGGTGGATCACCAACACTTGTATTCTAGACTACGACACGGTCGCTATGGCCGACAAGTTTGGCAACGTAGCCGTTTTGAGACTGCCTCAGTCTGTGAGCGACGATGTGGATGAGGATCCGACTGGAAACAAAGCGCTCTGGGACAGAGGTCTTCTGAATGGAGCGTCTCAAAAGGGTGACATCACTGTTAATTTCCACGTTGGAGAGACTGTGACGTCTTTGCAAAGAGCTACTCTAATCCCGGGCGGTTCGGAGGCGCTCTTGTACGCCACAGTGAGCGGAGCACTGGGAGTGTTCCTACCGTTCACCTCCAGGGAAGATCACGACTTCTTCCAGCACCTTGAAATGCACATGAGGAGTGAAAACTCACCTCTGTGCGGACGAGACCACTTGTCATTCAGAAGCTACTATTATCCAGTAAAGAATGTGATAGACGGCGACCTCTGCGAACAGTTCAACTCGCTGGAGCCGGCGAAACAGAAAGCCATCGCCGGAGACCTGGAGCGAACTCCGGCCGAGGTGTCCAAGAAGCTGGAGGACATCAGAACTAGATACGCCTTTTAA

Protein sequence:

>DPOGS207257-PA
MYLYNLTLQGSTAISHAVHGNFSGTKQQEIIISRGKTLELLRPDPNTGKVHTLMKVEIFGVIRSMMSFRLTGGTKDYIVVGSDSGRIVILEYIPAKNILEKVHQETFGKSGCRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMVGVDVGFENPMFACLEIDYEEADSDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLEEHANFLITVPGGNDGPSGVLICSENYLTYKNLGDQHDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFFFLAQTEQGDIFKITIETDEDMVTEIKLKYFDTVPVATAMCVLKTGFLFVACEFGNHYLYQIAHLGDEDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDELDSLSPILACHVADLTGEDTPQVYLACGRGPRSSLRALRHGLEVAEMAVSELPGSPNAVWTVRRHKDDDYDSYIIVSFVNATLVLSIGETVEEVTDSGFLGTTPTLSCHALGSDALVQVYPDGIRHIRADKRVNEWKAPGKKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKKLSSDVSCMALGSVATGEQRAWFLAVGLVDNTVRIISLDPADCLAPRSMQALPASPESLCIVDQPFESGAKSALHLNIGLSNGVLLRTTLDSVSGDLADTRTRYLGSRPVKLFKVRVQSAEAVLAVSSRTWLGYQYQNRFHLTPLSYECLEYAAGFSSEQCTEGIVAISSNTLRILALEKLGAVFNQTFQQLDYTPRKFVINSDNNHIIVLETDHNAYTEEMKKQRRVQMAQEMREAAAGGTPEEQQLANEMADAFLSDVLPENIFSSPKAGAGMWASQIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRSCSEGSLHVYKIYNTGKLELVHKTPIDEYPGALAAFNGKLLAGVGRMLRLYDIGRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQLIIFADDTNPRWITNTCILDYDTVAMADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGLLNGASQKGDITVNFHVGETVTSLQRATLIPGGSEALLYATVSGALGVFLPFTSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSYYYPVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTPAEVSKKLEDIRTRYAF-