Monarch geneset OGS2.0

DPOGS206070
TranscriptDPOGS206070-TA4023 bp
ProteinDPOGS206070-PA1340 aa
Genomic positionDPSCF300028 - 390053-396799
RNAseq coverage764x (Rank: top 17%)
Annotation
HeliconiusHMEL0050390.096.95% 
BombyxBGIBMGA006851-TA0.095.52% 
DrosophilaCG2807-PA0.081.04% 
EBI UniRef50UniRef50_E0VK590.082.79%U2 snRNP component prp10, putative n=49 Tax=Eukaryota RepID=E0VK59_PEDHC
NCBI RefSeqXP_623732.10.084.44%PREDICTED: similar to CG2807-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838570660.084.21%PREDICTED: splicing factor 3B subunit 1-like [Megachile rotundata]
NCBI nr blastxgi|1892408850.084.00%PREDICTED: similar to U2 small nuclear ribonucleoprotein [Tribolium castaneum]
Group
Gene OntologyGO:00054885.4e-142binding
KEGG pathwayame:5513310.0 
 K12828 (SF3B1, SAP155)maps-> Spliceosome
InterPro domain[940-1216] IPR0119895.4e-142Armadillo-like helical
[529-1332] IPR0160242.3e-95Armadillo-type fold
[329-477] IPR0150161.9e-37Splicing factor 3B subunit 1
Orthology groupMCL11450 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206070-TA
ATGGATAAAATACCGCGTACTCATGAAGCCATCGAGGCCCAAATAAAAGAAATACAATCGAAGAAAAAAGAGCTCCCTGAAAATGGTTCAGGCAAGGGTGTTTCTCTGGGTGATGCCTTCTATGACAGTGACATTTATGACAATTCTGGCCAAGGAGGGAAATCCAGATACGATGGCTACGTGACTTCTATTGCTGCGAATGATGAAGTGGAGGATGAGGATGTAGAGAATGTTCCAATATCACAGAAGAGACCTGGCTATACTGCTCCCGCTTCTTTATTAAATGATATAGCTCAGAGTGATAAAGATTATGATCCTTTTGCTGATAAGAGAAGACCTACTATTGCAGACAGGGAAGATGAGTATCGCCAGAAAAGGCGTAGGATGATTATATCACCAGAACGTTCCGATCCTTTTGCTGAAGGTGGAAAGACACCAGATGTTGGTTCTAGGACATACACAGAAATTATGAAGGAACAGTATTTGCGGGCTGAAGAAACAGAGTTACGGAAAAAGCTCCTGGAAAGAGCACGAGAAGGCACACTCAAGGCAGTATCACAGTCAAATGGTGAGGCAACAAAACCAGCAGCGAAACGTAAAGGTCGCTGGGACCAGAGCTCAGAGGATACACCGTCTGTGAAAAAACCTGTGGTTCAAGCTACACCAAGCTCTCAGGCTACACCATCCTGGGAAAATGAGCGAGGTGCATGGGAAGAAACACCAAGTGCAGGTGGCCGTGGTGGTGAGACACCTGGTGCGACGCCGTCTGCACGTGTGTGGGACGCCACGCCGGCTCATCTTACACCGGGCCATGCTACACCCGGTCGTGAGACGCCCGCTCACCATGCCTCCAGACGAAACCGGTGGGATGAGACACCCAAAACTGATAGAGAAACGCCTGGACATGCCAGCGGTTGGGCCGAAACTCCTCGTACAGATCGCGGCGTAGGTGTCGATACTATACAAGAGACGCCCACGCCGGGCACTAAGAGACGATCGCGCTGGGATGAGACTCCTGGCGCCACTCCCGCCGCAGCTACACCCACACCCTCACACGCGACACCTTCACACGCCACGCCCTCACATGCTACACCCTCCATGGGCACACCGACACCTCATACACCAATGTTTACTCCAGGCGGGTCAACACCGGTGGGTGTTAAGGCAATGGCCATGGCGACACCAACGCCGGGCCACATCGCAGCTATGACACCAGAGCAGTTGCAAGCGTATCGCTGGGAGAAAGAAATCGACGAACGAAATAGACCGTACACTGATGAAGAACTGGATGCCATGTTCCCACCTGGGTACAAGGTTTTGCCTCCACCGGCCGGTTATGTTCCTATTCGGACCCCGGCTCGTAAGCTGACCGCGACGCCCACACCTTTGGCTGGTACCCCAATCGGCTTTTTCATGCAGACGGAGGAAGTAGGCGGGAGTGCTGCAGCAGCGGCGCGGCTCCTCGACCCGCAGCCCAAAGGCAGTCAGCAGCTGCCGTTCATGAAGCCCGAGGACGCTCAGTACTTCGACAAACTTCTTATCGACGTCGACGAAGAAACACTGTCACCCGAAGAACTGAAGGAGAGAAAGATCATGAAGTTGCTGCTTAAGATTAAGAATGGAACACCGCCTATGTGCAAAGCAGCTCTCCGTCAAATCACAGACAAAGCTCGGGATTTCGGCGCCGGACCGCTCTTTAATCAAATCCTACCGTTATTGATGAGTCCTACACTCGAAGATCAAGAACGTCATCTCTTAGTAAAAGTTATAGATCGAATTCTTTACAAATTAGATGATTTGGTCCGCCCATATGTACACAAAATTTTGGTCGTCATAGAACCTCTGCTTATTGATGAAGATTACTACGCCCGTGTCGAGGGTCGAGAGATCATATCCAACTTGGCAAAAGCAGCTGGTTTAGCCACAATGATCTCTACAATGAGACCAGATATTGATAATATCGATGAATATGTTCGAAACACCACGGCCAGGGCCTTCGCTGTTGTTGCATCTGCTTTAGGTATACCGTCATTATTGCCGTTTTTAAAGGCCGTGTGCAGATCGAAGAAGTCATGGCAGGCTCGTCACACCGGTATCAAAATCGTGCAACAAATCGCAATTCTAATGGGATGTGCCATTTTGCCCCATCTGAAGTCGCTCGTGGAAATCATTGAGCATGGCTTGGTCGACGAACAACAAAAAGTTAGGACAATCACGGCGTTGGCGAGCGCCGCTTTAGCCGAAGCAGCCACGCCGTACGGTATCGAGTCCTTTGACTCTGTGCTAAAACCATTATGGAAGGGTATCAGAACCCATCGCGGTAAGGGTCTAGCGGCCTTCCTTAAAGCTATCGGCTACCTCATACCTCTCATGGACGCCGAATATGCAAACTATTACACCCGTGAGGTGATGCTTATATTGATCCGTGAGTTCCAGTCGCCCGACGAGGAAATGAAGAAGATTGTATTGAAGGTGGTGAAGCAGTGCTGCGGAACAGATGGTGTTGAACCTCAGTATATAATGGATGAAATCTTACCTCACTTCTTCAAACATTTCTGGAATCACAGAATGGCTTTGGACCGTCGCAACTATCGCCAACTTGTCGATACAACACAGCTTTATCGATTGTTCCATCAAGTTGGGGCGTCCGAAATAATAAACAGAATCGTAGACGATCTCAAGGATGACAACGAACAGTATAGGAAAATGGTTATGGAGTCCATTGAAAAAATTCTAGCCAACTTGGGCGCAGCTGATATAGATTCTAAGCTTGAGGAAGCCTTGATTGACGGCATTCTATACGCCTTCCAAGAACAGACCACTGAGGACGTGGTGATGTTGAATGGATTTGGTACAATAGTGAATCAACTCGGTAAGCGAGTCAAGCCTTATTTACCACAAATCTGTGGTATAATTCTGTGGCGTATGAACAACAAGTCGGCAAAGGTGAGGCAACAAGCTGCCGATCTTATTTCTCGTATCGCCGTAGTCATGAAAACTTGTCAGGAGGAAAAACTTATGGGGCATCTCGGTGTAGTGCTATATGAATATCTCGGGGAGGAATATCCTGAAGTACTCGGTTCTATTCTGGGTGCCTTAAAGGCTATAGTGAATGTGATCGGTATGACCAAAATGACACCACCCATCAAGGATTTACTTCCTAGATTAACGCCAATTCTCAAGAACAGACATGAAAAAGTGCAAGAAAATTGCATTGATCTGGTCGGACGTATTGCAGACAGGGGTCCCGAATTCGTGTCAGCGAGAGAGTGGATGAGGATTTGCTTTGAACTGCTGGAATTGCTCAAAGCACACAAGAAAGCTATCAGGAGAGCCACAGTCAATACATTTGGTTACATCGCCAAAGCTATCGGTCCGCATGACGTACTTGCTACACTGCTCAATAATCTTAAAGTTCAAGAGAGACAGAACAGAGTGTGCACAACAGTTGCAATTGCCATTGTAGCTGAGACATGTTCTCCATTCACAGTCTTGCCAGCGCTGATGAATGAGTACAGAGTTCCAGAATTAAATGTTCAGAATGGTGTTTTGAAATCGTTGTCATTTTTGTTTGAATACATCGGAGAAATGGGTAAAGATTACATATATGCTGTGTGCCCGTTACTAGAGGACGCACTTATGGACAGAGATTTAGTGCATCGACAAACTGCATGTGCCGCAATAAAACATATGGCATTGGGAGTGTATGGTTTCGGCTGTGAGGATGCTCTAATACATTTGCTGAACCATGTTTGGCCGAATATATTTGAAACCTCGCCTCATCTTGTACAAGCTTTTATGGACGCGGTTGAGGGCATGAGAGTTGCACTTGGCCCAATAAAAATACTCCAGTACGCATTACAGGGCTTATTCCATCCAGCTCGAAAGGTCCGTGATGTTTACTGGAAGATATATAACACATTATATATCGGAGGCCAAGACGCCCTGGTCGCTGGTTACCCACGGATACAAAATGATCCCAACAATCATTTTGTCAGATACGAGTTAGACTATTTGTTGTAG

Protein sequence:

>DPOGS206070-PA
MDKIPRTHEAIEAQIKEIQSKKKELPENGSGKGVSLGDAFYDSDIYDNSGQGGKSRYDGYVTSIAANDEVEDEDVENVPISQKRPGYTAPASLLNDIAQSDKDYDPFADKRRPTIADREDEYRQKRRRMIISPERSDPFAEGGKTPDVGSRTYTEIMKEQYLRAEETELRKKLLERAREGTLKAVSQSNGEATKPAAKRKGRWDQSSEDTPSVKKPVVQATPSSQATPSWENERGAWEETPSAGGRGGETPGATPSARVWDATPAHLTPGHATPGRETPAHHASRRNRWDETPKTDRETPGHASGWAETPRTDRGVGVDTIQETPTPGTKRRSRWDETPGATPAAATPTPSHATPSHATPSHATPSMGTPTPHTPMFTPGGSTPVGVKAMAMATPTPGHIAAMTPEQLQAYRWEKEIDERNRPYTDEELDAMFPPGYKVLPPPAGYVPIRTPARKLTATPTPLAGTPIGFFMQTEEVGGSAAAAARLLDPQPKGSQQLPFMKPEDAQYFDKLLIDVDEETLSPEELKERKIMKLLLKIKNGTPPMCKAALRQITDKARDFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASALGIPSLLPFLKAVCRSKKSWQARHTGIKIVQQIAILMGCAILPHLKSLVEIIEHGLVDEQQKVRTITALASAALAEAATPYGIESFDSVLKPLWKGIRTHRGKGLAAFLKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCGTDGVEPQYIMDEILPHFFKHFWNHRMALDRRNYRQLVDTTQLYRLFHQVGASEIINRIVDDLKDDNEQYRKMVMESIEKILANLGAADIDSKLEEALIDGILYAFQEQTTEDVVMLNGFGTIVNQLGKRVKPYLPQICGIILWRMNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGPEFVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCSPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVCPLLEDALMDRDLVHRQTACAAIKHMALGVYGFGCEDALIHLLNHVWPNIFETSPHLVQAFMDAVEGMRVALGPIKILQYALQGLFHPARKVRDVYWKIYNTLYIGGQDALVAGYPRIQNDPNNHFVRYELDYLL-