Monarch geneset OGS2.0

DPOGS207852
TranscriptDPOGS207852-TA7134 bp
ProteinDPOGS207852-PA2377 aa
Genomic positionDPSCF300042 + 1369369-1385921
RNAseq coverage1496x (Rank: top 9%)
Annotation
HeliconiusHMEL0153230.098.77% 
BombyxBGIBMGA009816-TA0.097.98% 
DrosophilaPrp8-PA0.092.11% 
EBI UniRef50UniRef50_Q6P2Q90.090.92%Pre-mRNA-processing-splicing factor 8 n=97 Tax=Opisthokonta RepID=PRP8_HUMAN
NCBI RefSeqXP_624014.20.092.39%PREDICTED: similar to CG8877-PA [Apis mellifera]
NCBI nr blastpgi|3407162040.092.34%PREDICTED: pre-mRNA-processing-splicing factor 8-like [Bombus terrestris]
NCBI nr blastxgi|3838551490.092.64%PREDICTED: pre-mRNA-processing-splicing factor 8-like [Megachile rotundata]
Group
Gene OntologyGO:00056812.7e-222spliceosomal complex
GO:00003982.7e-222nuclear mRNA splicing, via spliceosome
GO:00055153.2e-26protein binding
KEGG pathwayame:5516200.0 
 K12856 (PRPF8, PRP8)maps-> Spliceosome
InterPro domain[433-841] IPR0125922.7e-222PROCN
[1800-2030] IPR0219831.9e-128PRP8 domain IV core
[1482-1641] IPR0195803.9e-94Pre-mRNA-processing-splicing factor 8, U6-snRNA-binding
[69-277] IPR0125916e-92Pre-mRNA-processing-splicing factor 8
[1249-1383] IPR0195812.4e-69Pre-mRNA-processing-splicing factor 8, U5-snRNA-binding
[2251-2375] IPR0129846.9e-49PRO, C-terminal
[1026-1117] IPR0195826.8e-45RNA recognition motif, spliceosomal PrP8
[2139-2273] IPR0005553.2e-26Mov34/MPN/PAD-1
Orthology groupMCL15129 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207852-TA
ATGGCCGCTAATAGAAATATTGCGGTAACCGGCGCAGCGATGTCGCTGCCGCCATACCTATTGGGGCCCAACCCCTGGGCCACCATGATGGCGCAGCAGCAGCTAGCGGCAGCTCAACAAGCAGCGCTCCAAGCGCATGCTGCCGCTGCTGCTGCTGCACCGCCCGTGCCGCCGACCCAGCCACCTAAACCTCACCACATACCAGAAGAAAAGATCAAAGAGAAAGCTCAAAAATGGCTTCAGCTGCAATCAAAGCGTTTCTCGGACAAGAGGAAATTTGGTTTTGTGGACGCCCAAAAGGAAGATATGCCTCCGGAGCACATTCGAAAGATAATCCGAGATCATGGTGATATGACCAGTCGCAAGTATCGTCATGACAAACGAGTGTATCTGGGAGCCCTTAAGTATATGCCACATGCTGTAATGAAGCTTCTAGAAAACATGCCCATGCCCTGGGAACAGATCAGAGATGTCAATGTCCTGTACCACATCACCGGTGCTATAACATTTGTCAATGAGATTCCCTGGGTCATAGAGCCAGTGTATATCGCGCAGTGGGGCACAATGTGGATTATGATGCGTAGAGAGAAACGTGATCGTCGGCATTTCAAGCGTATGAGATTCCCACCATTTGATGATGAAGAACCACCTTTGGATTATGCTGACAACATTTTGGATGTTGAACCTCTGGAACCCATACAAATTGAATTAGATCCGGAAGAGGACGGAGCTGTGGCCTCATGGTTTTACGACCACAAACCTTTATTGGGAACAAAACACGTGAACGGCTCGACATACAGGAAGTGGAATCTTAGCTTACCACAGATGGCTACACTGTATCGTCTTGCAAATCAGCTTCTAACTGACTTAGTAGACGATAATTACTTCTACCTATTTGATTCTAAAAGTTTCTTTACCGCGAAAGCTCTAAACATGGCAATTCCCGGAGGTCCCAAGTTTGAACCACTTGTCAAAGACAACTCTGCTGGTGATGAAGACTGGAATGAATTCAACGATATCAACAAGATTATTATTCGTCAGCCGATCAGAACAGAGTACAGAATAGCTTTCCCATACCTTTACAACAACTTGCCGCATTTCGTCCAATTATCCTGGTATCATACTCCCAATGTGGTGTATATAAAAACAGAAGATCCCGACTTGCCAGCCTTCTACTTTGATCCGCTTATCAATCCAATCTCTCACCGTCATACCGTGAAGTCATTAGATCCAATTCCGGAAGAGGAAGATTTCTTGCTACCTGAAGAAGTAACGCCATTCCTGCAGGAAACGGCTTTGTACACAGACAACACCGCTAACGGGATCGCTTTGCTGTGGGCTCCGCGACCTTTTAGTATGAGATCAGGTCGTTCCCGGCGAGCGATCGACGTTCCTCTCGTGAAGACATGGTACAAGGAGCACTGCCCGCCAGGACAACCCGTGAAAGTGCGTGTGTCATATCAAAAACTACTCAAGTACTACGTGCTTAATTCGCTCAAACATAGGCCACCTAAGCCACAGAAGAAAAGATATCTCTTCCGTTCGTTCAAATCGACGAAGTTCTTCCAAACTACAACTTTGGATTGGGTGGAGGCTGGTCTCCAAGTATGCAGACAGGGCTACAACATGCTCAATCTGCTGATACACAGAAAGAATCTTAATTATCTGCATTTAGATTATAATTTCAACTTGAAACCAGTCAAGACTCTCACTACTAAAGAGAGAAAGAAGTCTCGGTTTGGTAATGCATTCCACTTGTGCCGCGAGATCCTCCGCCTCACTAAACTGATAGTGGATTCTCATGTTCAATATCGTCTGAACAACGTGGACTCGTTCCAGCTAGCGGACGGTCTACAGTACATTTTCGCTCACGTCGGCCAACTCACCGGCATGTACAGATACAAGTACAAACTCATGAGACAGATACGAATGTGCAAGGACTTGAAACATCTCATCTACTACAGATTTAATACGGGTCCAGTATCTAAGGGTCCAGGATGTGGTTTCTGGGCACCTGGTTGGCGTGTGTGGTTGTTCTTCATGCGAGGTATCACGCCGCTACTTGAGCGATGGCTTGGAAACCTATTGTCGAGACAATTCGAGGGTCGCCATTCGAAAGGGGTCGCAAAAACGGTGACGAAACAGCGCGTAGAGTCGCACTTCGACCTGGAACTGCGAGCATCCGTCATGCACGATATTGTGGACATGATGCCGGAAGGTATCAAACAGAATAAGGCCAGAACAATCCTACAGCATCTCTCTGAAGCCTGGAGATGCTGGAAAGCTAATATTCCCTGGAAGGTCCCAGGTCTTCCTACTCCCATAGAGAACATGATACTTCGTTACGTTAAAATGAAGGCGGACTGGTGGACAAATACAGCGCACTACAACAGGGAGAGAATACGTCGCGGTGCCACTGTAGACAAAACCGTCTGTAAGAAGAACTTGGGAAGACTAACACGATTGTACTTAAAGGCTGAGCAGGAAAGACAGCATAATTATTTAAAGGATGGTCCATACATATCCCCTGAAGAAGCAGTCGCTATTTACACGACAACAGTCCATTGGCTCGAGTCCCGACGTTTCGCGCCCATACCATTCCCGCCCCTGTCATACAAACACGACACCAAACTACTCATATTGGCTTTGGAGAGACTGAAAGAAGCTTACAGCGTTAAGTCGAGGCTCAATCAAAGTCAAAGGGAAGAACTGGGTCTTATAGAACAGGCGTATGATAACCCACACGAGGCGCTGTCTAGGATAAAACGTCATTTGCTCACACAGAGGACTTTCAGAGAAGTGGGCATAGAGTTTATGGACCTCTACTCACATCTAGTGCCAGTATACGACGTGGAACCTCTAGAGAAGATAACGGACGCGTACCTCGATCAATATCTTTGGTATGAAGCTGACAAACGACGTCTTCTACCGCCGTGGGTGAAACCCGCTGACACAGAGCCCAGTCCGCTCCTCGTCTATAAATGGTGTCAAGGTATCAACAATCTTCAAGATGTATGGGAGGTCGGCGAAGGTGAATGCAACGTTTTGCTAGAGTCGAGATTTGAAAAACTCTATGAGAAGATTGATCTGACACTGCTGAATCGTCTCTTGCGTTTGATAGTGGACCACAACATTGCTGATTACATGACGGCTAAGAACAACGTCGTCATTAATTACAAGGATATGAATCATACAAATTCCTATGGTATCATTCGGGGTTTGCAATTTGCTTCCTTCATAGTTCAATACTATGGTCTTGTACTGGATCTGTTAGTGCTGGGTCTGCAACGGGCCAGCGAAATGGCTGGACCTCCCCAACTACCAAACGACTTCTTGTCTTACCAAGAGAGGCCGGCGGAGCAGGCGCATCCTATAAGACTGTATTGCAGATACATTGACAGAATACATATTTTCTTCAGATTCACAGCAGAAGAAGCTCGCGACCTCATCCAAAGGTACCTGACGGAACATCCCGACCCCAATAATGAGAATATCGTCGGCTACAACAATAAAAAGTGCTGGCCGCGTGACGCCAGAATGAGACTCATGAAACACGATGTTAACTTGGGTCGAGCGGTGTTCTGGGATATTAAGAACCGTCTTCCACGTTCCGTCACCACTATACAGTGGGAGAATAGTTTCGTCTCGGTCTACTCCAAGGACAACCCCAACTTGTTATTCAACATGGCCGGATTTGAGTGCAGGATATTGCCTAAATGCCGTAGTCTTCACGAAGAGTTGTCACATCGCGATGGTGTTTGGAATCTACAAAACGAAGTTACCAAGGAACGCACAGCTCAATGCTACCTGAGAGTGGATGACGAATCACTGGCACGGTTCCACAACCGTGTTAGACAGATATTGATGGCCTCAGGTTCAACGACCTTCACCAAAATTGTCAACAAATGGAATACCGCTTTAATCGGTCTCATGACGTACTTCCGTGAAGCCGTAGTAAACACTCAAGAGCTATTAGACCTGCTAGTGAAATGCGAGAATAAAATTCAAACTCGTATTAAAATTGGTTTGAACTCAAAAATGCCTTCGCGTTTCCCGCCTGTTGTGTTCTACACGCCCAAAGAGTTAGGCGGGCTTGGGATGTTGTCTATGGGTCATGTTCTGATTCCACAGTCGGATCTGCGTTGGTCAAAACAAACAGACGTTGGCATCACTCACTTCCGATCGGGAATGTCACATGATGAAGATCAGCTGATTCCTAATCTTTATCGTTACATACAACCATGGGAGGCTGAGTTTGTCGACTCACAGAGAGTATGGGCTGAGTACGCTCTCAAGAGACAGGAGGCCAATGCTCAGAACAGGCGTCTCACACTCGAAGATTTGGAAGACTCCTGGGATAGAGGTATACCAAGAATAAATACACTCTTCCAAAAGGACAGACACACACTTGCATATGACAAAGGATGGCGTATTCGTACCGAGTTTAAACAGTATCAAGTACTGAAACAAAACCCGTTCTGGTGGACACATCAGAGACACGACGGAAAATTATGGAATCTGAACAACTACCGTACTGATATGATACAGGCTTTGGGAGGAGTAGAAGGAATTCTGGAACACACATTGTTTAAGGGCACCTACTTCCCTACTTGGGAGGGTTTGTTCTGGGAGAAGGCATCCGGTTTCGAGGAGTCGATGAAATATAAAAAACTGACAAACGCTCAACGATCTGGTTTGAACCAGATTCCAAACCGACGGTTCACCTTATGGTGGTCACCGACCATCAACAGAGCCAATGTGTATGTTGGTTTCCAGGTGCAATTAGATTTGACAGGTATATTCATGCACGGCAAAATACCAACACTCAAGATATCTCTTATCCAGATATTTAGAGCTCACTTGTGGCAGAAAGTCCATGAGTCAATTGTTATGGACTTGTGTCAAGTGTTTGATCAAGAATTGGATGCTCTGGAAATAGAAACAGTACAAAAGGAAACCATTCATCCTCGAAAATCATACAAGATGAACTCCTCATGTGCAGACATTTTACTCTTCTCAGCCTACAAGTGGAATGTCTCCCGTCCCTCACTGCTGGCTGACACAAAGGATACAATGGATAATACCACAACCCAGAAATATTGGTTGGATATACAATTACGTTGGGGAGACTATGACTCGCACGATGTCGAGAGATACGCTCGAGCGAAGTTCTTGGACTACACCACGGATAACATGTCCATATATCCTTCGCCCACTGGACTGCTGATCGCTATAGATTTGGCTTATAACTTGCACAGTGCATATGGTAATTGGTTCCCGGGATGCAAGCCGCTCATACAACAGGCGATGGCGAAAATCATGAAGGCAAATCCAGCCCTTTATGTGCTAAGGGAGCGTATACGGAAGGCTTTACAGTTGTACTCGTCTGAACCTACCGAGCCATACTTGTCCAGTCAGAATTATGGAGAGCTGTTCTCAAATCAGATTATTTGGTTTGTCGACGACACGAACGTGTACCGTGTAACTATACACAAGACCTTTGAAGGAAATCTCACAACTAAACCTATTAACGGAGCCATCTTCATATTCAACCCTCGGACTGGACAACTGTTCCTCAAGATCATCCACACCAGCGTGTGGGCCGGTCAGAAACGTCTTGGACAGCTCGCTAAATGGAAAACAGCTGAAGAAGTGGCCGCCCTGATTCGTTCCCTGCCTGTTGAAGAACAACCCAAACAGATTATTGTCACAAGAAAGGGAATGTTGGATCCACTTGAGGTGCACTTGCTAGACTTCCCCAACATTGTCATCAAAGGTTCAGAACTGCAGCTACCTTTCCAAGCGTGTCTTAAAGTGGAGAAATTCGGAGACCTCATCCTCAAGGCCACAGAGCCACAGATGGTGCTCTTCAACTTGTATGATGATTGGTTAAAGACTATATCTTCTTATACCGCATTCAGCAGATTGATACTCATTCTGAGAGCGTTACACGTGAACACTGAGCGTACTAAGGTACTTCTGAAACCAGACAAGACTACACTCACTGAACCACATCACATCTGGCCCACACTCACCGATGATGACTGGATCAAGGTGGAAGTGCAACTCAAGGACCTTATATTGGCTGACTACGGCAAAAAGAATAACGTAAACGTGGCATCACTGACACAATCAGAAATCCGCGACATTATACTTGGTATGGAAATATCAGCTCCGTCAGCACAGAGGCAGCAGATAGCCGAGATTGAGAAACAGAGCAAGGAACAGAGCCAGCTCACAGCAACCACGACCAGGACTGTTAACAAACACGGAGACGAGATCATCACCTCCACCACCAGCAACTACGAGTCGCAGACCTTCAGTTCCAAAACCGAATGGCGTGTGAGAGCGATATCAGCGACCAATCTTCACTTGAGGACAAACCACATCTATGTAAGCTCTGATGACATCAAGGAAAGTGGCTATACTTATATATTGCCAAAGAACTTGCTCAAGAAGTTTGTCACCATATCCGATTTGAGAGCACAGATCGCCTGCTACCTGTACGGCACATCGCCTCCTGACAACCCTCAAGTCCGTGAAGTACACTGCGCGGTTCTTCCTCCTCAATGGGGAACACATCAGACTGTACATCTACCGCGACAACTTCCTAAACATCCAGCTTTAGCCCACCTTCAACCATTGGGATGGATGCACACTCAGCCTAACGAACTGCCACAACTTTCGCCACAGGATATAACCACTCACGCCAAAATAATGGCGGAAAATCAGACGTGGGACGGTGAGAAGACGATCATAATCACGTGCTCCTTCACACCGGGGTCGTGTTCGCTGACTGCATACAAGTTGACACCGAGCGGATATGAATGGGGCGCCAAGAACACGGACAAAGGCAATAATCCCAAGGGATATCTCCCTAGCCACTATGAGCGAGTGCAAATGTTACTGTCCGATCGATTCCTAGGATACTTCATGGTGCCTTCACAGGGTAGCTGGAATTATAACTTCATGGGTGTCCGTCACGATCCCAACATGAAGTATGGCGTTCAGCTGGGGAATCCCCGCGAGTTCTACCACGAGGTGCATCGACCTGCACACTTTATGAACTTCGCGGCAATGGAGGATTCAGTCGCGCCCATACCAGCTGCTGACCGAGAGGATTTCTTCGCCTAG

Protein sequence:

>DPOGS207852-PA
MAANRNIAVTGAAMSLPPYLLGPNPWATMMAQQQLAAAQQAALQAHAAAAAAAPPVPPTQPPKPHHIPEEKIKEKAQKWLQLQSKRFSDKRKFGFVDAQKEDMPPEHIRKIIRDHGDMTSRKYRHDKRVYLGALKYMPHAVMKLLENMPMPWEQIRDVNVLYHITGAITFVNEIPWVIEPVYIAQWGTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEPIQIELDPEEDGAVASWFYDHKPLLGTKHVNGSTYRKWNLSLPQMATLYRLANQLLTDLVDDNYFYLFDSKSFFTAKALNMAIPGGPKFEPLVKDNSAGDEDWNEFNDINKIIIRQPIRTEYRIAFPYLYNNLPHFVQLSWYHTPNVVYIKTEDPDLPAFYFDPLINPISHRHTVKSLDPIPEEEDFLLPEEVTPFLQETALYTDNTANGIALLWAPRPFSMRSGRSRRAIDVPLVKTWYKEHCPPGQPVKVRVSYQKLLKYYVLNSLKHRPPKPQKKRYLFRSFKSTKFFQTTTLDWVEAGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREILRLTKLIVDSHVQYRLNNVDSFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVSKGPGCGFWAPGWRVWLFFMRGITPLLERWLGNLLSRQFEGRHSKGVAKTVTKQRVESHFDLELRASVMHDIVDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKMKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYISPEEAVAIYTTTVHWLESRRFAPIPFPPLSYKHDTKLLILALERLKEAYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRTFREVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWYEADKRRLLPPWVKPADTEPSPLLVYKWCQGINNLQDVWEVGEGECNVLLESRFEKLYEKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVLDLLVLGLQRASEMAGPPQLPNDFLSYQERPAEQAHPIRLYCRYIDRIHIFFRFTAEEARDLIQRYLTEHPDPNNENIVGYNNKKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVTTIQWENSFVSVYSKDNPNLLFNMAGFECRILPKCRSLHEELSHRDGVWNLQNEVTKERTAQCYLRVDDESLARFHNRVRQILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHDEDQLIPNLYRYIQPWEAEFVDSQRVWAEYALKRQEANAQNRRLTLEDLEDSWDRGIPRINTLFQKDRHTLAYDKGWRIRTEFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKYKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKVHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFSAYKWNVSRPSLLADTKDTMDNTTTQKYWLDIQLRWGDYDSHDVERYARAKFLDYTTDNMSIYPSPTGLLIAIDLAYNLHSAYGNWFPGCKPLIQQAMAKIMKANPALYVLRERIRKALQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNTERTKVLLKPDKTTLTEPHHIWPTLTDDDWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSAQRQQIAEIEKQSKEQSQLTATTTRTVNKHGDEIITSTTSNYESQTFSSKTEWRVRAISATNLHLRTNHIYVSSDDIKESGYTYILPKNLLKKFVTISDLRAQIACYLYGTSPPDNPQVREVHCAVLPPQWGTHQTVHLPRQLPKHPALAHLQPLGWMHTQPNELPQLSPQDITTHAKIMAENQTWDGEKTIIITCSFTPGSCSLTAYKLTPSGYEWGAKNTDKGNNPKGYLPSHYERVQMLLSDRFLGYFMVPSQGSWNYNFMGVRHDPNMKYGVQLGNPREFYHEVHRPAHFMNFAAMEDSVAPIPAADREDFFA-