Monarch geneset OGS2.0

DPOGS209847
TranscriptDPOGS209847-TA984 bp
ProteinDPOGS209847-PA327 aa
Genomic positionDPSCF300117 + 1024745-1043604
RNAseq coverage540x (Rank: top 23%)
Annotation
HeliconiusHMEL0043384e-9179.50% 
BombyxBGIBMGA008073-TA1e-14586.38% 
DrosophilaCG6015-PA7e-16477.88% 
EBI UniRef50UniRef50_Q9VD521e-16177.88%CG6015 n=50 Tax=Opisthokonta RepID=Q9VD52_DROME
NCBI RefSeqXP_396966.24e-16782.12%PREDICTED: similar to CG6015-PA [Apis mellifera]
NCBI nr blastpgi|3838658725e-16782.42%PREDICTED: pre-mRNA-processing factor 17-like [Megachile rotundata]
NCBI nr blastxgi|3800116954e-17082.67%PREDICTED: pre-mRNA-processing factor 17-like [Apis florea]
Group
Gene OntologyGO:00055152.2e-64protein binding
KEGG pathwayame:4135231e-166 
 K12816 (CDC40, PRP17)maps-> Spliceosome
InterPro domain[11-327] IPR0159432.2e-64WD40/YVTN repeat-like-containing domain
[24-327] IPR0110462.9e-63WD40 repeat-like-containing domain
[156-194] IPR0197811.4e-06WD40 repeat, subgroup
[155-194] IPR0016801.9e-06WD40 repeat
Orthology groupMCL13221 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209847-TA
ATGGGCCGATCCTGGATTGAGGCTCCGAGAAGTGAAACACAATTACGCAGTGACACACCACCAGACAAATGTTTCCTACCGAAAGCTCATATATTCACGTGGAAGGGTCACACCAAGGGTGTGTCCGCTGTTCGTTGGTTTCCTCGTACAGCTCACCTCATGTTGTCAGCCGCTATGGATTGCAGGGCAAAGATCTGGGAGGTGTATGGTGATCGTCGTTGTATAAGAACATACTTTGGTCACAGACAGGCGGTTAGAGATGTCAACTTCAATAACACTGGCACACTAAAGAACACATATGATCGATACATCAAGTTATGGGACACGGAGACCGGTGACTGTGTGTCCCGGTTCACCTCCAGGAAGGTCCCGTACTGTGTGAAGTTCAACCCGGACGAGGACAAACAACACCTCATCGTTGCCGGAACCTCAGATAAAAAAATTATATGTTGGGACACTCGTAGCGGTGAGATAGTTCAGGAGTATGACCGTCACCTGGGCGCTGTCAACACTATCACCTTCGTGGATGACAACAGACGCTTCGTCACCACCTCCGACGACAAAAGTCTAAGGGTCTGGGAGTGGGACATCCCGGTTGACATGAAGTATATAGCGGACCCATCGATGCATTCCCTACCAGCGGTGACTGCGGCGCCAAACGGCAAATGGCTGGCTTGTCAGTCGATGGACAATAAAGTTGTGGTCTTCTCAGCGCTGAACAGGTTCAAGATGAACAGGAAGAAGACTTTCACCGGACATATGGTTGCCGGGTACGCTTGCAGTGTGGACTTTTCACCAGATATGAGTTATCTGGTGTCGGGAGACGCGGACGGCAAGGCGTATATCTGGGATTGGAAGACAACCAAGCTTTATAAAAAGTGGAAAGCGCATGATGGTGTTTGTATCTCGTCTCTGTGGCATCCCCACGAGCCGAGCCGCCTCCTGACCGCGGGCTGGGACGGCCTCATCAAATACTGGGATTAA

Protein sequence:

>DPOGS209847-PA
MGRSWIEAPRSETQLRSDTPPDKCFLPKAHIFTWKGHTKGVSAVRWFPRTAHLMLSAAMDCRAKIWEVYGDRRCIRTYFGHRQAVRDVNFNNTGTLKNTYDRYIKLWDTETGDCVSRFTSRKVPYCVKFNPDEDKQHLIVAGTSDKKIICWDTRSGEIVQEYDRHLGAVNTITFVDDNRRFVTTSDDKSLRVWEWDIPVDMKYIADPSMHSLPAVTAAPNGKWLACQSMDNKVVVFSALNRFKMNRKKTFTGHMVAGYACSVDFSPDMSYLVSGDADGKAYIWDWKTTKLYKKWKAHDGVCISSLWHPHEPSRLLTAGWDGLIKYWD-