Monarch geneset OGS2.0

DPOGS208319
TranscriptDPOGS208319-TA2874 bp
ProteinDPOGS208319-PA957 aa
Genomic positionDPSCF300293 + 146872-158898
RNAseq coverage583x (Rank: top 22%)
Annotation
HeliconiusHMEL0114990.076.83% 
BombyxBGIBMGA002214-TA0.087.60% 
DrosophilaCG34133-PC0.053.04% 
EBI UniRef50UniRef50_E0VKN10.050.97%WD repeat domain-containing protein, putative n=12 Tax=Metazoa RepID=E0VKN1_PEDHC
NCBI RefSeqXP_001604905.10.053.36%PREDICTED: similar to wd-repeat protein [Nasonia vitripennis]
NCBI nr blastpgi|3800162470.054.57%PREDICTED: WD repeat-containing protein 44-like [Apis florea]
NCBI nr blastxgi|3800162470.055.63%PREDICTED: WD repeat-containing protein 44-like [Apis florea]
Group
Gene OntologyGO:00055158.3e-66protein binding
KEGG pathway 
InterPro domain[920-940] IPR0159438.3e-66WD40/YVTN repeat-like-containing domain
[547-941] IPR0110468.6e-52WD40 repeat-like-containing domain
[671-710] IPR0016801.8e-08WD40 repeat
[632-668] IPR0197811.3e-07WD40 repeat, subgroup
Orthology groupMCL14423 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208319-TA
ATGGACGACAGTAGCGATTCTGAAGAGTTTTTTGATGCTGAAGAATTCACGCCTGTCAAGGGTTCGAAGAGATCCTCATTATGTAAGAGTATCAGTTTGTCAGAGAAGAGTAATGGTGAAGATAAAACGAGCAAAGCTGAAGAGACGCCACAAGTGGTTGCAGCCAGCACTCCCATAGAAACAGTTCAGGAAGATAGGAATACAGTCAAAAATGTGGTGCAAGAAAGACGACGCTTCCAGGAGTTGAGGAGATGCATGCAGACGGAGGAGGAAGATGAGCCGGAGGCGGGGGCACCACTGGCAGCACCCATGGAACACCCCTTCAAAATAATATCACACGACACAATGAGTCTTCAGAGCATGACGTCACTTGGTAGGATCGGGAGAATTCTAAGCGGAGCCGCTGAATCGCATCCGAATATCCGAGATTCCACGCACACAGTGACCGCGCCCCGGGAACTCTCACACATCACTGACGAACCCATCATGGTGGATAGGACGCAAAGCGTGGAGCCGGATGTGATCGCCAGCACCAAGGCGAACAGTTTGAAGGTATGCCGCGGGACCACCGTCAGCGTTAGCGCCCCACGAGCTGGTCCTATAGCGCCACCTAGGAGAAGAAAACGAAACAAACTTCACGGCTCCCAATCCCTGTCTACGACCGACGGGGACAATCTCAGCGTCTGCACAACAGACACTGATGATATCGGTTACATGAGGAGGATAGACAAACCGCAGCTAGAGACTCTGATTGTCGAGCCAGCGTCTTCGGTAAGCGCGGCGTTACCGAGTCCCGCCAGCACCATCGAGTCGCTCACCAGAGAGTTCAGAGACGAACTGGAACTGTCCACCAAATACGCGGATTCTTCAATAACTCGCGAGTCTCGTGATTCTACTCTGAAGCCCCATTCCACGTTGGACATTCGGGAGGCCGTCAAAGGAGATTATGTGGTGAAGCCTCAGGATGAAGAGAAAGCTTATAGTGGTGATAGTATCGAGGCCTCTTGCGGTACCGGCAGGACTCACAGCGAGGGCCGGCCGCCGACGGGCACCTCCAGCTTAGGGTCGGTCTCCAAACGTACAGAGTGCCGCGCACGTCGGCGCTCGGCCGGGGATTCCCCTCAGCACGCCGCGCGACCGCTCTCAGACATGGAAATACTGGAACAGGTCACCGTGCTCAACCTGGACACAGGTGAGCGCATGCCTCTCAGCGTGGCCGAACACAAGCTCCCTCAGTGCATCAACCCTCTCAGTCTGCACATCATGAGGCTCACTTCGGAGTACATCGGGCGGACGAACGACGGCGAGAACACCAGAGAAACTGATGAAGAGAGTGAGAGTGTGTGTGTGGGGGCGGAGCAGGAGGCCAGGGACGCCGGCAAGGAGGGCGCGGACCGACCCACCGGCATCAAGAGGCGGACAGACAAGTTGAAGCGTTTCTTCGGTTCCACGGTGAAGAAGACCGTGGACGCGGCCAAGTCCCTGGCGCAGGAGGTGTCCCACGCGCGTCACAAGGAGGACGTGGCGGACATAGTGGACGAGGTCAGGGGAGAGCATAACGTTAAACTGAAGGCCTCTAACTCACACAAAGGACCTTACGACTTCGACGGATGTGTCAGATATGTTCAGGAGATGGGTTCGGCGCACGCCGGGGCGGTGTGGTGTTGTAAGTTCAGCGTGTGCGGGCGATTGCTGGCCACGGCGGGCCAGGACCGGCTACTGAGGATCTGGGTCACCAGGGACGCCTACCACCTGTTCCAGGATATGCGAACCAAATACAACGCAGAGAAGAAATCCTCGCCGACTCCTTCCCAGGAGTCCCTGCCGTCTATGGCGGCGCCACCCCCGTCCCCGGAGGACACGCCCCTCGGCCCCTCCGCCCCCTTCTGTCCCAAGCCGTTCTGCACGTACTCCGGCCACACGTCCGACCTGCTGGACGTGTCGTGGTCCAAGAACTACTTCGTGCTATCCAGCTCCATGGACAAGACGGTCAGGTTGTGGCACATCTCCCGGGGCGAGTGTCTGTGCTGCTTCCAACACATCGACTTCGTGACCGCCATCGTCTTCCATCCGAGGGACGACAGGTACTTCCTCTCGGGCAGTCTGGACGGGAAACTGAGGCTGTGGGACATACCCGATAAGAAGGTGGCCGTGTGGAACGAGGTGGACGGCAAGACGAAGCTAATAACGGCAGCAAACTTTTGTCAGAACGGCAAATTCGCCGTGGTCGGGACGTACGACGGCAGGTGCATCTTCTACACGACGGACCAGCTGAAGTATCACACGCAGATAGACGTGCGCTCCACGCGGGGCAAGAACTCCACGGGACAGAAGATCAGCGGCATAGAACCCATGCCCAACGACGACAAGATACTCGTCACCTCCAACGACAGCCGCATCAGGCTCTACGACCTGCGGGACCTCAACCTGTCCTGCAAGTACAAGGGATACGTGAACGTGTCCAGTCAGATAAAAGCGTCTTTCTCCCACGACGGGAAGTACATAGTCAGCGGCTCCGAGAACCAGTGCATCTACATCTGGAAGACCTGCCACGACTACTCCAAGTTCACGTCCGTGAGGAGGGACAGGAACGACTTCTGGGAGGGCATCAAGGCGCACAACGCGGTGGTCACGTGCGCTGTGTTCGCGCCCAACCCCGACCACATGATACGGACCATCACGGAGCGCGAGGAGCGCGGCAGGGCAGCCAGCGGCGCGGCCGGGGACGCCGGGGACGCCAAGGGCGAGGGTCTGGTCTCGTCGTCGCAGGCCGGTCACGTGCTGGTGAGCGCCGACTTCAGCGGGGTCATTAAGGTGTTCGTGCACCGGGCGAGACCCAAACACAGCTCGCTGCCGGCCTCCGCGCTACAGTAG

Protein sequence:

>DPOGS208319-PA
MDDSSDSEEFFDAEEFTPVKGSKRSSLCKSISLSEKSNGEDKTSKAEETPQVVAASTPIETVQEDRNTVKNVVQERRRFQELRRCMQTEEEDEPEAGAPLAAPMEHPFKIISHDTMSLQSMTSLGRIGRILSGAAESHPNIRDSTHTVTAPRELSHITDEPIMVDRTQSVEPDVIASTKANSLKVCRGTTVSVSAPRAGPIAPPRRRKRNKLHGSQSLSTTDGDNLSVCTTDTDDIGYMRRIDKPQLETLIVEPASSVSAALPSPASTIESLTREFRDELELSTKYADSSITRESRDSTLKPHSTLDIREAVKGDYVVKPQDEEKAYSGDSIEASCGTGRTHSEGRPPTGTSSLGSVSKRTECRARRRSAGDSPQHAARPLSDMEILEQVTVLNLDTGERMPLSVAEHKLPQCINPLSLHIMRLTSEYIGRTNDGENTRETDEESESVCVGAEQEARDAGKEGADRPTGIKRRTDKLKRFFGSTVKKTVDAAKSLAQEVSHARHKEDVADIVDEVRGEHNVKLKASNSHKGPYDFDGCVRYVQEMGSAHAGAVWCCKFSVCGRLLATAGQDRLLRIWVTRDAYHLFQDMRTKYNAEKKSSPTPSQESLPSMAAPPPSPEDTPLGPSAPFCPKPFCTYSGHTSDLLDVSWSKNYFVLSSSMDKTVRLWHISRGECLCCFQHIDFVTAIVFHPRDDRYFLSGSLDGKLRLWDIPDKKVAVWNEVDGKTKLITAANFCQNGKFAVVGTYDGRCIFYTTDQLKYHTQIDVRSTRGKNSTGQKISGIEPMPNDDKILVTSNDSRIRLYDLRDLNLSCKYKGYVNVSSQIKASFSHDGKYIVSGSENQCIYIWKTCHDYSKFTSVRRDRNDFWEGIKAHNAVVTCAVFAPNPDHMIRTITEREERGRAASGAAGDAGDAKGEGLVSSSQAGHVLVSADFSGVIKVFVHRARPKHSSLPASALQ-