Monarch geneset OGS2.0

DPOGS209985
TranscriptDPOGS209985-TA2940 bp
ProteinDPOGS209985-PA979 aa
Genomic positionDPSCF300148 + 323252-326688
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0135510.049.27% 
BombyxBGIBMGA011270-TA0.046.41% 
DrosophilaCG33172-PA2e-3923.80% 
EBI UniRef50UniRef50_E2BQG62e-10430.12%WD repeat-containing protein 6 n=1 Tax=Harpegnathos saltator RepID=E2BQG6_HARSA
NCBI RefSeqXP_002430448.13e-8828.53%WD-repeat protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3072027338e-10430.12%WD repeat-containing protein 6 [Harpegnathos saltator]
NCBI nr blastxgi|3071701941e-10529.08%WD repeat-containing protein 6 [Camponotus floridanus]
Group
Gene OntologyGO:00055152.5e-22protein binding
KEGG pathwaycnb:CNBA65306e-06 
 K03361 (CDC4)maps-> Ubiquitin mediated proteolysis
    Cell cycle - yeast
InterPro domain[89-424] IPR0110462.5e-22WD40 repeat-like-containing domain
[88-325] IPR0159431.4e-17WD40/YVTN repeat-like-containing domain
[166-203] IPR0197816.4e-06WD40 repeat, subgroup
[164-203] IPR0016808.4e-06WD40 repeat
Orthology groupMCL16854 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209985-TA
ATGTCCACTCTAATACGAACTGATGTAACATCAGTAAAATTGTGTAAAGACATTATTTTAGCAGGTATTGGTAGTTTCCTATGCACATTCTATACTAAGAATAGCAAACCAATTCAGAAAATTCAAGCATTGAATGGACAAAAGATACGTGGACTTATCCCTTCAAAGTGTCTCACAAAATTACTTATATTTGGTGGAAAACAGTTTACAATATTTAATGAAATTGAATCTATATTTAAATCTCAAATTGATGCTGTGGTTTATGATGATTGGATTCACACAGCAATTTGGTTGTCAGAGAACAAAGTAGCCCTCTTGAGTGCACATAATGTGGTGACGACTTGGGACATTACAACTACAAAACTCCTACAGCAGCATATTAATAAAGACAACTCTATACTCTACAGTGGGCTACTTCAGCCACTACAACATGACATCCTGGTGTTTAGTGGTACTGTGTACTCACAGGTCATCCTGCAATGGTTTGGTGACGAACAGCCTTTGCACTATTTAAAGGGACATAAGGGTGTCATATTCTCTATAAGCTGTAACCTACAAAGAGGCATTATAGTTACCACCTCAGATGATAGATCAGTGAAGATTTGGTCTGTGACATCCGTTCACTCCGACTATAATATTAAAACTTACTGGCAAAATGCTCATATAGATTGTGTTCATGATCTGTATGGTCACTTGGCACGGGTGATGAGAAACACTCTTACTAATTTATACATAATATCAGTGGGTGAAGACTCGGCTATATGCTTTTGGGATTACAACGGGAACCTGTTGAAGAAAACTATATCCCATAAAAACTCGTGCATTTGGTCCTTAGATGCTGATGAATCTAATTTAGTCACTGGAGGGGGCGACTGTGGGATAATGATGCATCCCCTCTCATCTGTAACTTACAATAGTCATGGTGAGGTCATCAACACCAGTGTGACACCAAAAAAAGTTCTCTTCACTGCTCGAAACAATATTGTCATAACCACGGTTGGTAACGTCTTAAATTATTACAACTCTAACTTAAATAAAATACAAGAAATTCGATTAAATCACACATCTACCTACCAACTGGTCGGCCTGTCTTCATGTAAACAACTTATAGCGGTGGTCGACATGGACGGTAACTTAGACATATTTATGGAAAATTGTAAAGGAGACCCAGGACTCAAGAAGATTATAGAAACGCGGCTGCACCTAGGAAAGATTCTATCGATGCAGTGGGCTGGCAACAGACATTTAGTTTTCTGTTCTGAAGGTGGCGTTATTACAGTCGGAGCCTCCAAAGGAAATACTATCGAGATCATCGCTAACTACCTCCTACCGCCTTGCAAAGAAAGGTGGCTGACTGCCAGCGCCCTCCATGACACCAAAGACACATTGATCGTAGGGGACCGATGTGGACACATACACTTGTATGAGTGGCGACGACAACAACCAGCTTATACCATGAAAAGAGTTCACGGAAGATACGGCCCTACCTCCATCGATATAAGGAATGATATCGTCAGAACGACCGGCAGAGACGGGACGGTCAGGTACTTGAAGATTATCAATTCAGGATTCAAGTATATGAGTTGTAAAGACTTGGAGTTCGAGTGGGTTGAAAAGTTTCTAGACGTACAAGGGAAGTACGTCTGCGGCTTCAGGGAGAGGAGTTTGGTCGTTTATGATGTAGAGAATGATCTGAAGGTAGTGGATGTGTCGTGCGGAGGAGGGCACCGGTCATGGGACGTTGTGCGGTATATCGAAAACAACGGCGGATGTTACGAGGAGTGTCTCAGACTCATGTTTGTAAAGAATACGCAGGTCTATGTCAACACGTTCCGGCTCCGCGACATCGTGTCCACAGTTATATTGCCCGGGACACATTCTAAGGAGATAAACTGTTTGAGAACGTACCGCCGTCGTAACGACGACCCAGTCACGTGGTTCATAACAGGAGGAGAGGACACCACACTGAGAGTGTCCACGTCAGAACAGGAAGCGGAGTTCTGGGACCGAGTGATCTTCAGACATCTGTCGAACGTACGGGCGTTGAAACTGTTGAGTGTGTCCCATGACGAAGTGTTGGTGGTGTCGGCGGGAGGCAGGGCGCAAATATGTATCAGAACCATCGGCTTCGTTGATAAGAATGTAACGGCGGAGGAACTCATTGACTATCAGATAAAAGGAACAGACAGGGAAAGAAGGGGAAACCAAAACTGGAGAAACTGTTCCGTAGACTTCGACCCGGAGACCAGGATCATGGATGTGGAAGTGGACGAACTGAATGAAGCTAAAGTCATGATATACACGGCGTGCTCTGACGGTGAGGTCCGGGTCTTCGAGTGGAACAGACGCGGTGGACAGTTCACTATGATCCAGGAAGTCAGGCATCACAAAACCTGTATACTGAAACTGAAGATGTTTACATGCTCTAATAAAAAAATAATAACTACGTGCGGGACCCGAGGAGACGTGGCCTTTTGGGAGGTCAGCTCAGAGGACGGGACACTGGCGGAGGGACCGGCTCTCGTCCTCAGGACCAACGAATCGGGGATCAATAGTGTCGACATTAAAGTGACCGGAGGCTGTCAATTCGTGTTGGCGACCGGCGGAGATGATAACGCGGTCCATATGAGCCTTGTGAGATTGGGCGGAGACGGGGGCTGGGCGGCGGTGACGTCACACGCGTACCTGAACGCGCATTGCTCACAAGTAACGGGACTGGCACTGGTCGAGGGTTTGTGTGTGACGACCGGCGTGGACCAGAGAGTGACCTCGGTCTCATGGCGCCTGGAGGGAGAGGACATAAAAACAGAGTTCATCGACCAGATGTACAGCGACGTCTCCGACATCCACGGAATGGATGTCGTGCGGGACTCGGGAGACCGGCTCACAGTGTGCGTCTACGGTAAAGGTATCCAAGTCATCGAACTACTGAAACCGTAA

Protein sequence:

>DPOGS209985-PA
MSTLIRTDVTSVKLCKDIILAGIGSFLCTFYTKNSKPIQKIQALNGQKIRGLIPSKCLTKLLIFGGKQFTIFNEIESIFKSQIDAVVYDDWIHTAIWLSENKVALLSAHNVVTTWDITTTKLLQQHINKDNSILYSGLLQPLQHDILVFSGTVYSQVILQWFGDEQPLHYLKGHKGVIFSISCNLQRGIIVTTSDDRSVKIWSVTSVHSDYNIKTYWQNAHIDCVHDLYGHLARVMRNTLTNLYIISVGEDSAICFWDYNGNLLKKTISHKNSCIWSLDADESNLVTGGGDCGIMMHPLSSVTYNSHGEVINTSVTPKKVLFTARNNIVITTVGNVLNYYNSNLNKIQEIRLNHTSTYQLVGLSSCKQLIAVVDMDGNLDIFMENCKGDPGLKKIIETRLHLGKILSMQWAGNRHLVFCSEGGVITVGASKGNTIEIIANYLLPPCKERWLTASALHDTKDTLIVGDRCGHIHLYEWRRQQPAYTMKRVHGRYGPTSIDIRNDIVRTTGRDGTVRYLKIINSGFKYMSCKDLEFEWVEKFLDVQGKYVCGFRERSLVVYDVENDLKVVDVSCGGGHRSWDVVRYIENNGGCYEECLRLMFVKNTQVYVNTFRLRDIVSTVILPGTHSKEINCLRTYRRRNDDPVTWFITGGEDTTLRVSTSEQEAEFWDRVIFRHLSNVRALKLLSVSHDEVLVVSAGGRAQICIRTIGFVDKNVTAEELIDYQIKGTDRERRGNQNWRNCSVDFDPETRIMDVEVDELNEAKVMIYTACSDGEVRVFEWNRRGGQFTMIQEVRHHKTCILKLKMFTCSNKKIITTCGTRGDVAFWEVSSEDGTLAEGPALVLRTNESGINSVDIKVTGGCQFVLATGGDDNAVHMSLVRLGGDGGWAAVTSHAYLNAHCSQVTGLALVEGLCVTTGVDQRVTSVSWRLEGEDIKTEFIDQMYSDVSDIHGMDVVRDSGDRLTVCVYGKGIQVIELLKP-