Monarch geneset OGS2.0

DPOGS207390
TranscriptDPOGS207390-TA4770 bp
ProteinDPOGS207390-PA1589 aa
Genomic positionDPSCF300267 + 106500-116316
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0122450.084.65% 
BombyxBGIBMGA008884-TA0.078.95% 
DrosophilaCG1109-PA0.065.17% 
EBI UniRef50UniRef50_F4WM590.062.72%WD repeat-containing protein 33 n=8 Tax=Formicidae RepID=F4WM59_ACREC
NCBI RefSeqXP_969187.10.077.24%PREDICTED: similar to wd-repeat protein [Tribolium castaneum]
NCBI nr blastpgi|910819970.077.24%PREDICTED: similar to wd-repeat protein [Tribolium castaneum]
NCBI nr blastxgi|910819970.058.98%PREDICTED: similar to wd-repeat protein [Tribolium castaneum]
Group
Gene OntologyGO:00160201.1e-95membrane
GO:00071661.1e-95cell surface receptor linked signaling pathway
GO:00055151e-88protein binding
KEGG pathwayaag:AaeL_AAEL0066690.0 
 K06226 (SMO)maps-> Basal cell carcinoma
    Pathways in cancer
    Hedgehog signaling pathway
InterPro domain[1062-1393] IPR0005391.1e-95Frizzled protein
[131-426] IPR0159431e-88WD40/YVTN repeat-like-containing domain
[119-419] IPR0110461.4e-78WD40 repeat-like-containing domain
[908-1025] IPR0200671.6e-21Frizzled domain
[291-328] IPR0197811.3e-08WD40 repeat, subgroup
[289-328] IPR0016802.1e-08WD40 repeat
Orthology groupMCL12511 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207390-TA
ATGGATTTTGGGACCCCTCCTCCAAACATGGGAATGCCTCCACCCGGCATGGGAGGACCTATGGGTCCACCAGGTCGAAACAATATTCGCCATAATTTTAGGCCTTTCAATTATCAAATGCGATTTCCACCACAGGGTCCTTTGAACATGACTCAAGATGATTTTGATGGTAAACGTTTACGTAAATCAGTTATGCGTAAGACTGTAGACTATAATTCAGCTATTATAAAAGCTTTAGAATGTCGCGTTTGGCAGAGGGATTGGCGGGATAGGCATGCGTTACAACCGGATGCTATGTACACACCTGATCTATTACCACCACCTTCATACCCAGACAATCCAATAAATGCTGTAACCACACGATTTGTTAAGACTGCAACAAATAAAATGAGATGTCCAATATTCGCTGTTGCCTGGACACCTGAAGGTCGGAGATTAATAACTGGAGCTTCATCGGGTGAATTTACTTTATGGAATGGTCTTACATTTAATTTTGAAACAATTTTGCAAGCACATGACTCTCCAGTCAGATCTATGGTTTGGTCTCATGGAGAGGGCTGGATGGTAACAGGAGACCATTCCGGCTTCATAAAATATTGGCAAAGCAACATGAACAATGTTAAAATGTACCAAGCCCATAAAGAAGCTGTTAGAGGTATAAGTTTTAGCCCCAGTGATGCTAAGTTGGTAACTTGTTCTGATGATGGTACATTGCGCATATTTGATTTCTATAGATGTCAAGAAGAGAGAATATTAAGAGGCCACGGAGCTGACGTAAAATGTGTACAGTGGCATCCTACAAAAGCACTTATTGTGTCAGGAAGTAAAGATAATCAGCAACCAATAAAATTGTGGGATCCTAAATCTGGTACGGCATTGTGTACTCTTCATGCACATAAATCAACAGTTATGGATTTGAAATGGAATGATAATGGAAACTGGTTAATAACTGCATCTCGTGATCATTTGTTAAAGCTCTTTGATATTCGCAAGTTAGGCACAGAAGTACAAGTGTTTAGAGGTCATAAGAAAGAAGCATCAAGTGTTGTATGGCACCCAACACACGAAGGACTGTTTTGTTCTGGTGGCTCCGATGGATCTATTTTATTCTGGAATGTAGGTACTGATAAAGAAGTAGGATGTATAGAAGGCGCCCATGAATCAATTGTTTGGACTATGGCTTGGCATCCACTCGGTCATATATTGTGTTCAGGATCAAATGACCACACCTCAAAATTTTGGACGCGCAATCGACCTGGAGATCAAATGAGAGACAAGTATAATCTCAATACCCTACCACCAGGTGTACAAGACGATAATGACCTTGAAGAGCCAGCAGCAATCCCTGGAATGGGTCCTGAAGATAAGGTTGATATATTCTCATCTGACACCGATAAGGTGATTCCTGGATTAGATTTAGATACTACATTAATTCCAATGGAGTTTGAGAAGAAAGTTAAGAAGGTGCCTTATAGCAAACCTATTCCAAGGAATTTCCAAGCCCAGTGGAATCATGGCCCAGCTGTAGACGATGCAGCTGCCGAGGCATTTACACAGGCCCTAGTTGAATCTGTTCCAGGAGCTGTTCCACTGCAGCAGATCAATCCTTCTGCCATCTTTATTTATGGTAAACTAATACCAGTAGAACCTGGCTCAAAATTAGAAAAAGCTATAACAGAAGGTCATGTTGCACTAAAGAGATACATAGCTACTGGCGAAATTGAAGAATTGGATGAAATGATGGCCCATCTGGAGGAGGACGCAGACGACGACAACTTCCCTCCATTTGAATATCCAACTCAAGAGAATAATGAAAATGAAGGTGAGAAAAGCGAAAATGAAAATGCGGAAAAAGAAAACTATGAACAAGAAGAATATTTTAACAATGAGATGGAAACTAGTGAACAGAATCCAGAAGATTATAATAATTATGATAATGGACCCGATATGTCAATGATGCCTCCCATGGTTCCTATGGGTAATATGCCACCAATGGGTTTGTTAAGACCTCCAATGAATATGGGACCTATGAGACCGCCTTTAGGGCCGCCACCACTTCATGCCATGCACATGCCACCACCACACATGGGAGGCATGCCACCAATGGGTGGAATGCCTCCTATGGGCCCCATGGGAAATATGCCTCCGATGGGCAATATGAACCAATTACCGCCTTTCTCAGACAACGGCTACAATAAAGGACACTTTGAGGGTGGTTATGAAAGTAACCAGTCGTTTTCACAAGAGAATAGTGAATATAATGAAGAAGAATCATTCGACCAAAATTATGAGGAGGAGAATTATAATGAGGAAGAGGACGGTGATGATAGCTATGGAAGAAATCAGAGGTGGATTGGTGGTTTTAGAGGCAGGGGTCAAGAGAGAAACAATAGAGGTCGAGGTCACGACCGTGGGAGAGGTCAGGATCGAGGTCGCGGTAGAGGTATGAATGGTTTCGGTAGGAATATGCGCAGAGGAGCTAATAGGATGTTCCGCTCGGGGTGGTTGTGGGGTATGCTGGTTGTATCGAGCTGTTGGGCGAGCCAGTACGATAACGGCGGCGAGAAGAACTCGCTAGGCGCGACTGGAACCAGCGAAAACACGACCGTCAGACTCGAAGCCATAGAGGGAACTCTGTATTACAGAGTGATTAAGGCAGAAAAAAGTCCTCAATGGTTCCCAGAACGGGAGTTGAAGCTAGACAGCTGCGTCAGGAGAGCCCAGTGCGAACCTTTGACCAAAACCACGTGCTTGGGAGTTAGGTTACCATACAATAGGACAAGCGTCCGCCTGACCTTCTACGACAGTCAGTACAAAATACAAAACCAACTGGAGCTATACAGGGAGTTGATAAACGTTCCTAAATGCTGGGCGGTTATACAGCCACTGCTCTGTGCTACGTTCATGCCCAATTGCGAGAGTATTAACGGGCAGGACATGGTGCACCTGCCATCGTACGAGATGTGCAAAATAACTATGGAGCCGTGTGCGATCCTGTATAACACCTCATACTTTCCGTCCTTTCTGAAATGCAACGCCACATTGTTCCCGCCGAAATGCGAGAACGCTGCCAGGGAGATGAAATTCAACACGACCGGTAAATGTCTGCCGCCGCTTATACACACGGACAAACGACATCACTTTTACGAAGGTATATCGGGCTGTGGCGTACCTTGCCGCGATCCCCTGTACACGGAGGACGAGCACGCGCAGATCCACCGTCTGATCGCATGGGGTGCGGGTTCATGTCTAGCTCTCAACCTGCTCACCGTGGCCACCTTCCTTATAGACTGGCGCAGCGCTAATAAGTACCCCGCGCTCGTCATTTTCTACATCAACGTGTGCTTCGCGGTAGCGTCCATGGGTTGGCTGGTACAATTCGGAGTGGGTTCGAGGGATGATATAGTTTGCTCGAAAGATGGCACTAGACGCCAAGGAGAGCCTTCAGCTGAAGAGAATCTGTCTTGTGTTGTTGTCTTTGTATTGGTTTATTATTTCATGATGGCGGCATGCGTTTGGTTCGTGATATTCACGTACGCGTGGCACATGAGCTTCAAAGCGTTGGGTAAAATTCAAGATCGTATAGACAAGAAGGCGGCATATTTTCACCTGGTGGCGTGGTCCCTACCACTGATACTCACCATCACGACCATGGCGTTCGGTGAGATCGACGGTAACAGCGTAACCGGCATCTGCTTCGTGGGTTACGTCAACCATCCGATGAGAGCGGCCTGGTTGTTGGCACCACTGTCAGTAGTATTGTTACTCGGCGGTTATTTCCTACTGAGAGGTGTGTTCTCCTTGATAGCGGTCCGCGTGTCCAGTAAGGACGTGATCTCTCCGCGCGCCTCCAACAAGATCCGTCAGACTATCACCCGCTGTTCGTTGACCGCTGCCCTGGTGGCCGTGTTCATCTGCGTCACGTTCGCGTGTCACGTGTACGAGTTCAGAAACGCCGAGGCTTGGAAGGAAGCGTTCAAGAATAACATCATCTGTCGCCTAGAGTCGTGGCGTGATCCGTCACTGGCTGGCCGCGAGTGTTCGCAAGGCGCTCGTCCGTCTGTGTCGGTGTTACAACTACGCCTACTGTGCTGCTTCGCGTCGGGAGCGCTAATGGCCTCGTGGACTTGGACGCCCAGCACCATGATGTTGAATGGCTCGATGATAACGACGAAATGTGGTTGCTCTGTAGAAGCCGATATGACGCGGCGTGCTCATAAACACGAGCTGATAGCGCGCGCGTACAGGCGGCGCAACGAGTTCATTACTAGAGGTAGACTTTCCATATCGCTCGGAGGTTCCAGGCAAGATCCCGTGGGCTTCTGTTTGGACAATTCGCCCGCTGATTACCCGGAAGACGCCAAACATGAGAGCGGCGAGTTGTCGTCGTCGTGGGCAGCGAACTTGCCGCGTTTCGTCCGACGTCGCGACGCCCTAGTGCTGCCTCAACACGCGCACCACTCGCACGACATGTCCTCTACTCCGGACCGCAGGAACTCACAAGACTCACAAATAAGCATCAGCCTCCGCCACGTGTCCGTCGAATCGCGCCGTAACTCGCTCGACAGTCAACTTTCGGTGAAAATAGCTGAAATGAAGACTAAGGTCGGAAGGCGGCGGACAAAACACAGTAAAGCCAAACGTAAACGAGCTTCAGTGCGTAAAGAAAGTACTCCCTCGATTGAGAGTCAGATAAGTCGGTACTGGTTACAAGCGGTCGCAGCTAACGCGGACCCCTCGCGCGAGGAGGTCAAATTTAGTTTCGACTGA

Protein sequence:

>DPOGS207390-PA
MDFGTPPPNMGMPPPGMGGPMGPPGRNNIRHNFRPFNYQMRFPPQGPLNMTQDDFDGKRLRKSVMRKTVDYNSAIIKALECRVWQRDWRDRHALQPDAMYTPDLLPPPSYPDNPINAVTTRFVKTATNKMRCPIFAVAWTPEGRRLITGASSGEFTLWNGLTFNFETILQAHDSPVRSMVWSHGEGWMVTGDHSGFIKYWQSNMNNVKMYQAHKEAVRGISFSPSDAKLVTCSDDGTLRIFDFYRCQEERILRGHGADVKCVQWHPTKALIVSGSKDNQQPIKLWDPKSGTALCTLHAHKSTVMDLKWNDNGNWLITASRDHLLKLFDIRKLGTEVQVFRGHKKEASSVVWHPTHEGLFCSGGSDGSILFWNVGTDKEVGCIEGAHESIVWTMAWHPLGHILCSGSNDHTSKFWTRNRPGDQMRDKYNLNTLPPGVQDDNDLEEPAAIPGMGPEDKVDIFSSDTDKVIPGLDLDTTLIPMEFEKKVKKVPYSKPIPRNFQAQWNHGPAVDDAAAEAFTQALVESVPGAVPLQQINPSAIFIYGKLIPVEPGSKLEKAITEGHVALKRYIATGEIEELDEMMAHLEEDADDDNFPPFEYPTQENNENEGEKSENENAEKENYEQEEYFNNEMETSEQNPEDYNNYDNGPDMSMMPPMVPMGNMPPMGLLRPPMNMGPMRPPLGPPPLHAMHMPPPHMGGMPPMGGMPPMGPMGNMPPMGNMNQLPPFSDNGYNKGHFEGGYESNQSFSQENSEYNEEESFDQNYEEENYNEEEDGDDSYGRNQRWIGGFRGRGQERNNRGRGHDRGRGQDRGRGRGMNGFGRNMRRGANRMFRSGWLWGMLVVSSCWASQYDNGGEKNSLGATGTSENTTVRLEAIEGTLYYRVIKAEKSPQWFPERELKLDSCVRRAQCEPLTKTTCLGVRLPYNRTSVRLTFYDSQYKIQNQLELYRELINVPKCWAVIQPLLCATFMPNCESINGQDMVHLPSYEMCKITMEPCAILYNTSYFPSFLKCNATLFPPKCENAAREMKFNTTGKCLPPLIHTDKRHHFYEGISGCGVPCRDPLYTEDEHAQIHRLIAWGAGSCLALNLLTVATFLIDWRSANKYPALVIFYINVCFAVASMGWLVQFGVGSRDDIVCSKDGTRRQGEPSAEENLSCVVVFVLVYYFMMAACVWFVIFTYAWHMSFKALGKIQDRIDKKAAYFHLVAWSLPLILTITTMAFGEIDGNSVTGICFVGYVNHPMRAAWLLAPLSVVLLLGGYFLLRGVFSLIAVRVSSKDVISPRASNKIRQTITRCSLTAALVAVFICVTFACHVYEFRNAEAWKEAFKNNIICRLESWRDPSLAGRECSQGARPSVSVLQLRLLCCFASGALMASWTWTPSTMMLNGSMITTKCGCSVEADMTRRAHKHELIARAYRRRNEFITRGRLSISLGGSRQDPVGFCLDNSPADYPEDAKHESGELSSSWAANLPRFVRRRDALVLPQHAHHSHDMSSTPDRRNSQDSQISISLRHVSVESRRNSLDSQLSVKIAEMKTKVGRRRTKHSKAKRKRASVRKESTPSIESQISRYWLQAVAANADPSREEVKFSFD-