Monarch geneset OGS2.0

DPOGS212420
TranscriptDPOGS212420-TA3099 bp
ProteinDPOGS212420-PA1032 aa
Genomic positionDPSCF300258 - 24160-29779
RNAseq coverage86x (Rank: top 63%)
Annotation
HeliconiusHMEL0058653e-16554.84% 
BombyxBGIBMGA002819-TA0.058.43% 
Drosophila% 
EBI UniRef50UniRef50_UPI0000E493B22e-11729.63%UPI0000E493B2 related cluster n=1 Tax=unknown RepID=UPI0000E493B2
NCBI RefSeqXP_967922.11e-14833.50%PREDICTED: similar to WD repeat domain 66 [Tribolium castaneum]
NCBI nr blastpgi|910897852e-14733.50%PREDICTED: similar to WD repeat domain 66 [Tribolium castaneum]
NCBI nr blastxgi|910897851e-14733.50%PREDICTED: similar to WD repeat domain 66 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.8e-16protein binding
GO:00055091.3e-07calcium ion binding
KEGG pathway 
InterPro domain[93-490] IPR0110462.8e-16WD40 repeat-like-containing domain
[750-795] IPR0159434.6e-15WD40/YVTN repeat-like-containing domain
[857-954] IPR0119921.3e-07EF-hand-like domain
Orthology groupMCL12862 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212420-TA
ATGAACGGAACATCTTCGAAGTTACAATCTCAGTCCACCGTCCGATCGTTCGGTGCTAGCGAGTCGGACTTAAGAAGGCTTTACACTCTGTCTCTATCCCAGATAAGGAATCGTTGCGAGCGTGGAGAAAATAGTAACTACAAACCTTCACCCTTCAATATACGTTGGGTTCATGGCTACAATCCCAGAGTTGGGGTCATCAACCTTGTCAAGGATGACTCTCCTACCCTCGCTTTCTACGCGGCAGGGTATTGCGGCGTGATTTACGATTGGATGGAAAATACTATGATGATACTCCAGGGTCATAAACATACAATTATTTGTATAGACTCAGACGCGAAAGGGGATTGGCTTGTTACTGCCGATTCGGGACCGGAGAACGTGGTTATTGTGTGGGATAGAAAAGATTGTTTTCCTCAAAGGACTATTTTCAATCCTCACGGAAATGTAAAATTGGCTGACGTGGTCATCAGCTCGGATGCTAAGTACCTTCTAACCCTGGGTTATCATGAAAAGGCTACTATTAACTGGTGGATATGGTCGATTGGTTCAGATGTCCCTCACGCTTCATTAGAAGTAGACATTCCAAAAGGGGGAGTGGTGGATATGGGATTTAATCCTTACAAAAGTGAGCAGTTTCTTTTAATGACGAAACATGACATTTGGCTCTTTGTCTCAAGTAAAATATTTGTCTTCGAGAGAGGACTTATAAAAGAAACCGATGATTACGAACTTAAAATAAAAATACCCAACAAGAAAATTAATCCCGATAACGGTTTGCTGACCGCTTTCACGTTTGTGGAAGAAACCTCTCAGATACTCGTGGCCACAAGCCGGGGATCCGTTCTCGTTTATGGATACACGATAGAGTTCACGGACAACGTGGATTCATCTTCCTTTGAAAACTTAAGATTCATTAAGGTGCTTAAAGTACAGACGAAGAGAATTAACGTCATCCAAAACATAGATGGAGTTGTGGTGACGGGCAACAACGCGGGTGAGATTCACTTCTATGACAACCAGGTGAAATTGCTGTATTGGATCCACGGCTTCACTGTCGACTCTGTCAAACAATTGAGTTTCAACATATCCCGAAGAAGCTACCAGATACTAGACCCAAAATGCAGAAAAGTCTGCCAATGCTGGGATGATGTCCAAACTGAATTCGATGAAACGACAGGCCAACCGTGTCATAAAATTATGAAAAAAGATGTCCCATTAGATGCGACTACCGGAAACAAGCCGTTCATCATCAGAGATTTCATTGTTTGTACAAACAATGAGGGAGTATATTTTGTTGATTTTCTAACAGAGAAGCTGACCACAGTTTTAGATAATAATGTTTCACACGCCTTATCCTTGTCTGTGAATCCGGAAAAAACGTTTGTTTGCATCGGATATGGAAACGGTGTTATTGAGTTATTCAATTACGTGTCCCACAAATTGTTCGTGAGGGTCGATTTGAGAGAACATTTTAAAACAACTATCCCGCCCAAGGACGACTCCCTCAAAGATGAACAAGAAGTCACTAAGCCAGAGATTTCAGTTACATGTTTGAAATATTCGCCATCAGGATTACACCTAGCCTGCGGTCTAGATAATGGCGAACTAATATTCCTAGACCCCACCACCATCAGCATTAAATCTAAAACCCCCCATAAAGACACCAGCTTCGCGATCACTCAGATTAACTACAGCTGTGATTCTCGGACTTTGGCCTTGGCTGACTCTGGCAGAACTGTACTAGTTTATAAATATGATTGCTCAAACTTTTTATGGACATTCATTGGGAAGCACAGAGCACATTATAAGGACGTAACTTCCGTTTTTTTCCTTCCAAAGAAAAACGTGAATGGGGAATACAAGTTGTTATCCCTTGGAATGGATCGAATCATGGTTGAGTACGACATTGGCGAAAGTTCTGAAGAGTACCTCGAGGTTTTGAGTTTAGACAGAGTGGATCAGACTGCCATACCTCTATTTGGTATCCCATGGCCAAATCCCCCGGATATTGATCCAGAAATACATCGGACCGATCTACCCCTGATTCTTATTGCTAATGATGAGTTCAAATACAAAATTGTTAACTATGGGACAACTATGACGTTATCTACCATACTGGGTCCAAGATACGAGAGCCCCGTGTGCCGTATGCAATTAGTTACAATCACTAAGGACGATAGACAGATGCAATACCTTCTCTACGCTACCAAAAATGTGGTTGGCCTGCAGAAGATGCCGTTAGACGGCAATCCTTGGAAGCACACAGCTCTGCTGGGACATCCTACTCACATTATCGACATGTGCTTCCGAGAAGATAGCGGAACGTTGTTTACGCTCGGAGCAAAGGATAACTGTGTCTATCAATGGGCTGCTAATTACAGGTCAGTGGAGACGACCACGAAGCTAGGTGGCGGTTATCTGGACCCTTACTACTGCCTGATGGAAGGCGGTAGACCAGGCTGGCTGTTCCAAGAGATTCGTGATCTATTTTACTATATACAGATTCTTTGTCAAGGAACCTTCTCACCTGCCATGCGACGCGTTAAGGATTTTATTCCAATTGATTCGCTGTCTGATCTGATGCGGGCTTTAGGATATTTCCCGTCAGAGTACGAGGTAGAAAACTTAATAATAGAAGCGAAATATAAGGTTTTTCTCAAAAAACCAATGACTGAGATTGATTTCGACGACTTTGTCAAATTATATATAAATCATCGGCCAGCTCTTGGGGATAATTTCAAGAGAATTAAAAACGCTTTCCGTCGTTTTGCTGACGCGGATAACAGCAATCTTACCATAAGTCGCGACGAGTTTATCCGAATATTATGTACAAATGGTGAAAGCTTCAGTAACCAGTTGTTGTGGTACCTCTTGTCAATATTATATGGACACAGTTTTGAAGATAGAACAGCCATGATGCCCGATGACTTTTCCTTTTTACCCGAGGAGATAACATTGGAAGAGCTAGCAATGAACGTAATAGGAATACAAGACCTGGAAGTTCTATCCGAGCAGTACTCCATGAAGGAATCCTTTGGATCTCAACAAACCGGAGACACTTCTACAGAGTCTGCAATTAGTAGCAGATTATTTTAA

Protein sequence:

>DPOGS212420-PA
MNGTSSKLQSQSTVRSFGASESDLRRLYTLSLSQIRNRCERGENSNYKPSPFNIRWVHGYNPRVGVINLVKDDSPTLAFYAAGYCGVIYDWMENTMMILQGHKHTIICIDSDAKGDWLVTADSGPENVVIVWDRKDCFPQRTIFNPHGNVKLADVVISSDAKYLLTLGYHEKATINWWIWSIGSDVPHASLEVDIPKGGVVDMGFNPYKSEQFLLMTKHDIWLFVSSKIFVFERGLIKETDDYELKIKIPNKKINPDNGLLTAFTFVEETSQILVATSRGSVLVYGYTIEFTDNVDSSSFENLRFIKVLKVQTKRINVIQNIDGVVVTGNNAGEIHFYDNQVKLLYWIHGFTVDSVKQLSFNISRRSYQILDPKCRKVCQCWDDVQTEFDETTGQPCHKIMKKDVPLDATTGNKPFIIRDFIVCTNNEGVYFVDFLTEKLTTVLDNNVSHALSLSVNPEKTFVCIGYGNGVIELFNYVSHKLFVRVDLREHFKTTIPPKDDSLKDEQEVTKPEISVTCLKYSPSGLHLACGLDNGELIFLDPTTISIKSKTPHKDTSFAITQINYSCDSRTLALADSGRTVLVYKYDCSNFLWTFIGKHRAHYKDVTSVFFLPKKNVNGEYKLLSLGMDRIMVEYDIGESSEEYLEVLSLDRVDQTAIPLFGIPWPNPPDIDPEIHRTDLPLILIANDEFKYKIVNYGTTMTLSTILGPRYESPVCRMQLVTITKDDRQMQYLLYATKNVVGLQKMPLDGNPWKHTALLGHPTHIIDMCFREDSGTLFTLGAKDNCVYQWAANYRSVETTTKLGGGYLDPYYCLMEGGRPGWLFQEIRDLFYYIQILCQGTFSPAMRRVKDFIPIDSLSDLMRALGYFPSEYEVENLIIEAKYKVFLKKPMTEIDFDDFVKLYINHRPALGDNFKRIKNAFRRFADADNSNLTISRDEFIRILCTNGESFSNQLLWYLLSILYGHSFEDRTAMMPDDFSFLPEEITLEELAMNVIGIQDLEVLSEQYSMKESFGSQQTGDTSTESAISSRLF-