Monarch geneset OGS2.0

DPOGS201590
TranscriptDPOGS201590-TA1476 bp
ProteinDPOGS201590-PA491 aa
Genomic positionDPSCF300152 - 36609-38402
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0101646e-10044.83% 
BombyxBGIBMGA012166-TA2e-12944.49% 
Drosophila% 
EBI UniRef50UniRef50_E2BE552e-2925.06%WD repeat-containing protein 34 n=5 Tax=Formicidae RepID=E2BE55_HARSA
NCBI RefSeqXP_972059.14e-3023.43%PREDICTED: similar to WD repeat domain 34 [Tribolium castaneum]
NCBI nr blastpgi|3838518043e-3126.81%PREDICTED: WD repeat-containing protein 34-like [Megachile rotundata]
NCBI nr blastxgi|3838518041e-3126.69%PREDICTED: WD repeat-containing protein 34-like [Megachile rotundata]
Group
Gene OntologyGO:00055151e-26protein binding
KEGG pathwaytgu:1002232893e-09 
 K10409 (DNAI1)maps-> Huntington's disease
InterPro domain[125-474] IPR0110461e-26WD40 repeat-like-containing domain
[124-482] IPR0159431.2e-25WD40/YVTN repeat-like-containing domain
Orthology groupMCL17029 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201590-TA
ATGTCCTGTTTGAGCAGTTTTGATAGCGAGGCTGTTGGTTTTGATTCGGCTTCAGATGACAAGAAGCCGATGAGGACGAATTCCTCACAGACAACCGAGTTTAGTGAGGGAGGGATGGGTTCACAGACTCACATGTCTAAAGATATTGGATGTCTGGCCCAACCGGAAGATATAAAACCTACTGAGGCCCAGGAATATCCACCGCCAGGCCTAAACGAGTTCCTCAGGAGGGTGGTGCCAGCAATGATGGACCAATTGGACCAAAATAATACGGAGTTCCTCAATAATTCATCAGATTCTGACGAGGAAGAGGTTATAGGTGCTAGGTTGTTCCAGGAAATGCAGCTTAGAGACGGAATCGGGGCCGGCGATCACGAAACCTCCATATTGGGAGTTACTTGGAGCAGCGCTGGCAACTCCCTGGCTGTTTCCATCGGCCAATTGCACCATGAAACTTTCTGCCAGACGTCCGGCATAATTAAAGTGTTTACGTTGAAGAGAAGCGAAGATAAATTTGTTCATTCGTTGGATATCAGTGAGAATAATTGCGTAACTGTTCTTAAGTACCATCCTAACGTGGCAGCTCTTCTTGCGTATGGCACAACTTCCGGGGAAGTTGTATTATGTAATTTGAGGAACGGAAGTTTGGATGAAGGCACCCAATTGACGTCTCCGGCGGGTTGTCACGGGTCGAGGCGAGTATCGGGCTTGCAGTGGGCGGACGCGCCGTTAGCTAACATATACTTGTTGATGCAGATCCATAATAAAGGCAAGCGCCGTGGCGCAGCAGATCAGGTTCTGTTCTCGTCTGGTTGCGACGGGACTCTGAACGCGTGGCAAGTGAACTCTCATTCTAAAGTTTTCGAAAACATAATTTGTTACAATATAAACGGCTCCAAAAAGGTGCCGGTGCCAGATATAACCTGCTTCGACTTCTTCAAGAGCTACCCGCTGCGATCAGCGGAGGAAAAGAGTTACAATGACGTGTTTGTTGTTGGAGCTAGAAGCGGCCAGCTCTTTTTGTGCAAAGTAACGCCCCAAACCGACAGTGACCCGGTCTACGAGGTTCTGCAACCTCACGGGACGTGCGTCCTGGATGTGTCGTTCAGTTTCCAAAGACCAGGGATCTTCGTCTCCATTTCCACGGATTCGGAAGTGCGGGTGTACGATATCAACCAGAACAGTCCGTTGAGGATAATATGTCTGGATATTCCGATCTGTTGCATGAGCTGGCTGTGTGGCCCGTGCGTGGTGCTGGGTCTGGCTGGGTGTGACGAGCTGTTGCGTGTTTACAACATCTGTAGCGGACGGGCGGTGCCTGTCAGCGGTTTGGCGGGAAACGCTACTGTCACTTGCGTCGCTGTCAACCAGAGCGGTTCCTGTCGCATCGCGGCCGGAGACGGCAGCGGCGCTCTCCACCTATACGAACTGCCGTCTCGACGGATGAGGCTGACGGCCGAGGACTTGGACTTTTGA

Protein sequence:

>DPOGS201590-PA
MSCLSSFDSEAVGFDSASDDKKPMRTNSSQTTEFSEGGMGSQTHMSKDIGCLAQPEDIKPTEAQEYPPPGLNEFLRRVVPAMMDQLDQNNTEFLNNSSDSDEEEVIGARLFQEMQLRDGIGAGDHETSILGVTWSSAGNSLAVSIGQLHHETFCQTSGIIKVFTLKRSEDKFVHSLDISENNCVTVLKYHPNVAALLAYGTTSGEVVLCNLRNGSLDEGTQLTSPAGCHGSRRVSGLQWADAPLANIYLLMQIHNKGKRRGAADQVLFSSGCDGTLNAWQVNSHSKVFENIICYNINGSKKVPVPDITCFDFFKSYPLRSAEEKSYNDVFVVGARSGQLFLCKVTPQTDSDPVYEVLQPHGTCVLDVSFSFQRPGIFVSISTDSEVRVYDINQNSPLRIICLDIPICCMSWLCGPCVVLGLAGCDELLRVYNICSGRAVPVSGLAGNATVTCVAVNQSGSCRIAAGDGSGALHLYELPSRRMRLTAEDLDF-