Monarch geneset OGS2.0

DPOGS214748
TranscriptDPOGS214748-TA1698 bp
ProteinDPOGS214748-PA565 aa
Genomic positionDPSCF300022 + 693719-696515
RNAseq coverage307x (Rank: top 37%)
Annotation
HeliconiusHMEL0060070.080.27% 
BombyxBGIBMGA004742-TA0.071.87% 
DrosophilaCG6420-PA1e-12257.97% 
EBI UniRef50UniRef50_E0V9270.055.47%WD-repeat protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0V927_PEDHC
NCBI RefSeqXP_001603422.10.059.08%PREDICTED: similar to wd-repeat protein [Nasonia vitripennis]
NCBI nr blastpgi|1565526510.059.08%PREDICTED: WD repeat-containing protein 20-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3838637970.058.57%PREDICTED: WD repeat-containing protein 20-like [Megachile rotundata]
Group
Gene OntologyGO:00055155.1e-32protein binding
KEGG pathway 
InterPro domain[532-548] IPR0159435.1e-32WD40/YVTN repeat-like-containing domain
[81-365] IPR0110461.6e-31WD40 repeat-like-containing domain
Orthology groupMCL11044 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214748-TA
ATGGCCGTGCAATCTGACTCTGGCGGGAAGGATGATGTGAAGACACAATTCGTGACTCGAGAGGGGACGTACAGATTGATGACATTGTCCGAGTATTCGAGGCCTAATCGTGTGGGGTACACCAGTGGCTCAGGCTCCTCTCACGTTCGAGTGTCTCTCGTTACCTTGCCGCCGGGGCCGGAGGATGGAGCACCCGGTTCCGACGGACAGGGCTCCGATGACAGAATATGTTTTAACCATGGGAAGGAGCTTTATGTTTACGTTTATAGAGGAATCAAGAAGGCTGCAGATCTCACCAAGCCGGTGGACAAAAAGATATACAAAGGCACGAACCCAACATGCCATGACTTCAACACGGTCACAATGACCTCGGAAACGGTGTCCTTGATAATAGGTTTCTCCACCGGTCAGATACAACTCATAGATCCTATTAAGAAGGAGTTAAGTAAATTATATAATGAAGAGAGGCTCATAGACAAGACTCGAGTGACTTGTATCAAATGGGTGCCGGGTTCCAGCAACCTGTTTGTGAGCGCACATGCCTCGGGCCAGTTGTATGTTTACAACGAGGAACTGGTCTGTGGGGCTGCACCGCACTATCAGCTGTTCAAGACGGGGGATGGTTACTCCATCCACACCTGCAGGACGAAGTCCACTAGGAACCCCTTGTACAGGTGGGTGATAGGTGCGGAGGGTAGTTGTATAAACGAGTTCGCGTTCTCGCCGTGCGGCGCCAACCTGGCCGTGGTGTCCCAGGACGGGTTCCTGCGAGTGTTCCACTACGACACTATGGAGCTGATCGGCCGAGCCCGGTCTTACTTCGGCGGCTTCCTGTGTGTGTGCTGGTCGCCGGACGGCAAGTACGTCGTGGTGGGCGGCGAGGACGACCTCGTCACCGTGTGGTCCTTCGGCGAGCGACGAGTTGTGGCGCGGGGTCAAGGTCACCGCTCCTGGGTCTCTGTCGTCGCCTTTGACCCCTACGTCGTCGGCTTCGTGGAGGCCGACCGCGACCCCGGCGGCGAGGGCGGCCAGTGCTACAGACTCGGCAGCGTCTCGCAGGACACGCAGCTCTGTCTGTGGGACCTCACCGAGGACGTGCTCCGGCCACCGCCCCGCGCCCGCGCCTCCGCACACCTGTCCCCAAACAGCCAGCACCTGTCGCCGAACGGCGTGCCCGGCACGAAGGCGCCGCGCTGTGGCAAGACCAACAAGATCGGCCACAAGCACGTCGAGCTAAGCGGCGCCAACACGGCGGGCGAGCCGTCAGGGACGAAGGCGGCCGTCCGCGCCAAGACCAACAACACGCTGGCCTCCAACACGAGCGCGGCGCGTGGCAAGGACGAGGCATCCGTTTCGCTGGGCGCGCACCTGTCCCAGCGACTGGCCGGCTTCTCTTTCGGCGAGCGTCGCTCCGAGCACCGTCGGAACTTCAGCCTGGGGAAGCCGGAGCGGCCCGCGGCGCCCCGGACGCCCGCCGCGGACCCGCTGCGACTTATCGGCTCGCCGGCGTGTCCGCGATTCGACGAGTGTCCCGTGTTGGAGCCCCTGGTGTGCAAGAAGATCGCCCACGAGCGACTGACGGCGCTGCTGTTCCGCGCGGAGTGCCTGGTGACGGCGTGCGCGGACGGCGGTGTCAACACGTGGGCGAGACCGCCGCGGCCCGAGCGCCGGCGGGACAGGAGCGACCTCGTGGACTAG

Protein sequence:

>DPOGS214748-PA
MAVQSDSGGKDDVKTQFVTREGTYRLMTLSEYSRPNRVGYTSGSGSSHVRVSLVTLPPGPEDGAPGSDGQGSDDRICFNHGKELYVYVYRGIKKAADLTKPVDKKIYKGTNPTCHDFNTVTMTSETVSLIIGFSTGQIQLIDPIKKELSKLYNEERLIDKTRVTCIKWVPGSSNLFVSAHASGQLYVYNEELVCGAAPHYQLFKTGDGYSIHTCRTKSTRNPLYRWVIGAEGSCINEFAFSPCGANLAVVSQDGFLRVFHYDTMELIGRARSYFGGFLCVCWSPDGKYVVVGGEDDLVTVWSFGERRVVARGQGHRSWVSVVAFDPYVVGFVEADRDPGGEGGQCYRLGSVSQDTQLCLWDLTEDVLRPPPRARASAHLSPNSQHLSPNGVPGTKAPRCGKTNKIGHKHVELSGANTAGEPSGTKAAVRAKTNNTLASNTSAARGKDEASVSLGAHLSQRLAGFSFGERRSEHRRNFSLGKPERPAAPRTPAADPLRLIGSPACPRFDECPVLEPLVCKKIAHERLTALLFRAECLVTACADGGVNTWARPPRPERRRDRSDLVD-