Monarch geneset OGS2.0

DPOGS210472
TranscriptDPOGS210472-TA1845 bp
ProteinDPOGS210472-PA614 aa
Genomic positionDPSCF300062 + 419528-422903
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0215720.070.59% 
BombyxBGIBMGA002764-TA0.068.12% 
Drosophila% 
EBI UniRef50UniRef50_D6WSV95e-14046.00%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WSV9_TRICA
NCBI RefSeqXP_967756.19e-14146.00%PREDICTED: similar to Bbs2 protein [Tribolium castaneum]
NCBI nr blastpgi|910861072e-13946.00%PREDICTED: similar to Bbs2 protein [Tribolium castaneum]
NCBI nr blastxgi|910861071e-13546.00%PREDICTED: similar to Bbs2 protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.6e-10protein binding
KEGG pathway 
InterPro domain[62-279] IPR0110461.6e-10WD40 repeat-like-containing domain
Orthology groupMCL16188 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210472-TA
ATGCAAGTTTCTAGTGTCCAGCCAGTATTTAAATTAGAATTAAACCACAAAGTAACACCTGGTATAGTAACTATCGCTAAATATGATGGTACACATTATTGTCTTACTGCGTCTGCTGGATATGATAAAATAATTATTCATCACCCTCACGGTGGTATGAGTGTTGGACGTGCTCAACGTTCCCAGGCACATGGAGAAGTGTCTGAACTCAACCTTAGCCAAGCTGTGATAGCTTTAGAAGCTGGGCCTATAAAACCAGACTGTGCGCGAGATATGCTTTTGATTGGCTCGCCCACCCAAATTTTAGCCTACGATGTCCATAACAATTCAGACATATTTTATAAAGAAGCTCCAGATGGTATAAATGTAATAATTGCAGCCCATTTTAGTAAATACTCAGATATGTTGGTCATGGTTGGCGGAAATAGTTCTGTTTGTGGTATAAATTCGGAAGGCGAAGAGGTGTTTTGGAATGTTGTTAGTGGAAAAGTTTTTTCCATGATAACATTTGATTTTGATAAGGATGGCAAAAATGAGCTGCTTATAGGTTGCGAAGATTCTTATATAAAAGTACTAAAGGATGATCATTTTATCATGGAGATTGCGGAAACGGGGCCTGTTTCTTGTTTATCCTATATAAATGAAGTGAGATTTGCTTACGGACTAGCAAATGGAACTATTGGTATATATGAGGATGGCATCCGTCTTTGGAGAGTAAAGTCGAAACAAAACGCAAGAAATCTTCAATGGTCAGGAGACAACTTAGTCTGTTGTTGGGCGAATGGCCGAATAGACTGGAGAGATTGCACAGGGAGAGTACTAAGAAGAGTACAGTTACGATCTGACGCAGCGGCAATGATTTTAGCGGATTATCGCACAGTTGGCATCCCTGACCTTGTTTGTGTATCAACTAAAGGCGAAGTGCTTGGATTCCCGCCGATCCAAGAAAATGGTGGACCAAACACAAAAAAAATTGCTCCATCGGAAGAGGATAGGCTTGCAGTAACTGAGTTGCTGAATAAGAAGCAGGCGTTAATGATAGAACTGCAACACTACGAGGGCAACGCTGCTAACACATCCTTAGACATAGACCGACCTGACAGCGCTATGCCGACAAATACAAGGCTTCAAGTCGCAGTAGCCGCTGACACAGAAGAAGGATGCCTGCAGTTGGCTGTATCGACAAACAATGACACAATTGTGCGTATGGCACTAGTGTTAGCGGAAGGTATTTTTGATTCTGGAGAAACGCTCGCGCGTCATCCTCATCCGGCTAAACTGAGATCCGTTCTTTACATACCACTAAAGCCACCGAGAGATGTTCCCGTTGATGTTCATATTAAGGCACTGGTTGGTTATCCAGAGAGTGAACGGTTTCACATATTTGAACTTACAAAGCAGTTGCCACGTTTTTCGATGTATACTTTAGTTTCGTCATCTATCGCAAAGTCAAAGGTTGTAAACTACGTGACTTTCCGCATTACTGAGAGAGTTCAGAGAATATGCATATGGATAAATCAAAACTTTTTATTAGACGAAGAAATTGAAATTAACAACGAGGAGACAAAAGAGCTGCATATTAGTTTTATGTGTCTCAGAGATATGTCTCGCTTGGATTTGGATTTTTGCCCAGATGGTCAAGTGAAAATTACAACTCACGACATTAGACTTGCTGGAGATTTGATTCAGAGTTTGGCTGTTTTTTTGAACTTGTCTGATTTACAGGTATTTCGAATACCTATTTTATTTTTTGTTATTAATTTTTTTTTTATAATTTTTTATTACATGTGTGACCAATACAAAATCGAACGAAATATATTTGTCTTTATAAGGTTCTGTTGGTAG

Protein sequence:

>DPOGS210472-PA
MQVSSVQPVFKLELNHKVTPGIVTIAKYDGTHYCLTASAGYDKIIIHHPHGGMSVGRAQRSQAHGEVSELNLSQAVIALEAGPIKPDCARDMLLIGSPTQILAYDVHNNSDIFYKEAPDGINVIIAAHFSKYSDMLVMVGGNSSVCGINSEGEEVFWNVVSGKVFSMITFDFDKDGKNELLIGCEDSYIKVLKDDHFIMEIAETGPVSCLSYINEVRFAYGLANGTIGIYEDGIRLWRVKSKQNARNLQWSGDNLVCCWANGRIDWRDCTGRVLRRVQLRSDAAAMILADYRTVGIPDLVCVSTKGEVLGFPPIQENGGPNTKKIAPSEEDRLAVTELLNKKQALMIELQHYEGNAANTSLDIDRPDSAMPTNTRLQVAVAADTEEGCLQLAVSTNNDTIVRMALVLAEGIFDSGETLARHPHPAKLRSVLYIPLKPPRDVPVDVHIKALVGYPESERFHIFELTKQLPRFSMYTLVSSSIAKSKVVNYVTFRITERVQRICIWINQNFLLDEEIEINNEETKELHISFMCLRDMSRLDLDFCPDGQVKITTHDIRLAGDLIQSLAVFLNLSDLQVFRIPILFFVINFFFIIFYYMCDQYKIERNIFVFIRFCW-