Monarch geneset OGS2.0

DPOGS214987
TranscriptDPOGS214987-TA1236 bp
ProteinDPOGS214987-PA411 aa
Genomic positionDPSCF300256 - 207950-211602
RNAseq coverage598x (Rank: top 21%)
Annotation
HeliconiusHMEL0101702e-15464.12% 
BombyxBGIBMGA012165-TA3e-6560.85% 
DrosophilaCG7845-PA9e-4534.01% 
EBI UniRef50UniRef50_UPI00021A83D37e-5733.17%UPI00021A83D3 related cluster n=2 Tax=unknown RepID=UPI00021A83D3
NCBI RefSeqXP_002430137.17e-5035.00%WD-repeat protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838623578e-5733.71%PREDICTED: WD repeat-containing protein 74-like [Megachile rotundata]
NCBI nr blastxgi|3838623574e-5733.41%PREDICTED: WD repeat-containing protein 74-like [Megachile rotundata]
Group
Gene OntologyGO:00055157.9e-23protein binding
KEGG pathway 
InterPro domain[49-302] IPR0110467.9e-23WD40 repeat-like-containing domain
[47-309] IPR0159436.6e-19WD40/YVTN repeat-like-containing domain
Orthology groupMCL11891 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214987-TA
ATGCAGATCAAAGAAGATAAAGAGATTGACCTGTTTGTAGCGAGCAGAATAGGATCTTTTAAACATATAAAGTATCACACGGATCCATCGAAAAACAGTAAGAAGTGCATAGAAAACCTCGTAGATATTAAGACCTTACAAAAAGATGATAACATAACGTGTATGGTGTGGGGTAGTCCTGAACAGACAGACATACTGATCGGAAGGAAAACCCAACAGATTCAGGTGTACAACACTCTGCACGGTTTCACAAAGTCATACACAGCTGACTTCGGTTCCGGGGATGTGGTGGGCTTGGGTCGGCACGGACGACGTCTGGTCGCCGCAGTGTCAGAGGGAGTGGTGCAGGTGTATGGCAAGAAGGAGAATGTCACCTACAACGTTGGAAAGATTGATAGGATGAAGATCTTCGATGGAGACACCACTATGTTCGCTACCGGTGGCGAGGAAAACGACCTCAAGGTATATAGAATCGGAGAAGCTGAGCCTTTATTTGTGGCGAAGAATCTTCCTCACGACTGGCTCCAACTACGGAAGTCGGTGTGGGTCAGTGACCTCACATTTTTGTCACCGAGCGAGCTGGCTGTGTGCTCTAGACACGGATATATCAGATTATACGACACTAGAGCTCAGAGACGGCCGGTTTGTAACGTTGAATGCGACAAAATGGCCGCCACGTGTATATCAAAGGGTTTCGACGAGAGGCAGGTGTTCGTCGGATTCGGCCGCGGACAGCTGCACCAGGTGGATCTGCGCCGCGGACACCTGGACAAGGGCTATAAAGGAGCCGCCGGGGCCATCACAGGAGTGGTCATCAGCCACGGGTCCGTCATCAGCTGCAGCCTGGACAGACACCTGCGGGTGCACCGCGCTGACACTAAGGAACTGCTATATAAGCAATACCTGACGTCTAAGCTGAGCTGCGTCCTGGTGCAGACAGCTTCCAGCACGCCCATGAAGGACGTGCAGCCGGAGATGAAGGAGGAGCTGGAAATGAAGGAGGAAACAGCGCTGGAAGACCTGGAGTCTGCCAGTGAGAAGCCTCAAAAACGTTCGGATGGCGAACAATCACAGGAAATAGACGCGAAGAAGATGAAGCCGTCAACAGAGGGCAGCACCGTCGCTGACGATGAGGACGCCATCATCAGCCTGCTGCGGAGCACAGAGAGACAGAAGAAGAAAAGAGACAAGATGAAGAAGAACAAGAAGGCAAAGAGCGTGTTCCATAATGCCTGA

Protein sequence:

>DPOGS214987-PA
MQIKEDKEIDLFVASRIGSFKHIKYHTDPSKNSKKCIENLVDIKTLQKDDNITCMVWGSPEQTDILIGRKTQQIQVYNTLHGFTKSYTADFGSGDVVGLGRHGRRLVAAVSEGVVQVYGKKENVTYNVGKIDRMKIFDGDTTMFATGGEENDLKVYRIGEAEPLFVAKNLPHDWLQLRKSVWVSDLTFLSPSELAVCSRHGYIRLYDTRAQRRPVCNVECDKMAATCISKGFDERQVFVGFGRGQLHQVDLRRGHLDKGYKGAAGAITGVVISHGSVISCSLDRHLRVHRADTKELLYKQYLTSKLSCVLVQTASSTPMKDVQPEMKEELEMKEETALEDLESASEKPQKRSDGEQSQEIDAKKMKPSTEGSTVADDEDAIISLLRSTERQKKKRDKMKKNKKAKSVFHNA-