Monarch geneset OGS2.0

DPOGS200738
TranscriptDPOGS200738-TA1638 bp
ProteinDPOGS200738-PA545 aa
Genomic positionDPSCF300030 + 156463-158734
RNAseq coverage142x (Rank: top 54%)
Annotation
HeliconiusHMEL0089620.072.24% 
BombyxBGIBMGA001038-TA5e-15173.64% 
DrosophilaPoc1-PA4e-7241.02% 
EBI UniRef50UniRef50_E2BSC15e-8347.47%WD repeat-containing protein 51A n=6 Tax=Formicidae RepID=E2BSC1_HARSA
NCBI RefSeqXP_624420.16e-8449.83%PREDICTED: similar to TUWD12 [Apis mellifera]
NCBI nr blastpgi|3838519314e-8449.50%PREDICTED: POC1 centriolar protein homolog A-like [Megachile rotundata]
NCBI nr blastxgi|3838519317e-8247.35%PREDICTED: POC1 centriolar protein homolog A-like [Megachile rotundata]
Group
Gene OntologyGO:00055152.6e-79protein binding
KEGG pathway 
InterPro domain[6-291] IPR0110462.6e-79WD40 repeat-like-containing domain
[10-302] IPR0159437.2e-78WD40/YVTN repeat-like-containing domain
[57-89] IPR0197811.3e-10WD40 repeat, subgroup
[252-291] IPR0016801.3e-09WD40 repeat
[35-49] IPR0204725.4e-06G-protein beta WD-40 repeat
Orthology groupMCL12605 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200738-TA
ATGGAAAATAATTATATGTCTCTTGAACCTTCTTTAGAAAAACAACTTAAAGGGCATCGCAAAAGTATAACATCATTATTTTTCAATCCTAATGAACAACAGTTAGCAAGTGGCTCTTTAGATAATAGCATATTGTTATGGGATTTACGTGGGACGATGCGAAGTTACAGATTCCAAGGCCACGATGAAGCTGTTATGGATGTAACATTTTCTCCAACGGGAAAATATATGGCTTCCGCCTCTCGAGACAAAACTGTTCGTCTTTGGGTACCAACTGTAACAGGAAGTACTGGTACATTTAAAGCGCATTCTCAGACTGTGCGATCCATAATAACAGCATCAGATGATAAAATTGTTAAGTTGTGGTCGAGTGAGAAACATAAGTTTTTGGCTTCATTTGTTGGTCACACAAATTGGGTTCGTCGGGCTCGCATCTCTCAAGATGGATCACTGATAGCATCATGTTCTGATGATAAAACAACAAAATTGTGGAACATAGAAACTGGTGTATGTATAAATACATATAAAGATCAGAGCGCACATGGGTTACATCTGGCCTGGCATCCATCGAGTTGCTATGTGGCTATTGGTACTTCTAAGGGAAATATTAAATTGTATGATGTTAGGACACACAATCTTGTACAATTCTACAGTATACACAATGATGCTGTCACACAGCTTGTCTTTCATCCGAGCGGAAGTTATATATTAACTTCCAGTAAAGACGGGACCATGAAGATTCTTGACCTGCTGGAAGGTCATCCAATATTTACATTAACGGGACACTCTGGGCCGATCAATGCCGTAGCATTCTCTCCTAGTGGACAAAAATTTACCTCTGCTGGTGATGACAAGCTGGTCTTTATTTGGAAGACTAAATTCACTGAACTTTCAAATCAAACAAAAGAAAATGTCGGACAAAATCAATTATCGACTCCCAGATCACAGACCTCTAAAGGTGTGAAGATGTGGTATCAGTGCTATCACAACACTAAGTATCCTGTGAAAACTTTTGTCCACATTGATAATGATCCTCATGATTCTGTAAGAAGCATCAGCTTCCAAGATGCACCTAATGCAAACAGTACTATGATAGAACCTGGCGATAATCAAGTGAATGTGACAAGAACTGCTTCAAATGTCGTAAGAAACAACTCAAATTCTCAATTTCACGGGGATTGTCAATCTAGTGAATGTTCGTCATACAGTGTGGGGCATATCAGCCATATTCCTCGTCAACCTCCTACACCGGCTACATATACAATTCCATCAATTGTCGACAAGTCGATTCAATTCCATAATGATGAAGACATCTTTGGTATGATCAAACTAGGAGATGACGATGTATGTTATACAAGTGGTACTATTGAATGGGTAGGTCGTAAACGAAGCTTCCCATTGAATAGTGGTTTTAAAATTCTAAGTCACAAAGAGACTAGTGAGCCGTTCTGTAAGAGAAAGAAACAAGTTGAGAATGAAACTTACATCAGTGGTGTTAACAAATTAAAGGAGTGCGATTGTGCTGATGTTTTGCCCACTGTGAATGCGATTGTAGATCATCTTAATGCGTTACACGAAGCGGTAGATCTTGTTGATTTACGACTCAACACATTGGAGGAAGCCGTGAATCCCAATTGA

Protein sequence:

>DPOGS200738-PA
MENNYMSLEPSLEKQLKGHRKSITSLFFNPNEQQLASGSLDNSILLWDLRGTMRSYRFQGHDEAVMDVTFSPTGKYMASASRDKTVRLWVPTVTGSTGTFKAHSQTVRSIITASDDKIVKLWSSEKHKFLASFVGHTNWVRRARISQDGSLIASCSDDKTTKLWNIETGVCINTYKDQSAHGLHLAWHPSSCYVAIGTSKGNIKLYDVRTHNLVQFYSIHNDAVTQLVFHPSGSYILTSSKDGTMKILDLLEGHPIFTLTGHSGPINAVAFSPSGQKFTSAGDDKLVFIWKTKFTELSNQTKENVGQNQLSTPRSQTSKGVKMWYQCYHNTKYPVKTFVHIDNDPHDSVRSISFQDAPNANSTMIEPGDNQVNVTRTASNVVRNNSNSQFHGDCQSSECSSYSVGHISHIPRQPPTPATYTIPSIVDKSIQFHNDEDIFGMIKLGDDDVCYTSGTIEWVGRKRSFPLNSGFKILSHKETSEPFCKRKKQVENETYISGVNKLKECDCADVLPTVNAIVDHLNALHEAVDLVDLRLNTLEEAVNPN-