Monarch geneset OGS2.0

DPOGS202234
TranscriptDPOGS202234-TA1260 bp
ProteinDPOGS202234-PA419 aa
Genomic positionDPSCF300149 + 591479-593974
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0091732e-11587.33% 
BombyxBGIBMGA013533-TA1e-17874.16% 
DrosophilaCG7568-PC2e-11045.88% 
EBI UniRef50UniRef50_E0VCG12e-13955.37%WD-repeat protein, putative n=11 Tax=Neoptera RepID=E0VCG1_PEDHC
NCBI RefSeqXP_394888.31e-14156.32%PREDICTED: similar to CG7568-PA [Apis mellifera]
NCBI nr blastpgi|1107599732e-14056.32%PREDICTED: WD repeat-containing protein 69-like [Apis mellifera]
NCBI nr blastxgi|1107599731e-13756.32%PREDICTED: WD repeat-containing protein 69-like [Apis mellifera]
Group
Gene OntologyGO:00055153.3e-90protein binding
KEGG pathway 
InterPro domain[82-419] IPR0110463.3e-90WD40 repeat-like-containing domain
[86-266] IPR0159431.5e-48WD40/YVTN repeat-like-containing domain
[381-418] IPR0197811.7e-10WD40 repeat, subgroup
[380-419] IPR0016809.5e-10WD40 repeat
[109-123] IPR0204721.9e-07G-protein beta WD-40 repeat
Orthology groupMCL14890 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202234-TA
ATGAAGTTATTAAAGTTCCACCTTAGGTATCATCCTCCTGGCATCGTGCTGGAGTATCTGCAGAAGGGTATCGTCAAGAACAAGGACATCGATCTCCTAGATCTCAATAGCAACTCAGACACGAAAGCAATCGCTAACGAGCTGTATCATAAAGAACTCCTGATAACGGAAGACGTTCGTGAGCAGTTGGAGCAATGTCTCGAAACGCTGAAGAAGAGGATCGAGCACGAGGGACCCACGGGGAAGAGGTTCTACATCTACAAAACCCTACACACACACGTCCTGCCACTCACTAATGTTTGTTTTGATAAGACAGGTCAACGATGCCTCTCTGGAAGCTATGACAGAACGTGCAAAGTATGGGATGTTGAATCTGGGAAAGAATTAAAGACTTTATCGGGCCACCAAAACGTCGTCTACGCTGTCGGGTTTAATTTTCCCTCGTGTAACAGGGTTTTGACTGGTTCCTTCGACAAAACTGCTAAAATCTGGAATGCGGAGACAGGGGAATGTCTCGCCACACTGTGGGGTCACACAGGGGAGGTGGTTGCCGCTCAATTTAACTCCAAAGGAGACTATGTAGGAACTGGATCTATGGACCATCTTGCCAAATTATATGACTCTGGAACAGGCGCAGAGATTCAGACTTATGCTGGACATACAGCAGAGGTGATTGCTTTACAATTCGATCCAAATGAAGGCCAAAAACTAATCACGGGGTCCTTCGATGGAACCATATCCCTGTGGGATACGAGAGTCAGAGATCGTGTGGGGGTGCTCCGCGGTCATTCTGGGGAGATATCGAGCGTGCAGTACAACTGGGACAGCACTCTCGTGGGGTCCGCATCGTTGGACGGCAGCGCTCGTCTGTGGGATGCGAGACAGAACACCTGCCTGGCGACCGTCGCCTCGCATTCTGATGAAGTGCTGGATATCTGTTTTGACTGGGCTGGTCAACGCATGGCCACCTCGTCCAGTGACTGTTCAGCTCGCGTGTACGATGTGCGAGCTGAATTTAAGGAACTGGCTGTCATGAAGGGGCATAGAGAGGAAGTGTCGAAGGTGTGTTTCAGTCCAGCTGGTGGATGCTTGCTCACGTCCTCCGCTGACAGGAGCGCTCGCATCTGGAACACTAACACCGGTAACTGCATACAGGTGTTGTCCGGTCACCAGGGGGAGATCTTCTCCTGTGCTTTCTCGTACGCCGGCGACGCTATCGTGACCGCCTCCAAGGATAACACATGCAGGATCTGGAGATGA

Protein sequence:

>DPOGS202234-PA
MKLLKFHLRYHPPGIVLEYLQKGIVKNKDIDLLDLNSNSDTKAIANELYHKELLITEDVREQLEQCLETLKKRIEHEGPTGKRFYIYKTLHTHVLPLTNVCFDKTGQRCLSGSYDRTCKVWDVESGKELKTLSGHQNVVYAVGFNFPSCNRVLTGSFDKTAKIWNAETGECLATLWGHTGEVVAAQFNSKGDYVGTGSMDHLAKLYDSGTGAEIQTYAGHTAEVIALQFDPNEGQKLITGSFDGTISLWDTRVRDRVGVLRGHSGEISSVQYNWDSTLVGSASLDGSARLWDARQNTCLATVASHSDEVLDICFDWAGQRMATSSSDCSARVYDVRAEFKELAVMKGHREEVSKVCFSPAGGCLLTSSADRSARIWNTNTGNCIQVLSGHQGEIFSCAFSYAGDAIVTASKDNTCRIWR-