Monarch geneset OGS2.0

DPOGS202735
TranscriptDPOGS202735-TA1026 bp
ProteinDPOGS202735-PA341 aa
Genomic positionDPSCF300284 + 39342-56638
RNAseq coverage193x (Rank: top 48%)
Annotation
HeliconiusHMEL0126741e-12275.27% 
BombyxBGIBMGA005355-TA1e-2972.37% 
Drosophila% 
EBI UniRef50UniRef50_E9IVQ68e-9850.76%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IVQ6_SOLIN
NCBI RefSeqXP_001814019.13e-10253.10%PREDICTED: similar to WS beta-transducin repeats protein [Tribolium castaneum]
NCBI nr blastpgi|1892401176e-10153.10%PREDICTED: similar to WS beta-transducin repeats protein [Tribolium castaneum]
NCBI nr blastxgi|1892401172e-10053.10%PREDICTED: similar to WS beta-transducin repeats protein [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-29protein binding
KEGG pathway 
InterPro domain[8-296] IPR0110461.8e-29WD40 repeat-like-containing domain
[9-288] IPR0159432.2e-29WD40/YVTN repeat-like-containing domain
[170-204] IPR0197813.1e-06WD40 repeat, subgroup
Orthology groupMCL15796 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202735-TA
ATGGTACGATTCGACCATAGCCATGGTTCGATAATCCTATGGGATATAAGAGACTTGAATGTCAAAGATCACAAGAATATAAGAGTGAACATTCCGTTCGACCATGCTTCCCACATCGCATGGAGTCCGGACTCCAAAGCCTTCGTGATACACACGGTTCAGGAGAATCACATCCAAGTGTACAAAATAGAGAAGAAAAAAGATGGCACTATAGGAGCAGCTCATCCCATGATCACGTTCGATAAGGCCCATGAGGATGACGTGATAGGTTTTGGTATATCAAGCAATGGTAAGTTCATGATGTCATGTTCGTCGAAAACCGATATGATCATATGGGATCTCAAGGGGCAACAGCTCGAGAGGCTGGATACATACCTGATGCACACGCACACAGCCAAGATATCGCCGTGCGGTCGCTTCGTGGTCGCTACCGGTTTCTCACCGGACGTTAAAATTATGGAAGTGTGTTTCAAAAAAACGGGTGAATACAACCAAGTGACCAAAGCATACGAGCTAACCGGTCACACGTCCGGCGTTTATGACGTTGTATTCGCCGTGGATACCTCGCACATCGCAACCATATCCAAGGACGGCACGTGGAAGATATATCATACTAGAATTGAATACTCGCGCGGGGAATCCCCTCACGTGCTCAAGACCGGATCCTACACCCAGACAGCCAACCCTCCGAAGATAGCTCTATCACCCAACGCTGAGGTTCTAGCCGTCTCCAACGACTCCAACGTAGAGTTCTATGACACTTACACCGGCGAACTCTACGATACAGTGGAGAATCTCTACACCGGTTTGATAAACTACATGTTGTTCGACGCGGCTGGTAAGTATCTATTCGTTTGCGGAGATCGCGCTATAAGGATACTACATAACGTCTGCGGATACTACACTACCATCAACAGCTGTAGGAGACTACTCACGTCCAAACAGACGTTAGCGACACAGGAACGTCTCAATAACACTATACTAGAGTGTAAAAAAACTCTAGAGAAATTTGGTAAAAAATACTAA

Protein sequence:

>DPOGS202735-PA
MVRFDHSHGSIILWDIRDLNVKDHKNIRVNIPFDHASHIAWSPDSKAFVIHTVQENHIQVYKIEKKKDGTIGAAHPMITFDKAHEDDVIGFGISSNGKFMMSCSSKTDMIIWDLKGQQLERLDTYLMHTHTAKISPCGRFVVATGFSPDVKIMEVCFKKTGEYNQVTKAYELTGHTSGVYDVVFAVDTSHIATISKDGTWKIYHTRIEYSRGESPHVLKTGSYTQTANPPKIALSPNAEVLAVSNDSNVEFYDTYTGELYDTVENLYTGLINYMLFDAAGKYLFVCGDRAIRILHNVCGYYTTINSCRRLLTSKQTLATQERLNNTILECKKTLEKFGKKY-