Monarch geneset OGS2.0

DPOGS213241
TranscriptDPOGS213241-TA969 bp
ProteinDPOGS213241-PA322 aa
Genomic positionDPSCF300124 - 355874-362989
RNAseq coverage393x (Rank: top 31%)
Annotation
HeliconiusHMEL0030542e-15885.14% 
BombyxBGIBMGA009520-TA2e-11871.93% 
DrosophilaCG11975-PA2e-5133.53% 
EBI UniRef50UniRef50_Q7ZUX33e-5736.36%WD repeat domain phosphoinositide-interacting protein 4 n=37 Tax=Coelomata RepID=WIPI4_DANRE
NCBI RefSeqXP_968426.18e-6942.29%PREDICTED: similar to WD repeat domain 45 [Tribolium castaneum]
NCBI nr blastpgi|3838651828e-6841.67%PREDICTED: WD repeat domain phosphoinositide-interacting protein 4-like [Megachile rotundata]
NCBI nr blastxgi|3071701684e-6641.86%WD repeat domain phosphoinositide-interacting protein 4 [Camponotus floridanus]
Group
Gene OntologyGO:00055154e-26protein binding
KEGG pathway 
InterPro domain[3-298] IPR0110464e-26WD40 repeat-like-containing domain
[6-295] IPR0159435.3e-20WD40/YVTN repeat-like-containing domain
Orthology groupMCL25618 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213241-TA
ATGGCGCGCCGGAGGAGCAGCGGTATAACAAGTCTGGGGTTTAACCAGGACCAAGGCTGCTTCACATGCTGCCTAATATCGGGCCTGCGCGTTTACAATGTTGACCCTCTGGTTGAGAAAGCACATTACAGTAAGGAGGAACTCGGTGAAGTATCTCTCTGTGAGATGGTATTTCGCACTAACTGGTTACTAGTAGTTCGTGCAAGAAGACCATGCAGCCTTATGCTACTTGACGACCAGCAGAGAGCTTTTAGAGCCGAGGTCCTATTTAAATCACCAATACGAGCCTTGAAAGCTAGGAAGGATAAAGTAGCCGTAGTGTTGTCATCAACAGTACAAATCCTCTCTCTGCCGTCGTTGACTCGGGTTGCCTTACTGCGTACCCCGAGTGCTGGCCGTCCTCTCTGTGCGATAGCCACAGACCCCGGGGCAGCGCAATTAGTGGCTGCGCCCGCGCACAGGAAGGGGTCCCTTCAGATTTTGGACGTGTCCCGAGCTATAAAGAACGCGGCGTCAAGTTCACCGGCGGTGGTAAGTTGTCATCAGACAGATCTGGTCTGCATCAGTCTGTCTCCCAATGGAGCGAAGCTAGCCACCGCCAGCGAACGTGGAACCATCATAAGGCTGTGGGATACCAACACTAAACACATGCTGCACGAGCTACGACGAGGATCTGATTACGCTGATGTTTATTGTATCAACTTCAACTGGTCGGGTACATTGGTGTGCTGTGTTTCGGACAAGGGCACCCTACACGTGTGGCTCGCGCGAGGTAACTACACGCACGTGTGTGCGGCGCCTGCTACCTCTCAGAGAGCTCTGTGTGCCTTCAGTGACGACAGTAGCGCCATAGTCATCTGCGAGGACGGAACGTTTCACAAATTCACATTCTCTACTGAAGGCAGTTACCATCGTAACGACTTTGAATATTTTTTACAGGTCGGCGATGACGACGAGTTCTTGCATTGA

Protein sequence:

>DPOGS213241-PA
MARRRSSGITSLGFNQDQGCFTCCLISGLRVYNVDPLVEKAHYSKEELGEVSLCEMVFRTNWLLVVRARRPCSLMLLDDQQRAFRAEVLFKSPIRALKARKDKVAVVLSSTVQILSLPSLTRVALLRTPSAGRPLCAIATDPGAAQLVAAPAHRKGSLQILDVSRAIKNAASSSPAVVSCHQTDLVCISLSPNGAKLATASERGTIIRLWDTNTKHMLHELRRGSDYADVYCINFNWSGTLVCCVSDKGTLHVWLARGNYTHVCAAPATSQRALCAFSDDSSAIVICEDGTFHKFTFSTEGSYHRNDFEYFLQVGDDDEFLH-