Monarch geneset OGS2.0

DPOGS201129
TranscriptDPOGS201129-TA1278 bp
ProteinDPOGS201129-PA425 aa
Genomic positionDPSCF300065 - 663563-671718
RNAseq coverage1064x (Rank: top 12%)
Annotation
HeliconiusHMEL0024437e-18086.34% 
BombyxBGIBMGA007298-TA2e-3030.08% 
DrosophilaAtg18-PA1e-11155.21% 
EBI UniRef50UniRef50_Q7ZWU51e-11855.83%WD repeat domain phosphoinositide-interacting protein 2 n=37 Tax=Metazoa RepID=WIPI2_XENLA
NCBI RefSeqXP_966338.29e-12454.99%PREDICTED: similar to Autophagy-specific protein, putative isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|3838617618e-12352.24%PREDICTED: WD repeat domain phosphoinositide-interacting protein 2-like [Megachile rotundata]
NCBI nr blastxgi|1892399087e-11654.99%PREDICTED: similar to Autophagy-specific protein, putative isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-23protein binding
KEGG pathway 
InterPro domain[19-263] IPR0110461.8e-23WD40 repeat-like-containing domain
[315-348] IPR0159434.9e-20WD40/YVTN repeat-like-containing domain
Orthology groupMCL14942 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201129-TA
ATGAGTTTGGGAGGAGGTCATAGCACAGAGGGATCAAATGCAGGAGGAATATTTGTACAATTCAACCAGGACTGCACGTCGCTGGTGGCCGGCAGCAGTAGCGGCTACCACCTGTTCGCGCTCACTCCGGACGATGGTGTGGAGGAGATCTACGCCAGTCGCTCAGGACTCGACACGTGCTTCGTGGACCGTCTCTTCAGCAGCTCGCTGGTGGCCGTGGTCACAGTCTCCGCTCCGAGGAAGTTAATAGTATGTCACTACAAGAAAGGCACCGAGATATGTAACTACAGCTACAGCAACACTATACTCGCCGTCAAGCTGAACAGATCCAGGCTGATCGTGTGTCTGGAGGAGTCGCTGCACATCCACAACATCCGCGACATGAAGATCCTGCACACAATCCGGGACACGCCTCCCAACCCTCGAGGACTGTGCGCGCTGTCCCCTTGCGTCGAGAGATGTCTCGTGGCGTACCCCGGATCGAGCGCGGTGGGGGAGGTGCAGATCTTCGACGCCGTCCACCTGAACGCCAAGTGCGTGATTGGAGCCCACGACAGTCCCCTGGCCGCCCTGGCCTGGTCCATGTGCGGGAAACGCCTCGCCACCGCCTCCGAGCGCGGGACGGTGATTCGCGTATTCGCGGTGCCCGAGCGCACGCGACTCTACGAGTTCCGACGTGGCGTGAAGCGCTGCGTGTCCATAGCCTGCCTGGCCTTCAGCGCCTGCGGGGCGTACCTGGCGGCCACCTCCAACACGGAGACCGTGCACGTGTTCCGGCTGAGGGAGGGCGCACCGCCGCCGCCCGCAGAGGACGCGGCCGCGCCGCCCGACGGCTGGATGGACTGGCTGTCACAGGCGGTGTCGCGCGGCGTCACCTACCTGCCGCCGCAGTTCACGGACGTGCTGACTCAGGGCCGAGCGTTCGCCGCGGCTCGTCTCCCGCGGCCCGCGCGCCACGCCGTGGCCGCCGTCACGAGCTCGGCACGCGCCCTGCGGCTGGTGGTGGCCACGGCGGACGGGGACGTGTACGTCTTCGGTCTGGACGCGGCCGAGGGCGGCGAGTGTCCGCTGCTACGCACCCACCGCCTGCTGGAGCGCTCTCCCTCCGTCACCGAAGAGTCGGAGCGCGCCTTGGAGCTGGGCTCGGCGCTGGCCGGCTCGCCTCCGCGAGGTTCTTTCGATCTCCGCGACCACAAACACTTCCCGCCGATGAGAGCCCCCGCGACGCAGGCCTCCGCCCCCGCCGGGGACGTTCCGCCCTCCACAGGCGACAACTGA

Protein sequence:

>DPOGS201129-PA
MSLGGGHSTEGSNAGGIFVQFNQDCTSLVAGSSSGYHLFALTPDDGVEEIYASRSGLDTCFVDRLFSSSLVAVVTVSAPRKLIVCHYKKGTEICNYSYSNTILAVKLNRSRLIVCLEESLHIHNIRDMKILHTIRDTPPNPRGLCALSPCVERCLVAYPGSSAVGEVQIFDAVHLNAKCVIGAHDSPLAALAWSMCGKRLATASERGTVIRVFAVPERTRLYEFRRGVKRCVSIACLAFSACGAYLAATSNTETVHVFRLREGAPPPPAEDAAAPPDGWMDWLSQAVSRGVTYLPPQFTDVLTQGRAFAAARLPRPARHAVAAVTSSARALRLVVATADGDVYVFGLDAAEGGECPLLRTHRLLERSPSVTEESERALELGSALAGSPPRGSFDLRDHKHFPPMRAPATQASAPAGDVPPSTGDN-