Monarch geneset OGS2.0

DPOGS200871
TranscriptDPOGS200871-TA1353 bp
ProteinDPOGS200871-PA450 aa
Genomic positionDPSCF300071 + 619052-628299
RNAseq coverage532x (Rank: top 24%)
Annotation
HeliconiusHMEL0114751e-8569.80% 
BombyxBGIBMGA009862-TA9e-8981.59% 
DrosophilaCG34449-PA5e-13651.94% 
EBI UniRef50UniRef50_Q9W3457e-13451.94%CG34449, isoform A n=5 Tax=Endopterygota RepID=Q9W345_DROME
NCBI RefSeqXP_968940.22e-15969.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892424043e-15869.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892424041e-16568.35%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00082702e-24zinc ion binding
KEGG pathway 
InterPro domain[102-155] IPR0015942e-24Zinc finger, DHHC-type, palmitoyltransferase
Orthology groupMCL11682 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200871-TA
ATGCCCAAATGCGACCTTAAAACGAGATACATCCCCGCGACATTCGCCTGGACTTTACTTTTGGGCACAACGTCTCTGTTTTTTTATTTTCCTTGTCAGTATTACCTTCACAAACACCCATGGGTCCCAGCATACCAAGGTGTTATCACATTTTTTGTTTTGGCCAATTTTACACTGGCCACTTTCATGGACCCGGGAGTCATACCAAAAGGCAACACTCCACCTGACGAGGACAGGGAGGATGATTTCCGCGCTCCCCTGTACCGGAGCGTGGAGATCAATGGGATCACAGTCAGGATGAAGTGGTGCGTCACATGCAAGTTCTACAGACCACCTCGATGTAGTCACTGCTCCGTCTGCAACCACTGTATAGAGACGTTCGACCACCACTGCCCGTGGGTGAACAACTGTATCGGTCGTCGGAACTACCGCTTCTTCTTCTTCTTCCTCATCTCCCTCTCGATACACATGTTGAGTATATTCGGCCTCAGTCTCTATTACATCATGAACAACAACAAGACGTTGACGCAGGTCGAGCCTATTGTGTCAATGGTAATAATGGGTATAATCGCGCTCCTCGCGATTCCCATATTCGGTTTGACGGGCTTCCACATGGTGCTGGTGTCGCGGGGGCGCACCACCAACGAACAGGTCACCGGCAAGTTCACGGGGGGCTACAACCCCTTCTCGAAGGGCTGTTGGTATAACTGCTGTTATACGCAGTTCGGACCGCAGTATCCTAGCTTGGTCCGTCCCTCAAAGTACATATACAAGGGCGGCAAGAAACGTCGGGAGGGCACGGCTATATCGACGATAGCGTCGGATGCGCACGCGCACGCTCACGCACACACGCACGCGCAGGTAAAGACGTACACCGCGGACAATGGAGCCGCTCACCGGCCGCTCGGGGCGCACGCACACTACAACAAGCTGTCCCCTGGCCGCGAGGAACACGAGCCTGAGATGGAGGGGCCAGGCGCTTCTCAATCAGCGGACTGCGAGCCGACGCCTCCGCCCCTCCAACGACGGGGCTCCGCCGCTAACCTGTTCCCTCCTGAACATCATCTCCCACCGCGACCCATGCATTACCCGCGACTGAGTCCATTATCAAGGAATACCAGTCACGGCACCACCACGTCCGGCTACCGAGTGGTGATGGAGCCGATGCGTTTCGAGACCAACGGCTCGCGGCCGTCGCCCACCATGCAGCAACGCATCAAGGCGCTGGGCGTGCCCACACCGCTCGCGGTCAACAGTCCCGTCAGGAGTCGGCATGGAACCAGCATGATCCCACCGATGGCCGCCCCGAACCTTGAGGACCCCGACGCGAGCGCTCCACGGACCACGGTTTAA

Protein sequence:

>DPOGS200871-PA
MPKCDLKTRYIPATFAWTLLLGTTSLFFYFPCQYYLHKHPWVPAYQGVITFFVLANFTLATFMDPGVIPKGNTPPDEDREDDFRAPLYRSVEINGITVRMKWCVTCKFYRPPRCSHCSVCNHCIETFDHHCPWVNNCIGRRNYRFFFFFLISLSIHMLSIFGLSLYYIMNNNKTLTQVEPIVSMVIMGIIALLAIPIFGLTGFHMVLVSRGRTTNEQVTGKFTGGYNPFSKGCWYNCCYTQFGPQYPSLVRPSKYIYKGGKKRREGTAISTIASDAHAHAHAHTHAQVKTYTADNGAAHRPLGAHAHYNKLSPGREEHEPEMEGPGASQSADCEPTPPPLQRRGSAANLFPPEHHLPPRPMHYPRLSPLSRNTSHGTTTSGYRVVMEPMRFETNGSRPSPTMQQRIKALGVPTPLAVNSPVRSRHGTSMIPPMAAPNLEDPDASAPRTTV-