Monarch geneset OGS2.0

DPOGS206512
TranscriptDPOGS206512-TA1245 bp
ProteinDPOGS206512-PA414 aa
Genomic positionDPSCF300367 + 121650-123263
RNAseq coverage571x (Rank: top 22%)
Annotation
HeliconiusHMEL0066132e-8670.83% 
BombyxBGIBMGA012754-TA2e-4291.30% 
Drosophilaapp-PJ9e-11962.38% 
EBI UniRef50UniRef50_D6WMN72e-12061.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WMN7_TRICA
NCBI RefSeqXP_966647.26e-12251.84%PREDICTED: similar to CG5620 CG5620-PA [Tribolium castaneum]
NCBI nr blastpgi|1892377011e-12051.84%PREDICTED: similar to CG5620 CG5620-PA [Tribolium castaneum]
NCBI nr blastxgi|1892377013e-11553.18%PREDICTED: similar to CG5620 CG5620-PA [Tribolium castaneum]
Group
Gene OntologyGO:00082708.9e-26zinc ion binding
KEGG pathway 
InterPro domain[132-185] IPR0015948.9e-26Zinc finger, DHHC-type, palmitoyltransferase
Orthology groupMCL12954 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206512-TA
ATGGCGTCGCGGCGCGTGACGCGCAAGTGGGAGGTGTTCGCCGGACGGAACCGGTTCTGGTGCGACGGACGCCTCATGACGGCGCCACATCCCGGGGTGTTCGCGCTCACGCTGGCTCTCATCTGTGGCACGTGTGTGCTGCACTTTGCCTTCGACTGCCCCTTCCTGGCAGCGCGCGTGTCCGGCGCTGTGCCGGCGGCGGGCGCGGCGCTGTGTGGAGTGACGCTGGCGGCTCTGCTGCGCACGGCCTTGTCGGATCCGGGTATAATTCCGCGCGCAGCTCCGCACGAGGCCGCCGCGTTGGGAGCGCTGGAGGCGGCCGACGGAGCCGCTGGCCGCCCGCCGCCGCGCGCTCGGGAGGTGCTCGTGCGCGGACGGCCCGTCAAGCTCAAGTATTGTTTCACTTGCAAGATGTTTCGTCCGCCGCGCGCCTCGCACTGCTCGCTCTGCGACAATTGCGTCGACCGCTTCGACCACCATTGCCCCTGGGTCGGCAACTGTGTCGGGAAGCGCAACTATCGCTACTTCTACTTGTTCGTGGTGTCGCTGTCCTTCCTGGCGGTGTGGGTGTTCGCGTGTGCGGTGACTCACCTGGCGCTGCTGGCGCGAGGCGCTGGGCTGGCGGCAGCACTGCGGGCGACTCCCGCCTCCGCCGTCGCGGCCGCGGTGTGCTTCCTGTCAGTGTGGTCGGTGCTGGGGCTGGCCGGCTTTCACACCTACCTCGCCTCTACGGACCAGACAACTAACGAGGATATAAAGGGATCATTCTCCCGCCGCGGCTCGGGAGGTGCGGGCACGAACCCGTACTCGCGCGGTAACGCGTGCGCAAATTGCTGGCATGTTCTCTGCGGTCCCCTCGCGCCGAGTCTCATCGACCGGCGCGGTGTCTTGTCGAGCGACACACGCGACGACCTACCGCCGCGCTTCGCTCACATCATGTGCGTGCCCCCTGCCCCGGCCCCGGGACCCCCCGCAGTCCCCGCACCCCTAGCCACCTCGCACTCGCCCGCACCGACACACAGGAACGGAGCCCTCGGCGGGAGTTACACGAATCTGTTCGAAGGTGGCGACGCGACGCGCCACGCGTACACGAACCACAGCCTGGAGCCCGAGCCCGTGCCGCTGCAGGAGGTGGCGGTGGCGGCCGCGCTCAGCGCCTCGCGCCTGCGCCTGCTGCACGACACCACCATGATAGACGCGGCGCTCGACCTCGACGACCCCGTGGCGCCCGCCGCCGCGCTCTGA

Protein sequence:

>DPOGS206512-PA
MASRRVTRKWEVFAGRNRFWCDGRLMTAPHPGVFALTLALICGTCVLHFAFDCPFLAARVSGAVPAAGAALCGVTLAALLRTALSDPGIIPRAAPHEAAALGALEAADGAAGRPPPRAREVLVRGRPVKLKYCFTCKMFRPPRASHCSLCDNCVDRFDHHCPWVGNCVGKRNYRYFYLFVVSLSFLAVWVFACAVTHLALLARGAGLAAALRATPASAVAAAVCFLSVWSVLGLAGFHTYLASTDQTTNEDIKGSFSRRGSGGAGTNPYSRGNACANCWHVLCGPLAPSLIDRRGVLSSDTRDDLPPRFAHIMCVPPAPAPGPPAVPAPLATSHSPAPTHRNGALGGSYTNLFEGGDATRHAYTNHSLEPEPVPLQEVAVAAALSASRLRLLHDTTMIDAALDLDDPVAPAAAL-