Monarch geneset OGS2.0

DPOGS210245
TranscriptDPOGS210245-TA1170 bp
ProteinDPOGS210245-PA389 aa
Genomic positionDPSCF300196 + 602578-605764
RNAseq coverage171x (Rank: top 50%)
Annotation
HeliconiusHMEL0137754e-6156.60% 
BombyxBGIBMGA002378-TA8e-6151.00% 
DrosophilaCG5196-PA2e-10448.67% 
EBI UniRef50UniRef50_Q9VG803e-10248.67%CG5196, isoform A n=19 Tax=Diptera RepID=Q9VG80_DROME
NCBI RefSeqXP_967795.19e-11153.24%PREDICTED: similar to CG5196 CG5196-PA [Tribolium castaneum]
NCBI nr blastpgi|910763722e-10953.24%PREDICTED: similar to CG5196 CG5196-PA [Tribolium castaneum]
NCBI nr blastxgi|910763729e-11353.68%PREDICTED: similar to CG5196 CG5196-PA [Tribolium castaneum]
Group
Gene OntologyGO:00082704.1e-24zinc ion binding
KEGG pathway 
InterPro domain[87-140] IPR0015944.1e-24Zinc finger, DHHC-type, palmitoyltransferase
Orthology groupMCL11893 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210245-TA
ATGTCCTTCAAACGTTTGTGTCATTGGGGACCAATATGTGTATTGGGTATAATAAAACTGATAACATGGTCAATGGTGCATCTGATTGGTATGTGGTGGCCGCCTTACCTAACTCTCGGCGGGGCTCTGCACGCCGCTTTCTTCTTGAGCCTGGCAGCATCCACTCTATACTTTTTTATGCAGGCATTACTTGAGGGTCCGGGCTATGTTCCTCAGGGGTGGAAGCCTGCGGAGGAATGTGATGTCCAGTATCTCCAGTACTGCACAACATGCAAGGGATACAAAGCACCGAGATCACATCACTGTAGAAAATGTGGGCACTGCATAAAGAAGATGGATCACCACTGTCCGTGGATCAACTGCTGTGTGGGTCACAACAACCACGCCTACTTCACTCTGTTCCTCATATCCGCCGTACTGGGCTGTCTTCACGCCTCCATAGTGTTGTCTATCTGCCTGTACCACGCTATACACCGCGTGTGGTACTTGCAGTACGGCGATGGCACCGAGCCGCTGATTTACGTGACCTTGACCACGTTGCTGCTTTCCCTGCTGGCCACGGGCATGGCCGTGGGCGTTGTGCTGGCCGTGGGCGCGCTGCTCTACCTGCAGATGCGGAGTATTCTTCGGAACCGCACCACCATAGAAGACTGGATCGTGGACAAGGCGGCCTGCAGGCGAGACGAGCGCGGGCTGCCTCAGTTCCAGTTCCCTTACGACCTGGGCTGGAGGAGAAACCTGCGGCTCGTGTGGTCCGGGCACCAGTACGACGGAATACACTGGCCGGTCAGGGAGGGACACGGCCAGTATGACCTCACTATGGAGCAGAAGGCGCAGAAAGCGGACAAGGCCGTCCGCTCCCGTGTGTACGCGGCGGCCTCTACATACTCCGGCCGCTGGGTTCCCTTACTGCAGTACCCGAGAGCCGCCCTCGGCCCGCCCTGCAGTGACGAGCCTCGCCTGTCGCTGAGGGTAGGGGACCGGGTCAAAGTCACCAGGCATCGTCGGCACTGGTTGTTCGGGGAGAAGGTTCTGTCGGAGCAGGAGGCTGTGGGTGGTCTGCGAGGCGACCGCGGCTGGTTCCCGCGCTCCGCCGCCACCGGCCTCGACACGCACACATACACACACACGAACACACACACTGGCAAAAACAAACACAAACCGGACTAG

Protein sequence:

>DPOGS210245-PA
MSFKRLCHWGPICVLGIIKLITWSMVHLIGMWWPPYLTLGGALHAAFFLSLAASTLYFFMQALLEGPGYVPQGWKPAEECDVQYLQYCTTCKGYKAPRSHHCRKCGHCIKKMDHHCPWINCCVGHNNHAYFTLFLISAVLGCLHASIVLSICLYHAIHRVWYLQYGDGTEPLIYVTLTTLLLSLLATGMAVGVVLAVGALLYLQMRSILRNRTTIEDWIVDKAACRRDERGLPQFQFPYDLGWRRNLRLVWSGHQYDGIHWPVREGHGQYDLTMEQKAQKADKAVRSRVYAAASTYSGRWVPLLQYPRAALGPPCSDEPRLSLRVGDRVKVTRHRRHWLFGEKVLSEQEAVGGLRGDRGWFPRSAATGLDTHTYTHTNTHTGKNKHKPD-