Monarch geneset OGS2.0

DPOGS202015
TranscriptDPOGS202015-TA1257 bp
ProteinDPOGS202015-PA418 aa
Genomic positionDPSCF300053 - 988949-1016122
RNAseq coverage690x (Rank: top 19%)
Annotation
HeliconiusHMEL0167590.094.27% 
BombyxBGIBMGA012565-TA2e-9192.35% 
Drosophilaqless-PA2e-14974.23% 
EBI UniRef50UniRef50_Q9V9Z33e-14774.23%CG31005 n=15 Tax=Endopterygota RepID=Q9V9Z3_DROME
NCBI RefSeqXP_001602352.13e-16467.81%PREDICTED: similar to CG31005-PA [Nasonia vitripennis]
NCBI nr blastpgi|3584430280.094.13%control protein HCTL026 [Heliconius erato]
NCBI nr blastxgi|3584430280.094.13%control protein HCTL026 [Heliconius erato]
Group
Gene OntologyGO:00082991.2e-52isoprenoid biosynthetic process
KEGG pathwaynvi:1001183699e-164 
 K12504 (PDSS1)maps-> Terpenoid backbone biosynthesis
InterPro domain[94-418] IPR0174462.2e-206Polyprenyl synthetase-related
[92-416] IPR0089497.5e-88Terpenoid synthase
[129-377] IPR0000921.2e-52Polyprenyl synthetase
Orthology groupMCL14130 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202015-TA
ATGGCGTGTTTAAGTTCGAGTAGGCTTTGTGCTCGTGTGCAGGATAAGTTAATGTGTCAATTTGGACAAAAAGTTCGAAGGCAAACTTTATTTGTGAAACTCAAACCTCCTGGACCATCACCATTGCGGATATTTGGAAAAACTTGTCAGAGAGTGACGACATGCGGACCAGCATGTGGCACTTCGGGTACAACTGTCACAGGTGCAGTTCGAAGGTTGGCATCCAGCATGCAAAGCACCAGCACTGGTTCTCTACCAGATTACGGTACACAGATAGTTGACCCATACCGTCTTCTGGAGGACGATCTCAACGGAATATACGAAGATATACGATCGGAACTAGAACGTAACACTAATCAGCCGGAGTTGAACACGATAGCTACATACTACTTCGATGGTCAGGGCAAGGCTCTGAGGCCGATGGTGGCCATATTGACAGCGAAAGCCATAAATTACCACGTATACGGAGAAAATAGTGCACTGCTGCCATCCCAGAGGCAGGTGGCTATGATCAGTGAGATGATCCACTCGGCGTCTCTCATACATGATGACGTCATCGACCAGAGCGACTTCCGGCGCGGGAAGCCGTCTGTCAATGTACTCTGGAATCATAAGAAGGTCGCTATGGCCGGTGATTTTATACTCGCAGTGGCATCCATGATGATAGCTCGCCTCCGCAGTGATGAAGTCACGCTTGTGCTCAGTCAGGTGGTGACAGACTTGGTCCAGGGAGAGTTCATGCAGCTCGGCAGTAAGGAGACTGAGAACGAACGTTTTGCGCACTACCTTACCAAGACGTATAGAAAAACCGCCTCGCTCTTTGCTAATTCAGTCAAAGCGGTGGCGCTGCTATCGGGCGCAGACGAGACCACCTGCGAGCTCGCGTTCCAGTACGGTCGTAATCTGGGACTGTCCTTCCAACTGGTCGACGATCTTTTGGACTTTGTGTCGTCGGCGCACGGCATGGGGAAACCGACCGCCGCAGATCTTAGACTAGGGCTGGCGACCGCGCCTGTACTGTTTGCTTGTGAAAAGTACCCGGAGCTGAATCCGATGATAATGAGGCGATTCCAAGACGCAGGGGATGTGGAGAAGGCTTTCGAGCTGGTTCATAAGTCGCGGGGCCTCGAACAGACTCGGTTCCTCGCTCGCAAGCACGGCCTGGAGGCGGCCCGGCTGGCCTCAGAGCTGGCGGACTCGCCTTACCAGAAGGCCTTAGTCGTGACCACTGACCTTGTACTCAATAGGATCAAATAG

Protein sequence:

>DPOGS202015-PA
MACLSSSRLCARVQDKLMCQFGQKVRRQTLFVKLKPPGPSPLRIFGKTCQRVTTCGPACGTSGTTVTGAVRRLASSMQSTSTGSLPDYGTQIVDPYRLLEDDLNGIYEDIRSELERNTNQPELNTIATYYFDGQGKALRPMVAILTAKAINYHVYGENSALLPSQRQVAMISEMIHSASLIHDDVIDQSDFRRGKPSVNVLWNHKKVAMAGDFILAVASMMIARLRSDEVTLVLSQVVTDLVQGEFMQLGSKETENERFAHYLTKTYRKTASLFANSVKAVALLSGADETTCELAFQYGRNLGLSFQLVDDLLDFVSSAHGMGKPTAADLRLGLATAPVLFACEKYPELNPMIMRRFQDAGDVEKAFELVHKSRGLEQTRFLARKHGLEAARLASELADSPYQKALVVTTDLVLNRIK-