Monarch geneset OGS2.0

DPOGS200060
TranscriptDPOGS200060-TA951 bp
ProteinDPOGS200060-PA316 aa
Genomic positionDPSCF300044 - 979319-983204
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0161643e-8147.10% 
BombyxBGIBMGA002401-TA2e-7846.08% 
Drosophilaqm-PB1e-7944.73% 
EBI UniRef50UniRef50_O957497e-7046.76%Geranylgeranyl pyrophosphate synthase n=73 Tax=Eukaryota RepID=GGPPS_HUMAN
NCBI RefSeqXP_001605679.13e-8247.77%PREDICTED: similar to ENSANGP00000021639 [Nasonia vitripennis]
NCBI nr blastpgi|1565547595e-8147.77%PREDICTED: geranylgeranyl pyrophosphate synthase-like [Nasonia vitripennis]
NCBI nr blastxgi|1565547593e-7948.22%PREDICTED: geranylgeranyl pyrophosphate synthase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00082993.7e-36isoprenoid biosynthetic process
KEGG pathwaynvi:1001220748e-82 
 K00804 (GGPS1)maps-> Terpenoid backbone biosynthesis
InterPro domain[2-307] IPR0174464e-92Polyprenyl synthetase-related
[7-307] IPR0089492.8e-55Terpenoid synthase
[21-278] IPR0000923.7e-36Polyprenyl synthetase
Orthology groupMCL23295 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200060-TA
ATGTCTGACTATGAAAAAAATGGACAAGAAGATTTGGAAAAAGAACTACTCGCACCTTTCACACACCTGCTTCAAGTATCAGGCAAGCGCTTCCGTAATAAAATCGTTTTGGCATTCAATCACTGGTTAAAAGTACCCGAAGATCAAGTACAGAGAGCAATGGACGTAACGACCACTTTGCACATTGGATCTTTACTACTTGACGATATCCAAGACAACTCTTTATCTCGGCGAGGACTTCCTGCTGCTCACTGTATTTATGGACTTCCCCTAACGTTAAATACTAGCATGCAAGTGGCTATGATATGCTTTCAAAAAACTATTCAGTTAACTCCTAGTGGAGAGGGTGGCTACATATACGTTAATCACTTGCATGATGCAATAGTCGGCCAAGGTTTTGATATTTACTGCCGAGACAACCTCATGTGTCCAACCGAGGCGGAGTACAAAAAAATGGTCGAAAGAAAAACTGGCGGTATGTTATTATTGGGAGTCAAATTGATACAACTATTTAGCGAAAATAAACAAAACTATGACGATTTTGTAAGACTTCTTGGATATTACTTTCAGCTGAGGGATGATTATTGCAATTTAAGACAGCAAGAGGCACTGGAAGAAGGGCCCGGGGGAGAAGACATACACGCATCCAAAGAAAATATTTTTTGTGAAGATATAACTGAGGGTAAATTCAGTTTACCTATTATACATGCAATGACGACAACGGAAGGTCCAACAATATTAAGGATATTGCGTCAGCGCACTCGTAATATGGAGTTGAAGAAGTATTGCCTTTCATTGATGGAGATATCTGGTAGTCTACAGTACACAAGGGACTTGCTGTCTGAGTTGGACCGAAAGGCAAGGAAAGAGCTCATACGTTTAGGTGGAAATCCTATGTTGGAAGACGTTTTAGATAGTCTACTGAGTTGGAAGGACGTGGAAGCAAGTTAA

Protein sequence:

>DPOGS200060-PA
MSDYEKNGQEDLEKELLAPFTHLLQVSGKRFRNKIVLAFNHWLKVPEDQVQRAMDVTTTLHIGSLLLDDIQDNSLSRRGLPAAHCIYGLPLTLNTSMQVAMICFQKTIQLTPSGEGGYIYVNHLHDAIVGQGFDIYCRDNLMCPTEAEYKKMVERKTGGMLLLGVKLIQLFSENKQNYDDFVRLLGYYFQLRDDYCNLRQQEALEEGPGGEDIHASKENIFCEDITEGKFSLPIIHAMTTTEGPTILRILRQRTRNMELKKYCLSLMEISGSLQYTRDLLSELDRKARKELIRLGGNPMLEDVLDSLLSWKDVEAS-