Monarch geneset OGS2.0

DPOGS209897
TranscriptDPOGS209897-TA1482 bp
ProteinDPOGS209897-PA493 aa
Genomic positionDPSCF300049 + 308088-315988
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0161642e-6945.67% 
BombyxBGIBMGA002401-TA7e-7545.27% 
Drosophilaqm-PB3e-7244.91% 
EBI UniRef50UniRef50_F2U1513e-6443.90%Quemao protein n=1 Tax=Salpingoeca sp. ATCC 50818 RepID=F2U151_SALS5
NCBI RefSeqXP_308860.33e-7443.96%AGAP006894-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3838553824e-7347.92%PREDICTED: geranylgeranyl pyrophosphate synthase-like [Megachile rotundata]
NCBI nr blastxgi|1565547597e-7147.44%PREDICTED: geranylgeranyl pyrophosphate synthase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00082997.3e-30isoprenoid biosynthetic process
KEGG pathwayaga:AgaP_AGAP0068948e-74 
 K00804 (GGPS1)maps-> Terpenoid backbone biosynthesis
InterPro domain[13-310] IPR0174465.2e-84Polyprenyl synthetase-related
[13-289] IPR0089497.2e-49Terpenoid synthase
[66-266] IPR0000927.3e-30Polyprenyl synthetase
Orthology groupMCL23295 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209897-TA
ATGGATATAAATAATAAAGATGATAAACAATACCTGGAGAAGGAGATTCTTGCACCTTCTACACACCTGGCAGAAGTTAACGGAGTAATTCTTATTGGCAGAATAACGTGGTCTTTCAATTACTGGTTTAAAGTCCCAGATGATTTACTTCGAAAAGTAACAGACGCTTGGGTAAAATTGATCAATGGAGTTGTTTTGATCGATGATATCCAGGATAACTCTTTAGTCCGTCGAGGAAAACCAACAGCACATTTGGTCTACGGTGTTCCCTTGACCATCAACTCCGCTGTTCAAGTAATGGCGGAGTCAATGAAAATAGCACTCGAGTTAGCACCAGATCCTTCGAGAGTTAAGGGATTCGCGGATCAATTTCATGACATGTGGAAGGGACAAGGCACTGAGGTTTATTGGAAGCACAACTTCATTTGCCCCACTGAGGATCAGTACACAAAGATGACGCATTTGAAAACCGCTACAGTTATATTAGTTGGAGTCAGATTGTTACAAGTGGTCAGTGATCATGACAAAAACTATGACGATCTAGCGCGGCTCCTCGGACATTACTCTCAGCTTCGAGATGACTATTGTAATCTTCGTGCGAAGGAGTTAAACAGCAATGGCAGTTTCTGTGAAGATATATCAGAAGGAAAGTTCTCCTTGCCTATCATACACGCATCCAAAACGTCAGTGGGAAACGAAGTTTTACGTATCTTACGACAGCGTACCCGTGATATTGACTTGAAGAAATACTGTATGTCTTTGTTGGAGAAGGCGGGCAGCATACAGTATACGAGAAACAGACTCAGTGAACTTGACCGAGAAGCTCGGGCTGAGGTCGCCCGGTTAGGCGGAAACCCTCTACTGGAAAAATGTTTGGATGATCTCAGTAGCTGGAAGGATGACGTAATGAGAGTTAAGGGATTCGCGGATCAATTTCATGACATGTGGAAGGGACAAGGCACTGAGGTTTATTGGAAGCACAACTTCATTTGCCCCACTGAGGATCAGTACACAAAGATGACGCATTTGAAAACCGCTACAGTTATATTAGTTGGAGTCAGATTGTTACAAGTGGTCAGTGATCATGACAAAAACTATGACGATCTAGCGCGGCTCCTCGGACATTACTCTCAGCTTCGAGATGACTATTGTAATCTTCGTGCGAAGGAGTTAAACAGCAATGGCAGTTTCTGTGAAGATATATCAGAAGGAAAGTTCTCCTTGCCTATCATACACGCATCCAAAACGTCAGTGGGAATCGAAGTTTTACGTATCTTACGACAGCGTACCCGTGATATTGACCTGAAGAAATACTGTATGTCTTTGTTGGAGAAGGCGGGCAGCATACAGTATACGAGAAACAGACTCAGTGAACTTGACCGAGAAGCTCGGGCTGAGGTCGCCCGGTTAGGCGGAAACCCTCTACTGGAAAAATGTTTGGATGATCTCAGTAGCTGGAAGGATGACGTAATGGTCAGATAG

Protein sequence:

>DPOGS209897-PA
MDINNKDDKQYLEKEILAPSTHLAEVNGVILIGRITWSFNYWFKVPDDLLRKVTDAWVKLINGVVLIDDIQDNSLVRRGKPTAHLVYGVPLTINSAVQVMAESMKIALELAPDPSRVKGFADQFHDMWKGQGTEVYWKHNFICPTEDQYTKMTHLKTATVILVGVRLLQVVSDHDKNYDDLARLLGHYSQLRDDYCNLRAKELNSNGSFCEDISEGKFSLPIIHASKTSVGNEVLRILRQRTRDIDLKKYCMSLLEKAGSIQYTRNRLSELDREARAEVARLGGNPLLEKCLDDLSSWKDDVMRVKGFADQFHDMWKGQGTEVYWKHNFICPTEDQYTKMTHLKTATVILVGVRLLQVVSDHDKNYDDLARLLGHYSQLRDDYCNLRAKELNSNGSFCEDISEGKFSLPIIHASKTSVGIEVLRILRQRTRDIDLKKYCMSLLEKAGSIQYTRNRLSELDREARAEVARLGGNPLLEKCLDDLSSWKDDVMVR-