Monarch geneset OGS2.0

DPOGS200123
TranscriptDPOGS200123-TA1041 bp
ProteinDPOGS200123-PA346 aa
Genomic positionDPSCF300044 + 962681-969384
RNAseq coverage157x (Rank: top 52%)
Annotation
HeliconiusHMEL0154842e-13271.18% 
BombyxBGIBMGA002401-TA3e-13168.37% 
Drosophilaqm-PB1e-12464.42% 
EBI UniRef50UniRef50_A1IIW71e-9852.66%Geranylgeranyl diphosphate synthase-3-B-isoform n=6 Tax=Nasutitermes takasagoensis RepID=A1IIW7_9NEOP
NCBI RefSeqXP_308860.33e-13068.28%AGAP006894-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582866755e-12968.28%AGAP006894-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582866751e-12368.28%AGAP006894-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00082994.4e-35isoprenoid biosynthetic process
KEGG pathwayaga:AgaP_AGAP0068948e-130 
 K00804 (GGPS1)maps-> Terpenoid backbone biosynthesis
InterPro domain[13-347] IPR0174461.6e-130Polyprenyl synthetase-related
[11-192] IPR0089492.6e-48Terpenoid synthase
[22-192] IPR0000924.4e-35Polyprenyl synthetase
Orthology groupMCL13331 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200123-TA
ATGTCGCAAATTTCTATAGAAAATGGTAATAAGGATCAGGACGAGAAACTGCTCATGCCTTTTACATACATCCAACAAGTACCTGGGAAGCAAGTTCGGGCTAAACTTACACTCGCCATCAATTACTGGTTGAAAGTCAGTGATAACAAACTCAAAGCCATTGGAGAAATTGTGCAAATGTTACACAACTCAAGCTTACTTTTGGATGATATACAAGATAATTCTATACTCCGTCGCGGGATACCAGTCGCGCATTCCATATATGGCATCGCTAGTACTATCAACGCTGCCAATTATGTGGTCACCATTGCTTTGGGAAAGACATTGCAGCTTGATCATCCTCTGGCAACAACAGTGTACACAGAACAGCTTCTTGAGCTGTATCGAGGACAAGGTATTGAGATATACTGGAGAGATAACTTCTATTGTCCCACTGAAGACGAATATAAGGAGATGACCATAAAAAAAACCGGTGGTCTCTTTATGTTGGCGATACGTCTCATGCAGTTGTTCAGTGAGAACAAATCAGATTTCACCAAACTGTCATCCATATTGGGACTGTATTTCCAGATCCAAACCGGTGGTCTCTTTATGTTGGCGATACGTCTCATGCAGTTGTTCAGTGAGAACAAATCAGATTTCACCAAACTGTCATCCATATTGGGACTGTATTTCCAGATCCGTGACGACTACTGTAATTTGTGTTTACAGGAGTACAGCGAGAATAAGAGCTACTGTGAGGATTTAACTGAGGGAAAATTCAGTTTCCCAATAATACACGCGATACACACACACAAGGAAGATAATCAAGTCCTTCACATCCTCCGTCAGCGGACACGTGATGTCGAGGTGAAGCGTTATTGCATCTCATTGTTGGAGAAGTTTGGCAGCTTCCAGTACACACGAGAACGCCTGGCCATGTTGGACCAGGAAGCTAGAGATGAGGTTCGACGCCTGGGTGGCAACCCTCACTTGGAAGAATTCTTAGACGACCTGTTGAGTTGGAGACGAGACAAGAAACTAAACAATTTTGAACAATAA

Protein sequence:

>DPOGS200123-PA
MSQISIENGNKDQDEKLLMPFTYIQQVPGKQVRAKLTLAINYWLKVSDNKLKAIGEIVQMLHNSSLLLDDIQDNSILRRGIPVAHSIYGIASTINAANYVVTIALGKTLQLDHPLATTVYTEQLLELYRGQGIEIYWRDNFYCPTEDEYKEMTIKKTGGLFMLAIRLMQLFSENKSDFTKLSSILGLYFQIQTGGLFMLAIRLMQLFSENKSDFTKLSSILGLYFQIRDDYCNLCLQEYSENKSYCEDLTEGKFSFPIIHAIHTHKEDNQVLHILRQRTRDVEVKRYCISLLEKFGSFQYTRERLAMLDQEARDEVRRLGGNPHLEEFLDDLLSWRRDKKLNNFEQ-