Monarch geneset OGS2.0

DPOGS208034
TranscriptDPOGS208034-TA1056 bp
ProteinDPOGS208034-PA351 aa
Genomic positionDPSCF300203 - 34970-40995
RNAseq coverage299x (Rank: top 37%)
Annotation
HeliconiusHMEL0040550.087.18% 
BombyxBGIBMGA001502-TA3e-13481.58% 
Drosophilabetaggt-I-PA2e-10050.29% 
EBI UniRef50UniRef50_A7SXB01e-10352.92%Predicted protein n=3 Tax=Metazoa RepID=A7SXB0_NEMVE
NCBI RefSeqNP_001177770.13e-17581.27%geranylgeranyltransferase type I beta subunit [Bombyx mori]
NCBI nr blastpgi|3000689696e-17481.27%geranylgeranyltransferase type I beta subunit [Bombyx mori]
NCBI nr blastxgi|3000689692e-17181.27%geranylgeranyltransferase type I beta subunit [Bombyx mori]
Group
Gene OntologyGO:00038242.5e-07catalytic activity
KEGG pathway 
InterPro domain[9-347] IPR0089304.6e-87Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
[231-270] IPR0013302.5e-07Prenyltransferase/squalene oxidase
Orthology groupMCL14265 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208034-TA
ATGAATAACGAGGAAAATAACTGTTTGGCTCACAGACAGCACGTTAAATATTTTATGAGGTTTCTCAATGTTCTACCAGCTTCTCTATCATCTCACGACACGACCAGGGTAACCATAGCATATTTTTCAGTGGCCGGTCTAGATGTATTAGGTTCTATAACATCAATGACAATAGATATGCAGTCGAGGATAATAGAATGGATATACAGGCTACAAGTTGAACCAAATAAAGAAACGGGAGATATGACAGCTTGTGGTTTTCAAGGTTCTTCGACAATTAACATGCCCTTCGACTCGGAAAAGAGTCAATACAGATGCGGTCACCTGGCTATGACGTACACCGGCCTCTGTGTGCTGTTGACTTTAGGTGACGACCTGTCTAGAGTGAACAGGAGAGCCTTGGTTGAAGGCGTGAAAGCTTTACAGCGCGAGGAAGGTAATTTTTCAGCGACGCTATCTGGCTGTGAGTCAGATATGAGATTCGTGTATTGCGCCGCCTGTATCAGTTACATTCTGAACGATTGGTCGGGTTTTGACGTTAAACGTGCCACTGACTACATAATAGATTCCATAGGTTACGACTACGGTATCGCTCAGTGTCCAGAGCTCGAATCCCATGGCGGGACCACATTCTGCGCTCTGGCAACACTCAGTTTGACGAACCAATTGGATAAATTGACTATAGAACAAATAGAGGGCTTGAAGCGGTGGTTACTGTTTAGACAGATAGATGGTTTTCAAGGTCGCCCCAACAAACCCGTCGATACTTGCTATAGTTTTTGGGTAGGCGCTTCATTAAAGATCTTGGATGCCTTACATCTATCTAACTTTGAGAGCAACAAGAGTTACGTGTATGAGACTCAGGATTGCGTTGTCGGTGGATTCTCAAAATGGCCGGATACATGCACGGATCCGATGCATACATATCTCGGTTTGGCAGGACTTAGTCTAATAGGTGAGAGCGGGCTCCTTGAAATTATACCAACTTTAAATATAACGAAGAAGGCCCACGACCATATGAAATATTTACACCGAATGTGGGAGACCGAATCATAG

Protein sequence:

>DPOGS208034-PA
MNNEENNCLAHRQHVKYFMRFLNVLPASLSSHDTTRVTIAYFSVAGLDVLGSITSMTIDMQSRIIEWIYRLQVEPNKETGDMTACGFQGSSTINMPFDSEKSQYRCGHLAMTYTGLCVLLTLGDDLSRVNRRALVEGVKALQREEGNFSATLSGCESDMRFVYCAACISYILNDWSGFDVKRATDYIIDSIGYDYGIAQCPELESHGGTTFCALATLSLTNQLDKLTIEQIEGLKRWLLFRQIDGFQGRPNKPVDTCYSFWVGASLKILDALHLSNFESNKSYVYETQDCVVGGFSKWPDTCTDPMHTYLGLAGLSLIGESGLLEIIPTLNITKKAHDHMKYLHRMWETES-