Monarch geneset OGS2.0

DPOGS210639
TranscriptDPOGS210639-TA963 bp
ProteinDPOGS210639-PA320 aa
Genomic positionDPSCF300715 - 7625-9618
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0225268e-7569.05% 
BombyxBGIBMGA011995-TA8e-8956.67% 
DrosophilaCoq2-PA6e-6947.65% 
EBI UniRef50UniRef50_Q7Q9Z41e-6746.65%AGAP004513-PA n=20 Tax=cellular organisms RepID=Q7Q9Z4_ANOGA
NCBI RefSeqXP_002073390.12e-7048.32%GK14102 [Drosophila willistoni]
NCBI nr blastpgi|1954525223e-6948.32%GK14102 [Drosophila willistoni]
NCBI nr blastxgi|1954525221e-6948.01%GK14102 [Drosophila willistoni]
Group
Gene OntologyGO:00046592.6e-84prenyltransferase activity
GO:00090582.6e-84biosynthetic process
GO:00160212.6e-84integral to membrane
KEGG pathwaydwi:Dwil_GK141025e-70 
 K06125 (COQ2)maps-> Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[20-293] IPR0063702.6e-844-hydroxybenzoate polyprenyl transferase
[35-276] IPR0005373e-17UbiA prenyltransferase family
Orthology groupMCL12165 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210639-TA
ATGTTGCCGGAGGAGAAAATAGAGTCCGGGTTGTCATTACTCTACCAAGAGAAAATTATGCCGTACATACGTTTAGCGAGATGGGATCGTCCTATAGGTGTATACTTATTATACTGGCCGTGTGCTTGGTCGATATCTTTGGCGTCCCTATCGGGTACGGTGCCCCCAACAACTACAGCACAGACGGCCTTGTTGTTCCTAGTCGGTGCTGGCTTGATGAGAGGTGCTGGTTGTACCATCAACGATTTATGGGACCGGGATGTTGATGCCCAGAGTATGCGACTACAAACCCGTATGTATGTTTGTTGCAAAAATCACTTGACTCTTACCAATACCAGAAATATAATATATCTTTATATTGACATCAGCTCGTATTTATCCACTAACTGCCTCCCAATTCTAGTCATAATATATCCTCTGGCAAAACGTTTCACAAACTACCCACAATTATTCCTCGGTGCAACTTTCAACTGGGGTGCACTACTCGGCTATTCAGCTATATGTGGTTCCATGGACCTGTCTGTGTGTTTACCGCTGTATATATCGGCCATGGCGTGGACAGTTCTTTACGATACTATATATGCCCATCAGGACAAACAAGACGATGCAAGACTAGGCATTAAATCCACTGCCCTAACGTTCGGTGATCACACCAAACCGGCGCTAACAGCATCCCTGGCTGTCAGTTTATGCGGTCTGACCCTGGCCGGGGTTAATGCTGGACTGAATGGGTGGTATTATACAGGGCTCGGAGTGTATATGCTTCATGCTGGGAGGCAGATCTATACCCTCAACCCTGACAATCCAACAGACTGTGCGGACAAATTCAAATCAAATTCTATGGTCGGTCTCATTATATTGCTCGGGATCCTCGCTGGTGGCTACCAACAGTATCTTGATAATAGAGAGAAGCATAAAGGTACAGAGACTAAAGACGCGACCAGATCATGTGTCTTTGGTTGA

Protein sequence:

>DPOGS210639-PA
MLPEEKIESGLSLLYQEKIMPYIRLARWDRPIGVYLLYWPCAWSISLASLSGTVPPTTTAQTALLFLVGAGLMRGAGCTINDLWDRDVDAQSMRLQTRMYVCCKNHLTLTNTRNIIYLYIDISSYLSTNCLPILVIIYPLAKRFTNYPQLFLGATFNWGALLGYSAICGSMDLSVCLPLYISAMAWTVLYDTIYAHQDKQDDARLGIKSTALTFGDHTKPALTASLAVSLCGLTLAGVNAGLNGWYYTGLGVYMLHAGRQIYTLNPDNPTDCADKFKSNSMVGLIILLGILAGGYQQYLDNREKHKGTETKDATRSCVFG-