Monarch geneset OGS2.0

DPOGS207505
TranscriptDPOGS207505-TA1035 bp
ProteinDPOGS207505-PA344 aa
Genomic positionDPSCF300177 - 402589-405794
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0179616e-14264.14% 
BombyxBGIBMGA001927-TA3e-11755.26% 
DrosophilaFpps-PA1e-7943.07% 
EBI UniRef50UniRef50_Q1XAB11e-12157.43%Farnesyl diphosphate synthase-like protein n=2 Tax=Choristoneura fumiferana RepID=Q1XAB1_CHOFU
NCBI RefSeqNP_001093302.17e-11454.68%farnesyl diphosphate synthase 3 [Bombyx mori]
NCBI nr blastpgi|630220774e-12157.43%farnesyl diphosphate synthase-like protein [Choristoneura fumiferana]
NCBI nr blastxgi|630220772e-11757.43%farnesyl diphosphate synthase-like protein [Choristoneura fumiferana]
Group
Gene OntologyGO:00082992.8e-57isoprenoid biosynthetic process
KEGG pathwaytca:6612721e-81 
 K00787 (FDPS)maps-> Terpenoid backbone biosynthesis
InterPro domain[3-343] IPR0089493.5e-91Terpenoid synthase
[34-305] IPR0000922.8e-57Polyprenyl synthetase
Orthology groupMCL10265 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207505-TA
ATGGAGAAAGAGAGGGAATTGTTTTTAGACGTCTTCCCAAGTATTATGGACACACTGGTAACAAAGACCAAGTTTTCTCATGTACCACATGTCGCTAATTGGACCAAGAAGGTACTTGAATACAATTTGCAAGGTGGTAAGAAGAATCGTGGACTTTCGGCTGCTCTTGCTTACGAGATGCTAGAAGATCCGGAGAAAATAACAGAGGAAAAGCTCAAAATCAGCAGAGTGTTGGGCTGGTGTGTTGAAATGCTGCAATCGTATCTGGTGGTTGTGGATGACATGATGGACGGCGCCACCAAACGCCGCGGCATTCCATGCTGGTATTGTTTACCTAACGTGGGACTTGGGGCTATTAACGACTCCATTTTGATATATTCAGCATTAAAAGAGATATTGGCTGCACATTGTAAACATTTGCCTCAATACGAGCTCATAGTAAATGAGTACGATGAGGCGTTGTTATACACAACTATGGGTCAACACTTGGACTTTGAAATGGCTCACAGGCACAAAAAGGACTACAGTCTGTTCACAACAGAGAAATATAACTCTATAGCGAAGTTTAAGACCGCGTACTATACGTACAAGCTTCCTGTTTGTTTGGGATTATTATTGGCAAATAAGACAGATCCCGAAACTCACAAGAGGGCTGAGAGTATATGTATTGATATAGGACTTTTGTTCCAAATGCAGGATGACTTCATAGACTGCTTTGGAGTTGAGACATTGACTGGTAAGATTGGTAACGACATCCAAGAAGGTAAGTGTTCTTGGCTGGCGGTCCAAGCACTACAACACTGCAGCCAAAAACAGAGGACTGTGTTCGTGTCGTGCTACGGCAGCCCAGAACCGGCCCATATAGAGAGAATTAAGAGATTATATGAAGAATTAGAACTGCCAGCATTATATCACAGGACTGAGAAACATTTGTATGACACGATCATCAAGAATCTAGAACAGTTACCGCCCGACAGCATGTCACCAAAGTTATTTGTTAAGCTGCTTGACCTTATTTATGATAGGAAGAACTGA

Protein sequence:

>DPOGS207505-PA
MEKERELFLDVFPSIMDTLVTKTKFSHVPHVANWTKKVLEYNLQGGKKNRGLSAALAYEMLEDPEKITEEKLKISRVLGWCVEMLQSYLVVVDDMMDGATKRRGIPCWYCLPNVGLGAINDSILIYSALKEILAAHCKHLPQYELIVNEYDEALLYTTMGQHLDFEMAHRHKKDYSLFTTEKYNSIAKFKTAYYTYKLPVCLGLLLANKTDPETHKRAESICIDIGLLFQMQDDFIDCFGVETLTGKIGNDIQEGKCSWLAVQALQHCSQKQRTVFVSCYGSPEPAHIERIKRLYEELELPALYHRTEKHLYDTIIKNLEQLPPDSMSPKLFVKLLDLIYDRKN-