Monarch geneset OGS2.0

DPOGS207504
TranscriptDPOGS207504-TA1278 bp
ProteinDPOGS207504-PA425 aa
Genomic positionDPSCF300177 - 409749-415670
RNAseq coverage1742x (Rank: top 7%)
Annotation
HeliconiusHMEL0224628e-14670.17% 
BombyxBGIBMGA001926-TA0.080.80% 
DrosophilaFpps-PA2e-11247.90% 
EBI UniRef50UniRef50_Q1XAA90.080.33%Farnesyl diphosphate synthase n=2 Tax=Noctuidae RepID=Q1XAA9_PSEUI
NCBI RefSeqNP_001036889.10.080.80%farnesyl pyrophosphate syntase [Bombyx mori]
NCBI nr blastpgi|56786090.080.80%dimethylallyltransferase [Agrotis ipsilon]
NCBI nr blastxgi|631031610.082.24%putative farnesyl diphosphate synthase [Choristoneura fumiferana]
Group
Gene OntologyGO:00082991.3e-60isoprenoid biosynthetic process
KEGG pathwaytca:6612723e-132 
 K00787 (FDPS)maps-> Terpenoid backbone biosynthesis
InterPro domain[85-421] IPR0089496.4e-91Terpenoid synthase
[116-386] IPR0000921.3e-60Polyprenyl synthetase
Orthology groupMCL10265 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207504-TA
ATGTTTTCTACAAGGAAGAGTGTAGACAAGTTTTTCCAAGCTTACAAGAATGAACTTCGTCGTCAAATAAGCAAAACAACTAGTGTTACTAACTCAGATTCTATGGCACCAAGATTGGATCAGACTAATAAGGCTACCAACGAAGAAGCAGGACAGAAACGTTTGCTGAAGTTGCAGAAATATCACAGATACCTGTCCACCCTCACGTCTCAGCAAATGCCACTGGCCGCCCGTGGACTGGCGGTGTCCAAAGACCAGTCCCGGGAATTCATGGCTTGCTTTCCGGACATCGTGAGGGAGCTCACTGAAACTGGCAAGCATATTGACGTGCCGGAAGCCAGCAAGTGGTTGGCTAAGCTTCTTCAATACAATGTACCAAACGGCAAGAAGAACCGCGGGCTGGCGACTGTCTTAGCCTACAAGATGCTGGAGAAACCTGATAGATTGACCCCAGAAAATATCCACCTAGCTAACATCATGGGATGGTGTACGGAAATGTTCCACACACACCAACTCCTGCTGAACGACATAATGTCTGGTACTGAGATGCGCCGCGGCGTCCCGTGTTGGTATCGTCAAACAAACGTCGGTATGGCCGCCATCAACGACTCGTCATTAGTACAGTCGGCGATGTACTCGACGCTGAAAAGAAACTTCATCAACAAACCATATTATAAGAACGTACTGGAAATGTTCAACGAGATGCTGTTGAAATGTTCAATCGGTAATTACTTGGAGAAGCAAATAATGAAGACGGATAAGCCGGACCTAAGCTTATTCACAATGGAAAAATATGAAGCTATCACTAAATACAAAACGTCATACTACACCTTCCAAATGCCGGTCGGCTTAGCGCTTCTGATGGCGGGGGTCGACGATCCGGAGACACACAGACAAGCAAAAACTATTCTACTAGAAATGGGAGAATTCTTCCAAATTCAAGACGACTTCCTAGATTGTTTCGGCGAGCCGTCAGTTATAGGTAAAAATGGTACAGATATTCAAGACGGCAAATGCACCTGGTTAGCTGTGGTGGCGTTACAGAGGGCGACACCGGCGCAGAAGCAGTTAATGGAAGATCACTACGGCAGTTCGAATATAGAAGACGTTCAGAAGATAAAGGATCTATACGAGGAACTACAACTGCCTCACACTTACTCAGTATACGAAGAAGCCACATACGACCTCCTCAGAACACAGATACAGCAAGTCACTAGAGGGCTACCTCACGATTTGTTTTTCAAAATTCTCGATAATATCTTTAGGCAGAGCATTTGA

Protein sequence:

>DPOGS207504-PA
MFSTRKSVDKFFQAYKNELRRQISKTTSVTNSDSMAPRLDQTNKATNEEAGQKRLLKLQKYHRYLSTLTSQQMPLAARGLAVSKDQSREFMACFPDIVRELTETGKHIDVPEASKWLAKLLQYNVPNGKKNRGLATVLAYKMLEKPDRLTPENIHLANIMGWCTEMFHTHQLLLNDIMSGTEMRRGVPCWYRQTNVGMAAINDSSLVQSAMYSTLKRNFINKPYYKNVLEMFNEMLLKCSIGNYLEKQIMKTDKPDLSLFTMEKYEAITKYKTSYYTFQMPVGLALLMAGVDDPETHRQAKTILLEMGEFFQIQDDFLDCFGEPSVIGKNGTDIQDGKCTWLAVVALQRATPAQKQLMEDHYGSSNIEDVQKIKDLYEELQLPHTYSVYEEATYDLLRTQIQQVTRGLPHDLFFKILDNIFRQSI-