Monarch geneset OGS2.0

DPOGS206234
TranscriptDPOGS206234-TA1197 bp
ProteinDPOGS206234-PA398 aa
Genomic positionDPSCF300334 + 113282-116536
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0112342e-17472.29% 
BombyxBGIBMGA009744-TA0.077.53% 
DrosophilaCG10585-PA1e-8039.11% 
EBI UniRef50UniRef50_Q9VP872e-7839.11%CG10585 n=28 Tax=Endopterygota RepID=Q9VP87_DROME
NCBI RefSeqXP_001866491.13e-8341.04%candidate tumor suppressor protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700620296e-8241.04%candidate tumor suppressor protein [Culex quinquefasciatus]
NCBI nr blastxgi|910841474e-7942.35%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
Group
Gene OntologyGO:00082991.6e-08isoprenoid biosynthetic process
KEGG pathwaytca:6586699e-83 
 K12505 (PDSS2)maps-> Terpenoid backbone biosynthesis
InterPro domain[30-396] IPR0174462.2e-99Polyprenyl synthetase-related
[49-398] IPR0089492.3e-33Terpenoid synthase
[125-311] IPR0000921.6e-08Polyprenyl synthetase
Orthology groupMCL25644 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206234-TA
ATGGCTCTACTGACTCGTGTGTTGAGGCGAGCGACGCCTTACCTCCACCACACCAGGCCGGAGTCCACAGTGGCCAGTTTCACCAAGGATGAGATCCTGATACTGCTGAGACCACCTTTCACGAACTGGAGCAACATCATCCGTGAGGCGGAGAAAGTTGTCGGCTATCCCACTTCCTTCATCAACCTGAGATGTCTCCTCAGCGACGAGTTCTCCAACCTGGCTCTGTACTTAAGGAAGCTGGTTGGTAGCAACCATCTCGTGATGCAAACAGCAAAGAACGTCCTCTACGGAGACACGAAGAACCTTCAACCGTGGGGTCTGGTCATCTCGCTGCTGTCGAAGTCGGTGAAGTCGTCCACGACTCCCGAGCACGTCTACAATCAGCAGCGCCAACTGGCGGAGCTGACAGAGATGATGAGAACAGGCCATCTCATCCACAGAGGCATCGTCAACGTTCCTTTTGCGAAGCGATCCAAAAGCACTGAATCGGCAATATTTGGTAACAAAATCGCGATACTCCTGGGCGACTTCCTCCTCGTGACTGCGAACTCGATGTTGGCGAACCTGAAGGACCCGGATGTCCTGTACATCGTGTCCACGGCGTTACGAGATCTATCAGAGAGCGAATTTTTCGGGGAACGAGACGAGCAGAACATGCCGCTGCCTGGTAAACCGAAGAAAACGATCGAGGAGTTGGATATATCATTCGATACGAGCACCATCGAGGCTAGCGATGTTCTGGGGAAGCCGAGGAAGGAGTGGACCACTAGAACGGTCTACAACGGCGTCAGTCTCCTGGGCAGGGGTTGCCAATCGGCCATGCTGATCGGCAAACAGAACAGAGACATTCAGAACTATGCTTATCACTTCGGCTGCCACGTGGGTTTAGCTTGGCAGGCTGCCACTGAACTCCAGAGACTGACATCCGAGAAGGGTCAGTTTTGCTTGGCCAGCGCTCCCGTACTCTTCGCTTTGGAGGGCAATCCGGATTTATATAAAATTATCGATCAAGCAAAAAACGATGTAAACGACGTCGACTACGAGGATCTCAAGTTTAACATATTGAAAACTGACGCGATCGACAAAACGAGAATGCTTTACAGAGATCACGCGAGCAAAGCCATGGCCTTCATAGATAACATCGGACGAAACGAATCCACTGAGATGATCAGGAAGCTGGTTTACACGTTTTAA

Protein sequence:

>DPOGS206234-PA
MALLTRVLRRATPYLHHTRPESTVASFTKDEILILLRPPFTNWSNIIREAEKVVGYPTSFINLRCLLSDEFSNLALYLRKLVGSNHLVMQTAKNVLYGDTKNLQPWGLVISLLSKSVKSSTTPEHVYNQQRQLAELTEMMRTGHLIHRGIVNVPFAKRSKSTESAIFGNKIAILLGDFLLVTANSMLANLKDPDVLYIVSTALRDLSESEFFGERDEQNMPLPGKPKKTIEELDISFDTSTIEASDVLGKPRKEWTTRTVYNGVSLLGRGCQSAMLIGKQNRDIQNYAYHFGCHVGLAWQAATELQRLTSEKGQFCLASAPVLFALEGNPDLYKIIDQAKNDVNDVDYEDLKFNILKTDAIDKTRMLYRDHASKAMAFIDNIGRNESTEMIRKLVYTF-