Monarch geneset OGS2.0

DPOGS212810
TranscriptDPOGS212810-TA1131 bp
ProteinDPOGS212810-PA376 aa
Genomic positionDPSCF300086 - 569645-571234
RNAseq coverage166x (Rank: top 51%)
Annotation
HeliconiusHMEL0081725e-17074.73% 
BombyxBGIBMGA000752-TA1e-15467.29% 
DrosophilaCG10585-PA5e-7840.31% 
EBI UniRef50UniRef50_Q9VP877e-7640.31%CG10585 n=28 Tax=Endopterygota RepID=Q9VP87_DROME
NCBI RefSeqXP_970126.11e-8645.11%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
NCBI nr blastpgi|910841472e-8545.11%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
NCBI nr blastxgi|910841472e-8244.41%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
Group
Gene OntologyGO:00082995.2e-05isoprenoid biosynthetic process
KEGG pathwaytca:6586693e-86 
 K12505 (PDSS2)maps-> Terpenoid backbone biosynthesis
InterPro domain[17-374] IPR0174467e-95Polyprenyl synthetase-related
[24-376] IPR0089495.8e-28Terpenoid synthase
Orthology groupMCL22692 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212810-TA
ATGACCAACCTAACCCAGGAGGACTTTTCGTTTCATCCTCCTATGCCTCAATGGAGCAAAATAATTAGGGAAGCAGAGAAAATAATCGGATATCCGACCTCCTTCATGAATTTGAGGTGGTTATTAAGTGATGAGTTCGCCAATCTTGCTGTGCAATTGCGAAAAATTGTAGGTAGCAACCATCCCATTTTAAAAACGGCTAAAACTCTCCTTAATAATCAGCACAACAACCTTCAGCCATGGGGTCTCATAATATTGCTGTTGTCCAAAGCAACAAATGAATCCACCCATTCAGTACAAAATACAACACTTGTGGAGCACCAACGAATGTTAGCCGAGTTAACAGAAATGATACGCACTGGCCACTACATACACAGAGGTCTCCTCAACATACCCGTTGAAGAGCGAGACAAAACCACAGACACATCAATGTTTGCAAACAAAATTGCTATATTGATTGGTGATTATTTACTTGTTACAGCCAACGGAATGTTGGCTCGTTTGAAAAATCAAGATTTGTCATATTTAATATCTACAGCATTAAGAGATCTCAGTGAAGGTGAATTCTTTGGTCCTCGGGACTGTCAAAACCTGCCTCTACCTGGAAAACCTTCAGATGGGAATAGTATAAATTTCAAAATATCGAATGATACAGCACCGTTAAATAACTCAAATGTGTTAGGGTCACCCAAGGAAGAATGGATGATACGGACTTTGTATAATGGTGGGACTTTATTTGGACGGGGTTGTGAGGGTGCTGTATTGCTTGCTGGGAAGAGTGCGGCAGAGCAAAAGCAGGCCTACCTCTTTGGCTGCCACTTATGCTTAGCATGGCAAGCCGCAAGTGAATTACAAAAGTTTAGTTCAACAAGTAAAGAGCCAGTTTCTCTTGTGAGTGCCCCATTTTTATTTGCCATTAACGAAAATCCTGAATTATATGACATGGTTAAAGATGTACATAAAATACCAAGCTTAGATTTTGAGGATATCAAAGCCAAGGTCGCAGAAACTAATGCTATTGAGAGAACTAAATTGTTATATTCAGATAATGCATCTAAGGCAATCCGTTATATTAATATGTTTGGTAACAACGAGTCAGTAGATGCTATTAAAAGATTCATTGAGACTTGA

Protein sequence:

>DPOGS212810-PA
MTNLTQEDFSFHPPMPQWSKIIREAEKIIGYPTSFMNLRWLLSDEFANLAVQLRKIVGSNHPILKTAKTLLNNQHNNLQPWGLIILLLSKATNESTHSVQNTTLVEHQRMLAELTEMIRTGHYIHRGLLNIPVEERDKTTDTSMFANKIAILIGDYLLVTANGMLARLKNQDLSYLISTALRDLSEGEFFGPRDCQNLPLPGKPSDGNSINFKISNDTAPLNNSNVLGSPKEEWMIRTLYNGGTLFGRGCEGAVLLAGKSAAEQKQAYLFGCHLCLAWQAASELQKFSSTSKEPVSLVSAPFLFAINENPELYDMVKDVHKIPSLDFEDIKAKVAETNAIERTKLLYSDNASKAIRYINMFGNNESVDAIKRFIET-