Monarch geneset OGS2.0

DPOGS212879
TranscriptDPOGS212879-TA1236 bp
ProteinDPOGS212879-PA411 aa
Genomic positionDPSCF300086 + 564865-568228
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0081730.079.74% 
BombyxBGIBMGA000823-TA0.078.12% 
DrosophilaCG10585-PA1e-8243.54% 
EBI UniRef50UniRef50_Q9VP872e-8043.54%CG10585 n=28 Tax=Endopterygota RepID=Q9VP87_DROME
NCBI RefSeqXP_970126.12e-8445.80%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
NCBI nr blastpgi|910841474e-8345.80%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
NCBI nr blastxgi|910841471e-7945.63%PREDICTED: similar to candidate tumor suppressor protein [Tribolium castaneum]
Group
KEGG pathwaytca:6586695e-84 
 K12505 (PDSS2)maps-> Terpenoid backbone biosynthesis
InterPro domain[28-412] IPR0174466.6e-91Polyprenyl synthetase-related
[40-407] IPR0089498e-26Terpenoid synthase
Orthology groupMCL26782 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212879-TA
ATGTTCAAACCATTGAAATATATAAAATCAAATTGTTTTACGCGAATCACAGCCAATGCATCCGTTCGTCACAGCTCTTGCAAACGCGAGGAGCTCGACTGGCGTGATGTCATCTCGGAGGCAGAGCAAATCGTTGGCTATCCTACGTCTTTCCTGAACCTGAGGTGGCTGTTCAATGATGAGATCGCTAACACAGCTATACAGTTGAGGAAACTGGTCGGCACAAATCATCCGCTCTTGAAATCGGCGAAGAACCTACTCATTGGCTCCAAAAGCAACCTTCAATCAGTGGGACTTATAATTCTTTTGGCATCGAAAGCTGCCGGTATCGATGTCAGGAAGTACACGAGAGATCACTACGACTCGGGAGTCCTCCACGCACAGAGGGCGCTGGCTGAGATCGTGGAGATGAAGAGAACTGGCCATTTGATCCACAAGACCATGGCCAATTTACAGGAGAAGGAGAAACACGGGAAAAAGTATAAGGATTTGTTATATGGAAATAAAATTGTATTGCTTACAGGAGACTATTTGCTAGCAACGTGCCTCCAACATTTAGGCGGTTTGCACAACAATGAAGTGACCGAGCTTATCTCGACTGGTCTCCGGGATTTGGTTGAGGGAGATTTTCTCGGCGACCACGACGATGACCACAATCCCCTGCCCAGTAGACCAAGGGCCAGCAACGAAGTAAAGAGTCACTATGTCTGGGAAGAGGAGGATAACCTTGCTAAACTTGGTTCCAACGAATTTCTCGGTCAGGGGAAAGACGAGTGGCTTTTACGTACAATGCTAACATCTGGAAGTTTACTTGGAAAGGGCTGTCAGGGTGCTATGAAACTTGCTGGTTGGGGCAAGGATATGGAAAGGCAGGCTTATATTTTGGGAGGACACTTGGCTATTATCTGGCAATTGTATCTGGACGTGAAAGACTTCTTCACACATCCATATTCTTATTCGTTAGTTGGGGCTCCCGTAATTATAGCACTATGGGAGTATCCGACAATCTATAGCTATATTATAGAATCAAAATTAGAGAAAAAGCCTATAGAATACAAGCAATTATATTACGCTGTGAGGGCGACTAGATCTTTGGAGTATTTGACGATATTTCTTAATGAGGAAATAGAAGCTATAATGAGGAATAGTGACCAATTCCCTGTCAAGGACGCCCGTGCTGCAATACAGAAGATGGCTTGGACAGTACATAACGAAACACTACAATACATGGAGTAA

Protein sequence:

>DPOGS212879-PA
MFKPLKYIKSNCFTRITANASVRHSSCKREELDWRDVISEAEQIVGYPTSFLNLRWLFNDEIANTAIQLRKLVGTNHPLLKSAKNLLIGSKSNLQSVGLIILLASKAAGIDVRKYTRDHYDSGVLHAQRALAEIVEMKRTGHLIHKTMANLQEKEKHGKKYKDLLYGNKIVLLTGDYLLATCLQHLGGLHNNEVTELISTGLRDLVEGDFLGDHDDDHNPLPSRPRASNEVKSHYVWEEEDNLAKLGSNEFLGQGKDEWLLRTMLTSGSLLGKGCQGAMKLAGWGKDMERQAYILGGHLAIIWQLYLDVKDFFTHPYSYSLVGAPVIIALWEYPTIYSYIIESKLEKKPIEYKQLYYAVRATRSLEYLTIFLNEEIEAIMRNSDQFPVKDARAAIQKMAWTVHNETLQYME-