Monarch geneset OGS2.0

DPOGS201495
TranscriptDPOGS201495-TA2226 bp
ProteinDPOGS201495-PA741 aa
Genomic positionDPSCF300006 + 470506-476755
RNAseq coverage103x (Rank: top 60%)
Annotation
HeliconiusHMEL0159670.059.22% 
BombyxBGIBMGA002682-TA0.047.96% 
Drosophila% 
EBI UniRef50UniRef50_E2BY466e-9433.82%Probable phosphoenolpyruvate synthase n=1 Tax=Harpegnathos saltator RepID=E2BY46_HARSA
NCBI RefSeqXP_002738291.12e-7030.33%PREDICTED: hypothetical protein, partial [Saccoglossus kowalevskii]
NCBI nr blastpgi|3071983832e-9333.82%Probable phosphoenolpyruvate synthase [Harpegnathos saltator]
NCBI nr blastxgi|3071983837e-9133.67%Probable phosphoenolpyruvate synthase [Harpegnathos saltator]
Group
Gene OntologyGO:00163109.6e-63phosphorylation
GO:00055249.6e-63ATP binding
GO:00163019.6e-63kinase activity
GO:00038242.8e-36catalytic activity
GO:00168741.5e-32ligase activity
KEGG pathwaybfo:BRAFLDRAFT_1210215e-56 
 K01007 (E2.7.9.2, ppsA)maps-> Reductive carboxylate cycle (CO2 fixation)
    Pyruvate metabolism
InterPro domain[432-740] IPR0021929.6e-63Pyruvate phosphate dikinase, PEP/pyruvate-binding
[427-610] IPR0138152.8e-36ATP-grasp fold, subdomain 1
[611-740] IPR0138161.5e-32ATP-grasp fold, subdomain 2
Orthology groupMCL17280 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201495-TA
ATGGACGTAGTACTCCAAGTCGGTATATTTGTTGTTGCTGTCATTGTTTATTTGATTTTCGTAAAAAAATATGTGAAAAATAAAGGACTTTATAGCCAAGAAGGATGGAATTATCCCTTTAAAGTATTATTGGCCTGGTTTGCAATTAAGAGATGGAAATCAAGTTTAAACTCTCTCCCTCTGAGTGAATTGCCCCATGTGGGTCAGAATCATGGTTGTGACGGTATTTCATTGAAAGCATCTGCCCCTGACGGTTCCACTGTATTTGTTGGAATCCGTAAAATATGCAGCAAGGAAAAGGTTGCTGAAGTTTCTGTAATTGTGAAATTGTCTGATGGAACAACTTTTAAACTACCTCAACATTCCGAAACGGCAGTCAGTGAGTGGGTGAATACTGCTGGCTGTTGGAATGCTGGAGGCCTAAAGATTCAAGAATTAGAACCTCAAAAAAGTTTAAGAATTCTTTTTAATGGTGTTCTTATTCGAGCCGAAGACAATAAGTCGTATCATGTTAAGATGAATTTATTATGGGCGTCCGCGACAAGTGTGCAACGCTATCCTGAGGACTGGAATGATTGCCTCGCTGCGAAGTGCTTGGCCTTGGAGCCTTTGATGAACGAGCAATGGCCGAACATGCTTAACAAATGTGGACAAGGTTCTGGTAGCTGGTTGCAGTTTGGAGCGATACAAGGTCGTTTCCAATCCTTCGATGTAAACGGTGAAATTCACACCAACGAGTACCTTCGCGTTCGTGGAGCTAGAGAGCGGATGTGGTCTTTGGGGAGTAAAGATCTAAGACGCATTGTGACTATAAATGTATGTACAAGGGATGGTACTGCGGTCCAAATTCGTGGGTTATCTTATCACAAGAATTTTACACAGTGTTTATACGGAAGCGTGAGATTACCAAATTTTCTTATGACAAGTGTAAAATCTTGCGACTTAGTATTATCTGAATTTTGTGAAACAGCTGATGAAATACCTAAAACTTTTACCATTCATGTTACCACATCCAATGGAAGGACATTAAAATTAATTCTGCGAATCAAAGATGGCGGCACACTTTATACAGGAGTACCTCATGAACAAAGTATCATATATCGCACTCTGGAAGTAGACATCAATGGGGAACATGGCACTGGCATTCTTGAATTAGGATACGAATGTTTGGATGGTATAATACCGTCCAACATAAAACCTTCCCCTGCATTGCGATGGTTAAGCGAAGATGAGGCGGGACCGGTCGGAAACTGTGTGTCTTTGGAATGTAGCAGCGCAGCTTGCATTCATTTCACTGGAGGAAAAGCTGCTTCACTGGCATTGTTAAGTTCTGTACAAAAAGAACAGGGATACAAAGTACCTCCAGGTTTTTGTATCACCACAAAAGCGTTATATAAACATTTGGAAGTTAATACATCAATTAAGGACGCTATTTTTGAAATCGAAGCATGTAATAAGGATTATGATGAGAACATTTTCAAAGAGAAATGCTCAAAAGCTATAGAACTGTTTCTTACAACTGAAGTAGTGGAAGATGTAAGGAAAGATATTTTGTCGTTTGTCCGCGACTTAAGAAGCAAATACGAAAGTGAAGAACTTTTCACACCGCTGCGATTTGCTGTGAGATCGTCGGGCGTAGGTGAAGATAGTGAGGCGTTATCAGCTGCCGGTCAGAATGAGACGGTCCTGGGTTGTGTTACTGATGATGATGTTATGCGGGGAGTGAAAAAATGTTGGGCATCAATGTTCGCTTACACTAGCGTATATTACAGAAGACAGAACGGCCAGCAGTGTTTTTCTCTGGGTGGAGTGGTGGTTCAGGCCCTGGTGAAGTCCCGTGCCGCCGGGGTCTTATTTACATCACATCCACCATATGGTGACGTGACACGGATATTACTCACAGCTAATTACGGACTAGGAGAGAGCGTTGTATCGGGATCTGTAGAACCCGACTCCATAGTTATAAGACGTGATCTAGATGACACACTGTCTATTCAAAAAATTGATCTCGGTTCCAAAATACAGAGAGTTGTAACCAATAGCAATGGAGTTTCGTTCGAAAATGTACCGGAATCAGATCGAAAGAAATCTTGTTTATCTGAAGACGAAATATTGAAACTGGCTAAAATAGCTGTGGCTCAAGAACGACTGTGGGGTGCTGGAAGAGATATTGAATGGGCCATTTTCGGGGATGACATATTCCTGCTTCAAGCTAGGCCCTATTGA

Protein sequence:

>DPOGS201495-PA
MDVVLQVGIFVVAVIVYLIFVKKYVKNKGLYSQEGWNYPFKVLLAWFAIKRWKSSLNSLPLSELPHVGQNHGCDGISLKASAPDGSTVFVGIRKICSKEKVAEVSVIVKLSDGTTFKLPQHSETAVSEWVNTAGCWNAGGLKIQELEPQKSLRILFNGVLIRAEDNKSYHVKMNLLWASATSVQRYPEDWNDCLAAKCLALEPLMNEQWPNMLNKCGQGSGSWLQFGAIQGRFQSFDVNGEIHTNEYLRVRGARERMWSLGSKDLRRIVTINVCTRDGTAVQIRGLSYHKNFTQCLYGSVRLPNFLMTSVKSCDLVLSEFCETADEIPKTFTIHVTTSNGRTLKLILRIKDGGTLYTGVPHEQSIIYRTLEVDINGEHGTGILELGYECLDGIIPSNIKPSPALRWLSEDEAGPVGNCVSLECSSAACIHFTGGKAASLALLSSVQKEQGYKVPPGFCITTKALYKHLEVNTSIKDAIFEIEACNKDYDENIFKEKCSKAIELFLTTEVVEDVRKDILSFVRDLRSKYESEELFTPLRFAVRSSGVGEDSEALSAAGQNETVLGCVTDDDVMRGVKKCWASMFAYTSVYYRRQNGQQCFSLGGVVVQALVKSRAAGVLFTSHPPYGDVTRILLTANYGLGESVVSGSVEPDSIVIRRDLDDTLSIQKIDLGSKIQRVVTNSNGVSFENVPESDRKKSCLSEDEILKLAKIAVAQERLWGAGRDIEWAIFGDDIFLLQARPY-