Monarch geneset OGS2.0

DPOGS201496
TranscriptDPOGS201496-TA1329 bp
ProteinDPOGS201496-PA442 aa
Genomic positionDPSCF300006 + 476810-480450
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0159675e-16360.78% 
BombyxBGIBMGA002683-TA2e-11864.01% 
Drosophila% 
EBI UniRef50UniRef50_UPI0002246A386e-9144.30%UPI0002246A38 related cluster n=1 Tax=unknown RepID=UPI0002246A38
NCBI RefSeqXP_966733.22e-8644.09%PREDICTED: similar to T21C9.6 [Tribolium castaneum]
NCBI nr blastpgi|3454832972e-9044.30%PREDICTED: putative phosphoenolpyruvate synthase-like [Nasonia vitripennis]
NCBI nr blastxgi|3454832975e-8843.66%PREDICTED: putative phosphoenolpyruvate synthase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00163106.9e-23phosphorylation
GO:00167726.9e-23transferase activity, transferring phosphorus-containing groups
KEGG pathwaycel:T21C9.67e-59 
 K01007 (E2.7.9.2, ppsA)maps-> Reductive carboxylate cycle (CO2 fixation)
    Pyruvate metabolism
InterPro domain[330-437] IPR0082796.9e-23PEP-utilising enzyme, mobile domain
Orthology groupMCL17280 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201496-TA
ATGGCTGACGATGAACTGACTACTTTTGCTAATACAGGAGAGCCGATAATTACCTCTAAATGGGTTATGAAAGAGACGACACGGTCAGTGAGTTCGTTGAATCTGAACGTAGAAACGAAGGATCCTCTTAAGTTTTTAGACGTCATAGCTGGAACTAAGGAGGATATGTTGAAGTTCAATATCGGGCACTGTACTACTACTTTAGCGTCTACTTTCAGCCAGTTTATAGCTATGACTGTGATGTTGGAAGGAAGATCCGATTTTACAAGCGAAGAATGCAACGAAGTTAGTACACTATTAAGTTCAGGCGATGCTGTGTCCGCGGAAGTACCGCATGCACTTGATAGATTAGCATTAAGTATTGAAGAATCTGGAATGATTGAAGAATTCCGTAAACAGAAACCTGAAGAGTCTTTAATGTGGTTGAAACAAAATCTGCCCAAAATTTACGATGATGTCTTATTATTTTTGGATCAACATGGTCATAGGGCTATTATGGAATTTGACTTAGCAACGAAACCATGGTCATTAGTGCCGGAGGACTTGATGAGAGTTCTAATGAATATGCGTCCGACTTCAAAAACTCAAACGTATAAAAGCACTGAAGAATTGATCGCTTCTTTAAAAACTCCAAAAAAAGCAAATACGAAGAAAGTTCTCCGATGGTTGTTGCCCCTTTGTCGTCGCACTGTCAGCCACCGCGAGGGTACAAAGGCACAAGTCATACTCGCTGTACACAAACTGAGATTAGCTGTGAGACGGCTCGCCACCATGATGTACCATTCTTGGATAATACCCGATGTAGAACTGGTGTTTTACTTCAGATTACATGAACTTAAGGAGTACATAATAACACGGGACCCTGGATTGTTGAGAAAAGCTGTTCAACGACAACAATATTATACAAAATGGTGTCAACTAAAGTTTTCGGAAATGAATAAGGGCTGGCCGGAACCCTTGAGAGTAGATGGTCCTCGTGTAACAAGTGGAGATGTTAAAATTTTTGCGACTTCGGTATGTCAAGGAGAGACTGTGGCAAGAGCGTGTGTCGTGAAAGATCTTTCAGAGATTTATCAACTACGACAAGGGGACATATTGATAACTCATTGCACAGATATAGGCTGGTCACCCTACTTCCCACTTTTGTCTGGGATTGTGACTGAACTTGGAGGACTTATTTCACATGGAGCGGTAATCGCCCGCGAATATGGACTACCGTGTGTCGTTGGCGCCACACATGCGACTGACATTTTCAATACCGGGGATATTGTCAGGTTGTCCGGAGACCAGGGATTTTTGGAAAGGGTTAAAGTTGATGCTTCATCTTGA

Protein sequence:

>DPOGS201496-PA
MADDELTTFANTGEPIITSKWVMKETTRSVSSLNLNVETKDPLKFLDVIAGTKEDMLKFNIGHCTTTLASTFSQFIAMTVMLEGRSDFTSEECNEVSTLLSSGDAVSAEVPHALDRLALSIEESGMIEEFRKQKPEESLMWLKQNLPKIYDDVLLFLDQHGHRAIMEFDLATKPWSLVPEDLMRVLMNMRPTSKTQTYKSTEELIASLKTPKKANTKKVLRWLLPLCRRTVSHREGTKAQVILAVHKLRLAVRRLATMMYHSWIIPDVELVFYFRLHELKEYIITRDPGLLRKAVQRQQYYTKWCQLKFSEMNKGWPEPLRVDGPRVTSGDVKIFATSVCQGETVARACVVKDLSEIYQLRQGDILITHCTDIGWSPYFPLLSGIVTELGGLISHGAVIAREYGLPCVVGATHATDIFNTGDIVRLSGDQGFLERVKVDASS-