Monarch geneset OGS2.0

DPOGS209713
TranscriptDPOGS209713-TA1362 bp
ProteinDPOGS209713-PA453 aa
Genomic positionDPSCF300105 - 368509-369949
RNAseq coverage363x (Rank: top 33%)
Annotation
HeliconiusHMEL0113460.086.70% 
BombyxBGIBMGA008953-TA0.080.67% 
DrosophilaGyk-PA6e-0822.46% 
EBI UniRef50UniRef50_D6X2T72e-16157.93%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=D6X2T7_TRICA
NCBI RefSeqXP_971652.24e-16257.93%PREDICTED: similar to carbohydrate kinase-like [Tribolium castaneum]
NCBI nr blastpgi|2700138436e-16157.93%hypothetical protein TcasGA2_TC012506 [Tribolium castaneum]
NCBI nr blastxgi|1892410315e-15757.93%PREDICTED: similar to carbohydrate kinase-like [Tribolium castaneum]
Group
Gene OntologyGO:00167734.9e-159phosphotransferase activity, alcohol group as acceptor
GO:00059754.9e-159carbohydrate metabolic process
KEGG pathwayxtr:1001450874e-113 
 K11214 (SHPK)maps-> Carbon fixation in photosynthetic organisms
InterPro domain[1-448] IPR0005774.9e-159Carbohydrate kinase, FGGY
[1-250] IPR0184848e-27Carbohydrate kinase, FGGY, N-terminal
Orthology groupMCL18344 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209713-TA
ATGGATATTGGTACGACTTCGGTAAAAGTTTGCGTTTACGATCCACATACGAAGGAACTCGTAGCTAAACAGAGTAAGGATACGGCTGCGAACATTCCCAGTGATCTAGGGATCGAAGGGAACAAGCAGGACGTTCCGAAAATAGTGTCCGCTGTTCATTATTGTGTCTCACGTTTACCACGAGATGTTCTGAGACATGTTAAAAAAATTGGTGTTTGTGGACAAATGCATGGAGTTGTGCTGTGGAAAAATGGAGCTTGGGAAAAGGTAGAAAAGGAGGGTGCCTTTATAAGATATGAAGGTAACAGAGAAAATATGTCAGCGTTATATACATGGCAAGATACCAGATGTAAGCCAGAGTTTTTGGAAACATTACCAAAGCCCGATTCACATCTCCAATGTTATTCTGGTTATGGCTGTGCCACTCTTCTCTGGATGCTAAAGCATAAGCCTGAAAAACTAAAGAATTTCACGTATTCAGCTACTGTACAAGATTTCGTTGTGGCTATGCTTTGTGATCTTGATGTCCCAAAAACATCAGATCAGAACGCTGCCAGCTGGGGTTACTTCAACACAGAAAAGAATGAATGGAATATTGATATACTGAAATCTATTGATTTTCCTGTTAATCTTCTGCCAAAAGTGATAAGAAGTGGGGAAATAGCGGGTACATTAAGCTGTTGTTGGAATGGAATCCCAGAGGGTACTCCAGTGGGTGCAGCTATGGGTGACCTTCAATGTTCAATACTTGCCACACTTGAGAACGGACAAGATGCTGTTCTAAACATATCTACCTCAGCTCAACTCGCCTTTGTTGTTGACCAAATTAAAGATTTAGGTTGTACAACTATTGAGCACTTGCCATATTTTAATAATACATATCTAGTTGTTGCTGCATCATTAAATGGGGGTAATGTATTGGCTACTTTTGTCAAAATGTTACAACAATGGATGCTTGAATTTGGATTTCCAATACCACAATCTAAAGTATGGGAAAAACTAATTGCACTTGGCCTGGACGCATCTGACGGCACTGATATGAAAATAAGTCCACTCCTTTTAGGGGAAAGGCACTCACCAACAGCTAAAGCAGCTGTTCAAAATATTGATCTTTCAAATATTCAACTCGGTCATGTGTTTAGATCTCTCTGTAATAGTATTATTGATAATATCCATTGTATGATGCCAAAGAAAATTCTATTGGATGCAAATATAAAAAGGATAGTCGGCAATGGTTCTGGACTATCAAGAAATGCTGTATTACAAAGAGCTGTTGAGCATAATTACAGCTTGCCATTAGAATTTACTTCTGGAGGCGATGCAGCCCGAGGAGCAGCTGTAGCTGTCCAAAATGCATCATAA

Protein sequence:

>DPOGS209713-PA
MDIGTTSVKVCVYDPHTKELVAKQSKDTAANIPSDLGIEGNKQDVPKIVSAVHYCVSRLPRDVLRHVKKIGVCGQMHGVVLWKNGAWEKVEKEGAFIRYEGNRENMSALYTWQDTRCKPEFLETLPKPDSHLQCYSGYGCATLLWMLKHKPEKLKNFTYSATVQDFVVAMLCDLDVPKTSDQNAASWGYFNTEKNEWNIDILKSIDFPVNLLPKVIRSGEIAGTLSCCWNGIPEGTPVGAAMGDLQCSILATLENGQDAVLNISTSAQLAFVVDQIKDLGCTTIEHLPYFNNTYLVVAASLNGGNVLATFVKMLQQWMLEFGFPIPQSKVWEKLIALGLDASDGTDMKISPLLLGERHSPTAKAAVQNIDLSNIQLGHVFRSLCNSIIDNIHCMMPKKILLDANIKRIVGNGSGLSRNAVLQRAVEHNYSLPLEFTSGGDAARGAAVAVQNAS-