Monarch geneset OGS2.0

DPOGS215041
TranscriptDPOGS215041-TA1167 bp
ProteinDPOGS215041-PA388 aa
Genomic positionDPSCF300208 - 495822-502459
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0020131e-9862.16% 
BombyxBGIBMGA005670-TA5e-4565.52% 
DrosophilaCG5288-PA7e-2224.50% 
EBI UniRef50UniRef50_E4X9643e-9550.55%Whole genome shotgun assembly, reference scaffold set, scaffold scaffold_16 n=1 Tax=Oikopleura dioica RepID=E4X964_OIKDI
NCBI RefSeqXP_001187174.17e-10651.57%PREDICTED: similar to galactokinase 1 [Strongylocentrotus purpuratus]
NCBI nr blastpgi|2608316071e-10551.69%hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae]
NCBI nr blastxgi|2608316077e-10251.70%hypothetical protein BRAFLDRAFT_90953 [Branchiostoma floridae]
Group
Gene OntologyGO:00468353.3e-140carbohydrate phosphorylation
GO:00055243.3e-140ATP binding
GO:00060123.3e-140galactose metabolic process
GO:00043353.3e-140galactokinase activity
GO:00081522.8e-39metabolic process
GO:00163012.8e-39kinase activity
GO:00167732.8e-39phosphotransferase activity, alcohol group as acceptor
GO:00057372.8e-39cytoplasm
GO:00163102.4e-10phosphorylation
KEGG pathwaybfo:BRAFLDRAFT_909532e-106 
 K00849 (galK)maps-> Galactose metabolism
    Amino sugar and nucleotide sugar metabolism
InterPro domain[6-387] IPR0007053.3e-140Galactokinase
[32-209] IPR0147211.1e-58Ribosomal protein S5 domain 2-type fold, subgroup
[12-213] IPR0205681.5e-54Ribosomal protein S5 domain 2-type fold
[34-58] IPR0062062.8e-39Mevalonate/galactokinase
[18-65] IPR0195392.8e-19Galactokinase galactose-binding domain
[288-368] IPR0137504.2e-15GHMP kinase, C-terminal
[125-190] IPR0062042.4e-10GHMP kinase
Orthology groupMCL15012 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215041-TA
ATGTCGGAAGCGGTCCCTAAAGGTGAAGAAGTTCTGCTTAAAGAGGCAGTCTCTAAGTTCGTCTCCACATACAATCGCAAGCCGCTAGCGGCAGCTGCTGCCCCTGGTAGGGTCAATCTCATTGGAGAGCACGTTGACTATTGTGAAGGATTCGTGCTACCTGTGGCGTTACCATTTTTCACTGTAGTGGTGGGTGCATACAATTCCACTGATGAGTGCATGGTGCTGTCAATCCTTGCCAGTGGACAAGAGGTCCAGACCAGCTTCTCTTCTACAGAATCATCTTCTTTGCAACCCGGGGAGCCAGGGTGGGCTAACTATGTAAAGGGTGTGCTGGCCAATTTTCCAGAAAAGGTGAAAGGTCTCGATGCGGTTATTGTATCAGACGTGCCAATGGGGTCCGGCGTCTCCAGCAGCGCTTCATTGGAAGTAGCATTCTTCACGTTCCTTGAGGACCTCACTAAGATCACGGTTGATCCAGTCAAAAAAGCTCAGTTGTGTCAGAAAGCTGAGCATGATTTTCCCGGAATGCCGTGCGGTATTATGGACCAGTTCATAGTGACTCTTGGAAAAAAAGATCACGCATTACTAATAGATTGCAGGTCATTGGAGTCCAAACAGGTGCCAATGAAGTGTTCAGACGTCGTGCTGTTGGTTGTGAATTCTAGTGTGAAGCATCAGCTAACCGGAAGCGAATACCCTCAGAGACGAGCGCAGTGTCAGCAAGCGGCTGATGAATTGGGGAAACCCTCTTTAAGGAGCGCCACCATTCAAGATCTTTCAAAACTGAAATGCGAGGAATTAGTTCTGAAACGTGCTAAGCATGTGGTCGAAGAGATCACCCGGACCGAGTTAGTCGCACAGCTTTTAGAGAGGAAAGATTATAAGGAGGTAGGGCGACTGTTCTATCAGTCCCACGAGTCCCTGAGCAAGCTGATGGAGGTTTCCTGTCCCGAGTTAGACCAACTGGTTGATATCATGAGGTCATCGGACGGAGTGTTCGGCGCCAGAATGACGGGCGGCGGCTTCGGGGGATGCGTCATAGCCTTAATAAAGAAGGAATGCTTGGCGTCTTTAAAGAGCAAGGTCCGGTCGGAGTACAAAGGTAACCCAGTGTTCTTTGAGTGCGAGCCGAGTGACGGAGCGAGAATATTAAAGATAGGATAA

Protein sequence:

>DPOGS215041-PA
MSEAVPKGEEVLLKEAVSKFVSTYNRKPLAAAAAPGRVNLIGEHVDYCEGFVLPVALPFFTVVVGAYNSTDECMVLSILASGQEVQTSFSSTESSSLQPGEPGWANYVKGVLANFPEKVKGLDAVIVSDVPMGSGVSSSASLEVAFFTFLEDLTKITVDPVKKAQLCQKAEHDFPGMPCGIMDQFIVTLGKKDHALLIDCRSLESKQVPMKCSDVVLLVVNSSVKHQLTGSEYPQRRAQCQQAADELGKPSLRSATIQDLSKLKCEELVLKRAKHVVEEITRTELVAQLLERKDYKEVGRLFYQSHESLSKLMEVSCPELDQLVDIMRSSDGVFGARMTGGGFGGCVIALIKKECLASLKSKVRSEYKGNPVFFECEPSDGARILKIG-