Monarch geneset OGS2.0

DPOGS200075
TranscriptDPOGS200075-TA1584 bp
ProteinDPOGS200075-PA527 aa
Genomic positionDPSCF300044 - 333634-346714
RNAseq coverage1113x (Rank: top 11%)
Annotation
HeliconiusHMEL0052500.082.82% 
BombyxBGIBMGA004595-TA0.074.41% 
DrosophilaHex-A-PA8e-7937.72% 
EBI UniRef50UniRef50_E2C3M92e-12551.43%Hexokinase-2 n=11 Tax=Arthropoda RepID=E2C3M9_HARSA
NCBI RefSeqXP_392350.21e-12951.99%PREDICTED: similar to Hexokinase A CG3001-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838658551e-12853.32%PREDICTED: hexokinase-1-like [Megachile rotundata]
NCBI nr blastxgi|3838658551e-12453.32%PREDICTED: hexokinase-1-like [Megachile rotundata]
Group
Gene OntologyGO:00055246.2e-127ATP binding
GO:00167736.2e-127phosphotransferase activity, alcohol group as acceptor
GO:00059756.2e-127carbohydrate metabolic process
KEGG pathwayisc:IscW_ISCW0123878e-87 
 K00844 (HK)maps-> Starch and sucrose metabolism
    Galactose metabolism
    Glycolysis / Gluconeogenesis
    Amino sugar and nucleotide sugar metabolism
    Fructose and mannose metabolism
    Type II diabetes mellitus
    Streptomycin biosynthesis
    Insulin signaling pathway
    Butirosin and neomycin biosynthesis
InterPro domain[114-524] IPR0013126.2e-127Hexokinase
[91-283] IPR0226724.8e-54Hexokinase, N-terminal
[290-521] IPR0226735.4e-53Hexokinase, C-terminal
Orthology groupMCL18831 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200075-TA
ATGTCATTGCCACTAACTTCACCAATTGTATCAATAAATGTTTATAAAATGAACTCTGGTGATAACCTTAAAATTGGAAAGATATACATCACAATGTCAACGGCTGTTGAGAATCTTAACATGGAGCCGCTGGTGAACGGTCACAATGGCCCACATTTGAATGGCACAAATGGTACAAATGGCACCACAGTTATCACAAACGGCACTAATGGAACCTGGGCCCAGCAATTCCTACCACCACAACTGATTTTGGACGACAGAGTCAGAGCTACCAAGATTGATAATCGTTTGTCCAAAATGATTCTGTGTGGTAACACGCTGAGGCGAGTTGGCGAAGTGTTTGCCAGAGAAATAGAGAACGGATTAAAGGAACGTCCATCCAGTCTGCAGATGGAGAACACGTACGTCCCTGAACTACCCGATGGAACTGAGGAAGGGGTTTTCTTGGCTTTAGATCTCGGAGGTACTAATTTTAGAGTTCTGCTGCTAGAACTTCGTGCTGGTAAACTTGTGAGAGAGGACGTTAAGCATTATCATATCAGTGACGTCTTGCGGCTCGGGCCCGGTGAGGATCTCTTTAATTTCCTTGCGGACAGTGTACTGGACTTTTTGACGTCGGAGAACATGGAAAACGACGTGCTTTCGCTCGGTTTCACGTTTTCGTTTCCGATGAAACAGCACTCGATTTCATCTGGTGAGTTGATAACGTGGACCAAGAGCTTCAACTGCGGTGGGATGCAGGGCGTGGACGTCGCTGCCCTGTTGCAGCGCTGTCTGCGAGACCGTGGACTGAGAGTGACGGTTCAAGTGTTGCTTAATGACACCACGGGTACACTTGTCGCTGGTGCTCACATGGATCCGGATGTTGCCATCGGTGTGATAATGGGCACTGGTTCAAACGGTTGTTATATGGAACAAGCGAAGAGAGTGCAACACTGGGAGGCGAAACACGACCGCGTGCAGGATGTGTGTGTTGACATCGAGTGGGGAGCCTTCGGAGATAACGGGTGCCTGTCCTTCCTGAGGACAGATTTCGACAAAGTCGTGGACGACAATTCTTTGCTCGCTACATCTTTCACTTTCGAAAAGTACATCGGTGGAAAATATATAGGGGACTTATTGTGTGCGGTTTTGAGTGGACTGGCACACGATCGTCTTTTCCCCGCACCACCAGCGCCCGGTTCATTGGCCTCGTCGGATCTTAGCATGTTTGAAGAAGAGAACGTGACAGGTTCGTGGTCTAACACAGCTAACACATTGACTGCGGCCTGCGGTGTGCGAATCTCGCGTGCTGATGCGTTAGTCGCCCAACACGCAGCACGAGTCATATCAAATCGTGCTGCGCAGCTCGTGTCTGTTTGTATAGCGACGCTGTTGCTTCGTATGAATCGCCCGCATGTGGGCGTGGCTGTTGATGGTTCAGTTTTCAAACGACACCCTCGTATCCGTGGACTGATGGAGCGCTACATTGAGTTGCTCGCCCCCCATCACAAGTTCACTCTTCTTGGAGCTGAAGATGGTAGTGGCAAAGGCAGTGCTTTGACGGCAGCTATCGCGGCCAGGGTCGCCGCTCGTTCACCCTAA

Protein sequence:

>DPOGS200075-PA
MSLPLTSPIVSINVYKMNSGDNLKIGKIYITMSTAVENLNMEPLVNGHNGPHLNGTNGTNGTTVITNGTNGTWAQQFLPPQLILDDRVRATKIDNRLSKMILCGNTLRRVGEVFAREIENGLKERPSSLQMENTYVPELPDGTEEGVFLALDLGGTNFRVLLLELRAGKLVREDVKHYHISDVLRLGPGEDLFNFLADSVLDFLTSENMENDVLSLGFTFSFPMKQHSISSGELITWTKSFNCGGMQGVDVAALLQRCLRDRGLRVTVQVLLNDTTGTLVAGAHMDPDVAIGVIMGTGSNGCYMEQAKRVQHWEAKHDRVQDVCVDIEWGAFGDNGCLSFLRTDFDKVVDDNSLLATSFTFEKYIGGKYIGDLLCAVLSGLAHDRLFPAPPAPGSLASSDLSMFEEENVTGSWSNTANTLTAACGVRISRADALVAQHAARVISNRAAQLVSVCIATLLLRMNRPHVGVAVDGSVFKRHPRIRGLMERYIELLAPHHKFTLLGAEDGSGKGSALTAAIAARVAARSP-