Monarch geneset OGS2.0

DPOGS204133
TranscriptDPOGS204133-TA1584 bp
ProteinDPOGS204133-PA527 aa
Genomic positionDPSCF300184 + 213121-219434
RNAseq coverage172x (Rank: top 50%)
Annotation
HeliconiusHMEL0109838e-15672.04% 
BombyxBGIBMGA013614-TA0.077.21% 
DrosophilaCG3534-PA8e-14046.89% 
EBI UniRef50UniRef50_Q9VEQ01e-13746.89%CG3534 n=18 Tax=Diptera RepID=Q9VEQ0_DROME
NCBI RefSeqXP_001604257.17e-15853.89%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565555561e-15653.89%PREDICTED: xylulose kinase-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565555563e-15053.76%PREDICTED: xylulose kinase-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00167737.9e-176phosphotransferase activity, alcohol group as acceptor
GO:00059757.9e-176carbohydrate metabolic process
KEGG pathwaynvi:1001206392e-157 
 K00854 (E2.7.1.17)maps-> Pentose and glucuronate interconversions
InterPro domain[1-523] IPR0005777.9e-176Carbohydrate kinase, FGGY
[296-477] IPR0184852e-18Carbohydrate kinase, FGGY, C-terminal
[134-286] IPR0184842.5e-16Carbohydrate kinase, FGGY, N-terminal
Orthology groupMCL12138 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204133-TA
ATGATTGATGAAACGCGAAAGAAAACTTTTTTAGGATTCGATTTTAGTACTCAGAGGCTTAAGGCAATAGTTATTGATGACAAATATGAAATACAACAAGAAGCAGATGTGGAATTTGATGTGGATCTGCCCGAATTTAGAACGGCAGGTGGAGTTCTCCGAGGAGAGCATGGGGAGGTTACAGCACCTCCTCTACTCTGGGTGAAAGCGCTGGATATGGTGTTAGATAGACTTGTTGTGGCAGGAGTCGACTTCTCAACCGTTGAGGCGCTATCAGGGGCTGCTCAGCAACATGGGTCTGTGTGGTGGGCTAAAGGAGCTGAAGCCAAACTGGCATCACTAACACCCGATGGATTCCTGCACACACAGCTAGCAACTGTATTTGTGTGCGACTCACCAGTATGGATGGACTCTAGCACCACTGCTGATTGTAAAGCTTTAGAGGAAGCTGTTGGTGGCCCAGAGGAATTGGCAAAGATCACAGGATCGAGAGCTTATGAACGTTTCACTGGACCTCAAATACGTAAAATGTACAAAAGTAAGCCGAGAGTGTACGGGGCCACTGAAAAAATATCTCTCATATCATCATTCGCCTGTTCCCTGTTTGTGGGAAAGATAGCTCCCATAGACCTCTCAGACGGTTCCGGAATGAACCTCCTTGACATTCACACCATGAAGTGGTGTGACAAGGCTTTGGAGGCTTGTGGAGATGAAACACTAAAGCAGAAGCTTGGCGAGCCTGTACCCTCGGCCTCAGTGGTTGGATCCATATCACCATACTTTGTTCAGAGATATGGGTTTAAACCTGACTGCAAGGTTGTCGCCTTTACAGGGGATAACTGCTCCGCCTTGGCTGGTCTCCGCCTGCGTTCCGGGTGGGTGGGTCTCAGCCTGGGTACAAGCGACACTCTGTTACTTGGGCTGGAGGAGCCCGGGGCTCCGGTGGCGGGCCACGTGTTGGTGGGACCTACCAGCGCTCCATACATGGCGTTACTGTGCTTCGCCAACGGCTCGTTGACTCGCCAGGCAGAGAGGGATCGTCTCTGCGGTCCGAGCTGGCAGGCCTTTGACGAGCTACTGAGAAGTACTGTCAGGGGTAACATGGGGTATATGGGTATATACTACAACACAGCGGAGATCCTCCCTCGAGCGCCGTCTCTACGTGTGGTACAGGACTCTGCTGGTCGTCCCTGTAACCCGGCGCCGCAGTTCGAGGCGCGCGCTCTACTAGAAGGACAGGCTCTCTCCGCTAGAGCACACTCTGAGGATGTAGGGCTCGCTCTAGAGCGTAGTTCCCGCATAGTGGCGACAGGTGGCGCCTCCGTCAACACCTCACTGTTGCAGATCTTCGCGGACGTGTTCAACACACCCGTATATGTACAGGATCAACACGCGAACGCCGCTCTTCTGGGGGCTGCCATGAGAGCCGCTGAGGTGTGGGCGCAAGAGACTAACACACAACTTAGTGGTTCTGAGGTGACCGTCTCTCCCGTGGCGAAGCCCTACCCTGACGCCGAGAAGATCTACTCTCCAATGTTACAGCGCTACAGGAAAATGCTCGAAGAACTTCCCAAGCTCAACTGA

Protein sequence:

>DPOGS204133-PA
MIDETRKKTFLGFDFSTQRLKAIVIDDKYEIQQEADVEFDVDLPEFRTAGGVLRGEHGEVTAPPLLWVKALDMVLDRLVVAGVDFSTVEALSGAAQQHGSVWWAKGAEAKLASLTPDGFLHTQLATVFVCDSPVWMDSSTTADCKALEEAVGGPEELAKITGSRAYERFTGPQIRKMYKSKPRVYGATEKISLISSFACSLFVGKIAPIDLSDGSGMNLLDIHTMKWCDKALEACGDETLKQKLGEPVPSASVVGSISPYFVQRYGFKPDCKVVAFTGDNCSALAGLRLRSGWVGLSLGTSDTLLLGLEEPGAPVAGHVLVGPTSAPYMALLCFANGSLTRQAERDRLCGPSWQAFDELLRSTVRGNMGYMGIYYNTAEILPRAPSLRVVQDSAGRPCNPAPQFEARALLEGQALSARAHSEDVGLALERSSRIVATGGASVNTSLLQIFADVFNTPVYVQDQHANAALLGAAMRAAEVWAQETNTQLSGSEVTVSPVAKPYPDAEKIYSPMLQRYRKMLEELPKLN-