Monarch geneset OGS2.0

DPOGS204601
TranscriptDPOGS204601-TA1041 bp
ProteinDPOGS204601-PA346 aa
Genomic positionDPSCF300418 - 28839-33844
RNAseq coverage182x (Rank: top 49%)
Annotation
HeliconiusHMEL0038106e-16677.68% 
BombyxBGIBMGA001613-TA2e-14170.89% 
DrosophilaCG9232-PA8e-12359.59% 
EBI UniRef50UniRef50_E2C5K97e-13061.40%Galactose-1-phosphate uridylyltransferase n=17 Tax=cellular organisms RepID=E2C5K9_HARSA
NCBI RefSeqXP_001600507.13e-13163.01%PREDICTED: similar to ENSANGP00000017622 [Nasonia vitripennis]
NCBI nr blastpgi|3838618411e-13363.98%PREDICTED: galactose-1-phosphate uridylyltransferase-like [Megachile rotundata]
NCBI nr blastxgi|3838618412e-13463.98%PREDICTED: galactose-1-phosphate uridylyltransferase-like [Megachile rotundata]
Group
Gene OntologyGO:00081084.6e-200UDP-glucose:hexose-1-phosphate uridylyltransferase activity
GO:00082704.6e-200zinc ion binding
GO:00060124.6e-200galactose metabolic process
GO:00038244.5e-64catalytic activity
KEGG pathwaynvi:1001165739e-131 
 K00965 (E2.7.7.12)maps-> Galactose metabolism
    Amino sugar and nucleotide sugar metabolism
InterPro domain[1-347] IPR0019374.6e-200Galactose-1-phosphate uridyl transferase, class I
[175-344] IPR0111514.5e-64Histidine triad motif
[1-173] IPR0111463.9e-63Histidine triad-like motif
[2-172] IPR0058499e-57Galactose-1-phosphate uridyl transferase, N-terminal
[180-344] IPR0058507.9e-54Galactose-1-phosphate uridyl transferase, C-terminal
Orthology groupMCL14321 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204601-TA
ATGGAATTCAATGTCACAGAACATCAGCACGTCCGCTACAATCCCTTAAAAGACCAGTGGGTTTTGGTCTCACCACATCGCTGCAAACGTCCCTGGAGCGGGCAGACGGAACCAGAACCGGAAGAACTCTCCGATGAGAAGAACCCATTGAAAGCTGGTGCAGTCAGAGCTAATGGACAGAAAAATCCGAATTACACTTCCACGTACGTGTTCCCAAACGACTTCCCGGCTCTCCTGGAACGTGTCCCGGAACCGCCGCCGTCAGAACACCCTCTGTTCCAGATGTCCCAGGCGAAGGGAACTTGCAGGGTGATGTGCTTTCATCCGGATTCCAAAATGACGATATCGCTGATGACCGTGGACGAAATACTGAGCGTCATCGAAGAATGGATACGACAAACCCAGGAGTTGGGTCGACGTTACACCTGGGTGCAGGTCTTTGAGAACAAGGGCTCCGTCATGGGCTGCTCCAACCCTCACCCCCACTGTCAGATATGGGCCTCCAGCTATTTACCCGACGAGGGTAAAATTAAGGACAGGTGTCAGAAAGAGTACTTCATCAAAAATGCTAGGCCGATGTTGATGGAGTACTTGGAGCAAGAGCTGATGAGGAAGGAACGTATAGTCCTCGAGAACCAGTCTTGGGTGACCCTCGTCCCGTACTGGGCTGTATGGCCGTACGAGACCTTACTTCTGCCGAAGCAGCACGTTCAGAGGATCACAGACCTGGACGAGGTTCAGAAGCAGGACCTGGCTATCATGATGAAAGAGCTGAACACCAAATATGATAACTTATTCCAATGCAACTTCCCCTACAGTATGGGCTGGCATGGGGCTCCCACGGGTCCATCCGCTAAACCCGGGGACTCCCCGCACTGGGTGTTCCACGGCATCTATCTACCGCCACTCCTGAGATCGGCTAGTGTCAAAAAATTCATGGTGGGCTACGAACTGCTCGCACAACCACAAAGAGATTTAACACCCGAGCAAGCAGCGGAAAAACTAAGAGGATGCAGTCTAGTACACTACAAATATGTGTAG

Protein sequence:

>DPOGS204601-PA
MEFNVTEHQHVRYNPLKDQWVLVSPHRCKRPWSGQTEPEPEELSDEKNPLKAGAVRANGQKNPNYTSTYVFPNDFPALLERVPEPPPSEHPLFQMSQAKGTCRVMCFHPDSKMTISLMTVDEILSVIEEWIRQTQELGRRYTWVQVFENKGSVMGCSNPHPHCQIWASSYLPDEGKIKDRCQKEYFIKNARPMLMEYLEQELMRKERIVLENQSWVTLVPYWAVWPYETLLLPKQHVQRITDLDEVQKQDLAIMMKELNTKYDNLFQCNFPYSMGWHGAPTGPSAKPGDSPHWVFHGIYLPPLLRSASVKKFMVGYELLAQPQRDLTPEQAAEKLRGCSLVHYKYV-