DPGLEAN06440 in OGS1.0

New model in OGS2.0DPOGS215219 
Genomic Positionscaffold6191:- 19978-24401
See gene structure
CDS Length1539
Paired RNAseq reads  2429
Single RNAseq reads  5960
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008725 (0.0)
Best Drosophila hit  UGP, isoform D (0.0)
Best Human hitUTP--glucose-1-phosphate uridylyltransferase isoform a (3e-172)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC009819 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC009819 [Tribolium castaneum] (0.0)
GeneOntology terms
  
GO:0003983 UTP:glucose-1-phosphate uridylyltransferase activity
GO:0008152 metabolic process
InterPro families
  
IPR002618 UTP--glucose-1-phosphate uridylyltransferase
IPR016267 UTP--glucose-1-phosphate uridylyltransferase, subgroup
Orthology groupMCL14653

Nucleotide sequence:

ATGGAGTTTGAAATGGATCATGTACAGTTTTGGATCCGAAGTCATCAACGTACGCCGTCT
GGGTCCCGGGACTTTAAAGAGGCGACCAAACGTGATGCTCTGGCTCGTCTGGAAGTGGAA
CTCGAGAGACTTCTGTCTACCGTCCCCGAGCCCAAACAGCCCCTAGTGGAGAAAGAGTTC
GCGGGATTCAAGAATCTATTCAGCAGGTTCCTCGCTGAACAGGGTCCGTCTGTTACGTGG
GAGAAAATCCAGAAGCTTCCAGAACATGCCGTCATCGATTACACCACGCTTCAGACTCCT
ACCACCGATAGCATTCACCACATGCTGGATAAGTTGGTGGTGGTGAAACTGAACGGTGGC
CTGGGAACCTCCATGGGCTGTAAAGGACCCAAGTCGGTCATCCAAGTCAGAAATGAACTG
ACCTTCCTCGATCTGACTGTGCAACAAATTGAGCATCTGAACAAGACGTACAAATGCAAC
GTTCCCCTGGTGCTCATGAACTCTTTCAACACTGACGAGGACACGCTCAAGGTCATCCGC
AAGTACCGCGGTCTGAAGCTCGATATCCACACCTTCAACCAGTCCTGCCATCCCAGGATC
AACAGGGAATCCTTACTGCCGCTGGCCAAAGACGCTGACGTACACTCGGATATCGAGGCT
TGGTACCCGCCTGGTCACGGAGACTTCTACGAATCTTTCTACAATTCTGGTCTTCTGAAT
AAATTTATTAAAGAGGGCAGGACGTACTGCTTCATCAGCAACATAGATAATTTGGGGGCG
AACGTCGATCTGAACATCCTCAACCTGTTGTTGAATCCGGACCAGAAGGAGCAATCGGAA
TTCGTCATGGAGGTCACCGATAAAACCAGAGCCGACGTCAAAGGTGGCACTCTCATACAG
TACGAGGATAAACTGCGTCTCCTGGAAATCGCTCAGGTGCCCAAAGAACACGTGGACGAC
TTCAAATCGGTGAGCCAGTTCAAATTCTTCAACACCAACAATCTTTGGGCGAAGCTGGAC
GCCATCAAGAGGGTCGTCGAACGAGGGTCCCTGAACATGGAGATAATCGTGAACAATAAG
AGTCTAGCTGACGGAGTGAACGTCATTCAACTGGAAACGGCCGTGGGCGCGGCCATGAAG
TGCTTCGAAGGCGGCATCGGTGTCAACGTCCCACGAAGCAGATTCCTGCCGGTCAAGAAG
ACCTCGGACCTGTTGTTGGTGATGTCGAATCTATACAGCCTGTCGCACGGGTCGCTGGTG
ATGTCGTCTCAGAGGATGTTCCCATCGACGCCTCTAGTGAAACTCGGTGACAACCACTTC
GCCAAGGTGAAGGAGTTCCTGAACAGGTTCGCTACGATCCCCGACCTCATCGAGCTCGAC
CACCTCACCGTCTCCGGAGACGTGACCTTCGGCCGCGGCGTGTCTTTGAAGGGCACTGTT
ATAATAATAGCCAACCACGGCGAGCGCATCGACATCCCCTCCGGGGCGCTGCTCGAGAAC
AAAATAGTCTCAGGAAATCTAAGGATATTGGACCATTAG

Protein sequence:

MEFEMDHVQFWIRSHQRTPSGSRDFKEATKRDALARLEVELERLLSTVPEPKQPLVEKEF
AGFKNLFSRFLAEQGPSVTWEKIQKLPEHAVIDYTTLQTPTTDSIHHMLDKLVVVKLNGG
LGTSMGCKGPKSVIQVRNELTFLDLTVQQIEHLNKTYKCNVPLVLMNSFNTDEDTLKVIR
KYRGLKLDIHTFNQSCHPRINRESLLPLAKDADVHSDIEAWYPPGHGDFYESFYNSGLLN
KFIKEGRTYCFISNIDNLGANVDLNILNLLLNPDQKEQSEFVMEVTDKTRADVKGGTLIQ
YEDKLRLLEIAQVPKEHVDDFKSVSQFKFFNTNNLWAKLDAIKRVVERGSLNMEIIVNNK
SLADGVNVIQLETAVGAAMKCFEGGIGVNVPRSRFLPVKKTSDLLLVMSNLYSLSHGSLV
MSSQRMFPSTPLVKLGDNHFAKVKEFLNRFATIPDLIELDHLTVSGDVTFGRGVSLKGTV
IIIANHGERIDIPSGALLENKIVSGNLRILDH