DPGLEAN17758 in OGS1.0

New model in OGS2.0DPOGS210796 
Genomic Positionscaffold1772:- 14639-24217
See gene structure
CDS Length3078
Paired RNAseq reads  3219
Single RNAseq reads  7020
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007111 (0.0)
Best Drosophila hit  suppressor of rudimentary, isoform C (0.0)
Best Human hitdihydropyrimidine dehydrogenase [NADP+] isoform 1 (0.0)
Best NR hit (blastp)  dihydropyrimidine dehydrogenase [Aedes aegypti] (0.0)
Best NR hit (blastx)  dihydropyrimidine dehydrogenase [Aedes aegypti] (0.0)
GeneOntology terms







  
GO:0004159 dihydrouracil dehydrogenase (NAD+) activity
GO:0017113 dihydropyrimidine dehydrogenase (NADP+) activity
GO:0009055 electron carrier activity
GO:0005737 cytoplasm
GO:0055114 oxidation reduction
GO:0051536 iron-sulfur cluster binding
GO:0004158 dihydroorotate oxidase activity
GO:0006222 UMP biosynthetic process
GO:0006207 'de novo' pyrimidine base biosynthetic process
InterPro families







  
IPR009051 Alpha-helical ferredoxin
IPR005720 Dihydroorotate dehydrogenase, class 1
IPR017900 4Fe-4S ferredoxin, iron-sulphur binding, conserved site
IPR001295 Dihydroorotate dehydrogenase, conserved site
IPR012135 Dihydroorotate dehydrogenase, class 1/ 2
IPR013027 FAD-dependent pyridine nucleotide-disulphide oxidoreductase
IPR012285 Fumarate reductase, C-terminal
IPR013785 Aldolase-type TIM barrel
IPR017896 4Fe-4S ferredoxin, iron-sulpur binding domain
Orthology groupMCL12239

Nucleotide sequence:

ATGTCGAAGGTATTACTGAGTCGAGATCTGCCGGATATTGAGAACCTTCTTAAGCTTAAC
CCGACTGTTAAGCCATATACCAACCTGGTGCCATCAGCACAGACGAAGAAGAATAAACAG
CACTGGAAAAGGAATGCCGATAGAAAATGCACTAGTTGTCCAAGTCTCGAGAGGAATTTC
GATGACATCAAACACACGACATTGTCTGAGCGTGGTGCTTTGAAAGAGGCAGCTCGTTGT
CTGAAATGTGCTGACGCGCCCTGTCAGAAGTCCTGCCCGACGCAGATCGACGTCAAAAGC
TTTATAACAAGCATAGCTAATAAGAACTATTATGGCGCCGCCAAAGAAATACTCTCAGAC
AACCCCTTGGGTCTCACTTGTGGAATGGTTTGCCCCACCAGTGACTTGTGCGTCGGCGGA
TGTAACTTACACGCGAGCGAAGAGGGCGCCATTAATATCGGTGGATTACAGCATTTCGCT
GTTGAAACATTCATGAAAATGGGTATCTCCCAGACATTGGATCCAAAAACGAAACCCCTG
CCTCGAGGTGATAAAAAAATTGCTCTTATAGGCGGCGGACCGGCAAGCTTGAGTTGTGCT
TGTTTTTTGGCAAGGATGGGTTATAAGAATATTACGGTGTTTGAGAAAGAAAAATATCTT
GGAGGACTAAGTTCATCAGAGATCCCACAGTACCGCCTTCCGTATGACGTGGTTCAATTC
GAGATCGATCTCATCAGGGACCTTGGAGTGAAGTTCGTCACAGGAAGGAAACTTTCAACC
AGTGACATAACTGTTAATGGACTTTTTAAAGATGGTTATGATGCTGTGTTTCTTGGCATC
GGTCTGCCCGAGCCAAAGAGCATTCCTATCTTCGAAAACCTAACACCAGAAATGGGATTT
TACACTAGCAAACAGTTTTTACCCCTGGTGTCCAGGGGCAGCAAAAGAGGCTTATGTAGC
TGTTCGTCGTTGCCGGTTCTGTCTGGTACTGTGTTGGTTCTTGGGGCGGGAGACACGGCC
TTCGACTGCGCCACATCAGCACTCCGCTGCGGCGCCAAGAAAGTGTTTGTCGTGTTCAGG
AAAGGCATCACTCACATAAGAGCTGTTCCCGAGGAGGTTGATCTGGCTAAGGAAGAAAAA
TGTGAGTTCATACCATTTATGTCACCGCGAGAAGTAATCGTCAGAAACGGAAAGATAACT
GCCTTAAAAATGTTCAGAACTGAACAATTAGATGATGGCGAATGGTTCGAAGACCCAGAT
CAAGTGTTGCAGCTGAAGGCCGACTTCATCATATCCGCGTTCGGATCTGGACTTTATGAT
GAAGACGTGAAACAGGCGATGGATGGCGTGAAATTAAATAGCTGGGGTCTTCCCGAAATA
GATAATACTTGTATGCAGAGTTCGAGCAATCCGAAGGTGTTTGTTGGCGGGGACTTGGCC
GGCGTCGCTGACACCACCGTGGAGTCCGTCAACGACGGAAAAACTGCCGCTTGGTACATG
CATTGCTACCTACAGGGCATACCATTCTCAGCGGCCATTGAGCTGCCGAAATTCCATACA
GACATCGATGAGGTGGATCTGTCAGTGGAGGTCTGCGGAATCCGGTTCGAGAATCCTTTC
GGCCTAGCGAGCGCTCCGCCCACCACCAGCTCCGCCATGATAAGGAGAGCCTTCACCCAG
GGCTGGGGCTTCGTGGTTACTAAGACCTTCGGACTTGATAAGGACATCGTTACGAACGTG
TCACCGCGTATAGTACGCGGTGTGACCTCGGGAGAGAACTACGGACCCGGCCAAGGATCC
TTCTTGAATATTGAGCTGATCTCCGAAAAATGTGAAGCTTACTGGTGTCAGAGTATCACG
GAGCTAAAACGGGACTTCCCCACTAAGGTGATAATAGCGTCTATAATGTGTTCATACAAC
GAGGACGACTGGACGGAGCTGGCGAGGAAAGCCGAGGCGAGCGGAGCGGACGCGCTGGAG
CTCAACCTGTCCTGCCCGCACGGCATGGGAGAGAGCGGGATGGGGCTAGCGTGCGGACAG
GACCCGGTACTGGTGAAAGGAATATCCCAGTGGGTGCGGAAGGCTATAAACATACCGTTT
TTCGTGAAGCTGACTCCCAACATCACAGACATAGTATCAATAGCGACGGCCGCTTATGAA
GGCGGCGCTAGTGGTGTGTCTGCCATCAACACCGTGTCCGGTCTGATGACAGTTAGGGCG
GATGCTACTCCGTGGCCTCAAGTCGGTCGCGAGAAAAGCACAACATACGGCGGTGTATCC
GGCAACGCGACGCGGCCGATGGGATTGCGAGCTGTGTCCGCCATCGCTAACAAGTTACCG
GGATTCCCCATACTCGGCATAGGAGGCATTGACTCCGCGGACTCAGCCTTGCAGTTCATG
CTGTGCGGCGCACCAGTCGTTCAGATCTGTAGTGCGGTTCAAAACCAGGACTTCACCGTT
GTTGAGGACTATGTGACAGGTCTTAAGGCGCTTTTATATTTACGATCGAGGGGTCTCCAA
GGCTGGACGGGACAATCGCCGCCCACCAACAAACATCAGAAAGGGAAACCGGTGCAGACG
ATATGTGATGAAAACGGCAAGGTGTTAGCACACTTCGGTCCTTATATTAAGAAACGCGAA
GGAGTACTTCACGAGCAACGTTTAAATACAAATATTTTGGCTGACAACACCACGGAATGT
CCACGGAGGAATGTCAACGGCCATAATAAGGTGCCAAGGATCAGAGACGTCGTCGGTGAA
GCGCTGAACAGGATCAGTTCATACAAAAAACTGGACAATACAAAACAAGTTGTAGCCTTG
ATAGACGATGATATGTGTATTAACTGTGGTAAATGTTACATGGCGTGCGCGGATTCAGGA
TACCAGGCTATTGAATTCGATGAACAAACGCACATACCTCGCGTCACTGAAGACTGTACC
GGATGTACTTTGTGCCTTTCCGTTTGTCCCATCATCGACTGTATATCAATGGTTCCCAAG
AAAATACCACACGTTATAAAACGGGGACTTCACTACGAGATACACCCTGTATCACCGTTG
GACGGAGTTTGTCAATAA

Protein sequence:

MSKVLLSRDLPDIENLLKLNPTVKPYTNLVPSAQTKKNKQHWKRNADRKCTSCPSLERNF
DDIKHTTLSERGALKEAARCLKCADAPCQKSCPTQIDVKSFITSIANKNYYGAAKEILSD
NPLGLTCGMVCPTSDLCVGGCNLHASEEGAINIGGLQHFAVETFMKMGISQTLDPKTKPL
PRGDKKIALIGGGPASLSCACFLARMGYKNITVFEKEKYLGGLSSSEIPQYRLPYDVVQF
EIDLIRDLGVKFVTGRKLSTSDITVNGLFKDGYDAVFLGIGLPEPKSIPIFENLTPEMGF
YTSKQFLPLVSRGSKRGLCSCSSLPVLSGTVLVLGAGDTAFDCATSALRCGAKKVFVVFR
KGITHIRAVPEEVDLAKEEKCEFIPFMSPREVIVRNGKITALKMFRTEQLDDGEWFEDPD
QVLQLKADFIISAFGSGLYDEDVKQAMDGVKLNSWGLPEIDNTCMQSSSNPKVFVGGDLA
GVADTTVESVNDGKTAAWYMHCYLQGIPFSAAIELPKFHTDIDEVDLSVEVCGIRFENPF
GLASAPPTTSSAMIRRAFTQGWGFVVTKTFGLDKDIVTNVSPRIVRGVTSGENYGPGQGS
FLNIELISEKCEAYWCQSITELKRDFPTKVIIASIMCSYNEDDWTELARKAEASGADALE
LNLSCPHGMGESGMGLACGQDPVLVKGISQWVRKAINIPFFVKLTPNITDIVSIATAAYE
GGASGVSAINTVSGLMTVRADATPWPQVGREKSTTYGGVSGNATRPMGLRAVSAIANKLP
GFPILGIGGIDSADSALQFMLCGAPVVQICSAVQNQDFTVVEDYVTGLKALLYLRSRGLQ
GWTGQSPPTNKHQKGKPVQTICDENGKVLAHFGPYIKKREGVLHEQRLNTNILADNTTEC
PRRNVNGHNKVPRIRDVVGEALNRISSYKKLDNTKQVVALIDDDMCINCGKCYMACADSG
YQAIEFDEQTHIPRVTEDCTGCTLCLSVCPIIDCISMVPKKIPHVIKRGLHYEIHPVSPL
DGVCQ