New model in OGS2.0 | DPOGS210796  |
---|---|
Genomic Position | scaffold1772:- 14639-24217 |
See gene structure | |
CDS Length | 3078 |
Paired RNAseq reads   | 3219 |
Single RNAseq reads   | 7020 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007111 (0.0) |
Best Drosophila hit   | suppressor of rudimentary, isoform C (0.0) |
Best Human hit | dihydropyrimidine dehydrogenase [NADP+] isoform 1 (0.0) |
Best NR hit (blastp)   | dihydropyrimidine dehydrogenase [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | dihydropyrimidine dehydrogenase [Aedes aegypti] (0.0) |
GeneOntology terms    | GO:0004159 dihydrouracil dehydrogenase (NAD+) activity GO:0017113 dihydropyrimidine dehydrogenase (NADP+) activity GO:0009055 electron carrier activity GO:0005737 cytoplasm GO:0055114 oxidation reduction GO:0051536 iron-sulfur cluster binding GO:0004158 dihydroorotate oxidase activity GO:0006222 UMP biosynthetic process GO:0006207 'de novo' pyrimidine base biosynthetic process |
InterPro families    | IPR009051 Alpha-helical ferredoxin IPR005720 Dihydroorotate dehydrogenase, class 1 IPR017900 4Fe-4S ferredoxin, iron-sulphur binding, conserved site IPR001295 Dihydroorotate dehydrogenase, conserved site IPR012135 Dihydroorotate dehydrogenase, class 1/ 2 IPR013027 FAD-dependent pyridine nucleotide-disulphide oxidoreductase IPR012285 Fumarate reductase, C-terminal IPR013785 Aldolase-type TIM barrel IPR017896 4Fe-4S ferredoxin, iron-sulpur binding domain |
Orthology group | MCL12239 |
Nucleotide sequence:
ATGTCGAAGGTATTACTGAGTCGAGATCTGCCGGATATTGAGAACCTTCTTAAGCTTAAC
CCGACTGTTAAGCCATATACCAACCTGGTGCCATCAGCACAGACGAAGAAGAATAAACAG
CACTGGAAAAGGAATGCCGATAGAAAATGCACTAGTTGTCCAAGTCTCGAGAGGAATTTC
GATGACATCAAACACACGACATTGTCTGAGCGTGGTGCTTTGAAAGAGGCAGCTCGTTGT
CTGAAATGTGCTGACGCGCCCTGTCAGAAGTCCTGCCCGACGCAGATCGACGTCAAAAGC
TTTATAACAAGCATAGCTAATAAGAACTATTATGGCGCCGCCAAAGAAATACTCTCAGAC
AACCCCTTGGGTCTCACTTGTGGAATGGTTTGCCCCACCAGTGACTTGTGCGTCGGCGGA
TGTAACTTACACGCGAGCGAAGAGGGCGCCATTAATATCGGTGGATTACAGCATTTCGCT
GTTGAAACATTCATGAAAATGGGTATCTCCCAGACATTGGATCCAAAAACGAAACCCCTG
CCTCGAGGTGATAAAAAAATTGCTCTTATAGGCGGCGGACCGGCAAGCTTGAGTTGTGCT
TGTTTTTTGGCAAGGATGGGTTATAAGAATATTACGGTGTTTGAGAAAGAAAAATATCTT
GGAGGACTAAGTTCATCAGAGATCCCACAGTACCGCCTTCCGTATGACGTGGTTCAATTC
GAGATCGATCTCATCAGGGACCTTGGAGTGAAGTTCGTCACAGGAAGGAAACTTTCAACC
AGTGACATAACTGTTAATGGACTTTTTAAAGATGGTTATGATGCTGTGTTTCTTGGCATC
GGTCTGCCCGAGCCAAAGAGCATTCCTATCTTCGAAAACCTAACACCAGAAATGGGATTT
TACACTAGCAAACAGTTTTTACCCCTGGTGTCCAGGGGCAGCAAAAGAGGCTTATGTAGC
TGTTCGTCGTTGCCGGTTCTGTCTGGTACTGTGTTGGTTCTTGGGGCGGGAGACACGGCC
TTCGACTGCGCCACATCAGCACTCCGCTGCGGCGCCAAGAAAGTGTTTGTCGTGTTCAGG
AAAGGCATCACTCACATAAGAGCTGTTCCCGAGGAGGTTGATCTGGCTAAGGAAGAAAAA
TGTGAGTTCATACCATTTATGTCACCGCGAGAAGTAATCGTCAGAAACGGAAAGATAACT
GCCTTAAAAATGTTCAGAACTGAACAATTAGATGATGGCGAATGGTTCGAAGACCCAGAT
CAAGTGTTGCAGCTGAAGGCCGACTTCATCATATCCGCGTTCGGATCTGGACTTTATGAT
GAAGACGTGAAACAGGCGATGGATGGCGTGAAATTAAATAGCTGGGGTCTTCCCGAAATA
GATAATACTTGTATGCAGAGTTCGAGCAATCCGAAGGTGTTTGTTGGCGGGGACTTGGCC
GGCGTCGCTGACACCACCGTGGAGTCCGTCAACGACGGAAAAACTGCCGCTTGGTACATG
CATTGCTACCTACAGGGCATACCATTCTCAGCGGCCATTGAGCTGCCGAAATTCCATACA
GACATCGATGAGGTGGATCTGTCAGTGGAGGTCTGCGGAATCCGGTTCGAGAATCCTTTC
GGCCTAGCGAGCGCTCCGCCCACCACCAGCTCCGCCATGATAAGGAGAGCCTTCACCCAG
GGCTGGGGCTTCGTGGTTACTAAGACCTTCGGACTTGATAAGGACATCGTTACGAACGTG
TCACCGCGTATAGTACGCGGTGTGACCTCGGGAGAGAACTACGGACCCGGCCAAGGATCC
TTCTTGAATATTGAGCTGATCTCCGAAAAATGTGAAGCTTACTGGTGTCAGAGTATCACG
GAGCTAAAACGGGACTTCCCCACTAAGGTGATAATAGCGTCTATAATGTGTTCATACAAC
GAGGACGACTGGACGGAGCTGGCGAGGAAAGCCGAGGCGAGCGGAGCGGACGCGCTGGAG
CTCAACCTGTCCTGCCCGCACGGCATGGGAGAGAGCGGGATGGGGCTAGCGTGCGGACAG
GACCCGGTACTGGTGAAAGGAATATCCCAGTGGGTGCGGAAGGCTATAAACATACCGTTT
TTCGTGAAGCTGACTCCCAACATCACAGACATAGTATCAATAGCGACGGCCGCTTATGAA
GGCGGCGCTAGTGGTGTGTCTGCCATCAACACCGTGTCCGGTCTGATGACAGTTAGGGCG
GATGCTACTCCGTGGCCTCAAGTCGGTCGCGAGAAAAGCACAACATACGGCGGTGTATCC
GGCAACGCGACGCGGCCGATGGGATTGCGAGCTGTGTCCGCCATCGCTAACAAGTTACCG
GGATTCCCCATACTCGGCATAGGAGGCATTGACTCCGCGGACTCAGCCTTGCAGTTCATG
CTGTGCGGCGCACCAGTCGTTCAGATCTGTAGTGCGGTTCAAAACCAGGACTTCACCGTT
GTTGAGGACTATGTGACAGGTCTTAAGGCGCTTTTATATTTACGATCGAGGGGTCTCCAA
GGCTGGACGGGACAATCGCCGCCCACCAACAAACATCAGAAAGGGAAACCGGTGCAGACG
ATATGTGATGAAAACGGCAAGGTGTTAGCACACTTCGGTCCTTATATTAAGAAACGCGAA
GGAGTACTTCACGAGCAACGTTTAAATACAAATATTTTGGCTGACAACACCACGGAATGT
CCACGGAGGAATGTCAACGGCCATAATAAGGTGCCAAGGATCAGAGACGTCGTCGGTGAA
GCGCTGAACAGGATCAGTTCATACAAAAAACTGGACAATACAAAACAAGTTGTAGCCTTG
ATAGACGATGATATGTGTATTAACTGTGGTAAATGTTACATGGCGTGCGCGGATTCAGGA
TACCAGGCTATTGAATTCGATGAACAAACGCACATACCTCGCGTCACTGAAGACTGTACC
GGATGTACTTTGTGCCTTTCCGTTTGTCCCATCATCGACTGTATATCAATGGTTCCCAAG
AAAATACCACACGTTATAAAACGGGGACTTCACTACGAGATACACCCTGTATCACCGTTG
GACGGAGTTTGTCAATAA
Protein sequence:
MSKVLLSRDLPDIENLLKLNPTVKPYTNLVPSAQTKKNKQHWKRNADRKCTSCPSLERNF
DDIKHTTLSERGALKEAARCLKCADAPCQKSCPTQIDVKSFITSIANKNYYGAAKEILSD
NPLGLTCGMVCPTSDLCVGGCNLHASEEGAINIGGLQHFAVETFMKMGISQTLDPKTKPL
PRGDKKIALIGGGPASLSCACFLARMGYKNITVFEKEKYLGGLSSSEIPQYRLPYDVVQF
EIDLIRDLGVKFVTGRKLSTSDITVNGLFKDGYDAVFLGIGLPEPKSIPIFENLTPEMGF
YTSKQFLPLVSRGSKRGLCSCSSLPVLSGTVLVLGAGDTAFDCATSALRCGAKKVFVVFR
KGITHIRAVPEEVDLAKEEKCEFIPFMSPREVIVRNGKITALKMFRTEQLDDGEWFEDPD
QVLQLKADFIISAFGSGLYDEDVKQAMDGVKLNSWGLPEIDNTCMQSSSNPKVFVGGDLA
GVADTTVESVNDGKTAAWYMHCYLQGIPFSAAIELPKFHTDIDEVDLSVEVCGIRFENPF
GLASAPPTTSSAMIRRAFTQGWGFVVTKTFGLDKDIVTNVSPRIVRGVTSGENYGPGQGS
FLNIELISEKCEAYWCQSITELKRDFPTKVIIASIMCSYNEDDWTELARKAEASGADALE
LNLSCPHGMGESGMGLACGQDPVLVKGISQWVRKAINIPFFVKLTPNITDIVSIATAAYE
GGASGVSAINTVSGLMTVRADATPWPQVGREKSTTYGGVSGNATRPMGLRAVSAIANKLP
GFPILGIGGIDSADSALQFMLCGAPVVQICSAVQNQDFTVVEDYVTGLKALLYLRSRGLQ
GWTGQSPPTNKHQKGKPVQTICDENGKVLAHFGPYIKKREGVLHEQRLNTNILADNTTEC
PRRNVNGHNKVPRIRDVVGEALNRISSYKKLDNTKQVVALIDDDMCINCGKCYMACADSG
YQAIEFDEQTHIPRVTEDCTGCTLCLSVCPIIDCISMVPKKIPHVIKRGLHYEIHPVSPL
DGVCQ