DPGLEAN17992 in OGS1.0

New model in OGS2.0DPOGS203140 
Genomic Positionscaffold1561:+ 14257-21865
See gene structure
CDS Length1794
Paired RNAseq reads  522
Single RNAseq reads  1842
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009187 (5e-37)
Best Drosophila hit  Mocs1, isoform C (1e-121)
Best Human hitmolybdenum cofactor biosynthesis protein 1 isoform 4 (1e-83)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC000610 [Tribolium castaneum] (6e-139)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC000610 [Tribolium castaneum] (8e-129)
GeneOntology terms



  
GO:0006777 Mo-molybdopterin cofactor biosynthetic process
GO:0019008 molybdopterin synthase complex
GO:0003824 catalytic activity
GO:0051539 4 iron, 4 sulfur cluster binding
GO:0046872 metal ion binding
InterPro families






  
IPR006638 Elongator protein 3/MiaB/NifB
IPR013483 Molybdenum cofactor biosynthesis protein A
IPR023045 Molybdenum cofactor biosynthesis C
IPR000385 MoaA/nifB/pqqE, iron-sulphur binding, conserved site
IPR013785 Aldolase-type TIM barrel
IPR002820 Molybdopterin cofactor biosynthesis C (MoaC) domain
IPR007197 Radical SAM
IPR010505 Molybdenum cofactor synthesis C-terminal
Orthology groupMCL14087

Nucleotide sequence:

ATGTTTTATCTTTTTTATCCCAAATTACTACAAAACACAAGTAGTTATCAAACATTTAGC
TTAAAACGTTTTATAAATTCTGATCTAAAACCTAACATAAAAGAAACTGCAAATTTATTA
CTTAGAAATGATGTTCCACCACTTGTTGACTTATATGGACGTAAACATGACTATTTACGA
ATCTCCCTAACTGAAAAATGCAATTTAAGGTGTCAATACTGCATGCCGGCCGAAGGTGTA
AAGCTTAGTCCTCGTGACAAAATATTGTCTAATGAAGAAGTTTTACGATTGGCAAGAGTG
TTTGCCGCTCTTGGTATAAATAAAATACGGTTAACTGGTGGCGAGCCTACATTGAGAAAA
GACCTTGTGAATATTGTCCAGGAATTGACCAACCTCCATGGGATAACGACAGTGGCAATG
ACAACTAATGGTATAGCCCTAACAAGGAAATTACCTTCGCTACAACGTGCTGGTCTCTCA
GCTCTGAATTTGTCTTTGGATTCGCTGAAACCAGAACGTTTTGAGCGCATGGCACGAAGA
CCGGGCTTACCCCATGTGCTGGCCAGCATGGATTTGGCGCTGCAGCTTGGCTTCAAATCC
GTTAAAATTAATACCGTGCTTATGAAAGGTTTTAATGATGACGAAATTTGCGATTTCATT
GAACTAACTCGCGACCGTGATATGGATATAAGGTTCATAGAGTTCATGCCTTTCTCTGGG
AACCGTTGGGAGAAAGGGGAACGTATGGTTAGTGAGAAAGATGCTATCTCTGCTGCAATC
GAGAGATACGGTGATCTGATACCGCTCCCCCCCACTCCGTGTCGCACTGCCACGCTGTGG
CAGGTGCCAGGTTATACTGGTCGGGTGGGTTTCATATCGTCCATGACTAAGCCGTTTTGT
TCAACTTGCAACCGTCTTCGACTCACAGCAGATGGTAATTTGAAGTGTGGATGCGAGAGT
CTTTATATTTTGCTGGTCCAAACAGACAAGCCCCTTAGGGAGACTAACCTCCGTGACGCC
ATACGAGCTGGTGTCAACGATGATGATATTGAGACCTTAGTCCGTAGCGCTCTTAGGAGG
AAGTTACCTCGACATGCTGCGGAGCCGCGGCGACTGCAGGCGCGGGCGTTCTGCACCAGT
GCCCCGCAGCAACCCTCCTCTCCTCCCCCCACCCGCCGCCGTCTGCCCCAGGAGACTGAA
GAGGACGGAGTCATGACGCACTTAGACAAGTCGGGACGTGCTCGCATGGTCGATGTAGGT
GAAAAACCGGTGACTGTTCGAACTGCGGAAGCGGAATGCTATCTTGTTGTTGGAGCTCGT
CTGCTACGTCTATTGCGCTCGTCTGGCGTACCGAAAGGAGACGCGCTGACTGTCGCTCAG
GTAGCTGGTACAATGGCGGCCAAGCGTACATCAGATCTGATACCCATGTGCCATCCGCTA
GCTTTGACCTTGGCTCGGGTCCGAGTGATGTTGCCAAAGGAGAGTGAGCGCGGGGGAGGT
GGCCGTGTACGAGTAACGTGCGAGGCGCGTGCAACAGCAAAAACTGGTGTGGAAATGGAA
GCGCTCACCGGATGTTCCATAGCGGCGCTAACTTTGTATGACATGTGCAAATCGGTGGAC
AAAAACATGCAGATTACTGATCTCAAGGTGATATCGAAGACTGGTGGCAAAAGTTCCTGG
GGTGATGTAGAAGAGAAAGAAGAGGAAGGTTTTGGACCTAAAGTACGCGAACATGATACG
TCGCCGCATGCACCTGGAGAGACATACGTCCCTACTAATTTGTTATACTTTTAG

Protein sequence:

MFYLFYPKLLQNTSSYQTFSLKRFINSDLKPNIKETANLLLRNDVPPLVDLYGRKHDYLR
ISLTEKCNLRCQYCMPAEGVKLSPRDKILSNEEVLRLARVFAALGINKIRLTGGEPTLRK
DLVNIVQELTNLHGITTVAMTTNGIALTRKLPSLQRAGLSALNLSLDSLKPERFERMARR
PGLPHVLASMDLALQLGFKSVKINTVLMKGFNDDEICDFIELTRDRDMDIRFIEFMPFSG
NRWEKGERMVSEKDAISAAIERYGDLIPLPPTPCRTATLWQVPGYTGRVGFISSMTKPFC
STCNRLRLTADGNLKCGCESLYILLVQTDKPLRETNLRDAIRAGVNDDDIETLVRSALRR
KLPRHAAEPRRLQARAFCTSAPQQPSSPPPTRRRLPQETEEDGVMTHLDKSGRARMVDVG
EKPVTVRTAEAECYLVVGARLLRLLRSSGVPKGDALTVAQVAGTMAAKRTSDLIPMCHPL
ALTLARVRVMLPKESERGGGGRVRVTCEARATAKTGVEMEALTGCSIAALTLYDMCKSVD
KNMQITDLKVISKTGGKSSWGDVEEKEEEGFGPKVREHDTSPHAPGETYVPTNLLYF