New model in OGS2.0 | DPOGS203140  |
---|---|
Genomic Position | scaffold1561:+ 14257-21865 |
See gene structure | |
CDS Length | 1794 |
Paired RNAseq reads   | 522 |
Single RNAseq reads   | 1842 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009187 (5e-37) |
Best Drosophila hit   | Mocs1, isoform C (1e-121) |
Best Human hit | molybdenum cofactor biosynthesis protein 1 isoform 4 (1e-83) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC000610 [Tribolium castaneum] (6e-139) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC000610 [Tribolium castaneum] (8e-129) |
GeneOntology terms    | GO:0006777 Mo-molybdopterin cofactor biosynthetic process GO:0019008 molybdopterin synthase complex GO:0003824 catalytic activity GO:0051539 4 iron, 4 sulfur cluster binding GO:0046872 metal ion binding |
InterPro families    | IPR006638 Elongator protein 3/MiaB/NifB IPR013483 Molybdenum cofactor biosynthesis protein A IPR023045 Molybdenum cofactor biosynthesis C IPR000385 MoaA/nifB/pqqE, iron-sulphur binding, conserved site IPR013785 Aldolase-type TIM barrel IPR002820 Molybdopterin cofactor biosynthesis C (MoaC) domain IPR007197 Radical SAM IPR010505 Molybdenum cofactor synthesis C-terminal |
Orthology group | MCL14087 |
Nucleotide sequence:
ATGTTTTATCTTTTTTATCCCAAATTACTACAAAACACAAGTAGTTATCAAACATTTAGC
TTAAAACGTTTTATAAATTCTGATCTAAAACCTAACATAAAAGAAACTGCAAATTTATTA
CTTAGAAATGATGTTCCACCACTTGTTGACTTATATGGACGTAAACATGACTATTTACGA
ATCTCCCTAACTGAAAAATGCAATTTAAGGTGTCAATACTGCATGCCGGCCGAAGGTGTA
AAGCTTAGTCCTCGTGACAAAATATTGTCTAATGAAGAAGTTTTACGATTGGCAAGAGTG
TTTGCCGCTCTTGGTATAAATAAAATACGGTTAACTGGTGGCGAGCCTACATTGAGAAAA
GACCTTGTGAATATTGTCCAGGAATTGACCAACCTCCATGGGATAACGACAGTGGCAATG
ACAACTAATGGTATAGCCCTAACAAGGAAATTACCTTCGCTACAACGTGCTGGTCTCTCA
GCTCTGAATTTGTCTTTGGATTCGCTGAAACCAGAACGTTTTGAGCGCATGGCACGAAGA
CCGGGCTTACCCCATGTGCTGGCCAGCATGGATTTGGCGCTGCAGCTTGGCTTCAAATCC
GTTAAAATTAATACCGTGCTTATGAAAGGTTTTAATGATGACGAAATTTGCGATTTCATT
GAACTAACTCGCGACCGTGATATGGATATAAGGTTCATAGAGTTCATGCCTTTCTCTGGG
AACCGTTGGGAGAAAGGGGAACGTATGGTTAGTGAGAAAGATGCTATCTCTGCTGCAATC
GAGAGATACGGTGATCTGATACCGCTCCCCCCCACTCCGTGTCGCACTGCCACGCTGTGG
CAGGTGCCAGGTTATACTGGTCGGGTGGGTTTCATATCGTCCATGACTAAGCCGTTTTGT
TCAACTTGCAACCGTCTTCGACTCACAGCAGATGGTAATTTGAAGTGTGGATGCGAGAGT
CTTTATATTTTGCTGGTCCAAACAGACAAGCCCCTTAGGGAGACTAACCTCCGTGACGCC
ATACGAGCTGGTGTCAACGATGATGATATTGAGACCTTAGTCCGTAGCGCTCTTAGGAGG
AAGTTACCTCGACATGCTGCGGAGCCGCGGCGACTGCAGGCGCGGGCGTTCTGCACCAGT
GCCCCGCAGCAACCCTCCTCTCCTCCCCCCACCCGCCGCCGTCTGCCCCAGGAGACTGAA
GAGGACGGAGTCATGACGCACTTAGACAAGTCGGGACGTGCTCGCATGGTCGATGTAGGT
GAAAAACCGGTGACTGTTCGAACTGCGGAAGCGGAATGCTATCTTGTTGTTGGAGCTCGT
CTGCTACGTCTATTGCGCTCGTCTGGCGTACCGAAAGGAGACGCGCTGACTGTCGCTCAG
GTAGCTGGTACAATGGCGGCCAAGCGTACATCAGATCTGATACCCATGTGCCATCCGCTA
GCTTTGACCTTGGCTCGGGTCCGAGTGATGTTGCCAAAGGAGAGTGAGCGCGGGGGAGGT
GGCCGTGTACGAGTAACGTGCGAGGCGCGTGCAACAGCAAAAACTGGTGTGGAAATGGAA
GCGCTCACCGGATGTTCCATAGCGGCGCTAACTTTGTATGACATGTGCAAATCGGTGGAC
AAAAACATGCAGATTACTGATCTCAAGGTGATATCGAAGACTGGTGGCAAAAGTTCCTGG
GGTGATGTAGAAGAGAAAGAAGAGGAAGGTTTTGGACCTAAAGTACGCGAACATGATACG
TCGCCGCATGCACCTGGAGAGACATACGTCCCTACTAATTTGTTATACTTTTAG
Protein sequence:
MFYLFYPKLLQNTSSYQTFSLKRFINSDLKPNIKETANLLLRNDVPPLVDLYGRKHDYLR
ISLTEKCNLRCQYCMPAEGVKLSPRDKILSNEEVLRLARVFAALGINKIRLTGGEPTLRK
DLVNIVQELTNLHGITTVAMTTNGIALTRKLPSLQRAGLSALNLSLDSLKPERFERMARR
PGLPHVLASMDLALQLGFKSVKINTVLMKGFNDDEICDFIELTRDRDMDIRFIEFMPFSG
NRWEKGERMVSEKDAISAAIERYGDLIPLPPTPCRTATLWQVPGYTGRVGFISSMTKPFC
STCNRLRLTADGNLKCGCESLYILLVQTDKPLRETNLRDAIRAGVNDDDIETLVRSALRR
KLPRHAAEPRRLQARAFCTSAPQQPSSPPPTRRRLPQETEEDGVMTHLDKSGRARMVDVG
EKPVTVRTAEAECYLVVGARLLRLLRSSGVPKGDALTVAQVAGTMAAKRTSDLIPMCHPL
ALTLARVRVMLPKESERGGGGRVRVTCEARATAKTGVEMEALTGCSIAALTLYDMCKSVD
KNMQITDLKVISKTGGKSSWGDVEEKEEEGFGPKVREHDTSPHAPGETYVPTNLLYF