DPGLEAN14107 in OGS1.0

New model in OGS2.0DPOGS215498 
Genomic Positionscaffold2200:- 11179-25261
See gene structure
CDS Length3297
Paired RNAseq reads  6410
Single RNAseq reads  15443
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011222 (0.0)
Best Drosophila hit  CG11198, isoform D (0.0)
Best Human hitacetyl-CoA carboxylase 1 isoform 1 (0.0)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC015613 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC015613 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0003989 acetyl-CoA carboxylase activity
GO:0005524 ATP binding
GO:0006633 fatty acid biosynthetic process
GO:0009374 biotin binding
GO:0005811 lipid particle
InterPro families












  
IPR000089 Biotin/lipoyl attachment
IPR011761 ATP-grasp fold
IPR011764 Biotin carboxylation domain
IPR001882 Biotin-binding site
IPR005479 Carbamoyl-phosphate synthetase, large subunit, ATP-binding
IPR016185 PreATP-grasp-like fold
IPR011054 Rudiment single hybrid motif
IPR011053 Single hybrid motif
IPR005482 Biotin carboxylase, C-terminal
IPR013817 Pre-ATP-grasp fold
IPR013815 ATP-grasp fold, subdomain 1
IPR013816 ATP-grasp fold, subdomain 2
IPR013537 Acetyl-CoA carboxylase, central domain
IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal
Orthology groupMCL10763

Nucleotide sequence:

ATGTTGAAGAGGAGAAGCTCGAAGCGGTTCGTGCTTGGGGACACCTCGGTCGATGAAGGG
GAGTGGGCTGATGAGGCAACAGACGTGATGCCAGCGCACGTCGTGCAGAAGTTCCAGGTG
ACGATCAACTCGGAACCTGAGGGGAATGTGAAAAATGAAGCGCTGGAGGTGGATAAAAGG
CTACTCGGACCTCCCACGGCCAGCACCTTGCAGCCGTCGATGTCCCAGGGTACTGTGATC
CACAGTCAGCGTTTCCAAGAGAAGGATTTCACTGTTGCCACCCCTGAAGAATTTGTTCGA
CGTTTCCAAGGGACCAAGGCCATTAACAAGGTGCTGATAGCGAATAATGGCATTGGAGCT
GTGAAATGTATGCGTTCTATACGAAGGTGGTCCTATGAGATGTTCAAGAATGAGAGGGCT
GTGCGCTTCGTTGTTATGGTGACCCCCGAGGACCTGAAAGCGAATGCTGAATACATAAAG
ATGGCTGATCATTATGTCCCTGTGCCCGGAGGCTCAAACAACAACAACTATGCCAACGTC
GAACTCATAGTGGACATCGCTGTGCGGACCCAAGTACAGGCCGTATGGGCTGGTTGGGGT
CATGCTTCAGAGAATCCCAAGCTTCCGGAACTGCTCCATCGAGCTGAACTTAAAGCGGAG
TACAATACAAAGAAGATAAAGATATCATCGGAACTGTTCGCTAGAGGCTGTGTGTCTACA
CCGGAGCAAGGACTCCAAGCTGCTAACAAGATCGGTTTCCCCGTCATGATAAAGGCTTCT
GAGGGTGGAGGAGGTAAAGGTATCAGGAAGGTCGAGGTGCCTGATGACTTCAGCAACATG
TTCAGACAGGTCCAAGCGGAAGTGCCAGGTTCTCCGATATTCGTGATGAAGCTGGCGCGG
TCAGCTAGACATCTGGAAGTGCAGCTGTTGGCTGATCAGTACGGTAACGCTATATCTCTG
TTCGGGAGAGACTGTTCCATCCAGCGACGGCATCAGAAGATCATTGAGGAGGCACCGGCT
GCCGTCGCCAAGCCGGAAGTGTTCATTGAAATGGAGAAGGCGGCTGTCCGTCTCGCTAAG
ATGGTCGGCTACGTGAGTGCTGGCACCGTGGAGTATCTATATGAGCCGATAACGGGGGCT
TACTACTTCCTGGAGCTAAACCCTCGCCTGCAGGTGGAACACCCCTGTACGGAGATGGTA
GCGGACGTCAACCTACCAGCCGCGCAACTACAGATCGCCATGGGTCTCCCATTATACCAC
ATCAAGGACATCCGTCTCCTGTATGGGGAGTCGCCCTGGGGTATGACTCAGATAGACTTC
GACGAGCCCAAACAGAGACCCTCGCCCTGGGGACATGTCATCGCTGCCAGGATCACGTCG
GAGAATCCTGATGAAGGTTTCAAACCTTCCTCGGGTACGGTCCAAGAGCTCAACTTCCGG
TCGTCAAAGAACGTGTGGGGCTATTTCAGTGTGGCCGCATCCGGGGGCTTGCACGAGTTT
GCCGATTCACAGTTCGGGCACTGCTTCTCCTTCGGGGAGACCAGGGAACAGGCTAGAGAG
AACCTGGTGATAGCTCTCAAGGAGCTCAGCATCCGAGGAGATTTCAGAACCACGGTCGAA
TATCTGATCACATTGTTGGAGACGCCCGCCTTCCAGGACAACAACATTGATACTAGCTGG
CTGGACGCGCTCATCGCGGAGAGGGTCCAGTCGGAGAAGCCGGACGTGATGCTGGGCGTC
ATCTGCGGGTCCATCCTCATCGCCGACAACCTCATCACGGCACATCTGCTGGAGTTTAAG
AGCGCCCTGGAGAAGGGTCAGATCCAAGGGTCGAGTCAGCTGTCGAACACTGTAGAAGTG
GAGCTGATACACTCCGGTCACAAGTACAAAGTTCAAGCAACTAAGTCTGGCCCCACCTCG
TACTTCCTGGCTATGAACGGCAGCTTCAAGGAGCTTGAGGTTCATAAGCTAACTGACGGC
GGTACCTTACTGTCAATCGACGGAGCGTCATACACGACGTATTTAAAGGAGGAGGTTGAC
AAGTACAGGATAGTGATCGGCAACCAAACCGTGGTCTTCGAGAAGGAGAAGGATCCGTCC
AAGCTGAGAGCCCCCTCCGCTGGGAAGCTGGTCAACACGCTGGTTGAAGATGGAGGGCAT
GTGGACAAGGGTCAGCCTTACGCTGAGATAGAGGTGATGAAAATGGTGATGACTCTATCA
GCGCCAGAATCGGGCAAGGTCTTCTGGAATCTAAGACCTGGTGCTGTGCTGGATATGGGA
GCGTTGATGGGAACACTAGAGTTGGACGATCCGTCGCTAGTGACCACAGCCCAGCCCTAC
AAGGGCCAGTTCCCTCACGAGGACAACCCCAACCTGACGGAGAAACTAAACCACGCCCAC
AACAAATATAGATATATACTGGAGAACACACTTGCTGGTTACTGTCTGCCCGAGCCGTAC
AACACTCCTCGTCTCCGGGAGGTGGTGGAGAAATTCATGTCCAGTCTGAGAGACCCCTCG
CTGCCCTTACTGGAGTTACAGGAGGTGTTATCATCAACGTCCGGTCGTATCCCGGTCTCC
GTGGAGAAGAAGGTCCGCAAGCTGATGGCGCTCTACGAGAGCAACATCACGAGTGTGCTG
GCACAGTTCCCCAGCCAGCAGATAGCCAGCGTCATTGACCATCACGCGGCGTCCCTCGCG
AAGCGCGCTGACAGGGACGTGTTCTTCATGAGCACGCAGGCCTTGGTGGTGCTAGTGCAG
CGGTACAGGAACGGTATCCGCGGTAGGATGAAGGCGGCCGTTCATGATCTCTTGAAGCAG
TACTACCAGGTCGAGAGCAACTTCCAACTGGGTTCGTACGACAAATGTGTGATGGCGCTG
CGGGATCGTTACAAGGACGACATGCAGGCCGTGGCTAACATCATCTTCTCACACAACCAA
GTGGCTAAGAAGAATCTCCTGGTGACGATGTTGATAGACCACCTGTGGTCTAATGAACCA
GGTCTCACCGATGAACTGGCGGCCACCCTCAACGAACTGACCTCGTTACACCGAGCCGAA
CACAGTCGGGTAGCGCTGAGAGCCAGACAGGTGCTGATCGCCGCCCACCAGCCCGCCTAC
GAGTTGCGGCACAATCAAATGGAGTCTATCTTTTTGTCAGCGGTCGATATGTACGGACAC
GACTTCCATCCGGAGAACCTTCAGAAGCTGATCCTCTCGGAGACTTCGATATTCGACATC
CTTCACGATTTCTTCTATCACACGAACGCGGCGGTGAGTAGACCGAGTAGACCATAG

Protein sequence:

MLKRRSSKRFVLGDTSVDEGEWADEATDVMPAHVVQKFQVTINSEPEGNVKNEALEVDKR
LLGPPTASTLQPSMSQGTVIHSQRFQEKDFTVATPEEFVRRFQGTKAINKVLIANNGIGA
VKCMRSIRRWSYEMFKNERAVRFVVMVTPEDLKANAEYIKMADHYVPVPGGSNNNNYANV
ELIVDIAVRTQVQAVWAGWGHASENPKLPELLHRAELKAEYNTKKIKISSELFARGCVST
PEQGLQAANKIGFPVMIKASEGGGGKGIRKVEVPDDFSNMFRQVQAEVPGSPIFVMKLAR
SARHLEVQLLADQYGNAISLFGRDCSIQRRHQKIIEEAPAAVAKPEVFIEMEKAAVRLAK
MVGYVSAGTVEYLYEPITGAYYFLELNPRLQVEHPCTEMVADVNLPAAQLQIAMGLPLYH
IKDIRLLYGESPWGMTQIDFDEPKQRPSPWGHVIAARITSENPDEGFKPSSGTVQELNFR
SSKNVWGYFSVAASGGLHEFADSQFGHCFSFGETREQARENLVIALKELSIRGDFRTTVE
YLITLLETPAFQDNNIDTSWLDALIAERVQSEKPDVMLGVICGSILIADNLITAHLLEFK
SALEKGQIQGSSQLSNTVEVELIHSGHKYKVQATKSGPTSYFLAMNGSFKELEVHKLTDG
GTLLSIDGASYTTYLKEEVDKYRIVIGNQTVVFEKEKDPSKLRAPSAGKLVNTLVEDGGH
VDKGQPYAEIEVMKMVMTLSAPESGKVFWNLRPGAVLDMGALMGTLELDDPSLVTTAQPY
KGQFPHEDNPNLTEKLNHAHNKYRYILENTLAGYCLPEPYNTPRLREVVEKFMSSLRDPS
LPLLELQEVLSSTSGRIPVSVEKKVRKLMALYESNITSVLAQFPSQQIASVIDHHAASLA
KRADRDVFFMSTQALVVLVQRYRNGIRGRMKAAVHDLLKQYYQVESNFQLGSYDKCVMAL
RDRYKDDMQAVANIIFSHNQVAKKNLLVTMLIDHLWSNEPGLTDELAATLNELTSLHRAE
HSRVALRARQVLIAAHQPAYELRHNQMESIFLSAVDMYGHDFHPENLQKLILSETSIFDI
LHDFFYHTNAAVSRPSRP