New model in OGS2.0 | DPOGS215498  |
---|---|
Genomic Position | scaffold2200:- 11179-25261 |
See gene structure | |
CDS Length | 3297 |
Paired RNAseq reads   | 6410 |
Single RNAseq reads   | 15443 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011222 (0.0) |
Best Drosophila hit   | CG11198, isoform D (0.0) |
Best Human hit | acetyl-CoA carboxylase 1 isoform 1 (0.0) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC015613 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC015613 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0003989 acetyl-CoA carboxylase activity GO:0005524 ATP binding GO:0006633 fatty acid biosynthetic process GO:0009374 biotin binding GO:0005811 lipid particle |
InterPro families    | IPR000089 Biotin/lipoyl attachment IPR011761 ATP-grasp fold IPR011764 Biotin carboxylation domain IPR001882 Biotin-binding site IPR005479 Carbamoyl-phosphate synthetase, large subunit, ATP-binding IPR016185 PreATP-grasp-like fold IPR011054 Rudiment single hybrid motif IPR011053 Single hybrid motif IPR005482 Biotin carboxylase, C-terminal IPR013817 Pre-ATP-grasp fold IPR013815 ATP-grasp fold, subdomain 1 IPR013816 ATP-grasp fold, subdomain 2 IPR013537 Acetyl-CoA carboxylase, central domain IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal |
Orthology group | MCL10763 |
Nucleotide sequence:
ATGTTGAAGAGGAGAAGCTCGAAGCGGTTCGTGCTTGGGGACACCTCGGTCGATGAAGGG
GAGTGGGCTGATGAGGCAACAGACGTGATGCCAGCGCACGTCGTGCAGAAGTTCCAGGTG
ACGATCAACTCGGAACCTGAGGGGAATGTGAAAAATGAAGCGCTGGAGGTGGATAAAAGG
CTACTCGGACCTCCCACGGCCAGCACCTTGCAGCCGTCGATGTCCCAGGGTACTGTGATC
CACAGTCAGCGTTTCCAAGAGAAGGATTTCACTGTTGCCACCCCTGAAGAATTTGTTCGA
CGTTTCCAAGGGACCAAGGCCATTAACAAGGTGCTGATAGCGAATAATGGCATTGGAGCT
GTGAAATGTATGCGTTCTATACGAAGGTGGTCCTATGAGATGTTCAAGAATGAGAGGGCT
GTGCGCTTCGTTGTTATGGTGACCCCCGAGGACCTGAAAGCGAATGCTGAATACATAAAG
ATGGCTGATCATTATGTCCCTGTGCCCGGAGGCTCAAACAACAACAACTATGCCAACGTC
GAACTCATAGTGGACATCGCTGTGCGGACCCAAGTACAGGCCGTATGGGCTGGTTGGGGT
CATGCTTCAGAGAATCCCAAGCTTCCGGAACTGCTCCATCGAGCTGAACTTAAAGCGGAG
TACAATACAAAGAAGATAAAGATATCATCGGAACTGTTCGCTAGAGGCTGTGTGTCTACA
CCGGAGCAAGGACTCCAAGCTGCTAACAAGATCGGTTTCCCCGTCATGATAAAGGCTTCT
GAGGGTGGAGGAGGTAAAGGTATCAGGAAGGTCGAGGTGCCTGATGACTTCAGCAACATG
TTCAGACAGGTCCAAGCGGAAGTGCCAGGTTCTCCGATATTCGTGATGAAGCTGGCGCGG
TCAGCTAGACATCTGGAAGTGCAGCTGTTGGCTGATCAGTACGGTAACGCTATATCTCTG
TTCGGGAGAGACTGTTCCATCCAGCGACGGCATCAGAAGATCATTGAGGAGGCACCGGCT
GCCGTCGCCAAGCCGGAAGTGTTCATTGAAATGGAGAAGGCGGCTGTCCGTCTCGCTAAG
ATGGTCGGCTACGTGAGTGCTGGCACCGTGGAGTATCTATATGAGCCGATAACGGGGGCT
TACTACTTCCTGGAGCTAAACCCTCGCCTGCAGGTGGAACACCCCTGTACGGAGATGGTA
GCGGACGTCAACCTACCAGCCGCGCAACTACAGATCGCCATGGGTCTCCCATTATACCAC
ATCAAGGACATCCGTCTCCTGTATGGGGAGTCGCCCTGGGGTATGACTCAGATAGACTTC
GACGAGCCCAAACAGAGACCCTCGCCCTGGGGACATGTCATCGCTGCCAGGATCACGTCG
GAGAATCCTGATGAAGGTTTCAAACCTTCCTCGGGTACGGTCCAAGAGCTCAACTTCCGG
TCGTCAAAGAACGTGTGGGGCTATTTCAGTGTGGCCGCATCCGGGGGCTTGCACGAGTTT
GCCGATTCACAGTTCGGGCACTGCTTCTCCTTCGGGGAGACCAGGGAACAGGCTAGAGAG
AACCTGGTGATAGCTCTCAAGGAGCTCAGCATCCGAGGAGATTTCAGAACCACGGTCGAA
TATCTGATCACATTGTTGGAGACGCCCGCCTTCCAGGACAACAACATTGATACTAGCTGG
CTGGACGCGCTCATCGCGGAGAGGGTCCAGTCGGAGAAGCCGGACGTGATGCTGGGCGTC
ATCTGCGGGTCCATCCTCATCGCCGACAACCTCATCACGGCACATCTGCTGGAGTTTAAG
AGCGCCCTGGAGAAGGGTCAGATCCAAGGGTCGAGTCAGCTGTCGAACACTGTAGAAGTG
GAGCTGATACACTCCGGTCACAAGTACAAAGTTCAAGCAACTAAGTCTGGCCCCACCTCG
TACTTCCTGGCTATGAACGGCAGCTTCAAGGAGCTTGAGGTTCATAAGCTAACTGACGGC
GGTACCTTACTGTCAATCGACGGAGCGTCATACACGACGTATTTAAAGGAGGAGGTTGAC
AAGTACAGGATAGTGATCGGCAACCAAACCGTGGTCTTCGAGAAGGAGAAGGATCCGTCC
AAGCTGAGAGCCCCCTCCGCTGGGAAGCTGGTCAACACGCTGGTTGAAGATGGAGGGCAT
GTGGACAAGGGTCAGCCTTACGCTGAGATAGAGGTGATGAAAATGGTGATGACTCTATCA
GCGCCAGAATCGGGCAAGGTCTTCTGGAATCTAAGACCTGGTGCTGTGCTGGATATGGGA
GCGTTGATGGGAACACTAGAGTTGGACGATCCGTCGCTAGTGACCACAGCCCAGCCCTAC
AAGGGCCAGTTCCCTCACGAGGACAACCCCAACCTGACGGAGAAACTAAACCACGCCCAC
AACAAATATAGATATATACTGGAGAACACACTTGCTGGTTACTGTCTGCCCGAGCCGTAC
AACACTCCTCGTCTCCGGGAGGTGGTGGAGAAATTCATGTCCAGTCTGAGAGACCCCTCG
CTGCCCTTACTGGAGTTACAGGAGGTGTTATCATCAACGTCCGGTCGTATCCCGGTCTCC
GTGGAGAAGAAGGTCCGCAAGCTGATGGCGCTCTACGAGAGCAACATCACGAGTGTGCTG
GCACAGTTCCCCAGCCAGCAGATAGCCAGCGTCATTGACCATCACGCGGCGTCCCTCGCG
AAGCGCGCTGACAGGGACGTGTTCTTCATGAGCACGCAGGCCTTGGTGGTGCTAGTGCAG
CGGTACAGGAACGGTATCCGCGGTAGGATGAAGGCGGCCGTTCATGATCTCTTGAAGCAG
TACTACCAGGTCGAGAGCAACTTCCAACTGGGTTCGTACGACAAATGTGTGATGGCGCTG
CGGGATCGTTACAAGGACGACATGCAGGCCGTGGCTAACATCATCTTCTCACACAACCAA
GTGGCTAAGAAGAATCTCCTGGTGACGATGTTGATAGACCACCTGTGGTCTAATGAACCA
GGTCTCACCGATGAACTGGCGGCCACCCTCAACGAACTGACCTCGTTACACCGAGCCGAA
CACAGTCGGGTAGCGCTGAGAGCCAGACAGGTGCTGATCGCCGCCCACCAGCCCGCCTAC
GAGTTGCGGCACAATCAAATGGAGTCTATCTTTTTGTCAGCGGTCGATATGTACGGACAC
GACTTCCATCCGGAGAACCTTCAGAAGCTGATCCTCTCGGAGACTTCGATATTCGACATC
CTTCACGATTTCTTCTATCACACGAACGCGGCGGTGAGTAGACCGAGTAGACCATAG
Protein sequence:
MLKRRSSKRFVLGDTSVDEGEWADEATDVMPAHVVQKFQVTINSEPEGNVKNEALEVDKR
LLGPPTASTLQPSMSQGTVIHSQRFQEKDFTVATPEEFVRRFQGTKAINKVLIANNGIGA
VKCMRSIRRWSYEMFKNERAVRFVVMVTPEDLKANAEYIKMADHYVPVPGGSNNNNYANV
ELIVDIAVRTQVQAVWAGWGHASENPKLPELLHRAELKAEYNTKKIKISSELFARGCVST
PEQGLQAANKIGFPVMIKASEGGGGKGIRKVEVPDDFSNMFRQVQAEVPGSPIFVMKLAR
SARHLEVQLLADQYGNAISLFGRDCSIQRRHQKIIEEAPAAVAKPEVFIEMEKAAVRLAK
MVGYVSAGTVEYLYEPITGAYYFLELNPRLQVEHPCTEMVADVNLPAAQLQIAMGLPLYH
IKDIRLLYGESPWGMTQIDFDEPKQRPSPWGHVIAARITSENPDEGFKPSSGTVQELNFR
SSKNVWGYFSVAASGGLHEFADSQFGHCFSFGETREQARENLVIALKELSIRGDFRTTVE
YLITLLETPAFQDNNIDTSWLDALIAERVQSEKPDVMLGVICGSILIADNLITAHLLEFK
SALEKGQIQGSSQLSNTVEVELIHSGHKYKVQATKSGPTSYFLAMNGSFKELEVHKLTDG
GTLLSIDGASYTTYLKEEVDKYRIVIGNQTVVFEKEKDPSKLRAPSAGKLVNTLVEDGGH
VDKGQPYAEIEVMKMVMTLSAPESGKVFWNLRPGAVLDMGALMGTLELDDPSLVTTAQPY
KGQFPHEDNPNLTEKLNHAHNKYRYILENTLAGYCLPEPYNTPRLREVVEKFMSSLRDPS
LPLLELQEVLSSTSGRIPVSVEKKVRKLMALYESNITSVLAQFPSQQIASVIDHHAASLA
KRADRDVFFMSTQALVVLVQRYRNGIRGRMKAAVHDLLKQYYQVESNFQLGSYDKCVMAL
RDRYKDDMQAVANIIFSHNQVAKKNLLVTMLIDHLWSNEPGLTDELAATLNELTSLHRAE
HSRVALRARQVLIAAHQPAYELRHNQMESIFLSAVDMYGHDFHPENLQKLILSETSIFDI
LHDFFYHTNAAVSRPSRP