DPGLEAN10893 in OGS1.0

New model in OGS2.0DPOGS206130 
Genomic Positionscaffold4:+ 210924-221185
See gene structure
CDS Length1920
Paired RNAseq reads  1289
Single RNAseq reads  3100
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000721 (0.0)
Best Drosophila hit  acetyl coenzyme A synthase, isoform A (0.0)
Best Human hitacetyl-coenzyme A synthetase, cytoplasmic isoform 1 (0.0)
Best NR hit (blastp)  AGAP006569-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP006569-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms


  
GO:0003987 acetate-CoA ligase activity
GO:0008152 metabolic process
GO:0005737 cytoplasm
GO:0016208 AMP binding
InterPro families

  
IPR020845 AMP-binding, conserved site
IPR011904 Acetate-CoA ligase
IPR000873 AMP-dependent synthetase/ligase
Orthology groupMCL12992

Nucleotide sequence:

ATGCATAAAAAATCCCTTGAAGACCCAGAGGCGTTTTGGTCGGAAATAGCCAAAGAATTT
CACTGGCAAACCCCCTGCCAGCCGGGAAAATTTCTTTCGTACAACTTCGATATCGACAAG
GGGGATATTTTTGTTAAATGGATGGAAGGGGCTACGACAAATGTTTGCTTTAACGTCCTA
GATCGTAACATAAGAAATGGTCATGGGGATAAAATCGCTTATTACTGGGAAGGTAACCAT
CCTGATGACTACAGTCGAATCACATACAAGAAGCTATTGGATTCTGTTTGTATGTTTGCG
AACGCTCTACGAGAGCTGGGGGTTCGCAAAGGAGACAGAGTAGCCATCTACATGCCCATG
ATTATGGAGACGGTGATTTGTATGTTGGGCTGTGCAAGAATCGGCGCAGTACATTCTGTC
GTATTCGCTGGGTTTTCCTCGGATTCACTAGCGGAGCGAATGTCCGACTGCAAGGCGAAG
GTCATTGTGACCTCTGACGGGGCGTGGAGAGGAGAAAAAAAGTTGTTCTTGAAGAATACT
TGCGACGAAGCTATCGAGAAAGCTAGAACTAAGCACAACCATGAAGTCAACTTGTGCATC
GTCGTATCCCATTTGGGGAGAGTGAAGCCGGGTGCAAGAATGAATGTTTTAAAAAAACCG
TACACTTGGAATGACAACGTGGATATATGGTGGCACGAGATCATGGAAGGTCAATCACCC
ATCTGCGCTCCCGAGTGGATGAATGCTGAGGACCCCTTGTTCATGCTATACACTAGCGGT
TCCACGGGCAAGCCGAAGGGCGTTCTACACACCATCGCTGGTTACATGCTCTACGCGGCG
ACAACCTTCCGATATGTATTCGATTATCGCGAGAAGGACATCTATTGGTGCACCGCTGAC
GTAGGCTGGATCACGGGGCACACTTACGTCGTGTACGCGCCCCTTGCGAATGCCGCTACG
TCGCTTATGTTCGAAGGTACACCTTTCTACCCAGATAACGATCGCTACTGGTTGTTGGTT
AAGAAGTACAAGGTTACTCAATTCTACACAGCACCCACCGCCATTAGAGCTCTCATGAAA
TTTGGCGACGAGCTCGTCACCAAGAACAATTTAAAAACTTTGCGTGTGTTGGGGAGCGTT
GGGGAGCCTATCAACCCGGAAGCGTGGTTGTGGTTCTACAACCTAGTTGGTAATAAACGT
TGCTCCATCGTGGATACTTTCTGGCAGACTGAAACCGGTGGCCACGTACTCACCGGCCTG
CCAGGCGCCTCGCCTATGAAGCCTGGAGCTGCTGGGTTTCCATTCTTCGGCGTGGAACCG
ACACTGCTTGACGAAAGCGGCAAAGTGATCGAAGGGCCCGGCGAGGGCTACCTGGTCTTC
TCGCGACCATGGCCAGGCATCATGAGGACCCTCTTCGGTGACCACGCTCGTTACCAAAAG
GTCTACTTCTCTAAATTCAAAGGATATTATTGCACAGGCGATGGCGCCAGGCGTGACGAG
GATGGGTTCCTGTGGGTGACGGGACGTATCGATGACATGCTGAATGTGTCCGGTCATCTG
CTGTCTACTTCCGAGGTGGAAGGTGTCCTCACTGAAGAGCCCTCCGTGTCCGAAGCTGCT
GTAGTCTCCAAGCCACATCCCGTCAAAGGCGAGTCTCTGTACTGCTTCGTCATCCTCAAC
GAGGGCGTCCAGTTCGGCCCCGAATTGGTGGACGCTCTGAAGAAACGCGTGAGGAATAGG
ATCGGAGCCTTTGCAGCTCCGGATGTCATTCAATACGCTCCCGGTCTGCCGAAAACCAGG
TCCGGGAAGATCATGAGGAGAATCCTAAGGAAGATAGCGCTCGGTGACACCGACATCGGC
GACACGTCGACTTTGGCCGACCCGTCCGTCGTCGACGAGCTCTTTAAATGCAGACCCTAG

Protein sequence:

MHKKSLEDPEAFWSEIAKEFHWQTPCQPGKFLSYNFDIDKGDIFVKWMEGATTNVCFNVL
DRNIRNGHGDKIAYYWEGNHPDDYSRITYKKLLDSVCMFANALRELGVRKGDRVAIYMPM
IMETVICMLGCARIGAVHSVVFAGFSSDSLAERMSDCKAKVIVTSDGAWRGEKKLFLKNT
CDEAIEKARTKHNHEVNLCIVVSHLGRVKPGARMNVLKKPYTWNDNVDIWWHEIMEGQSP
ICAPEWMNAEDPLFMLYTSGSTGKPKGVLHTIAGYMLYAATTFRYVFDYREKDIYWCTAD
VGWITGHTYVVYAPLANAATSLMFEGTPFYPDNDRYWLLVKKYKVTQFYTAPTAIRALMK
FGDELVTKNNLKTLRVLGSVGEPINPEAWLWFYNLVGNKRCSIVDTFWQTETGGHVLTGL
PGASPMKPGAAGFPFFGVEPTLLDESGKVIEGPGEGYLVFSRPWPGIMRTLFGDHARYQK
VYFSKFKGYYCTGDGARRDEDGFLWVTGRIDDMLNVSGHLLSTSEVEGVLTEEPSVSEAA
VVSKPHPVKGESLYCFVILNEGVQFGPELVDALKKRVRNRIGAFAAPDVIQYAPGLPKTR
SGKIMRRILRKIALGDTDIGDTSTLADPSVVDELFKCRP