New model in OGS2.0 | DPOGS206130  |
---|---|
Genomic Position | scaffold4:+ 210924-221185 |
See gene structure | |
CDS Length | 1920 |
Paired RNAseq reads   | 1289 |
Single RNAseq reads   | 3100 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000721 (0.0) |
Best Drosophila hit   | acetyl coenzyme A synthase, isoform A (0.0) |
Best Human hit | acetyl-coenzyme A synthetase, cytoplasmic isoform 1 (0.0) |
Best NR hit (blastp)   | AGAP006569-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP006569-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0003987 acetate-CoA ligase activity GO:0008152 metabolic process GO:0005737 cytoplasm GO:0016208 AMP binding |
InterPro families    | IPR020845 AMP-binding, conserved site IPR011904 Acetate-CoA ligase IPR000873 AMP-dependent synthetase/ligase |
Orthology group | MCL12992 |
Nucleotide sequence:
ATGCATAAAAAATCCCTTGAAGACCCAGAGGCGTTTTGGTCGGAAATAGCCAAAGAATTT
CACTGGCAAACCCCCTGCCAGCCGGGAAAATTTCTTTCGTACAACTTCGATATCGACAAG
GGGGATATTTTTGTTAAATGGATGGAAGGGGCTACGACAAATGTTTGCTTTAACGTCCTA
GATCGTAACATAAGAAATGGTCATGGGGATAAAATCGCTTATTACTGGGAAGGTAACCAT
CCTGATGACTACAGTCGAATCACATACAAGAAGCTATTGGATTCTGTTTGTATGTTTGCG
AACGCTCTACGAGAGCTGGGGGTTCGCAAAGGAGACAGAGTAGCCATCTACATGCCCATG
ATTATGGAGACGGTGATTTGTATGTTGGGCTGTGCAAGAATCGGCGCAGTACATTCTGTC
GTATTCGCTGGGTTTTCCTCGGATTCACTAGCGGAGCGAATGTCCGACTGCAAGGCGAAG
GTCATTGTGACCTCTGACGGGGCGTGGAGAGGAGAAAAAAAGTTGTTCTTGAAGAATACT
TGCGACGAAGCTATCGAGAAAGCTAGAACTAAGCACAACCATGAAGTCAACTTGTGCATC
GTCGTATCCCATTTGGGGAGAGTGAAGCCGGGTGCAAGAATGAATGTTTTAAAAAAACCG
TACACTTGGAATGACAACGTGGATATATGGTGGCACGAGATCATGGAAGGTCAATCACCC
ATCTGCGCTCCCGAGTGGATGAATGCTGAGGACCCCTTGTTCATGCTATACACTAGCGGT
TCCACGGGCAAGCCGAAGGGCGTTCTACACACCATCGCTGGTTACATGCTCTACGCGGCG
ACAACCTTCCGATATGTATTCGATTATCGCGAGAAGGACATCTATTGGTGCACCGCTGAC
GTAGGCTGGATCACGGGGCACACTTACGTCGTGTACGCGCCCCTTGCGAATGCCGCTACG
TCGCTTATGTTCGAAGGTACACCTTTCTACCCAGATAACGATCGCTACTGGTTGTTGGTT
AAGAAGTACAAGGTTACTCAATTCTACACAGCACCCACCGCCATTAGAGCTCTCATGAAA
TTTGGCGACGAGCTCGTCACCAAGAACAATTTAAAAACTTTGCGTGTGTTGGGGAGCGTT
GGGGAGCCTATCAACCCGGAAGCGTGGTTGTGGTTCTACAACCTAGTTGGTAATAAACGT
TGCTCCATCGTGGATACTTTCTGGCAGACTGAAACCGGTGGCCACGTACTCACCGGCCTG
CCAGGCGCCTCGCCTATGAAGCCTGGAGCTGCTGGGTTTCCATTCTTCGGCGTGGAACCG
ACACTGCTTGACGAAAGCGGCAAAGTGATCGAAGGGCCCGGCGAGGGCTACCTGGTCTTC
TCGCGACCATGGCCAGGCATCATGAGGACCCTCTTCGGTGACCACGCTCGTTACCAAAAG
GTCTACTTCTCTAAATTCAAAGGATATTATTGCACAGGCGATGGCGCCAGGCGTGACGAG
GATGGGTTCCTGTGGGTGACGGGACGTATCGATGACATGCTGAATGTGTCCGGTCATCTG
CTGTCTACTTCCGAGGTGGAAGGTGTCCTCACTGAAGAGCCCTCCGTGTCCGAAGCTGCT
GTAGTCTCCAAGCCACATCCCGTCAAAGGCGAGTCTCTGTACTGCTTCGTCATCCTCAAC
GAGGGCGTCCAGTTCGGCCCCGAATTGGTGGACGCTCTGAAGAAACGCGTGAGGAATAGG
ATCGGAGCCTTTGCAGCTCCGGATGTCATTCAATACGCTCCCGGTCTGCCGAAAACCAGG
TCCGGGAAGATCATGAGGAGAATCCTAAGGAAGATAGCGCTCGGTGACACCGACATCGGC
GACACGTCGACTTTGGCCGACCCGTCCGTCGTCGACGAGCTCTTTAAATGCAGACCCTAG
Protein sequence:
MHKKSLEDPEAFWSEIAKEFHWQTPCQPGKFLSYNFDIDKGDIFVKWMEGATTNVCFNVL
DRNIRNGHGDKIAYYWEGNHPDDYSRITYKKLLDSVCMFANALRELGVRKGDRVAIYMPM
IMETVICMLGCARIGAVHSVVFAGFSSDSLAERMSDCKAKVIVTSDGAWRGEKKLFLKNT
CDEAIEKARTKHNHEVNLCIVVSHLGRVKPGARMNVLKKPYTWNDNVDIWWHEIMEGQSP
ICAPEWMNAEDPLFMLYTSGSTGKPKGVLHTIAGYMLYAATTFRYVFDYREKDIYWCTAD
VGWITGHTYVVYAPLANAATSLMFEGTPFYPDNDRYWLLVKKYKVTQFYTAPTAIRALMK
FGDELVTKNNLKTLRVLGSVGEPINPEAWLWFYNLVGNKRCSIVDTFWQTETGGHVLTGL
PGASPMKPGAAGFPFFGVEPTLLDESGKVIEGPGEGYLVFSRPWPGIMRTLFGDHARYQK
VYFSKFKGYYCTGDGARRDEDGFLWVTGRIDDMLNVSGHLLSTSEVEGVLTEEPSVSEAA
VVSKPHPVKGESLYCFVILNEGVQFGPELVDALKKRVRNRIGAFAAPDVIQYAPGLPKTR
SGKIMRRILRKIALGDTDIGDTSTLADPSVVDELFKCRP