New model in OGS2.0 | DPOGS212359  |
---|---|
Genomic Position | scaffold101:+ 16776-18216 |
See gene structure | |
CDS Length | 1128 |
Paired RNAseq reads   | 463 |
Single RNAseq reads   | 1149 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012009 (4e-83) |
Best Drosophila hit   | CG17221, isoform B (1e-23) |
Best Human hit | reticulon-4-interacting protein 1, mitochondrial precursor (5e-16) |
Best NR hit (blastp)   | AGAP009178-PA [Anopheles gambiae str. PEST] (3e-66) |
Best NR hit (blastx)   | zinc binding dehydrogenase [Culex quinquefasciatus] (1e-41) |
GeneOntology terms    | GO:0008270 zinc ion binding GO:0016491 oxidoreductase activity GO:0055114 oxidation reduction GO:0016319 mushroom body development |
InterPro families    | IPR013154 Alcohol dehydrogenase GroES-like IPR016040 NAD(P)-binding domain IPR011032 GroES-like IPR020843 Polyketide synthase, enoylreductase IPR002085 Alcohol dehydrogenase superfamily, zinc-containing |
Orthology group | MCL14816 |
Nucleotide sequence:
ATGCGAGCCTGGCGGGTGCACGCCTACAGCGCCGGAACCGAGGAGTTGCGGCTGGAGAGC
GCGCGCGTGCCGCCGCTGAGGGCTCCCGATCAGCTGCTTGTGCGAGTCCACACCGCCTCC
ATCAACCCACTGGACGTGGCCATGCTCGGCGGGTACGGTTCTCGGATACTGAACACGCTG
CGGACGCTGGACGGCACCGACCTCGAGTTCCCGCTAGTGCCAGGGAGGGACTTCGCCGGC
GAAGTCGTCGCAGCCGGTGCGAGTTGCCGGCTGCGGGTCGGCGACCGCGTGTGGGGTGTG
GTCCCGCCGCACAGGCCGGGCTCGCATGCGGAGTACGTGACGGTGCGCGAGCGCTGGACC
GGCCTTGCCCCGCTTGCTCTGTCCGACGAGGAGGCAGGCGGGGCGCTGTACGCGGCTCTG
AGCGCGTGCGCGGCGCTCCGGGTTGGAGGCCTTCCGCCAGGGAGACGCGCCCGCCGTCCG
CCGCGCGTGTTATTACTGGGACTGGGCGGGGTCGGACACGTGGCCCTTCAGCTGCTCGTG
GACGCTGGCGCCGAGGTGATCGTTGGCTGCTCTGCGGACCTGTGTGAGCGCGCGACCTCG
CTCGGTGCCGCGGCGGCGCTCGATCGGTCGGCGGCTGACTACGACCGCCTCCTCGAGGAG
TCCGGCCCGTACGAGGTGATCGTGGACTGTGCGGGAGTGGGTGGCGCGGAGGCCGGTTCG
CGGCGCTGGAGGTTCTCCCGGTTCGTGACCCTGAGCTCGCCGCTGCTCCGGCTTACGGAC
GCCCGCGGGCTGGTGGGCGGGGGATGTGCGGCGGCGGCCCAGCTAGTCGCCGATGGCCTG
TCCGCGGCCCGGAGCGCGCCCGCACCGTCCTCCTGCCCGCCGCACGTCCGCTGGGCCTTC
TTCGCTCCGTCCTCGGACGACATCGAGACGCTCCGTCGCCTCGCGGAGAGAGGCAGGCTG
TCGGTGTGTGTGGAGCGCGTGTTCCCCTGGTGGGAGGGTGTGGCGGCGTACGAGCGCGCG
GCTCGTGGCGGGGCGCGAGGGAAGCTCGTGCTGGACTTCACGCGCTCGCCACCCCCCGCT
CTCGCCGCTCCCCCCGCCCCCGCCGACCGCACAGTGTCGTCGCGTTAG
Protein sequence:
MRAWRVHAYSAGTEELRLESARVPPLRAPDQLLVRVHTASINPLDVAMLGGYGSRILNTL
RTLDGTDLEFPLVPGRDFAGEVVAAGASCRLRVGDRVWGVVPPHRPGSHAEYVTVRERWT
GLAPLALSDEEAGGALYAALSACAALRVGGLPPGRRARRPPRVLLLGLGGVGHVALQLLV
DAGAEVIVGCSADLCERATSLGAAAALDRSAADYDRLLEESGPYEVIVDCAGVGGAEAGS
RRWRFSRFVTLSSPLLRLTDARGLVGGGCAAAAQLVADGLSAARSAPAPSSCPPHVRWAF
FAPSSDDIETLRRLAERGRLSVCVERVFPWWEGVAAYERAARGGARGKLVLDFTRSPPPA
LAAPPAPADRTVSSR