New model in OGS2.0 | DPOGS202918  |
---|---|
Genomic Position | scaffold273:+ 201384-206634 |
See gene structure | |
CDS Length | 1992 |
Paired RNAseq reads   | 158 |
Single RNAseq reads   | 334 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004155 (2e-20) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | Aldehyde Dehydrogenase [Truepera radiovictrix DSM 17093] (4e-09) |
Best NR hit (blastx)   | TPA: endonuclease-reverse transcriptase [Schistosoma mansoni] (1e-12) |
GeneOntology terms   | ND |
InterPro families    | IPR016161 Aldehyde/histidinol dehydrogenase IPR016163 Aldehyde dehydrogenase, C-terminal |
Orthology group | MCL22683 |
Nucleotide sequence:
ATGTTACGGGTCCCTTCGGAGCATGTGGCCGATCCACTTCCACTTGCGACGTTTGATTTG
CAGGTCAATCTGTATCTCACCGCAGCGTTCCCACAGATTCACGTTGGAGATTTTCTCGGG
CCAGTAAATACCGAGAAAACGGCGAAGACACCATTTGACGAAGACCTGAGGCCGATGCGA
GATGTCTTCAGTAACTTCCACGTCTCGCATCCCTATAACAGCACGGTATTGACATTGGAC
TGGAATATTTTGAGTTTGACTCTCCTGCGGATTGATAATGTGTTACGACAGAAAACAGAC
CGTGTTGCTGGGACTTTATCGGCGGTGATTGAGAGAAACATGGAGGGTATTAAGTTCGAC
GAAGCAAATGTTAGCACAAAACTAGCGGAGGATTATTTGAAGAAAAATCCAATATTTAAA
TCAAAAGGATTCGCCGACAGTGACAGATTCACGTTAAACGATCACAAAAACACAATATAC
TATACTAATTATACGAGAAACCTCGGGGCTGTGGTGTTGACTAAAGATGTAAGTGACCTT
CCATTAAAATTGATGAACCTGGCCAAGAAAATTGAGGAGAATGTCGAACTTTTCTGCCAA
TTGGAACTGTTACTGCGAAAGATCCCCTTAGAAGATACAGAGCGCATAGATTTGAAACTC
ATGACACAAAACTTTATGTATCACAGTTCGTTAAAAAGCTCCAAGGAAGCGACGAGCACA
GCTCGCCAGATTACGTTCGAGTCGGCCACGCCGCTGGTGGCGCTGGGTTTCATCAGCCAG
GTGCTGATGGCTCACATGCCGAAGGTCGTCATTGAATGCACCAAGGAAACAGCTCCCGTC
ACCACCCTCTTCATTGAACTTTGTCACCAAGTCGGATTGGAGGGAAAGGTACTTCTAGCG
ATTTTACCTGAAGTTATAGAAGGCGGTTCCTCCAAAGTGACGATGGATTTTACGAAAATG
GGTGAATTTCTGAACGGCTGCGTGGGAGTTGTGAGCGGGAAAAGTGACATCGACTCCGCC
GTTGAGTACTTCTTGGACGCGTCCAGTCGATATCCCTGGGCGTTGAGGAAGATTTTTGTT
CAAGAAAATGCTGTGGAAAGATTCACCACCACCATGACCTGGAAGGAGGAGAAGCAGATG
GAAGTAACATCAGCCGGGAAAAGCAAAAGCGTTACGGACCGCGAATTATCATACTTCTTT
GGGGAAAAAACGTTTCTGATGAAGCCGGGTCGAGATAGCACTGAACATGACAATAATGTT
GTCATCCTTGAGGCTTTTAGGACGGTGAAGGAACTGATAGGTCTGCTTGCGAACGAGAAG
CCGTTCGCGCTCTCCATCTGGTGCAGTGATATTTCAGAGACGAATGAGCTCGCCCACAAC
GTAGACGCCAGCATCGTGTGGGTCAACGACTTCGCCAACTTCGAAGGACCACCTCGCTCT
TCGCAAGCCTTCTTTTCCCTCATCGATATTTACTTCAGTTCTCAGCATATCGAACAATTT
CCTGAAATGGCTGAATTGACGAAACTCAAAGAATCGTGGTTGGAACTCAGCGTTGAGCAA
AGAAGAGCTGTCGTAAGAGATGCATTGGCAAAAATAGATAGGAACCAGTCCAAGAAAATG
TTAGATGTCCTTGATGATATGACGACGGAAATGGAAAGTTTCGTTTACCTCACCAAGAAT
ATGATAGCGACTGGCATTGAACTACAACCGCAAGCGATGATGCCGAGCGCGATGTACGAC
CACGGACTGGAATCGTCGATAATGTCTTACATTATAAAGGGTGGTGCCATCCTGCTACAC
ATACCTCCGCGTGATCTGCCAGTAAAAAGCAAAGATATTTCTTACATGTTCTATGATAAC
TTACGGAACATGGCCGGTCCCGTACTATTTCTAGAGAAAGAGTATAAAATCGGTGACGTG
TCTGTTATAAGGCATCCAATTAAAAGATATAAAGTCATTTGGACCAATTTCGGAACGATA
TTCGCTAATTAA
Protein sequence:
MLRVPSEHVADPLPLATFDLQVNLYLTAAFPQIHVGDFLGPVNTEKTAKTPFDEDLRPMR
DVFSNFHVSHPYNSTVLTLDWNILSLTLLRIDNVLRQKTDRVAGTLSAVIERNMEGIKFD
EANVSTKLAEDYLKKNPIFKSKGFADSDRFTLNDHKNTIYYTNYTRNLGAVVLTKDVSDL
PLKLMNLAKKIEENVELFCQLELLLRKIPLEDTERIDLKLMTQNFMYHSSLKSSKEATST
ARQITFESATPLVALGFISQVLMAHMPKVVIECTKETAPVTTLFIELCHQVGLEGKVLLA
ILPEVIEGGSSKVTMDFTKMGEFLNGCVGVVSGKSDIDSAVEYFLDASSRYPWALRKIFV
QENAVERFTTTMTWKEEKQMEVTSAGKSKSVTDRELSYFFGEKTFLMKPGRDSTEHDNNV
VILEAFRTVKELIGLLANEKPFALSIWCSDISETNELAHNVDASIVWVNDFANFEGPPRS
SQAFFSLIDIYFSSQHIEQFPEMAELTKLKESWLELSVEQRRAVVRDALAKIDRNQSKKM
LDVLDDMTTEMESFVYLTKNMIATGIELQPQAMMPSAMYDHGLESSIMSYIIKGGAILLH
IPPRDLPVKSKDISYMFYDNLRNMAGPVLFLEKEYKIGDVSVIRHPIKRYKVIWTNFGTI
FAN