New model in OGS2.0 | DPOGS215037  |
---|---|
Genomic Position | scaffold1244:+ 18636-20144 |
See gene structure | |
CDS Length | 1509 |
Paired RNAseq reads   | 507 |
Single RNAseq reads   | 1460 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005665 (0.0) |
Best Drosophila hit   | trehalase, isoform F (7e-99) |
Best Human hit | trehalase precursor (2e-98) |
Best NR hit (blastp)   | putative Trehalase-1B [Heliconius melpomene] (9e-161) |
Best NR hit (blastx)   | putative Trehalase-1B [Heliconius melpomene] (2e-162) |
GeneOntology terms    | GO:0005886 plasma membrane GO:0004555 alpha,alpha-trehalase activity |
InterPro families    | IPR008928 Six-hairpin glycosidase-like IPR001661 Glycoside hydrolase, family 37 IPR018232 Glycoside hydrolase, family 37, conserved site |
Orthology group | MCL39983 |
Nucleotide sequence:
ATGAAAGAAACAAACAATAAACCTTCACGTGACCGTCTTATGAATTTCGTTGATCAAAAT
TTCGGAGAAGGCAATGAACTCGAGCATTGGATGCCGCCTGACTTCGATCCGAATCCACCA
ATCTTAGAACAAATATCTAATCCTAAGTTAAAAGCATTTGCTAAGAGTGTTATAGCGTTA
TGGGCCAAACTGGGACGAAAAGTACAATCGGATGTAATGCTAAACCCAGATCAGTATAGC
TTCTTGTACGTGCCGAACGGATTCATCGTACCGGGAGGTCGATTTAAGGAATTGTATTAC
TGGGACTCATTTTGGACGATACGCGGCCTCATGATCAGCAACATGACTAAAACAGCTAAA
GGTATGATAGAAAACCTGTTGCATTTAGTGAGGAAATATGGTTATATACCGAATGGGAGT
CGTGTGTATTACTTAGGGAGAAGTCAACCTCCTTTACTAGCAGCTATGGTAGCCAGCTAC
TACGAGGCAACAGGGGATCTAGCGTGGATAGAACAACATATTAGCACTGTGGAAAAAGAA
CTATACTATTGGTTAGACAAAAAGAAAGTGACCGTCGAAATCAATGGTAACAAGTTTATG
CTGCTGAGATATCTAGCGGATAAGAAAGATAGAGGTCCTCGTCCGGAATCATATTACGAA
GACTATACAAACTCCAGAATCTTCCCTAATGAAGACAGACGAGATGATTTTTATTTAGAA
ATCAAGAGCGCTGCCGAAAGTGGCTGGGACTTCTCATCACGCTGGTTCGTGTCAGCCGAC
GGTGCAATTGGAAACTTGACTGATGTTCACGCGACACGGATCCTACCCGTTGATTTGAAT
GCCATATTCGCTGGTGCCTTGCAAACCGTCGGCGATCTCCATGACATACTGAAACACAGG
AGAGAGGCGCAGAAATGGTGGAGCCTAGCCAGGTATTGGAGAAGCGCAATCGAGAATATC
ATGTATCATGAGGTAGATGGAGTGTGGTACGATTTTGACGCACAGACTGGCTCCCCTAGG
AAACATTTTTATCCCAGCTGCGCTACTCCGCTATGGGCTAAGGTTGTCAATGAAACGAAA
GCTGATAAATATGCTCTGCTGTTGGTTAATTATCTCAAATCAACCGGCGCCCTTAACTTC
CCGGGCGGCGTCCCATCCTCCATACTACATTCGGGTGAACAATGGGATTTTCCGAATGCA
TGGCCCCCGATGCAAAGTATTTTAATCGGAGCATTGGACACCAGTGGAAACGTAGAAGCA
CGGAAATTGGCGAAGGAGCTAGCTGGTGTATGGATACGATCAAATTACATAGGTTATAAC
AACTGGCAGAAAATGTTTGAGAAGTACAGTGCAGTGCATCCAGGGCACGAGGGTGGTGGT
GGGGAGTACGTGGTGCAGGACGGCTTCGGGTGGACCAATGGGGTTGTGTTAGAACTGTTA
CAGAGGTATGGAAAAGACCTCACGCTTCATGAACGACCGGGAGCCACGCCAACGGTTGCT
CTTATATAA
Protein sequence:
MKETNNKPSRDRLMNFVDQNFGEGNELEHWMPPDFDPNPPILEQISNPKLKAFAKSVIAL
WAKLGRKVQSDVMLNPDQYSFLYVPNGFIVPGGRFKELYYWDSFWTIRGLMISNMTKTAK
GMIENLLHLVRKYGYIPNGSRVYYLGRSQPPLLAAMVASYYEATGDLAWIEQHISTVEKE
LYYWLDKKKVTVEINGNKFMLLRYLADKKDRGPRPESYYEDYTNSRIFPNEDRRDDFYLE
IKSAAESGWDFSSRWFVSADGAIGNLTDVHATRILPVDLNAIFAGALQTVGDLHDILKHR
REAQKWWSLARYWRSAIENIMYHEVDGVWYDFDAQTGSPRKHFYPSCATPLWAKVVNETK
ADKYALLLVNYLKSTGALNFPGGVPSSILHSGEQWDFPNAWPPMQSILIGALDTSGNVEA
RKLAKELAGVWIRSNYIGYNNWQKMFEKYSAVHPGHEGGGGEYVVQDGFGWTNGVVLELL
QRYGKDLTLHERPGATPTVALI