New model in OGS2.0 | DPOGS206511  |
---|---|
Genomic Position | scaffold1256:- 17280-22547 |
See gene structure | |
CDS Length | 2328 |
Paired RNAseq reads   | 9094 |
Single RNAseq reads   | 23116 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012753 (9e-18) |
Best Drosophila hit   | glycoprotein 93 (0.0) |
Best Human hit | endoplasmin precursor (0.0) |
Best NR hit (blastp)   | GJ10398 [Drosophila virilis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to endoplasmin [Acyrthosiphon pisum] (0.0) |
GeneOntology terms    | GO:0006457 protein folding GO:0051082 unfolded protein binding GO:0006950 response to stress GO:0005524 ATP binding GO:0005811 lipid particle GO:0061031 endodermal digestive tract morphogenesis GO:0007494 midgut development |
InterPro families    | IPR015566 Molecular chaperone, heat shock protein, endoplasmin IPR001404 Heat shock protein Hsp90 IPR020575 Heat shock protein Hsp90, N-terminal IPR003594 ATPase-like, ATP-binding domain IPR020568 Ribosomal protein S5 domain 2-type fold IPR019805 Heat shock protein Hsp90, conserved site |
Orthology group | MCL12618 |
Nucleotide sequence:
ATGAAATACTCGTTCCTGTTAGCGCTAGGCGTCTTGCTCCTTTCAGGATGCATCCAGGCG
CAGGAAGCCGCGCCCAGCGTGGAGGAGGTGACGGTGGACGCTGACCTCGGCGCCTCGAGG
GAGGCATCGCGCACCGATGCGGAGGCTGTGCTGCGTGAGGAGGAGGCCATCTCCCCCGAC
ACGCTGAGTGTGGCTCAGCAGCGAGAGATGCATAAAAACGCACAGAACTATACCTTCCAA
ACGGAGGTGAACCGCATGATGAAGCTGATCATCAACTCGTTATACAGAAACAAGGAGATC
TTCCTCCGCGAGCTGATCTCCAACGGGTCGGATGCGCTGGACAAGATCCGACTGCTGTCT
CTTACACAGCGCGAGGTCTTGGACGTCAACCCTGATTTGAGCGTACGCATCAAGGCGGAG
CCAGACAAGCGACTCCTGCACATCATCGACTCGGGGGTGGGCATGACCAAGAACGACCTT
ATCACCAACCTCGGCACCATCGCCAAGTCGGGGACGGCAGACTTCCTGTCCAAGATGCAG
GATGTGGAGAAAGGCGGCGCGCAAGAGATGAACGACATGATCGGTCAGTTCGGAGTGGGC
TTCTACTCCGCTTTCCTGGTGGCGGACAAGGTCACGGTCGTCTCCAAACACGCGGACGAC
GACCAGCACGTGTGGGAGTCGGACGCCAACTCATTCAGCGTGGCGCTCGACCCGCGCGGG
AACACTCTGAAGAGAGGCACACACATCACCCTGCACATGAAGGAGGAGGCTGCGGACTAC
CTTCAGCCGGACACCATCCGAGCGCTCGTCAAGAAGTACTCGCAGTTCATCAACTTCCCC
ATACACCTGTGGGCCTCGCGGACCGAGACGGTCGAGGAGCCCGTGGACCAGGACGCCGAC
GCCGCGGACGACGCGGACGAGGACGCTAAGGTGGAGTCCGAGGACAAAGCCGACACCAAG
AAGACCGAGAAGACCATCTGGGACTGGGAGATCATGAACGACAACAAGCCCATCTGGACC
AGGAAGCCGGCTGAGGTGCTCGACGATGAGTACACGCAGTTCTACAAGAGTCTCACCAAG
GACACGGCGGCACCGCTCGCCAAAGCCCACTTCGTGGCGGAGGGCGAGGTGACGTTCCGC
GCGCTGTTGTTTGTTCCCCGCGTGCAGCCCGCGGAGTCCTTCAACCGCTACGGCACCAAG
ACGGATCACATCAAGCTGTACGTGCGCCGCGTCTTCATCACCGACGAGTTCAACGACCTC
ATGCCCAACTACCTCGCCTTCATACAGGGCATCGTGGACTCGGACGACCTGCCGCTGAAC
GTGAGCCGGGAGACCCTCCAGCAACATAAACTCATAAAGATCATCAAGAAGAAGCTCGTG
CGGAAAGCTCTCGACATGCTCAAGAAGATCCCCGACGACGAGTACGAGCACTTCTGGAAG
GAATACTCTACCAACATCAAGCTCGGCGTGATGGAGGACCCGTCCAACCGTTCGCGCCTG
GCCAAGCTGCTGCGGTTCCACTCGTCGCGCGGCTCCGACATGACCTTCCTGAGCGACTAC
GTGTCCCGCATGAAGCAGGGACAGTCCCACATCTACTACATCGCGGGCGCCAGTAGGGCC
GAGGTGGAGCGCTCGCCTTTCGCCGAGCGTCTGGTCCGCGCCGGCTACGAAGTGCTCTAC
CTCACGGAGGCCGTGGACGAGTACTGCCTGTCGTCTCTGCCCGAGTACGACGGGAAAAAG
TTCCAGAACATCGCCAAGGAGATCTTCGACCTCGACGAGGACGACCGGCAAAAGGAGCAG
CTGGAGGCGTACAAGAAGGAGTTCGAACCGCTCACCAAGTGGCTCGGGGACAAGCTGTCG
GCCTGGATCACCCGCGCGCAGGTGTCGCGGCGCCTGGCGCGCTCGCCCGCCGCACTAGCC
GCCACCGCCTTCGGATGGACAGGCAACATGGAGAGGCTGGCCATGTCCAACGCTCATCAG
AAGGCGGACGACGCGCAGCGGCGCCACCATCTCACCCAGAAGAAGACGCTGGAGATCAAC
CCGCGCCACCCCGTCGTGCGCGAGCTGCTGCGGCGCGTGAGGGACGACCCCGACGACCCG
CTCGCCCTGGACGCGGCGCGCACCATGCACCGCACGGCGGCGCTGCGCTCGGGCTACATG
CTGCAGGAGGGCCAGGCCGGGGAGTTCGCCGACCAGGTGGACGACATGCTGCAGCGGGCG
CTGGGCCTGGCGCCCTCCGCCGCGCTCGAAGACGACGACCTGGAGCCCGAGGCCGCCGCC
GACGACCACGAGCTGGACGCCGAAGACGACGCACACGAGGAGCTGTAA
Protein sequence:
MKYSFLLALGVLLLSGCIQAQEAAPSVEEVTVDADLGASREASRTDAEAVLREEEAISPD
TLSVAQQREMHKNAQNYTFQTEVNRMMKLIINSLYRNKEIFLRELISNGSDALDKIRLLS
LTQREVLDVNPDLSVRIKAEPDKRLLHIIDSGVGMTKNDLITNLGTIAKSGTADFLSKMQ
DVEKGGAQEMNDMIGQFGVGFYSAFLVADKVTVVSKHADDDQHVWESDANSFSVALDPRG
NTLKRGTHITLHMKEEAADYLQPDTIRALVKKYSQFINFPIHLWASRTETVEEPVDQDAD
AADDADEDAKVESEDKADTKKTEKTIWDWEIMNDNKPIWTRKPAEVLDDEYTQFYKSLTK
DTAAPLAKAHFVAEGEVTFRALLFVPRVQPAESFNRYGTKTDHIKLYVRRVFITDEFNDL
MPNYLAFIQGIVDSDDLPLNVSRETLQQHKLIKIIKKKLVRKALDMLKKIPDDEYEHFWK
EYSTNIKLGVMEDPSNRSRLAKLLRFHSSRGSDMTFLSDYVSRMKQGQSHIYYIAGASRA
EVERSPFAERLVRAGYEVLYLTEAVDEYCLSSLPEYDGKKFQNIAKEIFDLDEDDRQKEQ
LEAYKKEFEPLTKWLGDKLSAWITRAQVSRRLARSPAALAATAFGWTGNMERLAMSNAHQ
KADDAQRRHHLTQKKTLEINPRHPVVRELLRRVRDDPDDPLALDAARTMHRTAALRSGYM
LQEGQAGEFADQVDDMLQRALGLAPSAALEDDDLEPEAAADDHELDAEDDAHEEL