New model in OGS2.0 | DPOGS206327  |
---|---|
Genomic Position | scaffold1212:- 6860-18022 |
See gene structure | |
CDS Length | 1482 |
Paired RNAseq reads   | 3246 |
Single RNAseq reads   | 10652 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005270 (1e-52) |
Best Drosophila hit   | translocation protein 1, isoform A (4e-34) |
Best Human hit | translocation protein SEC62 (1e-25) |
Best NR hit (blastp)   | AGAP009788-PA [Anopheles gambiae str. PEST] (4e-68) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP009788-PA [Tribolium castaneum] (2e-49) |
GeneOntology terms    | GO:0006614 SRP-dependent cotranslational protein targeting to membrane GO:0005784 Sec61 translocon complex GO:0008565 protein transporter activity GO:0016021 integral to membrane |
InterPro families   | IPR004728 Translocation protein Sec62 |
Orthology group | MCL12230 |
Nucleotide sequence:
ATGGCAGACAAAAGGAAAACTAAAAAACGAAAAGAGGAATTCGGTGAACCCTCCGAATCT
GAGAAGCCCACAAGCGAGGAATACGCCGTCGCTAAATGGCTAAAGGCGAACGTACCCACG
AAGAAGACAAAATTCCTTAATCATCACGTAGAATACTTCACAGGCACAAGAGCAGTGGAC
GCGTTATTGACATCAAAATGGGCGACTGGCAAGAACCCCATTTTCACTACCAGACACGAC
ATCACAGACTTCCTTCACCTAATGCTCTTACACAAACTCTTCCACAGAGCTAAAAAGGTG
CCAGTGACAGAGCAGGAGCTTAAAGGAAAATCGAGGAAGAAAGATGTCGAAAAGACGAGT
AAGAGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAA
GACAAAGACGGTAAAGATAAGGACAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATG
CACATGGAGCAAGTTTTCTTAGACACTAATGACGCGTACGTCTGGCTCTACGACCCCATG
CCCTGGTACTATTGGCTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTCTGCATG
TTCCCGCTATGGCCGGCCACTGTTAGAGGAATTGAATTCAGAGGAATTCAGAGGGTGCCA
GTGACAGAGCAGGAGCTTAAAGGAAAATCGAGGAAGAAAGATGTCGAAAAGACTAGTAAG
AGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAAGAC
AAAGACGGCAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATGCACATGGAGCAAGTT
TTCTTAGACACCAATGACGCGTACGTCTGGCTCTACGACCCCATGCCCTGGTACTATTGG
CTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTTTGCATGTTCCCGCTATGGCCG
GCCACTGTTAGGAAGGGTGTGTACTATCTGAGTATAGCGGCGGCTGGTTTCCTGGTACTG
ATCATAGCGTTGGCTGTACTCAGAGTGGTAGTGTTTTGTACTGTATGGGTCGCAACACTC
GCTAGACATCATCTCTGGTTACTTCCAAATCTGACTGAAGACGTTGGATTCTTTGCCTCA
TTCTGGCCGCTGTATAAGTACGAATATCGCGGTCCCGGTTCGGAGAGCGATAAATCGTCT
AAAAGCAAGAAGAAACGCAAGAAAGAAAAACATTCAGACGACGAGGAGGAAAAGACAGCT
CTCATGAAGGAATCGGAGGCAAAGGAAGTTAAAGAGAAAAAAGTAGTCTCTGAAACGGCC
GACACTACAGACAATAAGGAGGAACCACCGCAGCCTGAAACATCGGAACAGACGGACAAA
CTGTCGGAATCAGAATCTGAAAACAGCCAGAGGTCATCGACCGACAGAGACTTCGAGATG
ATAGATACAGCTGATGTAGACGAACATGCACACACACACTAA
Protein sequence:
MADKRKTKKRKEEFGEPSESEKPTSEEYAVAKWLKANVPTKKTKFLNHHVEYFTGTRAVD
ALLTSKWATGKNPIFTTRHDITDFLHLMLLHKLFHRAKKVPVTEQELKGKSRKKDVEKTS
KSGDEQDEKNQSACEGKETKDKDGKDKDKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPM
PWYYWLCGALLLVGTVGVCMFPLWPATVRGIEFRGIQRVPVTEQELKGKSRKKDVEKTSK
SGDEQDEKNQSACEGKETKDKDGKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPMPWYYW
LCGALLLVGTVGVCMFPLWPATVRKGVYYLSIAAAGFLVLIIALAVLRVVVFCTVWVATL
ARHHLWLLPNLTEDVGFFASFWPLYKYEYRGPGSESDKSSKSKKKRKKEKHSDDEEEKTA
LMKESEAKEVKEKKVVSETADTTDNKEEPPQPETSEQTDKLSESESENSQRSSTDRDFEM
IDTADVDEHAHTH