DPGLEAN18153 in OGS1.0

New model in OGS2.0DPOGS206327 
Genomic Positionscaffold1212:- 6860-18022
See gene structure
CDS Length1482
Paired RNAseq reads  3246
Single RNAseq reads  10652
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005270 (1e-52)
Best Drosophila hit  translocation protein 1, isoform A (4e-34)
Best Human hittranslocation protein SEC62 (1e-25)
Best NR hit (blastp)  AGAP009788-PA [Anopheles gambiae str. PEST] (4e-68)
Best NR hit (blastx)  PREDICTED: similar to AGAP009788-PA [Tribolium castaneum] (2e-49)
GeneOntology terms


  
GO:0006614 SRP-dependent cotranslational protein targeting to membrane
GO:0005784 Sec61 translocon complex
GO:0008565 protein transporter activity
GO:0016021 integral to membrane
InterPro families  IPR004728 Translocation protein Sec62
Orthology groupMCL12230

Nucleotide sequence:

ATGGCAGACAAAAGGAAAACTAAAAAACGAAAAGAGGAATTCGGTGAACCCTCCGAATCT
GAGAAGCCCACAAGCGAGGAATACGCCGTCGCTAAATGGCTAAAGGCGAACGTACCCACG
AAGAAGACAAAATTCCTTAATCATCACGTAGAATACTTCACAGGCACAAGAGCAGTGGAC
GCGTTATTGACATCAAAATGGGCGACTGGCAAGAACCCCATTTTCACTACCAGACACGAC
ATCACAGACTTCCTTCACCTAATGCTCTTACACAAACTCTTCCACAGAGCTAAAAAGGTG
CCAGTGACAGAGCAGGAGCTTAAAGGAAAATCGAGGAAGAAAGATGTCGAAAAGACGAGT
AAGAGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAA
GACAAAGACGGTAAAGATAAGGACAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATG
CACATGGAGCAAGTTTTCTTAGACACTAATGACGCGTACGTCTGGCTCTACGACCCCATG
CCCTGGTACTATTGGCTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTCTGCATG
TTCCCGCTATGGCCGGCCACTGTTAGAGGAATTGAATTCAGAGGAATTCAGAGGGTGCCA
GTGACAGAGCAGGAGCTTAAAGGAAAATCGAGGAAGAAAGATGTCGAAAAGACTAGTAAG
AGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAAGAC
AAAGACGGCAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATGCACATGGAGCAAGTT
TTCTTAGACACCAATGACGCGTACGTCTGGCTCTACGACCCCATGCCCTGGTACTATTGG
CTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTTTGCATGTTCCCGCTATGGCCG
GCCACTGTTAGGAAGGGTGTGTACTATCTGAGTATAGCGGCGGCTGGTTTCCTGGTACTG
ATCATAGCGTTGGCTGTACTCAGAGTGGTAGTGTTTTGTACTGTATGGGTCGCAACACTC
GCTAGACATCATCTCTGGTTACTTCCAAATCTGACTGAAGACGTTGGATTCTTTGCCTCA
TTCTGGCCGCTGTATAAGTACGAATATCGCGGTCCCGGTTCGGAGAGCGATAAATCGTCT
AAAAGCAAGAAGAAACGCAAGAAAGAAAAACATTCAGACGACGAGGAGGAAAAGACAGCT
CTCATGAAGGAATCGGAGGCAAAGGAAGTTAAAGAGAAAAAAGTAGTCTCTGAAACGGCC
GACACTACAGACAATAAGGAGGAACCACCGCAGCCTGAAACATCGGAACAGACGGACAAA
CTGTCGGAATCAGAATCTGAAAACAGCCAGAGGTCATCGACCGACAGAGACTTCGAGATG
ATAGATACAGCTGATGTAGACGAACATGCACACACACACTAA

Protein sequence:

MADKRKTKKRKEEFGEPSESEKPTSEEYAVAKWLKANVPTKKTKFLNHHVEYFTGTRAVD
ALLTSKWATGKNPIFTTRHDITDFLHLMLLHKLFHRAKKVPVTEQELKGKSRKKDVEKTS
KSGDEQDEKNQSACEGKETKDKDGKDKDKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPM
PWYYWLCGALLLVGTVGVCMFPLWPATVRGIEFRGIQRVPVTEQELKGKSRKKDVEKTSK
SGDEQDEKNQSACEGKETKDKDGKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPMPWYYW
LCGALLLVGTVGVCMFPLWPATVRKGVYYLSIAAAGFLVLIIALAVLRVVVFCTVWVATL
ARHHLWLLPNLTEDVGFFASFWPLYKYEYRGPGSESDKSSKSKKKRKKEKHSDDEEEKTA
LMKESEAKEVKEKKVVSETADTTDNKEEPPQPETSEQTDKLSESESENSQRSSTDRDFEM
IDTADVDEHAHTH