DPGLEAN01817 in OGS1.0

New model in OGS2.0DPOGS205878 
Genomic Positionscaffold1969:+ 29359-33652
See gene structure
CDS Length1836
Paired RNAseq reads  1050
Single RNAseq reads  3114
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000120 (0.0)
Best Drosophila hit  vacuolar protein sorting 26 (7e-143)
Best Human hitvacuolar protein sorting-associated protein 26B (6e-133)
Best NR hit (blastp)  PREDICTED: similar to vacuolar protein sorting 26, vps26 [Tribolium castaneum] (3e-158)
Best NR hit (blastx)  PREDICTED: similar to vacuolar protein sorting 26, vps26 [Tribolium castaneum] (5e-152)
GeneOntology terms



  
GO:0007040 lysosome organization
GO:0006605 protein targeting
GO:0006886 intracellular protein transport
GO:0007034 vacuolar transport
GO:0030904 retromer complex
InterPro families
  
IPR009069 MTCP1
IPR005377 Vacuolar protein sorting-associated protein 26
Orthology groupMCL12155

Nucleotide sequence:

ATGAGTTTCTTTGGGTTCGGACAAACCGCGGACATCGAAATTGTATTCGACGATGCTGAC
AAACGAAAAGTGGCCGAAGTTAAAACGGACGATGGTAAAAAAGAGAAACTGTTGCTTTAT
TATGATGGTGAAACTGTGTCGGGGAGGGTTAATGTGACGCTGCGGAAACCAGGATCGAAA
TTAGAGCACCAAGGTATCAAAGTTGAGCTTATCGGTCAGATAGAGTTGTTTTACGACAGA
GGAAATCATCACGAATTTATATCGTTGGTTAAAGAACTCGCTCGTCCCGGAGATCTATTG
CAGCACACCTCCTATCCGTTCGACTTTGCGAACGTTGAGAAACCCTATGAGGTGTACACA
GGAGCCAATGTCAGGTTAAGGTACTTTTTACGAGCCACAATAGTAAGACGTCTTACAGAC
ATCACTAAAGAGGTGGACATAGCCGTTCATACGTTATGCAGCTATCCCGATGTACTAAAC
TCTATAAAAATGGAAGTAGGCATCGAAGATTGTTTACACATAGAATTTGAGTACAACAAA
TCAAAATACCACCTGAAAGACGTTATAGTAGGTAAAATTTATTTCCTCCTCGTACGAATC
AAGATAAAACACATGGAGATATCTATTATAAAGAAAGAAACGACAGGTTCTGGACCTAAC
ACCTTCACAGAGAATGACACAGTCGCTAAATATGAAATAATGGACGGTGCACCAGTTAGA
GGTGAAAGTATTCCTATTAGAGTATTTTTGGCTGGCTACGATCTAACTCCTACTATGAGA
GACATAAACAACAAATTTTCGGTAAGATACTTTCTAAATCTTGTTCTAATGGACACAGAA
GATCGCCGTTATTTCAAACAACAGGAAGTTACTCTGTGGCGGAAAAGTGACAAATCACGA
CTTCCGCTACACAATCCGCATCATCCTCAGAACTTAACGAATTCGCAGCACTACCAAATG
GCTGTTTCCAGCGAAGAGAACTTGGCAAGAGGTATTTCCCCATCAATGCCACCAGAAAGT
GCATTACAGAGATCTATTTCACCTCCAATGCCAAATGTTGATAAACATAACGGCCCCTCA
CAAATGGAACAAGAAGAACCAGATGTATTACCAAACAAACTGTCAAGTACTCACATCGAG
AATGAGCCCGAACAGGTTGAACAAGAAAAGACGAGTGAAAAGCCCAAACTAGCCGAAAAG
CCACTAGATAAACCACAACTAGAGGAAGTCCAGGAAGTGAATAATTCAGAGCATATCAAA
GAAAAGCCACAGAATGTTAGCAAACCTCAAATATCAATAAAACCATCGGTTTCGGAAAAA
CCTATAGCCGAGAAAGTTGCCATCGCCGAGAAACCGTTATTGGCAGAGAAGCCCATACTA
GAAAAGCCCACTTTGGCCGAGAAGCCAGTCCTCTCACAGACGGAAAGTGTAGAAGCAGCT
ACGAAAAACAACTTCCAAGAGGTCCGCTGTGATCGCGTCTTCGAGGCTATGCGACAGTGT
TGTCTAAAACATAAACCCGTGTCATTAGTTTGCGAAGGTTACCGCTTGGAGCCGAGGGTT
TTCGCCCCCGTGACTGATCGACCAGCTAAGGATTTACAATGTCTGTGTTGGACAATAACT
ATTGATTACTTTGCCTACGTCTCAAGAGAAATGAAATCCCGTCCCATACAAGTGATATGG
ACACACATCGTCGAAGCAAAACACTCGGTCAAAGAAGTTGGATATATAGATATCCTCGGA
CGTTTCAGATCACTTTCACAGTGTTGGGGCTCGGTGTGTTCTTTTCTAAGCCAATTTATG
ACATATTCTTCCGCGAGAGATTCGTTCCCGATCTAA

Protein sequence:

MSFFGFGQTADIEIVFDDADKRKVAEVKTDDGKKEKLLLYYDGETVSGRVNVTLRKPGSK
LEHQGIKVELIGQIELFYDRGNHHEFISLVKELARPGDLLQHTSYPFDFANVEKPYEVYT
GANVRLRYFLRATIVRRLTDITKEVDIAVHTLCSYPDVLNSIKMEVGIEDCLHIEFEYNK
SKYHLKDVIVGKIYFLLVRIKIKHMEISIIKKETTGSGPNTFTENDTVAKYEIMDGAPVR
GESIPIRVFLAGYDLTPTMRDINNKFSVRYFLNLVLMDTEDRRYFKQQEVTLWRKSDKSR
LPLHNPHHPQNLTNSQHYQMAVSSEENLARGISPSMPPESALQRSISPPMPNVDKHNGPS
QMEQEEPDVLPNKLSSTHIENEPEQVEQEKTSEKPKLAEKPLDKPQLEEVQEVNNSEHIK
EKPQNVSKPQISIKPSVSEKPIAEKVAIAEKPLLAEKPILEKPTLAEKPVLSQTESVEAA
TKNNFQEVRCDRVFEAMRQCCLKHKPVSLVCEGYRLEPRVFAPVTDRPAKDLQCLCWTIT
IDYFAYVSREMKSRPIQVIWTHIVEAKHSVKEVGYIDILGRFRSLSQCWGSVCSFLSQFM
TYSSARDSFPI