DPGLEAN04160 in OGS1.0

New model in OGS2.0DPOGS211664 
Genomic Positionscaffold1662:+ 19730-25973
See gene structure
CDS Length2346
Paired RNAseq reads  541
Single RNAseq reads  1333
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001364 (0.0)
Best Drosophila hit  CG3338 (0.0)
Best Human hitvacuolar protein sorting-associated protein 53 homolog isoform 1 (3e-177)
Best NR hit (blastp)  PREDICTED: similar to CG3338 CG3338-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG3338 CG3338-PA [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0005768 endosome
GO:0010008 endosome membrane
GO:0005794 Golgi apparatus
GO:0016020 membrane
GO:0015031 protein transport
InterPro families  IPR007234 Vps53-like, N-terminal
Orthology groupMCL12183

Nucleotide sequence:

ATGGATTCCACTGAGGTATTAGAAGACAAGGATAGCGGTCCTGAGGTCCAGCTTAATTTA
CCCGGTTCTGTTATGAGCCGAATAGAAGAATTAATGGGAGGAACGGAACAATTCGATAGT
GAAGAATTTGACGCTGTGGCCTACATCAACCGTGTATTCCCTACTGAACAGTCGCTCTCA
GGAGTAGAGTCAGCAGCAGCTCGCTGCGAGTTTCGTCTAGCCAGTGTGCAGCATGACATT
CGAAGGCTGGTCCGAGCCCAGCATGATCAGCGAATGGCCGGACATGAAGCATTGCTCGAT
GCTCAGAAATGTATTGCTGAACTGGCTTTACAGGTCGCAGATATAAACAAAAAAGCTGAA
CGCAGTGAAAGCATGGTCCGCGAAATAACATCGGAGATCAAACAACTAGACTGCGCCAAG
TGGAACCTTACCGGAGCTATCACAGCCTTGAACCATCTCCATATGCTAGCTGGTGGAGCC
GCGTCTCTCAGGACCTTGGCTGATAATAGACGTTACAAGGAACTAGTGTTGCCACTGCAA
GCTATAATGGAGGTGCTAGATCAGCTTCAATGCTACCGGGATATTAAAGAGCTGAACGCC
TTGAGGGAGGAGGTGCTCGGCATCAGGACGATGCTGGCTCAGCAGATACTAGCCGACTTC
AAAGAAGCCCTCACTGGTGGTGGTAAGAGTGGCGTGTCTCCCCGGATGCTGGCGGAGGCG
TGTAGTGTGGTGGACGTGCTGGAACCTTCCGTCAGGAAAGAACTGCTCAAATGGTTTATA
GATATGCAGCTACAGGAGTACGAGCACCTGTTCTCTCCTGAGCAGGAACACGCGTGGGTG
TCTCACGTGGAGCGCCGCTACACCTGGCTGAAGAAACACTTGCTGCGGTTCGAGGAGACC
CTCGCGCTCACCTTCCCCCCGGCCTGGAGGATGAGCGAACGCCTGGCGCATCGGTTCTGT
AAGCTCACACACAAGGCTCTGAGTGATCTGCTACAGGCTCGGCGGAACGAGCTGGATGTC
AAACTGCTATTGTACGCCATACAGAAGACTTACAACTTTGAAGTGTTGCTGCACAAGAGA
TTTATAGGTACCGATGTCGGTGCTGATGCAGCGGACCTGTCCCCCGAGCATCAGCTGGTG
TTTGATGACGAAGAGACGGGTGCTGTCCTCCGTCAAGGCTCCCCGTGGGTCGGTCTGATA
GGATCGTGTTTCGAGTGCCACCTGTCGCTGTACATCACCAGCCTGGACGCCAACCTCCGA
GGGCTTATGGACAGGTTCATACAGGACGCCAAGAGTCCAGATAGCGTCACGAGCGCGGTG
GGCAGCGGCGCCGGAGCGGTGATGTCCTCGTGCGCTGACCTGTTCCTCTTCTACAAGAAG
TGTCTCGCCCAGTGCGCCACGCTCTCCACCGGGGAGCCCATGCTAGAGCTGTCGATGGTG
TTCTCGTCATATCTCCGCGAGTACGCCGGCAGCGTGTTGTCGGCCGCGCTGCCCAGGGCC
GCCCCCGCCCTGCCCGCCCTCGTCACCGGCCTCCACACCCTGCTCAGAGACGACGCTGTC
AGGTACACGAAGCAGGAGATCACTAAAATAACGAGCGTCATCACCACGTCCGAGTACTGT
CTGGAGACGACTGTACACCTGGAACAGAAATTAAAGGAGAAGATCTCACCCTCGCTGGTC
GAGAGGATAGACCTGGCGCCCGAGCAAGACCTGTTCCACAAGATGATCAGCAACTGCATC
CAGCTGTTGGTCCAAGACCTGGAAATGGCCTGCGAGCCGGCCCTCCAGGCGATGACTAAA
ATATCCTGGCTGCATTTCGACAACGTAGGCGACCAGAGCTCCTACGTCACACAGATCATC
ATGCACCTCAAGAACACGGTCCCCAACCTGCGGGACAACCTCGCGTCCTCGAGGAAGTAC
TTCACGCAGTTCTGCATCAGGTTCGCGAACTCCTTCATACCGAAGTTTATCCAGAACATA
TACAAGTGCAAGCCGATCTCGACCGTGGGCTCCGAGCAGTTACTCCTGGACACGCACATG
TTGAAGACGGCGCTGTTGGAGCTGCCGTCCATTGGGTCGGAGGTGAAGCGGCAGGCGCCC
ACCACCTACACCAAGGTCGTCATAAAGCTGATGACGAAAGCGGAAATGATACTGAAGCTG
GTGATGGCGCCGCTGGACGGTAACTTGGAAGGATTCGTCTCCCAGTTCGTCCAACTGCTG
CCGGAGAGTACCTTGGTGGAGTTCCACAAGGTCCTGGACATGAAGGGAGCCAAACTGACC
AAGACACAGCAGAGCTCTCTGGACGCTCTGTTCAAAGAGACCGCTAAGACTGTACAAAAT
AAATAA

Protein sequence:

MDSTEVLEDKDSGPEVQLNLPGSVMSRIEELMGGTEQFDSEEFDAVAYINRVFPTEQSLS
GVESAAARCEFRLASVQHDIRRLVRAQHDQRMAGHEALLDAQKCIAELALQVADINKKAE
RSESMVREITSEIKQLDCAKWNLTGAITALNHLHMLAGGAASLRTLADNRRYKELVLPLQ
AIMEVLDQLQCYRDIKELNALREEVLGIRTMLAQQILADFKEALTGGGKSGVSPRMLAEA
CSVVDVLEPSVRKELLKWFIDMQLQEYEHLFSPEQEHAWVSHVERRYTWLKKHLLRFEET
LALTFPPAWRMSERLAHRFCKLTHKALSDLLQARRNELDVKLLLYAIQKTYNFEVLLHKR
FIGTDVGADAADLSPEHQLVFDDEETGAVLRQGSPWVGLIGSCFECHLSLYITSLDANLR
GLMDRFIQDAKSPDSVTSAVGSGAGAVMSSCADLFLFYKKCLAQCATLSTGEPMLELSMV
FSSYLREYAGSVLSAALPRAAPALPALVTGLHTLLRDDAVRYTKQEITKITSVITTSEYC
LETTVHLEQKLKEKISPSLVERIDLAPEQDLFHKMISNCIQLLVQDLEMACEPALQAMTK
ISWLHFDNVGDQSSYVTQIIMHLKNTVPNLRDNLASSRKYFTQFCIRFANSFIPKFIQNI
YKCKPISTVGSEQLLLDTHMLKTALLELPSIGSEVKRQAPTTYTKVVIKLMTKAEMILKL
VMAPLDGNLEGFVSQFVQLLPESTLVEFHKVLDMKGAKLTKTQQSSLDALFKETAKTVQN
K