DPGLEAN15749 in OGS1.0

New model in OGS2.0DPOGS208482 
Genomic Positionscaffold28:+ 4453-7232
See gene structure
CDS Length2652
Paired RNAseq reads  645
Single RNAseq reads  1714
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010625 (7e-48)
Best Drosophila hit  scattered (5e-169)
Best Human hitvacuolar protein sorting-associated protein 54 isoform 2 (6e-63)
Best NR hit (blastp)  vacuolar protein sorting [Culex quinquefasciatus] (0.0)
Best NR hit (blastx)  vacuolar protein sorting [Culex quinquefasciatus] (0.0)
GeneOntology terms






  
GO:0007286 spermatid development
GO:0007291 sperm individualization
GO:0008270 zinc ion binding
GO:0005515 protein binding
GO:0006896 Golgi to vacuole transport
GO:0005794 Golgi apparatus
GO:0005739 mitochondrion
GO:0042147 retrograde transport, endosome to Golgi
InterPro families
  
IPR012501 Vps54-like
IPR019515 Vacuolar protein sorting-associated protein 54
Orthology groupMCL14749

Nucleotide sequence:

ATGGAGGACAAGAATACCACAAATAAGACGCCAGCATGGCAAAACTGTGTTCATTGCCCA
AACCTTCTGTTCAACTCTGCCACGGAATTCGAAAGGCACATACACGAAAAACACAGTATT
AAAGAGGGCTCCATCGTACTATGCCAATATGGGCAGAATGGCATTTGTTCAGCATTGCAA
TTTGGTGATTTACGAAAAGCAGGATTTAAGTACCACATTAGACAACACCACCTTTATAAA
AATTGTATAAATGATGACTGGACATTCTATTCCTCATCACAAAACTTGCCTGCAGTATTG
AATGATCCAAACAGAGGCAAGCAGAACAATTTTTTTACTAAAACATGGGGTGACAGCTTT
ACTGATAAGGTAGATATATATCCTAGTCCATATTTGCCAGTTATAACTCTGGCTCACTTT
GAATCATACTGTAAAAAAATATCAAAAAGATTTAAAAGGCACCGACAACTTAAAGAGACT
CTACCAGCTAAAAGTAAAACACCAGTAGTGGAAACAGTTTGTGAGATTCCTAGCATCTTT
TATGAGCAATCTTTAGCTCTTAATGACCCTCAAAACTTCAGTAAAGTTTTTCCTGGGTTA
TCAAACACACAGGACATAAATAGCACCATAAGGTCACTCCAAGAAACTTTGAGTACATAC
CTTGATATAATTGAAATGAAGATATCAAAACAGGTAGCGCAGAAATCTGATGCCTTTTTC
CATGCTATGCTTTCTCATGACACTATCATGGAACAGATGGCGAGAGCCGTTAGAACAGTT
CAAAATACAAGGAGGGATATAAAAGATGTTAAAGAGAATTTAACTGATAGTCCTTTAAAA
TTAATAAGCCTCACTAGAATCAACAAAAACCTTAACAATGTTCATGAACTCTTGAAGTTA
ATGGGAACAGTGCAGCAAACCCAACCTATGATTCAACTTCTTTTAAGTACTTCAGACTAT
GTTGCTGCTCTTGACTTAATAAGCTCTACACAAAGAGTTCTATCGACAAGATTATCGGGT
ATACAAGCCTTCAGACATTTATCACCACAGCTGACAGAGATGAAAAGGTTGATACACAAG
ATGTTAAGCAATGAATTTCTAAGATTTATTATTGCTGACATAAACAGACCTCTAAAAGAT
ACTGCTGATTTACCAGAAAGAGATAAAATTGTCTCCATTGTATCAGGTATACTCCGTTTG
AAAGAATTTGATTTTCTAGACATATTCAAAACAGAAGCAATGACTTCAATTCAAACGACT
ATAAAACAATGTGTCATAGAAATAATATCAGATAGAGATGGTAGTACAGAAATAGTATTA
AGGGGATCTAATACAGACACATGGTTGCTTTGTAATGAAGGTATCATCTTTCTTCAGAAA
GTCACTCCCAATATAGTTAATTTATTTAGAAGAATTTTATCTTTGTGTAATTTAATATTG
GATGTCAGTCAAATGAGTGACATCACAGGAAATGATAGTGAAGATATATGGACACAGGAT
GAGCTTTTATTAATTGAGGATAAAATTAAAAAACTTATTATCTCACTGTCTGACTACAGC
AACGAAAAATGTGCCAATCTCATTGTAACTAAGACGGATAGAGACTACGTTTTCACAGAT
TTAACACAACTATCAAAATTATCAAAGCTAATAGAAGACTTTTCAAAGGAGTGCGAAAAT
ATAACAGGACACTACAGTAATTCAATGAAATTGGCGTTAAGAAGTTTTGCTATGAAATAT
ATTCAAAATTTGCATTCTGATAGACGGGTACAACTGACCACAGCTCTTAACAGTGAGAGA
TGGAAAATTGCTGATGTACCATATGAGTTACAGAGTGTAATAAACAAAATATGTGAAATT
GGGGAAATACCTTCAACACTTAATTACGAAAGTGGCAAGGCTGATGGTAAATATTTAATT
ATAGATAAGGAGAGCTATGCTGTCGTTGCGACAGTGCAACTTTTGATAAAAATTCTACTT
GAGTACTGTGACGCAATAAAACAGTCTCCAGATATTGTTCAATATTTAGTTCATTGTATG
TTAGAATTGATGAGATTGTTTAATTCTCGATGTTGCCAGTTGGTTTTGGGTGCTGGAGCT
ATACAGAGTGCTGGATTAAAAACAATTTCTACATCAAATTTAGCTTTAGTGTCTAGATCA
CTTCAAGTAATACTTTGGCTTTTACCATTAATAAAAAAATTATTAGAAGAAAATAGCTCC
AAAGATTTGTCCCTCGGTGGATTTAACAGCATTGAGAGTGACATAATTGGTCATAAGAAG
GAAATTGAGAGCAAAATCTGTTTCATAGTGAGCAACATGTTGAGTTCTCAGTTAGTTGGC
TGGGAAGCTAAGCCTCCAGTACCTTCGCAGACATTCCGTAACATTTCTAAACACTTGGTC
AAACTGCATGAAGCTCTCATAGATATTTTACCTTTAGAACAAATCCGAAATATTTACATG
AAAGTACACGACAATTTTAAAGACAAATTACGAGAACAATTAAGCAAAATGAACATAGTT
GCGAACGGTAGCCCCCAGCACGGTGTTGTGACTTCTGAATTAACTTTTTATTTACAAACC
CTCAAAACATTAAGAGTGATCAATGAAAACGATCCTGAGGATAATATTTTATATGATATT
TGGTTACATTAA

Protein sequence:

MEDKNTTNKTPAWQNCVHCPNLLFNSATEFERHIHEKHSIKEGSIVLCQYGQNGICSALQ
FGDLRKAGFKYHIRQHHLYKNCINDDWTFYSSSQNLPAVLNDPNRGKQNNFFTKTWGDSF
TDKVDIYPSPYLPVITLAHFESYCKKISKRFKRHRQLKETLPAKSKTPVVETVCEIPSIF
YEQSLALNDPQNFSKVFPGLSNTQDINSTIRSLQETLSTYLDIIEMKISKQVAQKSDAFF
HAMLSHDTIMEQMARAVRTVQNTRRDIKDVKENLTDSPLKLISLTRINKNLNNVHELLKL
MGTVQQTQPMIQLLLSTSDYVAALDLISSTQRVLSTRLSGIQAFRHLSPQLTEMKRLIHK
MLSNEFLRFIIADINRPLKDTADLPERDKIVSIVSGILRLKEFDFLDIFKTEAMTSIQTT
IKQCVIEIISDRDGSTEIVLRGSNTDTWLLCNEGIIFLQKVTPNIVNLFRRILSLCNLIL
DVSQMSDITGNDSEDIWTQDELLLIEDKIKKLIISLSDYSNEKCANLIVTKTDRDYVFTD
LTQLSKLSKLIEDFSKECENITGHYSNSMKLALRSFAMKYIQNLHSDRRVQLTTALNSER
WKIADVPYELQSVINKICEIGEIPSTLNYESGKADGKYLIIDKESYAVVATVQLLIKILL
EYCDAIKQSPDIVQYLVHCMLELMRLFNSRCCQLVLGAGAIQSAGLKTISTSNLALVSRS
LQVILWLLPLIKKLLEENSSKDLSLGGFNSIESDIIGHKKEIESKICFIVSNMLSSQLVG
WEAKPPVPSQTFRNISKHLVKLHEALIDILPLEQIRNIYMKVHDNFKDKLREQLSKMNIV
ANGSPQHGVVTSELTFYLQTLKTLRVINENDPEDNILYDIWLH