DPGLEAN12965 in OGS1.0

New model in OGS2.0DPOGS207859 
Genomic Positionscaffold1399:+ 22803-29408
See gene structure
CDS Length3135
Paired RNAseq reads  61
Single RNAseq reads  178
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009942 (1e-165)
Best Drosophila hit  CG1625, isoform B (8e-17)
Best Human hit5-azacytidine-induced protein 1 isoform b (1e-17)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC005914 [Tribolium castaneum] (9e-39)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC005914 [Tribolium castaneum] (7e-35)
GeneOntology terms







  
GO:0005856 cytoskeleton
GO:0030154 cell differentiation
GO:0005829 cytosol
GO:0003674 molecular_function
GO:0007275 multicellular organismal development
GO:0008150 biological_process
GO:0007283 spermatogenesis
GO:0005737 cytoplasm
GO:0005813 centrosome
InterPro families  ND
Orthology groupMCL28331

Nucleotide sequence:

ATGTCTAAAGAAAATAATAATTTAAGGCTTCTCGGCTCACCAGTAAATTTAACTTATAGA
AACAAGAGAAAAGAAGATAGGAAAAATTCAAGAAATCGTCCAAGATCTGCTCTACAAAAC
TCTAGTTCTCCGGATGTTCCAGAAAGATACAAGAGACCATTCTCAGCGGACACCAAGGAA
CGAGGCCAATCCACAAGATCCTTTTATAAAACTTTTAGCGCTGATTTATTACAGTCTTAC
AATAACTCCCCGTTGAGTGTGAAAGTGCTCCCTCCCACAGAAGACCTTTTGACCCATTCA
AATGTTCACATCACTAGAAACGAAAACAAAGAGATAAGCAGCAACGCATCTGATTACGGT
TCCGAAGACACGTTTATTAGTTTGGGAACAAAGATAAAAGCAAAAGCTCAAACTGTCTGC
AAGAATAGAAATACGAATCCAAAGAATTTCTTAAAATACAGGACTATTGCAAAGAAAGGA
CGGAAATCTATGGAGAATTTAAATGAGACGAATGACAATAATTACGGTTTAGAGATTACA
ATCAAGGAGAAATCTGGACCGCCGTCGCCTACGAGGAGTACCGATTTGTTCCCATTGAGG
CCCTCGTCTCCGTGTCGGAATAAGTCGTACGAATCCTATTTTTTAGCCCTGGAAGACGTT
AGGAACGGTGATGGTATTGTAGGCAGAGTATCGTTCGCACCGGCCGATAATAAAATTGAC
AAAAATCTTGACAATCCGCGCCGTTTGAGCCTGACGAGACAACAATTATCGTTAGTTGAA
GAGGAATCAGCTCAAGACATTGATAGCATACCATCTCAGAATACTGAAAACCTTTCACCA
AAAAACAAATATAACTGTAAGGAAATATCTAACAACGATATTAATTACGATAAGAACGAC
AAGGGAACTTTATTAGATAATAATAAAGATTTTAATAGTGCACAAAATTATAACGCATTT
TACGATGACATCCACTCAAATACACCAGACTCCTTCAACACTAGATTATCTAGTACAGGC
ATTCATACAGATTCTTCAAAAGATTCAGGGTACCCGGACAGTGTCAATAAAGAACATAGA
ACGCTGACGCAAAACTATTTACTTACACCTGCGCCTGATTCATATGACTTTAATAGCAAA
ACAAATCCGACTAACAATTCGGAATATTCCACAGATAATGATAAAGCTTTTAAAACAAAC
TTCAGCAATAAATGGTCCGAACCATGGAATCACAGCAGATTACTCTACAAAGATTTCTTT
TTAAAGAAAGAGACTCACGGCCCAGCCCCACAGAACACACCAACAAAAAATGACGTTCAT
ATCCCGGGGACTAACGAAAACATAGAGGGAGACACAGAAAAGTTGGAATATCCTACTTAC
CTCCTAAACAGTTCTACAAAAGCTTACACCTCAAAAGTTATTGAAGATTACAGGAAAGAA
CTAGAGGCTATAAACACTTTACACGAATTAACAGTTAAAGATATAAAAATAGACCCCATA
TCCCCTACTCCTCTTAGTATAGACGAAATGTTTGAGCAACATAGTAATCGTTTTAACGAT
GGTAAAACAGACTTGAGCGAAAACTCGCAAGAAAGCACTAATAACAGTGACAGTACCGAC
AAAAGCTCTCTAAACAACAAGAGAGATATATCCAAAGTGCCAACGAGAGAGCTGATACAG
AACTACTTCAAAGTCAAAAGTGATTGTACAAAGGAGTTTCCGAGAAATGCAAAAAAATAT
GACAAAAAACTCAACAATATAAATCCGAAATACGAAGAAAATTCCAGTTATAAACAGTAT
TGGAACAACAGGAATGCAAAGAATAGTGTAGAGAGAAGTAAAACTCCCGTTGATGTGAAG
ACCCTGAACAAGGCGGTGACAGGCAGAGCGCCGTCTAGTGCTAGAATCGAAAGTGTCCAA
AACGACAAAGATATTGAATCATGGATGTCTTTATCAGCTCCGTCACCGAGAATGCTAGAG
ACTGATAACGTCAAAGATAACACCGCAGAACCTCCAAAAGCAGTACCAGTTATTGATAAA
ATCGAGGAAAATGACGAAGCCGAATTTAATAACCAAAGGACAAAAACATCAGAAGCCCCA
AAGCCAAAGGAGCTCAACTCTAAATCTACTATCGTCGACATTTACTCGATGTTGAAGGAA
ATTGAAAGTTTTGGTGATAATCCTGTCACGAATATTGATGAACCGAGCGAACCGAAACCG
AAACAGGAAGATAGATGTTCAACACCCAAAGATAACTTTATGGAGATCTTTGAGTTTTTG
GAAAAGGTAGAACAAAGCGCGAACGATGCTCTATCAGTCGTCACCAACACAACACCGCAA
ACTATACCCAAACTTGAAGCTCTACTAAAGCTGCCACAAACGGAGTTAGCTCAGAGGCTT
GTAACGCTATCATTACAACTAGAGGAACGATCCTGTTGCATTGCCTTACTTCAAGAAAGT
CTCGCCAATCATAAGGAACAGATGATCAACAAAGTTAGCAACCTCGAGAAGCAATCACAT
CGGAACATAGCCAAGGTTAAACAAGAATGCGAAGAGACGATAAAGAGACATCAGAATTTT
ATTGATCAGCTAATAAACGACAAGAAGACACTGAACCATCGCATTGAGCAGTTGGTTGAC
GAACGTCGTACTCTTGAAGAGAGGTGGAAGAGATCCGCTCAGACATTGGAAGAACGATAC
AAACTTGAGCTGAGAAATCAACACGACAAGATGGCCGCCGCTCAGCAAGTCGCACGGCAG
CGGTGGGTGCGTCAAAAAGCTGAGAAAATTAAGGAGCTTACAGTCAAAGGTCTGGAAGGA
GAGTTACGAGAGATGGCAGAGAGACAACAAAAAGAGATATCGGACCTGAAAATGTTCCAC
GCGGAACAATGTGGGAGAATGAGCGCGAAACACGCAAATGACTTAGAGGAACTAAGGAGG
AGTTTAGAGGAAGAAAAGGAGAATGCCCTGATAAAAGAGCGACAACTAGAGCGTCAGTTA
CTTGAGTTGGAGCTGTCTCAGCAGGAACAGCGTGCTAGGCTGGCGGACGAGCTGAGGGCT
GAGGGGGAGAGACTGGAGGGAGAGAGGGCAGCCAGGGAGAGAGAACATAGAGAACAGATG
GGTGAGAGACATTAG

Protein sequence:

MSKENNNLRLLGSPVNLTYRNKRKEDRKNSRNRPRSALQNSSSPDVPERYKRPFSADTKE
RGQSTRSFYKTFSADLLQSYNNSPLSVKVLPPTEDLLTHSNVHITRNENKEISSNASDYG
SEDTFISLGTKIKAKAQTVCKNRNTNPKNFLKYRTIAKKGRKSMENLNETNDNNYGLEIT
IKEKSGPPSPTRSTDLFPLRPSSPCRNKSYESYFLALEDVRNGDGIVGRVSFAPADNKID
KNLDNPRRLSLTRQQLSLVEEESAQDIDSIPSQNTENLSPKNKYNCKEISNNDINYDKND
KGTLLDNNKDFNSAQNYNAFYDDIHSNTPDSFNTRLSSTGIHTDSSKDSGYPDSVNKEHR
TLTQNYLLTPAPDSYDFNSKTNPTNNSEYSTDNDKAFKTNFSNKWSEPWNHSRLLYKDFF
LKKETHGPAPQNTPTKNDVHIPGTNENIEGDTEKLEYPTYLLNSSTKAYTSKVIEDYRKE
LEAINTLHELTVKDIKIDPISPTPLSIDEMFEQHSNRFNDGKTDLSENSQESTNNSDSTD
KSSLNNKRDISKVPTRELIQNYFKVKSDCTKEFPRNAKKYDKKLNNINPKYEENSSYKQY
WNNRNAKNSVERSKTPVDVKTLNKAVTGRAPSSARIESVQNDKDIESWMSLSAPSPRMLE
TDNVKDNTAEPPKAVPVIDKIEENDEAEFNNQRTKTSEAPKPKELNSKSTIVDIYSMLKE
IESFGDNPVTNIDEPSEPKPKQEDRCSTPKDNFMEIFEFLEKVEQSANDALSVVTNTTPQ
TIPKLEALLKLPQTELAQRLVTLSLQLEERSCCIALLQESLANHKEQMINKVSNLEKQSH
RNIAKVKQECEETIKRHQNFIDQLINDKKTLNHRIEQLVDERRTLEERWKRSAQTLEERY
KLELRNQHDKMAAAQQVARQRWVRQKAEKIKELTVKGLEGELREMAERQQKEISDLKMFH
AEQCGRMSAKHANDLEELRRSLEEEKENALIKERQLERQLLELELSQQEQRARLADELRA
EGERLEGERAAREREHREQMGERH