DPGLEAN15598 in OGS1.0

New model in OGS2.0DPOGS207050 
Genomic Positionscaffold1:+ 1673672-1683367
See gene structure
CDS Length3483
Paired RNAseq reads  1385
Single RNAseq reads  3042
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012992 (0.0)
Best Drosophila hit  CG1845 (0.0)
Best Human hitperegrin isoform 2 (0.0)
Best NR hit (blastp)  PREDICTED: similar to AGAP007617-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to AGAP007617-PA [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0008270 zinc ion binding
GO:0005622 intracellular
GO:0005515 protein binding
GO:0033563 dorsal/ventral axon guidance
InterPro families







  
IPR001965 Zinc finger, PHD-type
IPR001487 Bromodomain
IPR000313 PWWP
IPR019787 Zinc finger, PHD-finger
IPR007087 Zinc finger, C2H2-type
IPR019542 Enhancer of polycomb-like, N-terminal
IPR019786 Zinc finger, PHD-type, conserved site
IPR011011 Zinc finger, FYVE/PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL10973

Nucleotide sequence:

ATGGGTTTGGATTTTGATGTCTTTGAATTCTGCAAAAAATTGCGTCAGAACAGGCCTCCT
CCGTATCAGTGTCCACTAGAGAAATGTGATAAAGTATACAAGAGTTTATGTGGTTTGCAA
TATCACTTAGTAAACTATGACCATGACAATCCAACACCGGCGACACCTTCAATTGCAAGC
AGTCGCAAGAAAGGCAGAACTCGAGCTGCTGTGCCTACTGGAGATATCGCACTTCAAAGC
CCACCTAAGGAAGCTTTGACTTTTGCTGAGGCACAAAAAGTGGTGCAGTTTGAAGTTGAT
GGAAAAATTAGTAGAATACCAATTGACCAGCCACTGCCTATAGTTTCATTGGAAGAGTGG
GAGAGAAAAAATGCAGATTTAGAAAAGCCTATGCCATTTGTAGAGCCACCATCAGAGCCT
CATGTTAAATTACCAGAGGCTACATTCCGTCTAATACCAGATTATAATGCACGGGTATGT
GACGCACCACCTCGGCCAAATGCATACATACGTTTCATAGAGAAGTCGGCTGAGGAATTG
GATGGTGAAGTGGAATATGATGTTGATGAAGAAGACACAGCCTGGCTGGCCATTATTAAT
AAGAACAGAACCAAACAAGGTCTACCACCCGTCTCCGTAGATACCCTGGAGCTACTCATG
GATAGATTAGAAAAGGAATCATATTTTCAGGCTACACAGAACGGCCAGCAACCTGCGGCA
ACGGTGGACGAAGATGCTGTGTGCTGCATATGCATGGACGGGGAATGCCAGAACACCAAT
GTGATCCTGTTTTGTGATATGTGTAACCTGGCCGTCCACCAGGATTGTTATGGGGTGCCA
TATATACCAGAGGGACAATGGCTGTGCAGACGTTGTCTTCAATCACCATCACGACTTGTT
AACTGTGTGCTATGTCCCAACACTGGAGGAGCATTCAAGCAGACAGATCAGGGCACTTGG
GCGCACGTCGTCTGCGCCCTCTGGATACCAGAAGTTCGCTTTGCAAATACAGTGTTCTTG
GAACCTATTGATTCAATAGAGATGATTCCGGCTGCTCGTTGGAAGCTTCAGTGTATGGTG
TGCAAGCAGCGAGGTGCTGGTGCTTGCATACAATGTCACCGTAGCAACTGTTACAGCGCC
TTCCATGTCACATGCGCCCAGCAGGCCGGTTTGTATATGAAGATGGAAGCGGCCGGATCT
GGCCGTGATCCCAGTCAACCAGTTCAGGTGGCCAAAATGGCGTACTGTGATGCACACACA
CCAGCACATGTATTACAGGAGAGGAGAGCTTTGGAGTCGGAAGGTGAAAGTAAATCTTCA
GATTTGACTTCCATACGACAGAAAGGAAGGGAGAAGATAAAACAGGCTCGGAGAGTGTTA
GCGTTGAAGCGTACGTGGGCGCCGGTAGTGTTGGTGCCGACATTACCACCTGAACGTGTT
GCTGAGATTGCCCAACTGTCACACGGGACACCCGCGGCTAGAGCACAGCTGATGAAAAGA
CTTCTCGCTTACTGGACCCTCAAAAGACACAGCAGGAACGGGGTTCCACTCCTTAGGAGG
CTACAAAGCCTGACCAGCCATCACGGGAGCCGAGGTATCCAAGATGGCACTGTGAATGTA
CGAGAACTCTGCAATCAACTCAAGTACTGGCAGCGGATAAGACAAGATCTGGAGAGGGCT
AGATTGCTGTGTGAGTTGGTACGTAAACGCGAGCGTCTCAAGGCGGAATACACTCGTGTT
TGGGAACGCTGTGTTTTGCATACGCTCCGACCTGAACGTGCCATGCTGAGCAAGATGCTG
CGCATGATGAGACACGCTGACCACAGTGACGTGTTCACGGAGCCGGTCGACCCGCTAGAG
GTTCCAGATTACAGCACCGTCGTAAAGCATCCCATGGATTTAAGTACCATGGGCAAGAAA
TTGGACAGAGGCATTTATAAGACCATAGATGACGTAGAGGCAGATTTCCAACTAATGATA
GACAACTGCCTCACATATAATAAAAAGGATACAGTGTTTTACAAAGCTGGTGTCAAGATG
AGGGAGCAGTGTACGTCTATATTTCGTCAAGCACGTCGTGACGTCATAGAGGCGGGTCTG
GCGTCGCTGGCAGGGGAAGGGGACGCAGAGGAAACTTACACACCCGGGCGCACACACGCA
CAGAAACACACACAGTCAAGGCGCAGGAGTGTAAGAAACACAAGCAGCGACAGCGATCGT
ACAGCCGATACTCGAAGCGAGCGCGGCGTCTCCCTGGCACGTAGCGAGCGACGACACACC
AGCGCATTGAGAGACAGTGACGACGACATTAATCAGCGCGAGCCGTCGCCGGCTAAGAGC
AAGGTGAACCGCAATTGGTGGCGCGGCCGCGGTCGTGGTAGGGGCAGGAGGGGGAGGAGG
GGCCGCGGGAGAGGGGGACACGTGTCGCCTCGACCACTCAGAGACAACGATACTCCGACG
ACGGATTCAGAAGCTCCTATAGTTAAATCGAAGACTGTTGAGCGAACACAAAAATTGGTC
ACACCAGAAAAGTCACCAACTAAGCAACTGGAAAGTACTGGTCTAGGACTACTGGGTGGT
TTGAGAAAACCTACTTTGCTTGTGACGCCCTCAATAACCACCCCTCCGAAGAGTTTTGGT
TCTGATGCCACTTTGCCAACACTATCAGCCAGCTTGGGACACACAGAACCCTCGCCCAGG
AAGAAGGGTCGCGGTCGTCCACGGAAACAAGATAAAACAACAGATCTATTCAGAGGGGAC
TCGGAGGTCCTAGGAGGAGCATCGTTCCTTCAATACCGCGGTCCTCCCGGGGAAGTCGGC
TCGGATAGCGATTTGGCACTATCAAGGTCGTCAAGCAGTAGTTCAGCGTGGTCCCAGTCA
TGTTCCTCGTGCACACACTATGACGACGATAGATCTGGCGATGACAGTTCCAGCGAAGGT
TCTAGTTACAATGAAACGTTGGATTCGTTAGAAAGCCGCGGTCCGGAGCCCAGACGTCGC
GGCCGTCGGGCCGATGAGGGAGTCGACCGCACTCAGCCAGCAACACCCGTCAAGGGCCGT
GGTACAAGATCTTCTACGTCCAAGACTCCGGTGAAAGTTACCCAATCAGATGTTCTGCTT
GAGCCGTTGCAGTTAGTATGGGCAAAGTGCAGAGGCTATCCTTGGTATCCCGCACTAATA
ATAGATCCGAAGATGCCAAAAGGTTACATATACAACGGAGTTCCTCTACCAGTGCCGCCT
CAAGATGTACTGAACCTCAAGAAGAATTATGCTCACGAACCAGTATTGTACCTAGTTCTA
TTTTTCGACGTTAAACGAACGTGGCAATGGCTGCCTCCAAATAAATTGGAAATCTTGGGC
CTAGATAAGGAGATAGATGAAGCCAAACTGGTGGAGTCACGGAAACCGACCGACAGGAAG
GCTGTCAAGAAGGCTTATGGTGATGCAATGCAGTTCCGGAAGCAGGTTGACGGTGATAAA
TGA

Protein sequence:

MGLDFDVFEFCKKLRQNRPPPYQCPLEKCDKVYKSLCGLQYHLVNYDHDNPTPATPSIAS
SRKKGRTRAAVPTGDIALQSPPKEALTFAEAQKVVQFEVDGKISRIPIDQPLPIVSLEEW
ERKNADLEKPMPFVEPPSEPHVKLPEATFRLIPDYNARVCDAPPRPNAYIRFIEKSAEEL
DGEVEYDVDEEDTAWLAIINKNRTKQGLPPVSVDTLELLMDRLEKESYFQATQNGQQPAA
TVDEDAVCCICMDGECQNTNVILFCDMCNLAVHQDCYGVPYIPEGQWLCRRCLQSPSRLV
NCVLCPNTGGAFKQTDQGTWAHVVCALWIPEVRFANTVFLEPIDSIEMIPAARWKLQCMV
CKQRGAGACIQCHRSNCYSAFHVTCAQQAGLYMKMEAAGSGRDPSQPVQVAKMAYCDAHT
PAHVLQERRALESEGESKSSDLTSIRQKGREKIKQARRVLALKRTWAPVVLVPTLPPERV
AEIAQLSHGTPAARAQLMKRLLAYWTLKRHSRNGVPLLRRLQSLTSHHGSRGIQDGTVNV
RELCNQLKYWQRIRQDLERARLLCELVRKRERLKAEYTRVWERCVLHTLRPERAMLSKML
RMMRHADHSDVFTEPVDPLEVPDYSTVVKHPMDLSTMGKKLDRGIYKTIDDVEADFQLMI
DNCLTYNKKDTVFYKAGVKMREQCTSIFRQARRDVIEAGLASLAGEGDAEETYTPGRTHA
QKHTQSRRRSVRNTSSDSDRTADTRSERGVSLARSERRHTSALRDSDDDINQREPSPAKS
KVNRNWWRGRGRGRGRRGRRGRGRGGHVSPRPLRDNDTPTTDSEAPIVKSKTVERTQKLV
TPEKSPTKQLESTGLGLLGGLRKPTLLVTPSITTPPKSFGSDATLPTLSASLGHTEPSPR
KKGRGRPRKQDKTTDLFRGDSEVLGGASFLQYRGPPGEVGSDSDLALSRSSSSSSAWSQS
CSSCTHYDDDRSGDDSSSEGSSYNETLDSLESRGPEPRRRGRRADEGVDRTQPATPVKGR
GTRSSTSKTPVKVTQSDVLLEPLQLVWAKCRGYPWYPALIIDPKMPKGYIYNGVPLPVPP
QDVLNLKKNYAHEPVLYLVLFFDVKRTWQWLPPNKLEILGLDKEIDEAKLVESRKPTDRK
AVKKAYGDAMQFRKQVDGDK