DPGLEAN19203 in OGS1.0

New model in OGS2.0DPOGS202651 
Genomic Positionscaffold539:- 94772-109347
See gene structure
CDS Length1749
Paired RNAseq reads  1278
Single RNAseq reads  3412
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000849 (0.0)
Best Drosophila hit  prosap (2e-146)
Best Human hitSH3 and multiple ankyrin repeat domains protein 2 isoform 1 (7e-96)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC007761 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC007761 [Tribolium castaneum] (1e-177)
GeneOntology terms  GO:0005515 protein binding
InterPro families


  
IPR002110 Ankyrin repeat
IPR001452 Src homology-3 domain
IPR011511 Variant SH3
IPR020683 Ankyrin repeat-containing domain
Orthology groupMCL14910

Nucleotide sequence:

ATGAATGATCGAAGCTCAGACAATGTTAATAACCAGTCTCGAGTAAGGCTTCGCTGTAGC
ACGCTAGTCACATTAGGAGGTCCGACCGATATATTAGGGGGTGCATTTGCCTACAAATAC
GTTGCTGATGAAATCAATCCACGACCCACTGACATACAGCCCGTCATGTCTATTGAGTTG
AAAGAAAGCTTTAATTATGGGCTGTTCTGTCCACCGGTCAACGGCAAAGCCGGCAAGTTC
CTCGATGAAGAACGACGCCTCGGAGATTATCCTTTTAATGGACCCGTCGGCTACCTCGAG
TTAAAATACAAACGCCGTGTATACAAGATGTTGAAGCTGGACGAGAAGACATTGAAGGCT
CTACACTCGCGAGCCAACCTGAGGCGTTTTCTCGAGCACGTGACACACGGACAGATAGAC
AAAATCACCAAATCATGCGCTAAAGGACTGGATCCCAACTTTCACTGTCAGGACACCGGC
GAGACACCATTAACAATAGCCGCTGGCCTAAAATCCCCGGGAAAAGTGTTAATCGCTCTT
GTGAACGGCGGAGCTCTTCTGGATTACCGAACTAAAGATGGCAGCACCGCCATGCATAGA
GCTGTCGAAAAGAATTCCCTTGAGGCTGTGAAGACTTTATTGGAGCTCGGCGCCTCCCCG
AACTATAAAGATGGAAAGGGATTAACGCCTTTATACTTGTCCGTAACGAACAAGACGGAC
CCCTTGCTCTGTGAGACACTGTTACATGACCACGCCACCATCGGTGCTACGGACTTACAA
GGCTGGCTTGAAGTGCATCAGGCGTGTCGTAACGGCCTGGTCCAACACCTGGACCACCTT
CTCTTCTACGGAGCGGACATGAACGGTCGTAACGCGTCCGGCAACACGCCGCTCCACGTG
TGCGCTGTAAACGCCCAGGACTCCTGTGCGAGACAACTACTGTTCAGGGGCTGCGATAAA
GAGGCTCTCAACTTCGCCAACCAGACACCCTATCAGGTAGCAGTAATAGCCGGAAATTTA
GAATTGGCCGAGGTCATAAAAAACTACAAGTCGGACGAAGTCGTTCCGTTTCGGGGCCCG
CCGCGGTATAACCCGAAGCGTCGCTCGGCGTGGGGCGGGTGGTGGGCGGACCGCGCCGGC
GACCGGACGTCGCTGGCCTCCGTGCCCTCCGAGCTGGAGGCGCTCCTAAGGGCGGCTACA
CACTCGCCGGCCAGCGAGCGCTTCTCGTCCGCTTCGTCGAGCATCAGCGACGCCAGCCAT
CCCAGCCACGAGGACGACGCCAGCATCCTCACAGATAAGAGCGCGGACACGAGCGACATC
ACGGACTCTAGCGGCGTGGGGACCAGCACCTCGGACACCATGTGCTCGCTGCAGACCGCG
GCCACGGTCGTCTGCCTACAGCCCTACGAGCCCACACACCACGGACATCTGCGGCTCAAC
CAGGGTGACATCATAGAAGTGACTGGCGCGACAGACGATGGTCTGCTGGAAGGCTCGGTC
CGCGGCTCGACTGGCTCGGGGCTGTTCCCCGCCAGCTGCGTCCAGGAAGTACGACTCAGG
CAGAACGCACACCTGCATCAGGTGTTGTCCTCGGGTCCCATCCACCACTCGCGCGTCACG
GGGAGGAGGGAGATGGCGCTCAGCAAAACATACAGCGCGACCGCGCCGAGGATCAAGAAG
ACGTACGTATACCGCAGGCTGCATGCACCGGCGCTGCCGAGACTGCAGGTTTATCGGCGA
CCCACCTGA

Protein sequence:

MNDRSSDNVNNQSRVRLRCSTLVTLGGPTDILGGAFAYKYVADEINPRPTDIQPVMSIEL
KESFNYGLFCPPVNGKAGKFLDEERRLGDYPFNGPVGYLELKYKRRVYKMLKLDEKTLKA
LHSRANLRRFLEHVTHGQIDKITKSCAKGLDPNFHCQDTGETPLTIAAGLKSPGKVLIAL
VNGGALLDYRTKDGSTAMHRAVEKNSLEAVKTLLELGASPNYKDGKGLTPLYLSVTNKTD
PLLCETLLHDHATIGATDLQGWLEVHQACRNGLVQHLDHLLFYGADMNGRNASGNTPLHV
CAVNAQDSCARQLLFRGCDKEALNFANQTPYQVAVIAGNLELAEVIKNYKSDEVVPFRGP
PRYNPKRRSAWGGWWADRAGDRTSLASVPSELEALLRAATHSPASERFSSASSSISDASH
PSHEDDASILTDKSADTSDITDSSGVGTSTSDTMCSLQTAATVVCLQPYEPTHHGHLRLN
QGDIIEVTGATDDGLLEGSVRGSTGSGLFPASCVQEVRLRQNAHLHQVLSSGPIHHSRVT
GRREMALSKTYSATAPRIKKTYVYRRLHAPALPRLQVYRRPT