DPGLEAN03236 in OGS1.0

New model in OGS2.0DPOGS213589 
Genomic Positionscaffold66:+ 118436-128638
See gene structure
CDS Length3399
Paired RNAseq reads  1332
Single RNAseq reads  3099
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011657 (2e-21)
Best Drosophila hit  ND
Best Human hithypothetical protein LOC57614 (4e-66)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC003006 [Tribolium castaneum] (4e-94)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC003006 [Tribolium castaneum] (2e-84)
GeneOntology terms


  
GO:0005488 binding
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families


  
IPR016024 Armadillo-type fold
IPR021133 HEAT, type 2
IPR006594 LisH dimerisation motif
IPR011989 Armadillo-like helical
Orthology groupMCL17622

Nucleotide sequence:

ATGAATCCGTATAAAGATGTAGAAGAGAGTTCTTTCGTCGCGGCACCACATCTTACATAT
GAAGATATCGCTACTAAGTTGCTTAAGGATAACTTATTTTTAACTGCCCTAGAACTTCAT
ACGGAACTTGTGGAAAGTGGAAAAGAGTTACCTCAGCTGAGAGAATTCTTTTCAAATCCT
GGAAATTTCGAACAACATGTTTCACGCGCGTCAGAAATGGGAACTATTAATCGGACTCCA
AGTTTAGCGACATTGGATTCGCTGGATACAGCAAGGTATTCTGAGGACGGAGGTGGCGAT
CGAGCTGGTAGTGGTTGTGATGTTGCTGTTTTAGAATTTGAACTTAGAAAAGCTAGAGAA
ACAATAAACTCTTTACGAGCTAACCTAACACAGTTTGCTGATGAATCACCTCTTGACAAG
AACAATTCTGAAATCGATAGCCAAAGGACACTTAAACCACATGAGAAAAAAGCTCTCAAT
TTTCTTATAAATGAATATCTTCTTCTTCACAATTATAAACTAACGAGCATTACATTTTCT
GATGAAAATCCAGATCAGGAATTTGAGGATTGGGATGATGTAGGTTTAAATATTCCCCGA
CCTGCAAACCTTATGTCTCTATTCTGGGGAAGTACACGCAGCTTAAGTGTACCGAAGACA
GATGTTGCAACATACACAGATTTCTCATGCATAGACAGTGAATGCCAAACGGATTTGGAT
GAAAATGTGTGTGTGAGTTGTCAGACTTTAGACCATGACACAGATTGGAGCCATGAGGTG
GAAGAAATAGAACTATTGAAACAAAAAATAATAGCTCTCGAAACGGAAAAATTAAATTTC
CAAAAACTATATGATGCTGCCATTGTTAGTCTCAACACGCTAACAAGTCCGATGTCCGAG
AGTAAAACACTGGAATTACAAATACCAGATGAAAAAATTAATTCGAATTTAAAAGATCAC
TTAGAAGATATAAAACCTATTACGTGCATGGCTTTGGAAAACCATTATGGCAGCGGTAAT
TCAACTCACAGTGCTACGCCTGAACAATTTGAGATGATCTATGGTGATAAGAATAATTGC
ATGTCAAAGAAGGACGGTTCAAACACCAGTTCGTTTGAGCCGGCTGTATTGGACAGTGTC
CGAGGGTCGCCGAGACGAGCCAGTGTTACCACATTGGATGAAACTCTCAGCATCAACGAT
GCTGGCGAATGGACCAGGGTTCACTATGAATATAATACTGTAGACAATAACAAAGAAATG
TGGATCGACAGTGGTATACCGAGTGCCTTGAAGGATATGATAATGGGTTGGTGTAACGAG
GCGTTAGGTACAAACGGTCCAATAAACAATGATTTACTTTTGGATCTCGTTAATAGTGAG
AAGACAATCACACTGTCAGGATTATTACCACTTGTCGCGGACACGCTGCCGAGGGTGCTC
CCCCATACCTTAGTGTCTCGTCGCGGCGAGGCCGCAGCCCTAGTAGCTGGTGCAGCAGCC
CTACTATCTCCAGGTGACGCCAGGCGATCCAGACTCCTGCATACACTTCTCACACTATAC
AAGAAACCTGATCCCGAGGACGCGAAGATCATATGCGAAGCAACACGTCTCGTGGTGAAG
TGGGGCGGTAGTGGGGAGGTTCTGTCCTCTATAGCTGAGCTGCTAGGTTCAAGGTCGTCC
GAACGAAGAGTTCTAGCCAGTCAGATCTGCTTGGCAATAGCGCCTTATGTGCCGATAGAG
CTGTGCACCTCACTGCTACTGAGTTTGGTGATGCTGATGAGCGAGTCCAGTGAAATAGAA
GTGAGAAACATCGGGCTGAGAGCAGCCGTCCTCATATGTCCAGTGGCGGAACACAAGTAT
GGACAGTTGGAGGATTGCATGTTTAACTTCCTGAGAGATAAAGACGAGAAGATAGTTAAG
GATACAGTGAACGTGTTTGTACCTGTCCTAGCGAGGAGTGCCATCATATCTGGTAAATTC
TCAACGGATTTATTCAGCAAGGTGCTAGCGAATTTGAACAAATCTGGCTCAGACAATGAA
CGGAGGACAATGATCATGTATTTGGAGGTTCTGCAGTCGTTGGCATCCTCCGAACTGGTG
TACGTGACCAATGTACAACTGGTCAGGGATGTGAATTGTGATATTGTCATGTCCGAAGTA
CCGTTGAGTGATCAAATAGATGTATATAACATGAGTGATTCTGATAGAGTGTTGTGTGTG
GCAATGAACCGCCTGCTGAAGGAAGATCCCCATACCAGATGGGCCGAACTAAATTGGTTC
ATTGATGTTACAAAACAAATTTTAGATATAGGAATTAAATACAAAGCGTTGAACCATCCG
GCGGTGTATGAAACACTTATAACATTATTCCATACATATGTTGATAAGTTCGGACATGAC
TTCACGGCTGCTGTACTCAGCGGCGTGTTTACTGAGATTATATTGGATTTAGAGAATAAA
TTAGAGAAATTACACGCCATTAGTGTGGACAGTATGGTCGTTGTGGGCATATACCTGGCG
ACTGTTTTGATTGAAGTAGAAAGTGTCGATCAACAAGCAGAGTTCTTACAGAAATGGACT
ATGTATAGCAGTATAAGGGGTTTGCCACCTAAAATATTATCGATTCCCTTGAAATGGTTG
TCTCAACAGAGGCCGAGCACACTCAACACCTACATACATCATTTACGAGAATTTGCAGCG
AGCAGTTGTGACTCATCAAGCGGCAGTACTATACGGATGTTCATAGCTAACCTTATAACG
GAGTTTGTAAACACGACGGATGTCAACGAGGACTGTATCCACAACCAACTGTTGCCGGCC
GTCATCGCACTGCTCTATGATGATGACGTATCTGTCCGCGAGGCGGCGATAACGTCGTGG
GGCAGCGTTTCAAGGTCGTGTGTGAGTCGAGGGTTGTCCTGCTCAAAAAACTGCTGGCCG
GCCTTCGAGGAGGTCGTACGGAGACCGCTGGTCGGCAGGGAGTTAGCGCGCGCAGCCGAG
GCACTCGCTATGCTATTACTACCAATAGACGGACAAACAGTTTGCGAAAAAGCGGTGTCG
GTGCTGTGCTCGTTGTCTATGAGCGTGTCTATGGTCGATAGTGAAGTGATGTCGTCTCTG
GCACCGGCGCTCCAGTTAGCGTCGCATCACTGTCCCCAGCACCCAGCACTACTACCAGCT
CTCAGGAAATTAGAGGAAATCGTCCAATCACCTTCTATGGCCCAATACAAGCCAGCAATA
GAAGCCCTTCTCCACGTGGCCGGTACTGAGGCTGTAGACAGTTCCCCTCGGTCCAGCAAC
CTTCATACAGCGCAGGAAGTTGGCAGAAGAGTCACTCAGATCTTCCAGCAATCCAAAACC
AACATAAACCTTCCAAATATATTTAGAAAGAAAACTTAG

Protein sequence:

MNPYKDVEESSFVAAPHLTYEDIATKLLKDNLFLTALELHTELVESGKELPQLREFFSNP
GNFEQHVSRASEMGTINRTPSLATLDSLDTARYSEDGGGDRAGSGCDVAVLEFELRKARE
TINSLRANLTQFADESPLDKNNSEIDSQRTLKPHEKKALNFLINEYLLLHNYKLTSITFS
DENPDQEFEDWDDVGLNIPRPANLMSLFWGSTRSLSVPKTDVATYTDFSCIDSECQTDLD
ENVCVSCQTLDHDTDWSHEVEEIELLKQKIIALETEKLNFQKLYDAAIVSLNTLTSPMSE
SKTLELQIPDEKINSNLKDHLEDIKPITCMALENHYGSGNSTHSATPEQFEMIYGDKNNC
MSKKDGSNTSSFEPAVLDSVRGSPRRASVTTLDETLSINDAGEWTRVHYEYNTVDNNKEM
WIDSGIPSALKDMIMGWCNEALGTNGPINNDLLLDLVNSEKTITLSGLLPLVADTLPRVL
PHTLVSRRGEAAALVAGAAALLSPGDARRSRLLHTLLTLYKKPDPEDAKIICEATRLVVK
WGGSGEVLSSIAELLGSRSSERRVLASQICLAIAPYVPIELCTSLLLSLVMLMSESSEIE
VRNIGLRAAVLICPVAEHKYGQLEDCMFNFLRDKDEKIVKDTVNVFVPVLARSAIISGKF
STDLFSKVLANLNKSGSDNERRTMIMYLEVLQSLASSELVYVTNVQLVRDVNCDIVMSEV
PLSDQIDVYNMSDSDRVLCVAMNRLLKEDPHTRWAELNWFIDVTKQILDIGIKYKALNHP
AVYETLITLFHTYVDKFGHDFTAAVLSGVFTEIILDLENKLEKLHAISVDSMVVVGIYLA
TVLIEVESVDQQAEFLQKWTMYSSIRGLPPKILSIPLKWLSQQRPSTLNTYIHHLREFAA
SSCDSSSGSTIRMFIANLITEFVNTTDVNEDCIHNQLLPAVIALLYDDDVSVREAAITSW
GSVSRSCVSRGLSCSKNCWPAFEEVVRRPLVGRELARAAEALAMLLLPIDGQTVCEKAVS
VLCSLSMSVSMVDSEVMSSLAPALQLASHHCPQHPALLPALRKLEEIVQSPSMAQYKPAI
EALLHVAGTEAVDSSPRSSNLHTAQEVGRRVTQIFQQSKTNINLPNIFRKKT