DPGLEAN04942 in OGS1.0

New model in OGS2.0DPOGS210728 
Genomic Positionscaffold314:+ 9747-20868
See gene structure
CDS Length3609
Paired RNAseq reads  984
Single RNAseq reads  2443
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006258 (4e-89)
Best Drosophila hit  CG16908 (3e-26)
Best Human hitTEL2-interacting protein 1 homolog (1e-36)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC001200 [Tribolium castaneum] (3e-82)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC001200 [Tribolium castaneum] (4e-66)
GeneOntology terms


  
GO:0005488 binding
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families
  
IPR016024 Armadillo-type fold
IPR011989 Armadillo-like helical
Orthology groupMCL12695

Nucleotide sequence:

ATGAACGCACACTTGAAAGAGGCATTTACACGCATAAAGCCAATATGCGACATGGTGATG
GTAAATCCATCGCCGGAGCATATAACAGCGTTTGCAGCACTGGTAGGAGAGCTAAAGATG
GAAATAATCCAAGAGCTGCAGCAGTATATGCTGTTTCCGTTTATCACACACTTACAGTCC
AAAGAAATGGAAACAAAATATGAAATGCAAGGACTGTTAATAGACAGCATGCGGGAAGTG
CTACAGAGGGTGTGTGTAACCAGTTTTGAGATGTGCATGAAGATTGAAATGGGCCTTTTG
AGTCTCGTGTTTGAAAAACCAAAGCCGGGAATGATTGCGGATGTTCCCGAGGAACTCAAA
CTAAGTGTGATGCAGTGCTTGACGGTGTTGATGTTGCATATCGATCAGCCGACGAGGGTG
AAGATGCTGGAGAACCAAGTGCCATTGTTGGCTCAAGCTGTGTTTGTGTCGGTGCACCTC
GCAAAGTTAGAGAAACTAAGGAGCTTGAGGCTAGCTGCCATAACGTGTCTGTGTGCGCAC
ACGGGCTGCCACCCTCAGCAGACGGACGCGCGTGGTCTAGTTCCAGACCCCGCTATAGAG
ACTGCTGTCTTAGGGCTGCTAGCATGCATCCTGCCGGGCGTGCTGGCTGCGCTGCAGGAC
GTTGCCATGAGTGACAACCCTGGACATGCAGTTGTTGTTGCCGCACTGAATGCAACCCAT
CGCGTGCTGTGCCTCACGATGCACAACAAACACTTGGCGACCAAAGAAAACATCACGGCC
GATGACTTCGTGGCCATGTTAGCCGAAAAGGCCAAGCCCTCCAATGATGTAAGCAAGGAA
AGCATCAGAGATAGTCGGCTAGGTGCGATCATAGCGGCAAGAGATTATCACTCGCTAGCC
GAGTTCATACGATCTGGTAGGGACCAAACGGATCCTCCCAGGAGGGAGAGAAAGATTCCC
AAACAAAAAAAGAATAGAAATCAGAATGCCAACCAAAAAAACGTTAATTACATGGAAAGG
CTCGGCAGCTATGAAAGAACGCCGAATACTTCACAGAATGCAGCGAAGGCACCCCGCGAT
ATTCCCAAGCGGACATCCGAATGGTACACGATGGCCGGGGACAAACTGGCCATAGTGATC
AAGAGTCTCATACCGCTGGTCTCTCACGAACACTTCAAAGTGAGGAAGGAGCTCGCCATA
CTCTGCTATAGGATCATATCAGAGTGTAGTGCCACCATGCAGCCTTCTCTGCCGATGTCT
TTGGATGTACTGATCTCCCTCTGCCATGACTCCTACCAGGAGGTCTCGGACTACTGTGAT
GCAGCACTGAAGGCACAGTTTTCGAACCCCGAGAGAGAAACTATGGACAGTCTCTGTGAG
AACTTCTTCGCCACCATCAACTGCTTGCCGAGAATTATGAACAATATTGACGAGAACCGC
AAGCTGTCCGCCCTGAACCTGATAGCCGGCTACCTCACCATCCTGTGTGACGGCGGCCGC
CCCCAGCGCCTGACGAGTCTCTTGACAGCTAGCGACGGCTTCGACCGCGTGTGCGACGCG
CTAATCGCTACGGCTGACATGTACACCGACATGTCTCTCCTCGCCAGACCGGCCGGAGGC
GACATCACAGGGATGTCCTCGTGTGAGTTGTCCGGTCCGTGTCCCTGGCGCCGTCTCCGT
CACCTGTCTCCGGCTGGAAGTCAACAGCTGCAAGGCCTGCTGGCGTCGCTGGGCGCGGCG
GAGTGCGCGGAGCTACTGCTGGACAGATTTCTGGAACTCTTCCACGAGCGACGCTCCTGC
GACCTCATATACATCATCAACTATCTTGGATCAGGTCCAGATTCAAACCCGGATCTAGCT
CGTCGCATCATCAGCGTGTATATAACCGAGGACGTGTGGTACCAGCCGCTGGAGGTGCAG
AGCGGGGAGAAGCCGCTCTCAGCGGACGAGACCCTCGACGAGAGCATTTACAACCCACGG
TCCTGGACCAGGGACACCGTACCCGGTCTGTTCGAGGGCGCCATAGAGACTCGCTTCACC
GACATCAGCTCGACCCTACCGCGGGTCCGCCTGGAGCCCAACACGTGCGTGACCCTGGGA
CACGCGCGGAGGAACCTCACCAGGAGCTGCCTCCTCACTGAGGGGCTCGGCCTCATGGCG
CTGCGCCTGGGCAGGGACTACCAGCAGTACCTACTGAAGACACTGTGTCTCTTGTTGGAG
AGAGTCGGCAGCCGGTACCCTCCGCTCCGCTCATCGGGTCTCATCGCCCTGCAGCAAGTG
GCCGCGGCCACCGAGGCCAGTGACGTCACAGACCTCATCGGCAGGAACGCCGACTACTTC
ACCAGCCAGGTCACCGGCCGGCTTAAGAAGGCCTGGAACACGCAGTCGGCCCTACAGATA
CTGTCAGTGGTTATGGAGTACAGTGACGTCACTATACTGGACTACTTGTACGGGATAGTG
GAGGACGCCTGGAACACGCAGTCGGCCCTACAGATACTGTCAGTGGTTATGGAGTACAGT
GACGTCACTATACTGGACTACTTGTACGGGATAGTGGAGGACGTGTTAGTGCAGAGCTGC
GACCAGTACTACGAGAAGAATTTGTATTCATATTTACAAGTGTTCCTGACATTCATAAAC
TGCATACGGAAATGGTTTCTGATCCAAGACACGGTCAAGAGGAAGGAGAGTCACGGGCTG
GAGATAGACGTGCTGAAGGACGTCATAGAGTTCGCCAACAACAAGGAGGAAGTAGAGAGG
TTGTTGAACACGAAGGAGTTCGAGGAGGAGACGGGCAGGAGTGTGGAGGAGATGTACCAG
GAGGACCTGCAGCGGAAGGAGGAGGACCTCCTGGACTACGACGACACCGTGACGCAGGAG
GCCGTACCTCTGCCACAACACGTGCGGGTCACGATCACTATACTGAAGCGCTGCGTACAC
TTCGTGTCGTACAAGAGTCGAGACGAGGCGCTGGTGGCGATGGAGGCGCTGTGGCGAGGC
CTGGAGCTGCTGCGAGGACACGACGACGAGCTTCTGCCGCTGGTGCACGCGCTGTGGGAG
CCGTTGGCCGCCCGCCTGCAGGCCGAGCCCGTGCTGGCGCGAGCGGCGCTGCGGGTGTTG
GCGCTGGTGGCCGACCTGGCCGGGGACTTCGTCCGGGAGAGAGTCGTCAAGGACGTGCTC
CCCCGCGTGTGTTCGGTGCTCCGCTCGTGTTCTCGCCGCAGCGTGTTGGCGGACGCGGGG
TCGTCGTACCGCCTCACGGCGTCGTACTCCCTTCAGCGCTCCGCCCTCGAAGCGCTGGGT
CCGCTCGCCGCTAACGTGGGGCTCCGGGGCATCGCGCTCCTCGACGCCCTGAAGGCCGGG
GCGCTCTACCTTCACAAAAACCAACCCAAACCCCTACAGCTGTTGGCGGTGAAGTTCTTC
AAGGACATGCTGGAGTACGACTACGGCAGCTCGTGGCAGTTCCTCCGCCGCCTCGCCAAC
AACAAGCAGCCGCTGACGCCGCCCGCCAACCGGTTCCTGCACCTGGAGCCCGTCGTGGGC
TCGCCCTACGAGTGTACGGACCCGCACTACGACAACAACATCAAGCTCATCTTTTACGTA
CACAAGTGA

Protein sequence:

MNAHLKEAFTRIKPICDMVMVNPSPEHITAFAALVGELKMEIIQELQQYMLFPFITHLQS
KEMETKYEMQGLLIDSMREVLQRVCVTSFEMCMKIEMGLLSLVFEKPKPGMIADVPEELK
LSVMQCLTVLMLHIDQPTRVKMLENQVPLLAQAVFVSVHLAKLEKLRSLRLAAITCLCAH
TGCHPQQTDARGLVPDPAIETAVLGLLACILPGVLAALQDVAMSDNPGHAVVVAALNATH
RVLCLTMHNKHLATKENITADDFVAMLAEKAKPSNDVSKESIRDSRLGAIIAARDYHSLA
EFIRSGRDQTDPPRRERKIPKQKKNRNQNANQKNVNYMERLGSYERTPNTSQNAAKAPRD
IPKRTSEWYTMAGDKLAIVIKSLIPLVSHEHFKVRKELAILCYRIISECSATMQPSLPMS
LDVLISLCHDSYQEVSDYCDAALKAQFSNPERETMDSLCENFFATINCLPRIMNNIDENR
KLSALNLIAGYLTILCDGGRPQRLTSLLTASDGFDRVCDALIATADMYTDMSLLARPAGG
DITGMSSCELSGPCPWRRLRHLSPAGSQQLQGLLASLGAAECAELLLDRFLELFHERRSC
DLIYIINYLGSGPDSNPDLARRIISVYITEDVWYQPLEVQSGEKPLSADETLDESIYNPR
SWTRDTVPGLFEGAIETRFTDISSTLPRVRLEPNTCVTLGHARRNLTRSCLLTEGLGLMA
LRLGRDYQQYLLKTLCLLLERVGSRYPPLRSSGLIALQQVAAATEASDVTDLIGRNADYF
TSQVTGRLKKAWNTQSALQILSVVMEYSDVTILDYLYGIVEDAWNTQSALQILSVVMEYS
DVTILDYLYGIVEDVLVQSCDQYYEKNLYSYLQVFLTFINCIRKWFLIQDTVKRKESHGL
EIDVLKDVIEFANNKEEVERLLNTKEFEEETGRSVEEMYQEDLQRKEEDLLDYDDTVTQE
AVPLPQHVRVTITILKRCVHFVSYKSRDEALVAMEALWRGLELLRGHDDELLPLVHALWE
PLAARLQAEPVLARAALRVLALVADLAGDFVRERVVKDVLPRVCSVLRSCSRRSVLADAG
SSYRLTASYSLQRSALEALGPLAANVGLRGIALLDALKAGALYLHKNQPKPLQLLAVKFF
KDMLEYDYGSSWQFLRRLANNKQPLTPPANRFLHLEPVVGSPYECTDPHYDNNIKLIFYV
HK