DPGLEAN04929 in OGS1.0

New model in OGS2.0DPOGS200959 
Genomic Positionscaffold548:+ 22643-29282
See gene structure
CDS Length2145
Paired RNAseq reads  1274
Single RNAseq reads  3060
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010165 (0.0)
Best Drosophila hit  CG2747, isoform A (3e-137)
Best Human hitHEAT repeat-containing protein 5A (4e-112)
Best NR hit (blastp)  PREDICTED: similar to CG2747 CG2747-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC004631 [Tribolium castaneum] (2e-169)
GeneOntology terms  GO:0005488 binding
InterPro families
  
IPR011989 Armadillo-like helical
IPR016024 Armadillo-type fold
Orthology groupMCL10889

Nucleotide sequence:

ATGTCGAAAGGAACCCGTCAGGAGGCCATACAAATTAATGTTTACACAGCGCTGTTGTTG
GCGCTGAGGACGCTGGGAGAGCTGAAGAGCTCACTGGGACAGGACGCTGTCAAGAATACC
GCTACCGAACTCATCATAGCAGGGTTGTCAGCGAGCAGCACATCGGTTCGTGCTGCTGCA
GCGTCATGCGCCGGAAGGCTCTGTGGGTGTATCACCGAGTCTGAGACGCAGGCACTGAGC
GAACGCGTGATAGCCCTCGCTAGGACGTCTCCTAATAGATCGGTCCGTGCAGCCGCTGCA
GCCGCCGTCGGAACCGTGCAGCGGGCGAGAGGTGCTGCGTCAAGCGCTGCAGCGCTGCCA
GTGCTAAGAGCCCTGGCCCAAGATACGGCTGCGGTGGAAGTGCAGGTGTGGTCGCTTCAC
GCGCTGTCCGTGTTAATAGACGCCTCGGGTCCGATGTTCCGGAGTCATGTTGAGTCAACT
CTGAACCTGGTTCTGAAACTATTGTTCAGCGCTCCGCCCAGCCAGGACGACCTGCACAGA
AGTATCGGCAGACTCCTCGCAGCTCTCATCACAGTTGTTGGAGGCCTCGATAAATATATA
AACTTTTTAGCATGCGGGCACAGTGCTGTGGGTCGCTTCACGTGCGCGTGCGCAGCGCTG
CGGGAGGGGGGCGGTGCTGGCAGCGCTGCCGACGCCATCGGAGGCCTGCAACAACTACAT
CTGTTCGCCCCTGGACACGCTAACCTACGGACTCTCGTGCCGCAGCTGTGTCGCGATTTA
TCCAATCCCGAGCTGTCTGTACGCCGAGCAGCACTGTGCTGTTTGCGGCAGCTGTCCCAG
AAGGATGCCGCTGATGTGTGCAAGTACGCGCTACTCGCTAAGGACCACGTGCCCACTAAA
CCTTACTGTGGGGTTGTGATAACAGACACAGGTCTGCCAGGTGCATTGTTTGCATTCCTT
GATATGGAGCGTGACGAAATGGCGATATCATACGCCAAGGATACGTTGACCTGTTGCCTG
TTGGCCGCCGCCGCCAATAAGTCGGTCAAGGATTGGCTACTTTTGGCCAAACGAGTGCTA
ACTGTTAAATTGGAGGATAGCAACAACCCTGATACGGAGCTGGAAGACGACGATGATCAG
GCGGAGTTCCACGCTGAGAGTGAACAGGCGACGCACCCGGCAGTGCAGCCGCGGTGGCCT
ACAAGGGTGTTTGCAATGGAATGTATCCAGAAGGTAATGGGGGCGTGCGAGGCTACTGGA
GAGAGCGCACACTTTGATCTCGTGAAGGCAAAAGAGAAATTACAAGAGGATCCGAACGCT
GACTACCTCTCGTTGCATCTCTCCGACCTCGTGCGGATGTCGTTCGTTGGGGCGACTGGT
GAATCTGATGCTCTCAGGCTGTGCGGACTGAACACACTGCAACTCATCATACAACAGTAC
GCGAGGGCACCGGAACCGGATTTCCCCGGACATCTGCTTCTGGAACAGTACCAGGCACAG
GTGGGCGCGGCGGTGCGGCCCGCGTTCGCCGGAGACACGGCGTCTCACGTGACGGCGGCG
GCCTGCGACGTCTGCTCCGCGTGGATCGGCTGCGGCGTCGCCAGAGACATTAACGACCTG
CGCAGGGTGCACCAACTGCTGGTGTCAAGTCTGGATAAATTAAATAAAAAAGGCAACACC
ACACTTATTTACAACGAGAGTATGGCAACTTTGGAGAAATTGTCCATTCTTAAAGCATGG
GCTGAGGTGTATATAGTGGCCATGGTCAGCAACGACAGCGCTCCGGGAAGCTACGTCAAA
CAGCTGGATACTAAGCCGGTGAATAACAAGGCGGAGATAGCGAAGTGGCGGAACAGAGTG
CACGAGAGTAACAATCAAACAGACAACACGGCCCAGGAGACTGAGGACGACGATTACGGA
GAGTTTGAGTCGAAGGGGGAGAGCTTGCTCAAACTGGTTGAACCGGAACTGGAGAGCTTA
GGGGAGAATTGGCTCGCTGCCTTGAAGGACCATGCGCTACTGAGTCTACCACCGGAGTTC
GCGTCCCAACTGCCGCATGGTGGAGGTGCCTTCTACTCCATGGAGACGGCGGAGGCGTCC
CGGGCTCACTACGGCCGCGCCTGGCCCGCGCTTCTGCTGGTATAG

Protein sequence:

MSKGTRQEAIQINVYTALLLALRTLGELKSSLGQDAVKNTATELIIAGLSASSTSVRAAA
ASCAGRLCGCITESETQALSERVIALARTSPNRSVRAAAAAAVGTVQRARGAASSAAALP
VLRALAQDTAAVEVQVWSLHALSVLIDASGPMFRSHVESTLNLVLKLLFSAPPSQDDLHR
SIGRLLAALITVVGGLDKYINFLACGHSAVGRFTCACAALREGGGAGSAADAIGGLQQLH
LFAPGHANLRTLVPQLCRDLSNPELSVRRAALCCLRQLSQKDAADVCKYALLAKDHVPTK
PYCGVVITDTGLPGALFAFLDMERDEMAISYAKDTLTCCLLAAAANKSVKDWLLLAKRVL
TVKLEDSNNPDTELEDDDDQAEFHAESEQATHPAVQPRWPTRVFAMECIQKVMGACEATG
ESAHFDLVKAKEKLQEDPNADYLSLHLSDLVRMSFVGATGESDALRLCGLNTLQLIIQQY
ARAPEPDFPGHLLLEQYQAQVGAAVRPAFAGDTASHVTAAACDVCSAWIGCGVARDINDL
RRVHQLLVSSLDKLNKKGNTTLIYNESMATLEKLSILKAWAEVYIVAMVSNDSAPGSYVK
QLDTKPVNNKAEIAKWRNRVHESNNQTDNTAQETEDDDYGEFESKGESLLKLVEPELESL
GENWLAALKDHALLSLPPEFASQLPHGGGAFYSMETAEASRAHYGRAWPALLLV