DPGLEAN21701 in OGS1.0

Genomic Positionscaffold3770:- 7193-14905
See gene structure
CDS Length1812
Paired RNAseq reads  866
Single RNAseq reads  2405
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011132 (6e-10)
Best Drosophila hit  CG10011 (8e-98)
Best Human hitankyrin repeat domain-containing protein 50 isoform 2 (3e-92)
Best NR hit (blastp)  ankyrin repeat-containing protein, putative [Pediculus humanus corporis] (4e-153)
Best NR hit (blastx)  PREDICTED: similar to CG10011 CG10011-PA [Tribolium castaneum] (6e-139)
GeneOntology terms
  
GO:0005515 protein binding
GO:0007165 signal transduction
InterPro families
  
IPR020683 Ankyrin repeat-containing domain
IPR002110 Ankyrin repeat
Orthology groupMCL12717

Nucleotide sequence:

GCGGAAGCGGATATAGAGGCCTCAGACGCGGCGGGTCGCACGGCGCTGTGGGCGGCCGCG
TCTGCGGGCCACGCCGACACCGTCAGACTGCTACTGTTCTGGGGGGCCTGTGTCGACACC
ATGGACGACGAGGGACGGACTGTGCTCAGCACAGCCGCTGCACAGGGTAACGTGGAGGTT
GTCCGCCAGCTGCTAGACCGGGGGCTAGACGAACACCACCGGGACAACTCCGGCTGGACG
CCGCTACACTACGCCGCCTTCGAAGGTCATATAGAGGTCTGCGAAGCGCTTCTGGAAGCG
GGGGCGAAGGTCGACGAGGCCGACAACGACGGCAAGGGACCTCTCATGCTGGCGGCGCAG
GAGGGACACACCAGGCTCCTGGAACTGCTCGTAGACACCTGGGCCGCCCCGGTCGACCAA
CGCGCGCACGACGGCAAGACGGCGCTGCGCCTGGCGGCGCTGGAGGGGCACTTCGAGGCA
GTAGCCGCGCTGCACTGCCGCGGGGCGGACGTGGACGCGCTGGACGCGGACCGGCGGAGC
ACGCTATATGTACTGGCCTTGGACAACAGACTGGCGATGGCCAGGCAGCTGCTGGCGTGC
GGGGCCAGCGTACACTCCAGTGACACTGAGGGTCGGACTCCTCTCCACGTGTCCGCCTGG
CAGGGGCACACTGAGATGGTCAATCTGTTGATAAAAGTCGGCGGGGCGTCCGTGGACGGC
CGGGATCGCTGCTCACGCACGGCGTTGCACGCGGCGGCTTGGCGCGGCCGGGCCGGGGTG
TTGCGGACCCTGCTGGAACACGGAGCGGACCCCGCGGCCGTGTGCACCCAGGGAGCTACG
CCGTTGGGGATAGCTGCACAGGAGGGTCACGAGGAGTGCGTGCTGTGGCTACTCCAGCTC
GGGGCTGATCCGTTACAAGCTGACCACTGTGGGATAGCTGCACAGGAGGGTCACGAGGAG
TGCGTGCTGTGGCTACTCCAGCTCGGGGCTGATCCGTTACAAGCTGACCACTGTGGTCGT
ACGCCTGCTAAAGTAGCCTGGAGAGCTGGACATGCGAACATCTGCCGGCTTCTGGAGCGC
TGGACCGCGCCCTCCGCACCTCCAGCACCTCCCGTCACACATCACGAGGACAAGCGACCA
GCCTCCCCGGAGTACAAACGCCGTAGTATCCACAGCTCCAACTCCACAAAATCATCGTCC
AACATGACCGGCGGCTCCAACAGGTCACACGACGAGGACGATAAGGGTTCCCTCTCTTTC
GCCCAGCAGGTGGCGCGCTGTGGACGAGCGAGACGGGAGATAGAGAGAGACGAACCGATA
CCAGAGCACCAAGTGCTGGAACAGGACTCCAAGCTCAGGAGTTATATAGCGAATGAGAGG
GACAGCGAGCTACATGGATATGCGAGGGAGAGAGACAGGAGACGGGAACAGAGACACGGC
ACCACCAGCCCGCTGTACGCCTCGCCGCCCAGGAGCCCCAGCGAACCACGGAGCCCCGAC
CCGCCTGCTGGTTCCCAGCCAGCCAGTCTAACGAGCGCCCCGGCACTGACGGACAACCAC
TTCAACAGAGACACGCACATGAGGATCATCCTGGGCAGAGACAAGCACGCGGAGAAACAT
GACGGTAAGAATAAGAGGAATGGCATCGTCACCAACCCGGCGATGCGTCTGGTCGCTAAC
GTTAGGAACGGTCTGGACAGCGCAGCAGCTAACATTCGCCGGACGGGGGTCGCGTTAGCA
GCCAGCGCCAGTTCCTCCAACCCAGCAGTCAAGACCAACGCGTTCCAGTGGAGGAAGGAG
ACTCCGCTCTAG

Protein sequence:

AEADIEASDAAGRTALWAAASAGHADTVRLLLFWGACVDTMDDEGRTVLSTAAAQGNVEV
VRQLLDRGLDEHHRDNSGWTPLHYAAFEGHIEVCEALLEAGAKVDEADNDGKGPLMLAAQ
EGHTRLLELLVDTWAAPVDQRAHDGKTALRLAALEGHFEAVAALHCRGADVDALDADRRS
TLYVLALDNRLAMARQLLACGASVHSSDTEGRTPLHVSAWQGHTEMVNLLIKVGGASVDG
RDRCSRTALHAAAWRGRAGVLRTLLEHGADPAAVCTQGATPLGIAAQEGHEECVLWLLQL
GADPLQADHCGIAAQEGHEECVLWLLQLGADPLQADHCGRTPAKVAWRAGHANICRLLER
WTAPSAPPAPPVTHHEDKRPASPEYKRRSIHSSNSTKSSSNMTGGSNRSHDEDDKGSLSF
AQQVARCGRARREIERDEPIPEHQVLEQDSKLRSYIANERDSELHGYARERDRRREQRHG
TTSPLYASPPRSPSEPRSPDPPAGSQPASLTSAPALTDNHFNRDTHMRIILGRDKHAEKH
DGKNKRNGIVTNPAMRLVANVRNGLDSAAANIRRTGVALAASASSSNPAVKTNAFQWRKE
TPL