DPGLEAN15658 in OGS1.0

Genomic Positionscaffold1:+ 2729701-2743051
See gene structure
CDS Length1509
Paired RNAseq reads  137
Single RNAseq reads  454
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013068 (3e-54)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  hypothetical conserved protein [Glossina morsitans morsitans] (5e-11)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC009316 [Tribolium castaneum] (2e-11)
GeneOntology terms  ND
InterPro families
  
IPR006578 MADF domain
IPR004210 BESS motif
Orthology groupMCL31105

Nucleotide sequence:

ATGAAAATATTCCAACAGACTGTCATCTGCAGTATAGCTCCAAAGGCTCTACCTGCTCTA
ACTGTATATTTTGGTGAGAGAACCTCAAATAAGCGTACAGAGTATGAGATGGATTCTGAT
TCGCTTTATCCGGGCTCAAGGTCCCCTAGGCCGGGGGCGGTTCTCTTCCTGCCTCCCTGC
ACTCTCCCTCACCCCTCACCCCTCGCCCCTCACCCCTCAGACCCGCACAGCTCCACCTCG
ACCGGGAGGGCTCCGGCCCGGCCGGGCGAGCGAAAATTCGATATCGGATTAAGTTATATG
AAATGCATAGCAGTGAATAGATGCTCGGGTTTTGAAGCGAGTAATAAAACACGAGCAGCA
CAACTCCGGGCGGCAGCAGACGGCAGGCAGCAGACAGCCGCTATCCGCATTGGCGCATTG
GGCCGAATAAAAAAGAGTTGCTTCAGAGCAATAAGTGCCAGGGAACGGAGCCAGCTATTT
ACATGGCCCCACATGAACTTTTATGAAGACCGTCCAGGTCAAAGTCCGAGTGAACAGACG
TCCCGGAGGAAGGGAAGGGAACTCCTCGAGGAAATCAACGGACGCCCAGCAACTGGTCCC
GACACTACGACTTATCTCAGCCACGGGTCTAACTTGTTTCAAGACAATGGAGTCATGTTT
GCGACAGTCGCCGGTATGCGGCGCGGCGCGGGAGCGGCGCGGGGCGGGGCGACAGACGGT
CGCTCACCGCCGCGGCAGTCGCACGCAGACCCGCGCCGCGCCGCGACATCTTACTCAGGC
TGTCTTGGGGAGAGCAGTTCCACTATGATCAAAACGACCCTGTACCAGCTGAATCCTGTC
CGCCTGATAGAGGAGATCAAGAAGCGGCCAGGCCTGTACCGGACGGACCAGCCGGCGGAC
AGGGAGGAGAAGCTGCAGCTGTGGAAGGAGGTCGGAGCTTCCATCTACGATGACTGGGAC
ACCTTCAACAAGGCGACGGCCTACGACAGAGTTCTCCAGTTGCAGCGCAAGTGGCGCTCC
CTCCGCGACGCTTACAACCGGGAGCTTCGAGCCCGGAGAGCAGCCCCGCGCGGGAACAGG
CGCGTCTACATATACTTCAAACGAATGAGCTTCCTGGGAGGCTTCGACGGAGACGTTAGC
AACGACGAGGATCGCGATGGCAACCAAGTGATATTCAGCAACCAGCCGACGGAGGACCCT
TTGTTTGGGGAAGTCAGCAAGAAGAGGAAGAGGCGGAAGAGAAGAAAGTGTAGCTCGGAC
AGCGAGCACGAACCCAAGGAGCTGGAGATGCAGGTGTTCCCGGTCGAGATGGCTGACGAG
GGGGACAGTGACAAGTTGTTCCTACTGTCTTTCCTGACGGAGATGAAGCAGCTGCCGGCG
AACATCAAGATGTGGGCGAGGGCACAGATCGCCAACGTGATGCAGGAGGCCGTCAGCAGT
CAGTACGGGAACACCACACCCGGGGACAGAGTGCATGCCATCAAACCCAGGAGAGAGAGC
TCCGACTGA

Protein sequence:

MKIFQQTVICSIAPKALPALTVYFGERTSNKRTEYEMDSDSLYPGSRSPRPGAVLFLPPC
TLPHPSPLAPHPSDPHSSTSTGRAPARPGERKFDIGLSYMKCIAVNRCSGFEASNKTRAA
QLRAAADGRQQTAAIRIGALGRIKKSCFRAISARERSQLFTWPHMNFYEDRPGQSPSEQT
SRRKGRELLEEINGRPATGPDTTTYLSHGSNLFQDNGVMFATVAGMRRGAGAARGGATDG
RSPPRQSHADPRRAATSYSGCLGESSSTMIKTTLYQLNPVRLIEEIKKRPGLYRTDQPAD
REEKLQLWKEVGASIYDDWDTFNKATAYDRVLQLQRKWRSLRDAYNRELRARRAAPRGNR
RVYIYFKRMSFLGGFDGDVSNDEDRDGNQVIFSNQPTEDPLFGEVSKKRKRRKRRKCSSD
SEHEPKELEMQVFPVEMADEGDSDKLFLLSFLTEMKQLPANIKMWARAQIANVMQEAVSS
QYGNTTPGDRVHAIKPRRESSD