DPGLEAN12112 in OGS1.0

New model in OGS2.0DPOGS214516 
Genomic Positionscaffold422:+ 40144-47267
See gene structure
CDS Length1680
Paired RNAseq reads  138
Single RNAseq reads  392
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  sine oculis-binding protein (1e-45)
Best Human hitsine oculis-binding protein homolog (2e-22)
Best NR hit (blastp)  PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (2e-58)
Best NR hit (blastx)  PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (3e-59)
GeneOntology terms  GO:0048749 compound eye development
InterPro families  ND
Orthology groupMCL19521

Nucleotide sequence:

ATGCAGGAACTCAGTGCGAATCCCGAAAGCCTCAATCGAGTCTCCAAGCGGGTAGGTCTT
AAAATGAACAGGGATAAGTCAAAACTCATGTCTAATGTCCATGTGGCATCTACCTCTATA
ACGGATGAGAACTCGTTACCTGAAGTTGATAACGCGTATGATTTTGCGGAGACAGCGATG
AATGAGCTGCTCGGCTGGTATGGCTATGAGCGTTTGGAGCTTCGGAGATGGGCAGCTTCC
AGAGCCAGGGACAGTCTTGAACAGGAACAAAAAGGAAAAGCAGATGAATGTTCGTGGTGC
AACAAGAGTGTCGCCAGTGAGAGTGGCGCCTTACAGCAAGCTGGGGCGTTGTTCTGTTCG
GAGCTGTGCTTCAGTCAGTCACGGCGAGCCAACTTCAAACGCGCCAAGACATGCGATTGG
TGTCGTCACGTGAGACACACTGTCGCCTATGTGGATTTCCAGGACGGAGCCACTCAGCTT
CAGTTCTGCTCAGACAAATGTCTGAACCAATACAAGATGCACATCTTCTGTCGCGAGACA
CAGGCCCACCTCGACCTCAACCCGCACTTAGTGAACGCGGCGTCGTCGTCGAACCTCATC
ACGCCAGAACTATGGCTCAAGAATTGCAGGAGCAGATCTATATCACCCACATCAGAAGGA
TCTGGAACACCGAATGACAAAAACGACGACACATGCCAAAGAAAATCCCCACTCCCACTC
ATAACTATAGCGCCACCAGCCAAGTTAATGAATCCGAAACCACCAGAAGACAGGCCGGTG
CAGAAGTCTCCCGAGACCAAGAAGGATCTGAGAACCAAAATGAATTTACGTAAACGTAGA
ACGTCCAAGTGCTCGACGGTGACGTCACAAACTGTCCGGCAACGAAGTATAACACCAAAA
ACCCAAGACTTCAGGATGCTGAGTCCCTCGATGGACGGTTCGTCGCCCGCGTCCTGCGTC
ACGAGCAACAATCCCGTACACTCTCCACCACACATGAACCAACCGATCCCACCACCGTTT
CCGAATCCCATGTTCGGGATGCCGCCGCCGGTCTTCATGGACAGCAACCACATGAATGAC
CACAGAAATCCCATGTTTCAACCGAGAGTGAATTTCATGCCACCGCCTGGCATGCACCAA
GAGAGACCGAGATTATTCCCACCGTTGAATTTCCACCAGCCAATAAACCAGGCTCCGCCA
GTGACGGTACTGGTTCCTTACCCCGTCGTCATACCCGTCCCGATACCCATCCCGATACCT
CTACCCCTGAGCTCGTTCATTCAAGCCCATTGCACCAACAAAGTCAAAACTGAAGTCAAC
ACCGACGACGCCGAGGGTCCTTTAGACTTCACGATGAACCCCGCCAAGAAAAACGAATGC
AACCAACCAGAGGTCCACGAAGAGATCGATCCGGCGGCCACGGAACAAGCTAGTCAACAA
ATAAATAATCATAACGAGAGAGTGGACGACGACTCCCAAAATAATACAGAGACCAACCCC
GAACAGACGCTGCCGAAGTTCAAGATAACGCGATTGGGTAACAAGATGGCGAAAATCGTT
TCAAAAACGAGAGAGAACGCCGAGTCGTCCAGACCTTTGAGGAAAAGACGGAGACTCGTG
GAAGTCGCTACCGACGAAGAGACGCTCATTCCTAAAACAAGGAAAATTGTACAAGTTTAA

Protein sequence:

MQELSANPESLNRVSKRVGLKMNRDKSKLMSNVHVASTSITDENSLPEVDNAYDFAETAM
NELLGWYGYERLELRRWAASRARDSLEQEQKGKADECSWCNKSVASESGALQQAGALFCS
ELCFSQSRRANFKRAKTCDWCRHVRHTVAYVDFQDGATQLQFCSDKCLNQYKMHIFCRET
QAHLDLNPHLVNAASSSNLITPELWLKNCRSRSISPTSEGSGTPNDKNDDTCQRKSPLPL
ITIAPPAKLMNPKPPEDRPVQKSPETKKDLRTKMNLRKRRTSKCSTVTSQTVRQRSITPK
TQDFRMLSPSMDGSSPASCVTSNNPVHSPPHMNQPIPPPFPNPMFGMPPPVFMDSNHMND
HRNPMFQPRVNFMPPPGMHQERPRLFPPLNFHQPINQAPPVTVLVPYPVVIPVPIPIPIP
LPLSSFIQAHCTNKVKTEVNTDDAEGPLDFTMNPAKKNECNQPEVHEEIDPAATEQASQQ
INNHNERVDDDSQNNTETNPEQTLPKFKITRLGNKMAKIVSKTRENAESSRPLRKRRRLV
EVATDEETLIPKTRKIVQV