New model in OGS2.0 | DPOGS214516  |
---|---|
Genomic Position | scaffold422:+ 40144-47267 |
See gene structure | |
CDS Length | 1680 |
Paired RNAseq reads   | 138 |
Single RNAseq reads   | 392 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | ND |
Best Drosophila hit   | sine oculis-binding protein (1e-45) |
Best Human hit | sine oculis-binding protein homolog (2e-22) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (2e-58) |
Best NR hit (blastx)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (3e-59) |
GeneOntology terms   | GO:0048749 compound eye development |
InterPro families   | ND |
Orthology group | MCL19521 |
Nucleotide sequence:
ATGCAGGAACTCAGTGCGAATCCCGAAAGCCTCAATCGAGTCTCCAAGCGGGTAGGTCTT
AAAATGAACAGGGATAAGTCAAAACTCATGTCTAATGTCCATGTGGCATCTACCTCTATA
ACGGATGAGAACTCGTTACCTGAAGTTGATAACGCGTATGATTTTGCGGAGACAGCGATG
AATGAGCTGCTCGGCTGGTATGGCTATGAGCGTTTGGAGCTTCGGAGATGGGCAGCTTCC
AGAGCCAGGGACAGTCTTGAACAGGAACAAAAAGGAAAAGCAGATGAATGTTCGTGGTGC
AACAAGAGTGTCGCCAGTGAGAGTGGCGCCTTACAGCAAGCTGGGGCGTTGTTCTGTTCG
GAGCTGTGCTTCAGTCAGTCACGGCGAGCCAACTTCAAACGCGCCAAGACATGCGATTGG
TGTCGTCACGTGAGACACACTGTCGCCTATGTGGATTTCCAGGACGGAGCCACTCAGCTT
CAGTTCTGCTCAGACAAATGTCTGAACCAATACAAGATGCACATCTTCTGTCGCGAGACA
CAGGCCCACCTCGACCTCAACCCGCACTTAGTGAACGCGGCGTCGTCGTCGAACCTCATC
ACGCCAGAACTATGGCTCAAGAATTGCAGGAGCAGATCTATATCACCCACATCAGAAGGA
TCTGGAACACCGAATGACAAAAACGACGACACATGCCAAAGAAAATCCCCACTCCCACTC
ATAACTATAGCGCCACCAGCCAAGTTAATGAATCCGAAACCACCAGAAGACAGGCCGGTG
CAGAAGTCTCCCGAGACCAAGAAGGATCTGAGAACCAAAATGAATTTACGTAAACGTAGA
ACGTCCAAGTGCTCGACGGTGACGTCACAAACTGTCCGGCAACGAAGTATAACACCAAAA
ACCCAAGACTTCAGGATGCTGAGTCCCTCGATGGACGGTTCGTCGCCCGCGTCCTGCGTC
ACGAGCAACAATCCCGTACACTCTCCACCACACATGAACCAACCGATCCCACCACCGTTT
CCGAATCCCATGTTCGGGATGCCGCCGCCGGTCTTCATGGACAGCAACCACATGAATGAC
CACAGAAATCCCATGTTTCAACCGAGAGTGAATTTCATGCCACCGCCTGGCATGCACCAA
GAGAGACCGAGATTATTCCCACCGTTGAATTTCCACCAGCCAATAAACCAGGCTCCGCCA
GTGACGGTACTGGTTCCTTACCCCGTCGTCATACCCGTCCCGATACCCATCCCGATACCT
CTACCCCTGAGCTCGTTCATTCAAGCCCATTGCACCAACAAAGTCAAAACTGAAGTCAAC
ACCGACGACGCCGAGGGTCCTTTAGACTTCACGATGAACCCCGCCAAGAAAAACGAATGC
AACCAACCAGAGGTCCACGAAGAGATCGATCCGGCGGCCACGGAACAAGCTAGTCAACAA
ATAAATAATCATAACGAGAGAGTGGACGACGACTCCCAAAATAATACAGAGACCAACCCC
GAACAGACGCTGCCGAAGTTCAAGATAACGCGATTGGGTAACAAGATGGCGAAAATCGTT
TCAAAAACGAGAGAGAACGCCGAGTCGTCCAGACCTTTGAGGAAAAGACGGAGACTCGTG
GAAGTCGCTACCGACGAAGAGACGCTCATTCCTAAAACAAGGAAAATTGTACAAGTTTAA
Protein sequence:
MQELSANPESLNRVSKRVGLKMNRDKSKLMSNVHVASTSITDENSLPEVDNAYDFAETAM
NELLGWYGYERLELRRWAASRARDSLEQEQKGKADECSWCNKSVASESGALQQAGALFCS
ELCFSQSRRANFKRAKTCDWCRHVRHTVAYVDFQDGATQLQFCSDKCLNQYKMHIFCRET
QAHLDLNPHLVNAASSSNLITPELWLKNCRSRSISPTSEGSGTPNDKNDDTCQRKSPLPL
ITIAPPAKLMNPKPPEDRPVQKSPETKKDLRTKMNLRKRRTSKCSTVTSQTVRQRSITPK
TQDFRMLSPSMDGSSPASCVTSNNPVHSPPHMNQPIPPPFPNPMFGMPPPVFMDSNHMND
HRNPMFQPRVNFMPPPGMHQERPRLFPPLNFHQPINQAPPVTVLVPYPVVIPVPIPIPIP
LPLSSFIQAHCTNKVKTEVNTDDAEGPLDFTMNPAKKNECNQPEVHEEIDPAATEQASQQ
INNHNERVDDDSQNNTETNPEQTLPKFKITRLGNKMAKIVSKTRENAESSRPLRKRRRLV
EVATDEETLIPKTRKIVQV