DPGLEAN18496 in OGS1.0

New model in OGS2.0DPOGS210693 
Genomic Positionscaffold946:+ 37613-44793
See gene structure
CDS Length3177
Paired RNAseq reads  5349
Single RNAseq reads  14305
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006308 (3e-11)
Best Drosophila hit  chaoptic (0.0)
Best Human hitinsulin-like growth factor-binding protein complex acid labile subunit isoform 2 precursor (1e-22)
Best NR hit (blastp)  cell surface protein chaoptin [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG1744-PA [Nasonia vitripennis] (0.0)
GeneOntology terms



  
GO:0005886 plasma membrane
GO:0007156 homophilic cell adhesion
GO:0042051 compound eye photoreceptor development
GO:0005515 protein binding
GO:0042052 rhabdomere development
InterPro families

  
IPR003591 Leucine-rich repeat, typical subtype
IPR000483 Cysteine-rich flanking region, C-terminal domain
IPR001611 Leucine-rich repeat
Orthology groupMCL15085

Nucleotide sequence:

ATGATACTTGTTGCATACACTACCAAACAAAATATTAACAAAACCAAAACTTTGTATGTT
AATCAAATCTTAGTATGTCATATCAATTTAAAAGAAGATGCGTTCCGAAATGCAAAGATT
AAAACATTGTCTCTACGAGACTGCGGCGTCACGGACCTATCTCCAGCATCATTCGCCGGC
CTTGAAAACACTCTTCAGTCCCTAGATTTATCAGAAAACAACCTAACAATGATATCCAAA
TTCATGTTAAACAAGTTAGACTCGTTGCGGTTCTTGAACCTAAGAGAAAATAAGGTGGAC
ACGAATTTACTAGCAACGAATAATCCATCAGAGTACTCGACGCCATCGATAAATAACTTT
CAGTATAAGTTGTTCTTCTTGGACATCAGTGGCTCATCGTCTCTTGAAATTAGCTTGCAA
GATGTGAGAAGAATGCGTTCCCTCAGATATTTGTCTGTGAGTAAATTGATAAGACGCAGC
ATATCTGCAGAAGATTTCCTAGAATTCGGCGTGGAATTGGAGGATCTAAAAATAATCGGA
AGCACTATCAATCGAATCGAAGCGAGCGCCTTTCAACATGTACGGACTATAAAGTCCTTA
GATCTGTCTGAAAATAACATTGACTTTATAGACCCATTCGCGTTTGCGGAGTTACATAGC
TTGACATCATTGAAATTAGCCAATGGATTAGCAGATTCCGTAAAAATATTGCCATTTGAA
CCTTTGAAAGCACTTATAGAATTGCAGGATTTGGACGTTAGTAATAACAAATTGAGGAAC
GTCCCAGACACATCTTTCCACTTCTTATATAAGTTAAAAACATTGAACCTTCAAGATAAT
CTCATTGATCACTTCTCTAAAGGAACATTACAGAGTGACATACATCGACAGCTAGAAAGC
GTTTCTCTATCGCTAAACCAAATGCAACGAATTGATCAACATACATTTGTCAATTTACGA
GAATTACAGGAAATTTTAATCGAAGATAATCTAATAGAAACAGTGCATAGACGTTCCTTC
ACAAGTTTAGATAACCTGAAGGTGATTCGATTAAGAGGAAATATTATTACTGAAATTAGT
GAAGAGGCATTCCAAAATCTACCGGCCTTGAAAGAGTTAGATATATCATTTAATCAATTG
GAGACGTTCAAGTTTTCGATATTTGATCAAGTTGGATCTGCGACGGCCTTGAAAGTAAAC
GTGTCATACAACAGAATAGTTTCATTGACTGATTCAAATGCTGTCAATTTCTTCTCTTCA
AACTTTTATCCTCCACCTAAAGCGCAAAGATTAGTTTCTGAGGATCCCAGTCCTCTGCGT
ATAGAAAGAGGACTTGGCACGGTATCAGTGAATATAAGAGTTTTGGACTTCTCACACAAT
AACATTTCATACATCGCGCCATACTACTTCAGACACGCGGACCTGACGTTATCCGAGTTG
CACCTCTCCCACAATATGATCCGTAATATAACACGAGAAGTGTTTGGGTCGATGCTAATG
TTGCAATACTTGGATTTATCGCATAACCAAATATTCCACATGGAGTATGACTGTTTTAAG
AAAGTTAAAAGATTGCAAATAATAGACTTGTCCCATAATCACCTGTTCGATACACCGGTG
GAAGTGTTCCACGAGATGCAGGGACTTACTACAGTGGATCTTTCGGACAACAACATCAAA
AACTTAGCAGATAATCTCATCATATCTCCAGCTTTAGAGAGGCTAGACCTATCTGACAAT
GATTTGTCACGAATACCAACGAATTGTTTATCTCCGGCTGCTGCTATTAATCTAGTAGAA
CTAGATTTGAGCGGGAACAACATACCCGCTGTAGCTATTGCTGACTTAGTCCAAAGATAT
AGGCACGACGACTGGCCCGAGGAACCGGACTACAGTGACGAATACATGTACCACACGGCT
AGGCGCGACCACGCCAGAGTGTTCCATCAAAAAAAACAATACCCGCAGAACATATTGTTT
AAGTCGCTTGCGTGGTTGGATTTGTCTGACAATCACTTGGTGAGAGTTGAAAGCGGTTCT
TTTGCTGCTTTACCAAAACTCCGATGGTTGGATTTAAGTATGAATATGCCCTTTAACAAC
AATGACCGCGGAAGCAGTTTATTTAAAGGTTTAGAAAGAAGATTATCTCATTTGGGACTA
AAGAATGTTAGTCTCACAAATATCCCATCAATGCCGTTGCCGAAGTTAAAAAGCCTAGAC
CTATCATACAACAACTTTCCCTCCATTCCGACCGACATGACGGCAAACTTGACTCGTCTC
AGAGCTTTGGATTTGTCTTATAATGATTTGACTAATGTTCCCGTAGCGACTCACTCCCTC
AGCGAACTTCGTTGGTTGTCTCTATCTGGGAATCCAATCACTGCCCTTATGAACACTAGC
ATGTACGGCGTGTCTCCGAGACTAGAATATTTAGACGTAACTCACCTAAAATTGAGTATA
CTAGAGGCCGGGGCGTTCAGCAAAATGTACGGATTACGCACTCTTAAAATATCTGTTAAT
GGAAATATAAGAGACTTCAATATTCCAAAGATATTGACACACAATGACGCATTGAAGAAT
TTGTATTTACATATAGACAATTCTCAAATCGATCTTGGCAAGGAGATGATTGGAGAACTT
CCTCCCAAGCTAAATAACATTACTATTGTTGGTAAAGCTTTGAAATTTTTGTCACAGAAT
CTGCTAGGTGGTGTTACATCTGAAACTTTGACTCTGACCATTTATAATACCAGCCTTGAG
GAAGTAGAAAGTGAAGTTTTTTGGAGACCAGGCCATGTAAAGAATCTAACCCTAGATTTG
AGGCATAATAATATAGCTAGGGTTCCCAATCCAGCGAGACATGAATGGCCGGGAGTACCA
AATTCTTTATTCCTTCACGACATATTTTTGTCTGGAAATCCTTTATACTGTGATTGTCGC
ATCGGTTGGGTTCAAGCGTGGGATCGCAAACGAAGACAATATTTGTGCGAGAGTCCCTCT
AGTTGTGTTGCTGTACGAGACGATCTCAGATTTGCGAAATGTCCTTCCCATTATAACAGG
ACTTTCAGTGACGTCATCGCGAAAGATTTAGACTGCACTTGGAGTAAAGGATTCCTGAAC
TTACCAAACTTATACATAATTACGGCAATATCTATCATGACATGCCTCTACATTTGA

Protein sequence:

MILVAYTTKQNINKTKTLYVNQILVCHINLKEDAFRNAKIKTLSLRDCGVTDLSPASFAG
LENTLQSLDLSENNLTMISKFMLNKLDSLRFLNLRENKVDTNLLATNNPSEYSTPSINNF
QYKLFFLDISGSSSLEISLQDVRRMRSLRYLSVSKLIRRSISAEDFLEFGVELEDLKIIG
STINRIEASAFQHVRTIKSLDLSENNIDFIDPFAFAELHSLTSLKLANGLADSVKILPFE
PLKALIELQDLDVSNNKLRNVPDTSFHFLYKLKTLNLQDNLIDHFSKGTLQSDIHRQLES
VSLSLNQMQRIDQHTFVNLRELQEILIEDNLIETVHRRSFTSLDNLKVIRLRGNIITEIS
EEAFQNLPALKELDISFNQLETFKFSIFDQVGSATALKVNVSYNRIVSLTDSNAVNFFSS
NFYPPPKAQRLVSEDPSPLRIERGLGTVSVNIRVLDFSHNNISYIAPYYFRHADLTLSEL
HLSHNMIRNITREVFGSMLMLQYLDLSHNQIFHMEYDCFKKVKRLQIIDLSHNHLFDTPV
EVFHEMQGLTTVDLSDNNIKNLADNLIISPALERLDLSDNDLSRIPTNCLSPAAAINLVE
LDLSGNNIPAVAIADLVQRYRHDDWPEEPDYSDEYMYHTARRDHARVFHQKKQYPQNILF
KSLAWLDLSDNHLVRVESGSFAALPKLRWLDLSMNMPFNNNDRGSSLFKGLERRLSHLGL
KNVSLTNIPSMPLPKLKSLDLSYNNFPSIPTDMTANLTRLRALDLSYNDLTNVPVATHSL
SELRWLSLSGNPITALMNTSMYGVSPRLEYLDVTHLKLSILEAGAFSKMYGLRTLKISVN
GNIRDFNIPKILTHNDALKNLYLHIDNSQIDLGKEMIGELPPKLNNITIVGKALKFLSQN
LLGGVTSETLTLTIYNTSLEEVESEVFWRPGHVKNLTLDLRHNNIARVPNPARHEWPGVP
NSLFLHDIFLSGNPLYCDCRIGWVQAWDRKRRQYLCESPSSCVAVRDDLRFAKCPSHYNR
TFSDVIAKDLDCTWSKGFLNLPNLYIITAISIMTCLYI