DPGLEAN17109 in OGS1.0

New model in OGS2.0DPOGS214264 
Genomic Positionscaffold5451:- 185-7274
See gene structure
CDS Length3054
Paired RNAseq reads  108
Single RNAseq reads  263
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005982 (4e-07)
Best Drosophila hit  eyes shut (7e-112)
Best Human hitneurogenic locus notch homolog protein 1 preproprotein (2e-48)
Best NR hit (blastp)  crumbs [Aedes aegypti] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC010461 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0005886 plasma membrane
GO:0005509 calcium ion binding
GO:0042052 rhabdomere development
GO:0005615 extracellular space
GO:0010378 temperature compensation of the circadian clock
InterPro families










  
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR001791 Laminin G domain
IPR000742 Epidermal growth factor-like, type 3
IPR008985 Concanavalin A-like lectin/glucanase
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR006209 EGF
IPR012680 Laminin G, subdomain 2
IPR013111 EGF, extracellular
IPR006210 Epidermal growth factor-like
IPR001881 EGF-like calcium-binding
Orthology groupMCL11308

Nucleotide sequence:

ATGTTTAAACTACTTAACAGAGGACATTTAAAGAGTACGTGGTGGTTACTAATTGTAATA
CCTATCGCATCCGCCGGCTTTGCCTGCCTTAATAATCCTTGCGTACATGGGATATGTATT
GATGATATTAATAGTTTTAGCACATATTTATGTTATTGTATTGATGGATATACGGGTGTT
CAATGCCAAACAAATTGGGACGAATGCTGGTCAAACCCTTGCCAAAACGGTGGCACTTGC
ATAGATGGTGTGGCATCTTACAACTGTTCTTGCCCGGATGGCTTCATTGGCGATAATTGT
GAGACAAATTACAACGAATGCGATTCAAACCCCTGTTACAACAATGGCACATGCATTGAC
ATGACTAACGAGTACGTATGTCACTGCATCCCCGGCTTCTCTGGAGATCACTGTGAGTTA
GATGTAGCAGTGTGCAACTCGACGGGGGAGGTCAGGTGTCACAATGGAGGCGAGTGCATC
GAGGGTCCCGGGTTCAAATTTTATTGCAAATGTGCCGCAGGATGGACTGGACACAAATGT
GAAGACCAGATCGATGAATGCGAGTCGAATCCATGCAGGAATGGTGGCATATGCATCGAC
GCTCATGCTGATTACATGTGCGCGTGCACATACGGTTTCACTGGTAAAAGCTGCGAGGTA
GCGATAGAGTTCTGCTCTCAGGATTCCTGCAGCGAGAAGGCGCTGTGTGTTCTGGAAGAC
GTGTTGCGAGTCTGCTACTGTGTGCCCGACTATCACGGGGAACGATGTGAACTACAGTAC
GACGAGTGCGCACTAGGACCAAGATGCCTGAACGGCGGGACGTGTATAGATGGCGTTGAC
AATTTCACATGCTCATGTCCACCAAGACTTACAGGACTTCTCTGCGAGTGTCTTATTTTG
GATGACGGAAATTATGATTGTGAATATATTCGTCCAACTCTTCTACCAGACCATAGTACA
GCCACTTCTTCCTTTACTGAAACCATAATAATAGACACAAGTACCATGGAATCGAAGTAT
AATTCAAGCACTACCACCACCAGCCTCAGTACTATCGATAGCGGCACCAGTATTGATGTC
ATTACTACGGACATGGCAATTTATACGAAACTAGATAATGTAACGGACATACCAATAACC
GCTTCCAGTACAGACACAATAGTAACAGAAAATCTAACAACAAGCACAGAAATATCTGAT
ATGACAACTGATTCATTGACTTCAACATCAATTTCAACATCTCGATCAACAACGAAAGAA
GATTCGGTTACTGAAATAGTTACGATTCTGATAGAAACGAAAGGTACTATAGGAGCTGAT
GATTCTAAGACAGAAATAACAACAGAATGTAGTGGATCGTGTCCAAAAGGAAATTTTTCC
ACTAGTGATTTACCACCAACCACTATAACTTCAATAGAAACAACTGAAGGAATCACTACT
TCCACAGAAATTACTAAACAAACAGCAACAGACACAACAGTTCAGACAACCGTTGATCTT
AAAGAAACAACAAAGCAAATGACGTCAGATACCACAGAATATACTCATCAAGCACAAGAC
ATGACAACAGAAAGAATGTTCACCGACAGTCCTGTTGAAACAACAGAACTTGCAACCGAA
TTAACCCATCCAATGACCGAAATCGAAACAAGTACAGGTTATAATCAAATTTCAACAGCA
CACTCAGATTGTACCGACGTCATCTGTAACAATCACGGCAGTTGTATAAACACTCTTCAT
GGCGTTAGATGTCACTGTTTGTTCAATTACGAAGGAAGATTTTGTGAGAGTAAAATTATT
GTTAACTCGGCCGCCTTTGATGGCACTTCTTATATAGCGCATCATATAAAAAATTCTACC
AGCATATCTATAGCATTCAAAGCCAAAACTCTAATCCTTGACGGGCAAGTTATGTACGTG
GATATAGCTAAGGGCGCTTACATGAAATTGTATATGAATTCTGGCTTGTTGAGATTCGAA
TTCTCCTGTGGCTATCAGACAATGCTGCTAAGCGAACTTAAAACTCACCTTAACAAAGGA
TATATTATGAAAATTGAAACAAGATTAGATATATTCTTACCGGAAAATCATTGCAACGGA
ACCCTGAGACTTAACGACACTGTGGCTATGAGCGGCGGCCAGTTTGCAAATATTAGCTCT
CCCGAGTATAATTCGATTCTCTATTTCGGGAACATACCTAATGCTAATAGAAATAACTCT
AATGAGAAGTCTTTTATTGGATGTATCAAGGACTTAATTATAAACGACGAGAGACGTGAA
ATATTTAGCGACGCTTACGAAGCGTCTGAGGTGAGGGAATGCTCTTCTTTGTCTTGTTTG
TCGTCGCCGTGTGTGAACGGTGGTACTTGCAATGATGACGACGATACATACTCCTGTGCT
TGTGCCAATGGTTGGACCGGCGCCACTTGCAACGACTCCGTCTGCGACCACAACCCTTGT
CAGTCCGGTGGAAGTTGTGTCCATCACCCCGGGAGCGGATTCCTGTGCCTCTGTCCATAT
GGCAGGCACGGCATATTCTGTGAATATAACGTGGAAATAACACGTCCGTCTTTATCGCCT
ATATCCCCTGGAAGGTCTTCCTACGTCTTGTATCCGATGTCACAGTCCGCAGCGAATTCT
GATCGGTTTGAAATGCGTTTGCGTTTTCAAACGTCGGACATGGATCAGATAGCGTTGCTC
GCGTTCGTTGGACAAAGAGGAAGACACGATGCCAGGAGTCAACATTTAGCTTTGACCTTT
GTGAAGGGTTACGTTATGCTGACGTGGAACATGGGCGCTGGACCCCGACGTATTTTCACG
TCCCGTCCTCTGGGTCCACGGCGCGGGGGACACACGGTGCGGGTCTGGAGACGAGGAAGA
ACAGCCGGCCTCGTGGTCGACGGGCGATACAATGTATCAGGGAACGCACCCGCCCACACC
AACAACATGACTTTACTACCATACATCTATATTGGTGAGATGAAACATTCACAGATGGAA
CTAAGAATATCCGAACCAACGGATATCGAATATGTTACATCTGGTGAATGTTAG

Protein sequence:

MFKLLNRGHLKSTWWLLIVIPIASAGFACLNNPCVHGICIDDINSFSTYLCYCIDGYTGV
QCQTNWDECWSNPCQNGGTCIDGVASYNCSCPDGFIGDNCETNYNECDSNPCYNNGTCID
MTNEYVCHCIPGFSGDHCELDVAVCNSTGEVRCHNGGECIEGPGFKFYCKCAAGWTGHKC
EDQIDECESNPCRNGGICIDAHADYMCACTYGFTGKSCEVAIEFCSQDSCSEKALCVLED
VLRVCYCVPDYHGERCELQYDECALGPRCLNGGTCIDGVDNFTCSCPPRLTGLLCECLIL
DDGNYDCEYIRPTLLPDHSTATSSFTETIIIDTSTMESKYNSSTTTTSLSTIDSGTSIDV
ITTDMAIYTKLDNVTDIPITASSTDTIVTENLTTSTEISDMTTDSLTSTSISTSRSTTKE
DSVTEIVTILIETKGTIGADDSKTEITTECSGSCPKGNFSTSDLPPTTITSIETTEGITT
STEITKQTATDTTVQTTVDLKETTKQMTSDTTEYTHQAQDMTTERMFTDSPVETTELATE
LTHPMTEIETSTGYNQISTAHSDCTDVICNNHGSCINTLHGVRCHCLFNYEGRFCESKII
VNSAAFDGTSYIAHHIKNSTSISIAFKAKTLILDGQVMYVDIAKGAYMKLYMNSGLLRFE
FSCGYQTMLLSELKTHLNKGYIMKIETRLDIFLPENHCNGTLRLNDTVAMSGGQFANISS
PEYNSILYFGNIPNANRNNSNEKSFIGCIKDLIINDERREIFSDAYEASEVRECSSLSCL
SSPCVNGGTCNDDDDTYSCACANGWTGATCNDSVCDHNPCQSGGSCVHHPGSGFLCLCPY
GRHGIFCEYNVEITRPSLSPISPGRSSYVLYPMSQSAANSDRFEMRLRFQTSDMDQIALL
AFVGQRGRHDARSQHLALTFVKGYVMLTWNMGAGPRRIFTSRPLGPRRGGHTVRVWRRGR
TAGLVVDGRYNVSGNAPAHTNNMTLLPYIYIGEMKHSQMELRISEPTDIEYVTSGEC