New model in OGS2.0 | DPOGS214264  |
---|---|
Genomic Position | scaffold5451:- 185-7274 |
See gene structure | |
CDS Length | 3054 |
Paired RNAseq reads   | 108 |
Single RNAseq reads   | 263 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005982 (4e-07) |
Best Drosophila hit   | eyes shut (7e-112) |
Best Human hit | neurogenic locus notch homolog protein 1 preproprotein (2e-48) |
Best NR hit (blastp)   | crumbs [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010461 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005886 plasma membrane GO:0005509 calcium ion binding GO:0042052 rhabdomere development GO:0005615 extracellular space GO:0010378 temperature compensation of the circadian clock |
InterPro families    | IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site IPR001791 Laminin G domain IPR000742 Epidermal growth factor-like, type 3 IPR008985 Concanavalin A-like lectin/glucanase IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR006209 EGF IPR012680 Laminin G, subdomain 2 IPR013111 EGF, extracellular IPR006210 Epidermal growth factor-like IPR001881 EGF-like calcium-binding |
Orthology group | MCL11308 |
Nucleotide sequence:
ATGTTTAAACTACTTAACAGAGGACATTTAAAGAGTACGTGGTGGTTACTAATTGTAATA
CCTATCGCATCCGCCGGCTTTGCCTGCCTTAATAATCCTTGCGTACATGGGATATGTATT
GATGATATTAATAGTTTTAGCACATATTTATGTTATTGTATTGATGGATATACGGGTGTT
CAATGCCAAACAAATTGGGACGAATGCTGGTCAAACCCTTGCCAAAACGGTGGCACTTGC
ATAGATGGTGTGGCATCTTACAACTGTTCTTGCCCGGATGGCTTCATTGGCGATAATTGT
GAGACAAATTACAACGAATGCGATTCAAACCCCTGTTACAACAATGGCACATGCATTGAC
ATGACTAACGAGTACGTATGTCACTGCATCCCCGGCTTCTCTGGAGATCACTGTGAGTTA
GATGTAGCAGTGTGCAACTCGACGGGGGAGGTCAGGTGTCACAATGGAGGCGAGTGCATC
GAGGGTCCCGGGTTCAAATTTTATTGCAAATGTGCCGCAGGATGGACTGGACACAAATGT
GAAGACCAGATCGATGAATGCGAGTCGAATCCATGCAGGAATGGTGGCATATGCATCGAC
GCTCATGCTGATTACATGTGCGCGTGCACATACGGTTTCACTGGTAAAAGCTGCGAGGTA
GCGATAGAGTTCTGCTCTCAGGATTCCTGCAGCGAGAAGGCGCTGTGTGTTCTGGAAGAC
GTGTTGCGAGTCTGCTACTGTGTGCCCGACTATCACGGGGAACGATGTGAACTACAGTAC
GACGAGTGCGCACTAGGACCAAGATGCCTGAACGGCGGGACGTGTATAGATGGCGTTGAC
AATTTCACATGCTCATGTCCACCAAGACTTACAGGACTTCTCTGCGAGTGTCTTATTTTG
GATGACGGAAATTATGATTGTGAATATATTCGTCCAACTCTTCTACCAGACCATAGTACA
GCCACTTCTTCCTTTACTGAAACCATAATAATAGACACAAGTACCATGGAATCGAAGTAT
AATTCAAGCACTACCACCACCAGCCTCAGTACTATCGATAGCGGCACCAGTATTGATGTC
ATTACTACGGACATGGCAATTTATACGAAACTAGATAATGTAACGGACATACCAATAACC
GCTTCCAGTACAGACACAATAGTAACAGAAAATCTAACAACAAGCACAGAAATATCTGAT
ATGACAACTGATTCATTGACTTCAACATCAATTTCAACATCTCGATCAACAACGAAAGAA
GATTCGGTTACTGAAATAGTTACGATTCTGATAGAAACGAAAGGTACTATAGGAGCTGAT
GATTCTAAGACAGAAATAACAACAGAATGTAGTGGATCGTGTCCAAAAGGAAATTTTTCC
ACTAGTGATTTACCACCAACCACTATAACTTCAATAGAAACAACTGAAGGAATCACTACT
TCCACAGAAATTACTAAACAAACAGCAACAGACACAACAGTTCAGACAACCGTTGATCTT
AAAGAAACAACAAAGCAAATGACGTCAGATACCACAGAATATACTCATCAAGCACAAGAC
ATGACAACAGAAAGAATGTTCACCGACAGTCCTGTTGAAACAACAGAACTTGCAACCGAA
TTAACCCATCCAATGACCGAAATCGAAACAAGTACAGGTTATAATCAAATTTCAACAGCA
CACTCAGATTGTACCGACGTCATCTGTAACAATCACGGCAGTTGTATAAACACTCTTCAT
GGCGTTAGATGTCACTGTTTGTTCAATTACGAAGGAAGATTTTGTGAGAGTAAAATTATT
GTTAACTCGGCCGCCTTTGATGGCACTTCTTATATAGCGCATCATATAAAAAATTCTACC
AGCATATCTATAGCATTCAAAGCCAAAACTCTAATCCTTGACGGGCAAGTTATGTACGTG
GATATAGCTAAGGGCGCTTACATGAAATTGTATATGAATTCTGGCTTGTTGAGATTCGAA
TTCTCCTGTGGCTATCAGACAATGCTGCTAAGCGAACTTAAAACTCACCTTAACAAAGGA
TATATTATGAAAATTGAAACAAGATTAGATATATTCTTACCGGAAAATCATTGCAACGGA
ACCCTGAGACTTAACGACACTGTGGCTATGAGCGGCGGCCAGTTTGCAAATATTAGCTCT
CCCGAGTATAATTCGATTCTCTATTTCGGGAACATACCTAATGCTAATAGAAATAACTCT
AATGAGAAGTCTTTTATTGGATGTATCAAGGACTTAATTATAAACGACGAGAGACGTGAA
ATATTTAGCGACGCTTACGAAGCGTCTGAGGTGAGGGAATGCTCTTCTTTGTCTTGTTTG
TCGTCGCCGTGTGTGAACGGTGGTACTTGCAATGATGACGACGATACATACTCCTGTGCT
TGTGCCAATGGTTGGACCGGCGCCACTTGCAACGACTCCGTCTGCGACCACAACCCTTGT
CAGTCCGGTGGAAGTTGTGTCCATCACCCCGGGAGCGGATTCCTGTGCCTCTGTCCATAT
GGCAGGCACGGCATATTCTGTGAATATAACGTGGAAATAACACGTCCGTCTTTATCGCCT
ATATCCCCTGGAAGGTCTTCCTACGTCTTGTATCCGATGTCACAGTCCGCAGCGAATTCT
GATCGGTTTGAAATGCGTTTGCGTTTTCAAACGTCGGACATGGATCAGATAGCGTTGCTC
GCGTTCGTTGGACAAAGAGGAAGACACGATGCCAGGAGTCAACATTTAGCTTTGACCTTT
GTGAAGGGTTACGTTATGCTGACGTGGAACATGGGCGCTGGACCCCGACGTATTTTCACG
TCCCGTCCTCTGGGTCCACGGCGCGGGGGACACACGGTGCGGGTCTGGAGACGAGGAAGA
ACAGCCGGCCTCGTGGTCGACGGGCGATACAATGTATCAGGGAACGCACCCGCCCACACC
AACAACATGACTTTACTACCATACATCTATATTGGTGAGATGAAACATTCACAGATGGAA
CTAAGAATATCCGAACCAACGGATATCGAATATGTTACATCTGGTGAATGTTAG
Protein sequence:
MFKLLNRGHLKSTWWLLIVIPIASAGFACLNNPCVHGICIDDINSFSTYLCYCIDGYTGV
QCQTNWDECWSNPCQNGGTCIDGVASYNCSCPDGFIGDNCETNYNECDSNPCYNNGTCID
MTNEYVCHCIPGFSGDHCELDVAVCNSTGEVRCHNGGECIEGPGFKFYCKCAAGWTGHKC
EDQIDECESNPCRNGGICIDAHADYMCACTYGFTGKSCEVAIEFCSQDSCSEKALCVLED
VLRVCYCVPDYHGERCELQYDECALGPRCLNGGTCIDGVDNFTCSCPPRLTGLLCECLIL
DDGNYDCEYIRPTLLPDHSTATSSFTETIIIDTSTMESKYNSSTTTTSLSTIDSGTSIDV
ITTDMAIYTKLDNVTDIPITASSTDTIVTENLTTSTEISDMTTDSLTSTSISTSRSTTKE
DSVTEIVTILIETKGTIGADDSKTEITTECSGSCPKGNFSTSDLPPTTITSIETTEGITT
STEITKQTATDTTVQTTVDLKETTKQMTSDTTEYTHQAQDMTTERMFTDSPVETTELATE
LTHPMTEIETSTGYNQISTAHSDCTDVICNNHGSCINTLHGVRCHCLFNYEGRFCESKII
VNSAAFDGTSYIAHHIKNSTSISIAFKAKTLILDGQVMYVDIAKGAYMKLYMNSGLLRFE
FSCGYQTMLLSELKTHLNKGYIMKIETRLDIFLPENHCNGTLRLNDTVAMSGGQFANISS
PEYNSILYFGNIPNANRNNSNEKSFIGCIKDLIINDERREIFSDAYEASEVRECSSLSCL
SSPCVNGGTCNDDDDTYSCACANGWTGATCNDSVCDHNPCQSGGSCVHHPGSGFLCLCPY
GRHGIFCEYNVEITRPSLSPISPGRSSYVLYPMSQSAANSDRFEMRLRFQTSDMDQIALL
AFVGQRGRHDARSQHLALTFVKGYVMLTWNMGAGPRRIFTSRPLGPRRGGHTVRVWRRGR
TAGLVVDGRYNVSGNAPAHTNNMTLLPYIYIGEMKHSQMELRISEPTDIEYVTSGEC