DPGLEAN18615 in OGS1.0

New model in OGS2.0DPOGS203951 
Genomic Positionscaffold2:+ 16661-27523
See gene structure
CDS Length1869
Paired RNAseq reads  97
Single RNAseq reads  263
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000480 (2e-139)
Best Drosophila hit  scabrous, isoform A (6e-86)
Best Human hitangiopoietin-4 precursor (1e-26)
Best NR hit (blastp)  PREDICTED: similar to scabrous protein [Tribolium castaneum] (1e-113)
Best NR hit (blastx)  PREDICTED: similar to scabrous protein [Tribolium castaneum] (7e-112)
GeneOntology terms












  
GO:0048749 compound eye development
GO:0007399 nervous system development
GO:0008407 bristle morphogenesis
GO:0005577 fibrinogen complex
GO:0005576 extracellular region
GO:0046331 lateral inhibition
GO:0045468 regulation of R8 cell spacing in compound eye
GO:0007460 R8 cell fate commitment
GO:0004871 signal transducer activity
GO:0016321 female meiosis chromosome segregation
GO:0016318 ommatidial rotation
GO:0005102 receptor binding
GO:0007165 signal transduction
GO:0008587 imaginal disc-derived wing margin morphogenesis
InterPro families

  
IPR002181 Fibrinogen, alpha/beta/gamma chain, C-terminal globular
IPR014716 Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 1
IPR014715 Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 2
Orthology groupMCL16339

Nucleotide sequence:

ATGGAGTTCGTTAAACTTTGGGCGCTAATTATTTGTTTGTGTTCGGTGAACGCTCGCGAA
ATTGACATTAAAAATGAATTGCTATCCCTGACCGAACAATTCAAGGCTCTGAAGACCGTG
CATCTGGCTGATGTTTCCCGCCTCAAAGAAGAGATTAAGGAACTCAAGAAGCACGCTGCT
AATACGTTCACTGAGAACTACACTCGCAATGAACAAGCAACTCTGCAATGGGCGAAGAGT
TCCATGAGGGAACTTCGAATTGAAATGCGTGAACTAAGTCAAAGTATTAACAGCTCGGTA
CTGCTGCGACAATTGCAAAACATCCGCAACGAGTTAAAACGGGCATTGTCTGAGAACACG
GACCTGGCTCAGTTAGCTCGTACTCAGGAGGCGCGAGTGGACAAATTGGATAGCGAAGTC
GGCCGGCTCAAATACGACAGCCAAGAAATAAGAGGCATGGTCGCTGAGATACGCAGTCAA
GTCGCGAAACTCTCAAAGGAGATTAAATTGAAGACACTCAATGAAGATAGCTTCAATGAT
GTCCTAGAGCCATACGAAAAACATGCTTCAGACTCGCATCCTAAGCATGGACACAAAATA
CGCCACAACAAAATGGTGCACGCCCAAATATCCCGTTTGGCCCGCAGCCAAAATCAGTTG
GATGAATACCAACAGCACTTGCAGACTCAACTCCTCGATGTGCTTCGTCGCCTCGACCGC
ATTGAAGAAGCTAACTGGAATCTGGTTTCAACTCGAGTAGATTACCTTGCAACTGAAACT
AACACAATCAAAAATGAACTGAACAATGTAACCCAACGAGTGGCCGACTTTGATAAAGTC
CATGCCTCTATGCTTGAACTGCGCGAGGACGTTGAAAGCATTGAGAACAAAGCTGACAAA
ACAATTCCCGAGTTTAGAAAAGAGATATCTAAACTGGATCTTAGCTTCGCCCAGCTCAAC
GCTCAATCTTCTTATCTAAAAGAGGACCAAGAGAATCTCCGTCAATCTGTTAAGGCCATC
GCAGTCAGCGTGAGCAACACCATTGATCGTGCCGAAATGGATCGTCTCGTTATCAAAGCT
CTCAATGACTCTGTGATCGGTCTCGAAAATATAAGCAAGCAACACTACTACCGCCTTAAC
GATCACATTCTCAAGAGTGAAGCCAATAAGACAACAATGATTAGTCAATATATTCCGCTC
CCTGAACTTATTGATGAAGTTAAAGAGCTTCAACCCCTGGAACGTGAGTATGAAAATCTG
GTTGTTCAATTACCTAAGGATTGCTCAAGCGTGACTGGACCTGACCAAGTTTATTTAATA
AACCCTGGCCATTCTCCGATTGAGACCTTTTGTACCAATGGAAGTACCCTTATTCAACGA
CGTTACAACGGATCCGTAGAATTTAATAGGAAATTTGCTCAATACGTGCAAGGTTTTGGT
AACGCAGCCTCCGAATTTTGGCTTGGCTTGGAATCGATGCACCAATTGACCGCTGATAAC
TGCTCTTCTATGAGGATCGAGATGACCGATATTTATGGAAGCTCTTGGCATGCTGAATAT
GATCATTTCTCCGTTGGAAGCGCTGATACTGGATATGTTTTGACTGTGAGCGGTTTCAGA
GGCAATGCTAGTGACGCTTTTGAGTACCAAAACCATATGGAATTTTCTGCCATCGACCAC
GACAGAGACATCTCGAATACTCATTGCGCTGCCAACTATGAAGGAGGTTGGTGGTTCTCT
CATTGCCAGCACGTGAATATCAATGGCAAGTACACTCTTGGTTTGACCTGGTTTGACTCT
CTAAGGAATGAGTGGATAGCGGTTGCAACCAGTGAGATGCGCCTATTCCGTAACAAACGC
TGTACTTAA

Protein sequence:

MEFVKLWALIICLCSVNAREIDIKNELLSLTEQFKALKTVHLADVSRLKEEIKELKKHAA
NTFTENYTRNEQATLQWAKSSMRELRIEMRELSQSINSSVLLRQLQNIRNELKRALSENT
DLAQLARTQEARVDKLDSEVGRLKYDSQEIRGMVAEIRSQVAKLSKEIKLKTLNEDSFND
VLEPYEKHASDSHPKHGHKIRHNKMVHAQISRLARSQNQLDEYQQHLQTQLLDVLRRLDR
IEEANWNLVSTRVDYLATETNTIKNELNNVTQRVADFDKVHASMLELREDVESIENKADK
TIPEFRKEISKLDLSFAQLNAQSSYLKEDQENLRQSVKAIAVSVSNTIDRAEMDRLVIKA
LNDSVIGLENISKQHYYRLNDHILKSEANKTTMISQYIPLPELIDEVKELQPLEREYENL
VVQLPKDCSSVTGPDQVYLINPGHSPIETFCTNGSTLIQRRYNGSVEFNRKFAQYVQGFG
NAASEFWLGLESMHQLTADNCSSMRIEMTDIYGSSWHAEYDHFSVGSADTGYVLTVSGFR
GNASDAFEYQNHMEFSAIDHDRDISNTHCAANYEGGWWFSHCQHVNINGKYTLGLTWFDS
LRNEWIAVATSEMRLFRNKRCT