DPGLEAN09600 in OGS1.0

New model in OGS2.0DPOGS213998 
Genomic Positionscaffold3075:+ 23601-48671
See gene structure
CDS Length3600
Paired RNAseq reads  57
Single RNAseq reads  155
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013795 (1e-79)
Best Drosophila hit  Dscam3, isoform B (7e-119)
Best Human hitDown syndrome cell adhesion molecule-like protein 1 (7e-66)
Best NR hit (blastp)  GK11449 [Drosophila willistoni] (1e-154)
Best NR hit (blastx)  GK11449 [Drosophila willistoni] (2e-145)
GeneOntology terms
  
GO:0007155 cell adhesion
GO:0005886 plasma membrane
InterPro families






  
IPR003961 Fibronectin, type III
IPR013098 Immunoglobulin I-set
IPR013151 Immunoglobulin
IPR007110 Immunoglobulin-like
IPR013783 Immunoglobulin-like fold
IPR008957 Fibronectin type III domain
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
Orthology groupMCL10049

Nucleotide sequence:

ATGGATGCGACAGTTTGGATATGTCTCGCATATTACATATTTCTGTTCGGCGGCACGACT
GATGCTAAAAAGGAAAAATGGCCCTCCGAATATTTTTCAAATTCATCATCAATTTACAGT
AATAAGGATTTCGGCACGCCCAGGGAAAAAAACAGAAATCATAACGGATATCTTGGCATT
TACAGAAGAAAAAGAACCCTACCAATGATGATATCTCATCACTTTAGCGAAACCACAGTA
ACACCTGGTGTTGATGTAAATCTTCAGTGTGTTGTTAACTCTCCTCATCCAGCAAGATTT
GTTTGGGAACGAGACGGCGTTGTCATTTTATCTAATACTGATTCAAGGTATTCCATAACA
CAAACAATGACAACGGATGGTGTGTCTACACAACTTAATATTTCCCACGTTAGAGTTGAC
GATGGAGGTCGGTTTGCTTGTGTGGCTCATCTTGGTGAATCCACAGTGTTCCATGAAGAT
AGGGTCAATGTTTACGGTCCACCTTACATCCGGACGTTACCACCCTTCAAAGTTCAAAGC
GGTCAAAGTGTCACACTCAGATGCCCTTACTATGGATATCCAATACGAGAAATAACTTGG
GAGTACAAGGGAAAAGAAATTATTCCAGAAACAACTCAAACTAGATACAAACGCTTCATT
AATGATACCGCTAATATAGAAATTTTCGGCCGAAAACCAAAATTAAGAAGTAAAAGAGAC
GCAACAGTGACGAATGGCGTTCTTAGTGTTAATAGAGTTTCTAAAAGCGACAATGGACTT
TTTGCGTGCATAGTAAGAAGTCCTTCAGGTGAAATGGCAAAACGCTCTTTTGATCTGCAA
GTAGTGGAAGCACCTCAACTGGAAGAAATACTACTGGCCCCTGATTTACAGGAAGGACAA
ATTGTGCAAATACATTGCAACCTCAAAATAGTGTCAGAGACTTGGCGACCAGTTTTGGAA
GTGGCTGGTGGAGGGGTTTTAAGTTTATCGAATGGGTCACTCATATTTGATTCTGTTGCT
TTATCTGATGCTGGAATTTATACGTGTCATGTAGAAAATGGGGTTGGGGAAGCATTGAGC
AAAACTATTTGGATATCAGTAAATAAACCGGTAACGTTTGATATAGTATCCAGAAATTTA
ACAGCAAAATTAGGTCAACATGTTACCATCGAATGTCAAGCAAAAGGAGACGATCCAATT
CGGATCATGTGGACGAGGAATGGGAAACCAATTAATCCTCTTACACAAAGATTAAAAATA
TCGGAAGCTAAATCAGACGATGGTATGACGAGTTTGCTTGAGTTGATACAATCAGAGACA
GGCGACGCCGCTTTATATCAATGTAAAGCTGGAAACCCCTTTGGAGCTGACGTTTATAGT
GTTTATTTAAGTATACTAGAACCTCCGTCACCACCAACGGATCTTACGGTGGACTCTGTA
ACAAGCCGATCTGTTAAACTCTCATGGAGGGATATGACCCGTTCCTTAACGCAATACTAT
AGTGTTCAGGTCACGAATAGTGATAGACTAATGTGGAGTACCGCAAGGACCATCAATGTA
ACCAGGTTAGGCGATAATCAGCATAGTGTGGATATCACTGGTCTGCAGCCTGCCACACGG
TACGCAGCTCGGACGGCGGCGGGGCGATCCTCTGACATTAGTGCTTATTGTGCTCCCGTG
AGATTCACAACTTCAGAGGAAGCGCCTTCATCTCCTCCACTAAACATACAAGTATCGCAA
ACAACGTCTCCGGGAGAATTACGCGTCAAATGGTTACCCCCTCCAGCTGATACCCTGCAT
GGAGTAATCTTAGGGTACAGGGTAAAAGCTGTACCGCAGGAAGCGCCTTCATCTCCTCCA
CTAAACATACAAGTATCGCAAACAACGTCTCCGGGAGAATTACGCGTCAAATGGTTACCC
CCTCCAGCTGATACCCTGCATGGAGTAATCTTAGGGTACAGGGTAAAAGCTGTACCGCAG
GAAGATACCGGTGGGTATTCGGTATCGATTGCGGCGTTTAACAGCGCCGGGAATGGACCC
TTCTCCATACCGCTATTTCAAGATACAAGAGAGGGAGCCCCCGAAGAAGGTCCAACCTCA
GTCGAATGTGGAGGTGTGACGTCATCAGCACTTCGAGTTAGCTGGCAGCCCATACCTGTT
CACAGACAAGCCGGCTCTCTCGTTGGCTATTCCGTGTTATTCGCGGCTCAAGGTCGTCCA
TGGCAAAACGCAACATCTATAGTCACGGAAATGCGTTTACAAGGACTCCATAAGTTTACT
AACTACACCGTCAAAGTTGCGGGATACTCTAACTATGGAATAGGACCCTTTTCCTTCCCC
ATTGTGTGTTCTACGCTACAGGATGTTCCGGATGCTCCGTCTGAGATCAAGCTTCTCGTG
AGTTCAGCTAATTCCCTGCTGGTGAGCTGGAAACCGCCGCGGCCGAATGGAAGACTTCTG
CATTACACCGTGTACTCTAAATTGACCGCCAGCAATGATGGCCCGCAAATCCATCGTGTG
GATATAGAGTCAAACATAGATGCTTACGAACAAACTCAAAGTTTGGAACTCAAAGGTTTG
GTAGAGGGCAGACAGTATGACGTGTGGGTGAGTGCTAGTACAGCTGTCGGCGAAGGTCCG
GAAAGCAGACGGGTCAGTAACGCACCATCACAAAGAGTGGTAGCTGGTGTTTGGTCTCTT
GGGGGAAGAGTGTCAGTTCGTGTACACTCAGCGTTACCACTGGCATGTCGGAGCGTAGGC
TCTCCACCACCACGTACTGTGTGGTACCACAATCATAATATCATTACACACCATCCGCGA
TTCACGCGAAATAAAGATGACAGCTTACTTATTAAGAGCATAGATCAATCGCTCAGTGGC
AACTACACTTGTCTTGCAAAGAACCTATATGGATCAGACTCAGTCGTATACTCAGTGAGA
GTCCTACCACTACCTGATCCACCATCTTTGAGAGCGACACCTTATAAGGACTCAATAGTA
GTTGAATGGGATGAGATAAAAATTTCTAACGAATCTGGTTTTGGCGTTAGCTACAATCTT
ACCTGGCGTGAAGAAGATGGTCCTTGGCAAGAAGCTTGGCCCACAACACGGTTGCCAAAT
TCCCAACAGCAGCTTCCAGGTGTCCAACAGCATGCCCTGACTGGTTTGAAGTGCGGCACT
AAGTACTCCATTCGAGTTACCGCTACAGATAGCGTTGGCACATCCGCACCAGCTCATGTT
GATGTTACTACTTTGGGTGGAGCACCAGTCTCACCATTATCGACTGACTGGCTGTGGAGC
AACGCTACTCACATTTACATACAGCTGAGTGGTTGGGACGACGGTGGTTGTGACGTCACA
AAGTGGGACGTTGACTATCGAGCTCTCGGTACAAGTTTCTGGCACCGAGCTGATAATTTA
GCTACTTTAGATCCCAACCTTGGTTGGGGCTACAATTACGCGCGATTACCTACATCTTAC
GCGCTCGGCTCCCTTACCCCTGGGACGTGGTACCAAGTCCGCGTGACTGCATACAATGAT
GCTGGAACAGCCGCTACAGTATACACGTACGCTACCAAGACGGAAGATGGTGGTAAGTAG

Protein sequence:

MDATVWICLAYYIFLFGGTTDAKKEKWPSEYFSNSSSIYSNKDFGTPREKNRNHNGYLGI
YRRKRTLPMMISHHFSETTVTPGVDVNLQCVVNSPHPARFVWERDGVVILSNTDSRYSIT
QTMTTDGVSTQLNISHVRVDDGGRFACVAHLGESTVFHEDRVNVYGPPYIRTLPPFKVQS
GQSVTLRCPYYGYPIREITWEYKGKEIIPETTQTRYKRFINDTANIEIFGRKPKLRSKRD
ATVTNGVLSVNRVSKSDNGLFACIVRSPSGEMAKRSFDLQVVEAPQLEEILLAPDLQEGQ
IVQIHCNLKIVSETWRPVLEVAGGGVLSLSNGSLIFDSVALSDAGIYTCHVENGVGEALS
KTIWISVNKPVTFDIVSRNLTAKLGQHVTIECQAKGDDPIRIMWTRNGKPINPLTQRLKI
SEAKSDDGMTSLLELIQSETGDAALYQCKAGNPFGADVYSVYLSILEPPSPPTDLTVDSV
TSRSVKLSWRDMTRSLTQYYSVQVTNSDRLMWSTARTINVTRLGDNQHSVDITGLQPATR
YAARTAAGRSSDISAYCAPVRFTTSEEAPSSPPLNIQVSQTTSPGELRVKWLPPPADTLH
GVILGYRVKAVPQEAPSSPPLNIQVSQTTSPGELRVKWLPPPADTLHGVILGYRVKAVPQ
EDTGGYSVSIAAFNSAGNGPFSIPLFQDTREGAPEEGPTSVECGGVTSSALRVSWQPIPV
HRQAGSLVGYSVLFAAQGRPWQNATSIVTEMRLQGLHKFTNYTVKVAGYSNYGIGPFSFP
IVCSTLQDVPDAPSEIKLLVSSANSLLVSWKPPRPNGRLLHYTVYSKLTASNDGPQIHRV
DIESNIDAYEQTQSLELKGLVEGRQYDVWVSASTAVGEGPESRRVSNAPSQRVVAGVWSL
GGRVSVRVHSALPLACRSVGSPPPRTVWYHNHNIITHHPRFTRNKDDSLLIKSIDQSLSG
NYTCLAKNLYGSDSVVYSVRVLPLPDPPSLRATPYKDSIVVEWDEIKISNESGFGVSYNL
TWREEDGPWQEAWPTTRLPNSQQQLPGVQQHALTGLKCGTKYSIRVTATDSVGTSAPAHV
DVTTLGGAPVSPLSTDWLWSNATHIYIQLSGWDDGGCDVTKWDVDYRALGTSFWHRADNL
ATLDPNLGWGYNYARLPTSYALGSLTPGTWYQVRVTAYNDAGTAATVYTYATKTEDGGK