New model in OGS2.0 | DPOGS213998 |
---|---|
Genomic Position | scaffold3075:+ 23601-48671 |
See gene structure | |
CDS Length | 3600 |
Paired RNAseq reads | 57 |
Single RNAseq reads | 155 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013795 (1e-79) |
Best Drosophila hit | Dscam3, isoform B (7e-119) |
Best Human hit | Down syndrome cell adhesion molecule-like protein 1 (7e-66) |
Best NR hit (blastp) | GK11449 [Drosophila willistoni] (1e-154) |
Best NR hit (blastx) | GK11449 [Drosophila willistoni] (2e-145) |
GeneOntology terms | GO:0007155 cell adhesion GO:0005886 plasma membrane |
InterPro families | IPR003961 Fibronectin, type III IPR013098 Immunoglobulin I-set IPR013151 Immunoglobulin IPR007110 Immunoglobulin-like IPR013783 Immunoglobulin-like fold IPR008957 Fibronectin type III domain IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 |
Orthology group | MCL10049 |
Nucleotide sequence:
ATGGATGCGACAGTTTGGATATGTCTCGCATATTACATATTTCTGTTCGGCGGCACGACT
GATGCTAAAAAGGAAAAATGGCCCTCCGAATATTTTTCAAATTCATCATCAATTTACAGT
AATAAGGATTTCGGCACGCCCAGGGAAAAAAACAGAAATCATAACGGATATCTTGGCATT
TACAGAAGAAAAAGAACCCTACCAATGATGATATCTCATCACTTTAGCGAAACCACAGTA
ACACCTGGTGTTGATGTAAATCTTCAGTGTGTTGTTAACTCTCCTCATCCAGCAAGATTT
GTTTGGGAACGAGACGGCGTTGTCATTTTATCTAATACTGATTCAAGGTATTCCATAACA
CAAACAATGACAACGGATGGTGTGTCTACACAACTTAATATTTCCCACGTTAGAGTTGAC
GATGGAGGTCGGTTTGCTTGTGTGGCTCATCTTGGTGAATCCACAGTGTTCCATGAAGAT
AGGGTCAATGTTTACGGTCCACCTTACATCCGGACGTTACCACCCTTCAAAGTTCAAAGC
GGTCAAAGTGTCACACTCAGATGCCCTTACTATGGATATCCAATACGAGAAATAACTTGG
GAGTACAAGGGAAAAGAAATTATTCCAGAAACAACTCAAACTAGATACAAACGCTTCATT
AATGATACCGCTAATATAGAAATTTTCGGCCGAAAACCAAAATTAAGAAGTAAAAGAGAC
GCAACAGTGACGAATGGCGTTCTTAGTGTTAATAGAGTTTCTAAAAGCGACAATGGACTT
TTTGCGTGCATAGTAAGAAGTCCTTCAGGTGAAATGGCAAAACGCTCTTTTGATCTGCAA
GTAGTGGAAGCACCTCAACTGGAAGAAATACTACTGGCCCCTGATTTACAGGAAGGACAA
ATTGTGCAAATACATTGCAACCTCAAAATAGTGTCAGAGACTTGGCGACCAGTTTTGGAA
GTGGCTGGTGGAGGGGTTTTAAGTTTATCGAATGGGTCACTCATATTTGATTCTGTTGCT
TTATCTGATGCTGGAATTTATACGTGTCATGTAGAAAATGGGGTTGGGGAAGCATTGAGC
AAAACTATTTGGATATCAGTAAATAAACCGGTAACGTTTGATATAGTATCCAGAAATTTA
ACAGCAAAATTAGGTCAACATGTTACCATCGAATGTCAAGCAAAAGGAGACGATCCAATT
CGGATCATGTGGACGAGGAATGGGAAACCAATTAATCCTCTTACACAAAGATTAAAAATA
TCGGAAGCTAAATCAGACGATGGTATGACGAGTTTGCTTGAGTTGATACAATCAGAGACA
GGCGACGCCGCTTTATATCAATGTAAAGCTGGAAACCCCTTTGGAGCTGACGTTTATAGT
GTTTATTTAAGTATACTAGAACCTCCGTCACCACCAACGGATCTTACGGTGGACTCTGTA
ACAAGCCGATCTGTTAAACTCTCATGGAGGGATATGACCCGTTCCTTAACGCAATACTAT
AGTGTTCAGGTCACGAATAGTGATAGACTAATGTGGAGTACCGCAAGGACCATCAATGTA
ACCAGGTTAGGCGATAATCAGCATAGTGTGGATATCACTGGTCTGCAGCCTGCCACACGG
TACGCAGCTCGGACGGCGGCGGGGCGATCCTCTGACATTAGTGCTTATTGTGCTCCCGTG
AGATTCACAACTTCAGAGGAAGCGCCTTCATCTCCTCCACTAAACATACAAGTATCGCAA
ACAACGTCTCCGGGAGAATTACGCGTCAAATGGTTACCCCCTCCAGCTGATACCCTGCAT
GGAGTAATCTTAGGGTACAGGGTAAAAGCTGTACCGCAGGAAGCGCCTTCATCTCCTCCA
CTAAACATACAAGTATCGCAAACAACGTCTCCGGGAGAATTACGCGTCAAATGGTTACCC
CCTCCAGCTGATACCCTGCATGGAGTAATCTTAGGGTACAGGGTAAAAGCTGTACCGCAG
GAAGATACCGGTGGGTATTCGGTATCGATTGCGGCGTTTAACAGCGCCGGGAATGGACCC
TTCTCCATACCGCTATTTCAAGATACAAGAGAGGGAGCCCCCGAAGAAGGTCCAACCTCA
GTCGAATGTGGAGGTGTGACGTCATCAGCACTTCGAGTTAGCTGGCAGCCCATACCTGTT
CACAGACAAGCCGGCTCTCTCGTTGGCTATTCCGTGTTATTCGCGGCTCAAGGTCGTCCA
TGGCAAAACGCAACATCTATAGTCACGGAAATGCGTTTACAAGGACTCCATAAGTTTACT
AACTACACCGTCAAAGTTGCGGGATACTCTAACTATGGAATAGGACCCTTTTCCTTCCCC
ATTGTGTGTTCTACGCTACAGGATGTTCCGGATGCTCCGTCTGAGATCAAGCTTCTCGTG
AGTTCAGCTAATTCCCTGCTGGTGAGCTGGAAACCGCCGCGGCCGAATGGAAGACTTCTG
CATTACACCGTGTACTCTAAATTGACCGCCAGCAATGATGGCCCGCAAATCCATCGTGTG
GATATAGAGTCAAACATAGATGCTTACGAACAAACTCAAAGTTTGGAACTCAAAGGTTTG
GTAGAGGGCAGACAGTATGACGTGTGGGTGAGTGCTAGTACAGCTGTCGGCGAAGGTCCG
GAAAGCAGACGGGTCAGTAACGCACCATCACAAAGAGTGGTAGCTGGTGTTTGGTCTCTT
GGGGGAAGAGTGTCAGTTCGTGTACACTCAGCGTTACCACTGGCATGTCGGAGCGTAGGC
TCTCCACCACCACGTACTGTGTGGTACCACAATCATAATATCATTACACACCATCCGCGA
TTCACGCGAAATAAAGATGACAGCTTACTTATTAAGAGCATAGATCAATCGCTCAGTGGC
AACTACACTTGTCTTGCAAAGAACCTATATGGATCAGACTCAGTCGTATACTCAGTGAGA
GTCCTACCACTACCTGATCCACCATCTTTGAGAGCGACACCTTATAAGGACTCAATAGTA
GTTGAATGGGATGAGATAAAAATTTCTAACGAATCTGGTTTTGGCGTTAGCTACAATCTT
ACCTGGCGTGAAGAAGATGGTCCTTGGCAAGAAGCTTGGCCCACAACACGGTTGCCAAAT
TCCCAACAGCAGCTTCCAGGTGTCCAACAGCATGCCCTGACTGGTTTGAAGTGCGGCACT
AAGTACTCCATTCGAGTTACCGCTACAGATAGCGTTGGCACATCCGCACCAGCTCATGTT
GATGTTACTACTTTGGGTGGAGCACCAGTCTCACCATTATCGACTGACTGGCTGTGGAGC
AACGCTACTCACATTTACATACAGCTGAGTGGTTGGGACGACGGTGGTTGTGACGTCACA
AAGTGGGACGTTGACTATCGAGCTCTCGGTACAAGTTTCTGGCACCGAGCTGATAATTTA
GCTACTTTAGATCCCAACCTTGGTTGGGGCTACAATTACGCGCGATTACCTACATCTTAC
GCGCTCGGCTCCCTTACCCCTGGGACGTGGTACCAAGTCCGCGTGACTGCATACAATGAT
GCTGGAACAGCCGCTACAGTATACACGTACGCTACCAAGACGGAAGATGGTGGTAAGTAG
Protein sequence:
MDATVWICLAYYIFLFGGTTDAKKEKWPSEYFSNSSSIYSNKDFGTPREKNRNHNGYLGI
YRRKRTLPMMISHHFSETTVTPGVDVNLQCVVNSPHPARFVWERDGVVILSNTDSRYSIT
QTMTTDGVSTQLNISHVRVDDGGRFACVAHLGESTVFHEDRVNVYGPPYIRTLPPFKVQS
GQSVTLRCPYYGYPIREITWEYKGKEIIPETTQTRYKRFINDTANIEIFGRKPKLRSKRD
ATVTNGVLSVNRVSKSDNGLFACIVRSPSGEMAKRSFDLQVVEAPQLEEILLAPDLQEGQ
IVQIHCNLKIVSETWRPVLEVAGGGVLSLSNGSLIFDSVALSDAGIYTCHVENGVGEALS
KTIWISVNKPVTFDIVSRNLTAKLGQHVTIECQAKGDDPIRIMWTRNGKPINPLTQRLKI
SEAKSDDGMTSLLELIQSETGDAALYQCKAGNPFGADVYSVYLSILEPPSPPTDLTVDSV
TSRSVKLSWRDMTRSLTQYYSVQVTNSDRLMWSTARTINVTRLGDNQHSVDITGLQPATR
YAARTAAGRSSDISAYCAPVRFTTSEEAPSSPPLNIQVSQTTSPGELRVKWLPPPADTLH
GVILGYRVKAVPQEAPSSPPLNIQVSQTTSPGELRVKWLPPPADTLHGVILGYRVKAVPQ
EDTGGYSVSIAAFNSAGNGPFSIPLFQDTREGAPEEGPTSVECGGVTSSALRVSWQPIPV
HRQAGSLVGYSVLFAAQGRPWQNATSIVTEMRLQGLHKFTNYTVKVAGYSNYGIGPFSFP
IVCSTLQDVPDAPSEIKLLVSSANSLLVSWKPPRPNGRLLHYTVYSKLTASNDGPQIHRV
DIESNIDAYEQTQSLELKGLVEGRQYDVWVSASTAVGEGPESRRVSNAPSQRVVAGVWSL
GGRVSVRVHSALPLACRSVGSPPPRTVWYHNHNIITHHPRFTRNKDDSLLIKSIDQSLSG
NYTCLAKNLYGSDSVVYSVRVLPLPDPPSLRATPYKDSIVVEWDEIKISNESGFGVSYNL
TWREEDGPWQEAWPTTRLPNSQQQLPGVQQHALTGLKCGTKYSIRVTATDSVGTSAPAHV
DVTTLGGAPVSPLSTDWLWSNATHIYIQLSGWDDGGCDVTKWDVDYRALGTSFWHRADNL
ATLDPNLGWGYNYARLPTSYALGSLTPGTWYQVRVTAYNDAGTAATVYTYATKTEDGGK