New model in OGS2.0 | DPOGS201950  |
---|---|
Genomic Position | scaffold1376:- 6444-45149 |
See gene structure | |
CDS Length | 3378 |
Paired RNAseq reads   | 1979 |
Single RNAseq reads   | 4490 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000164 (4e-17) |
Best Drosophila hit   | sticks and stones, isoform A (0.0) |
Best Human hit | nephrin precursor (2e-90) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC003361 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC003361 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0007155 cell adhesion GO:0005886 plasma membrane GO:0007520 myoblast fusion GO:0007523 larval visceral muscle development GO:0007157 heterophilic cell-cell adhesion GO:0007514 garland cell differentiation |
InterPro families    | IPR013783 Immunoglobulin-like fold IPR007110 Immunoglobulin-like IPR003961 Fibronectin, type III IPR008957 Fibronectin type III domain IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 IPR013098 Immunoglobulin I-set IPR013162 CD80-like, immunoglobulin C2-set |
Orthology group | MCL11063 |
Nucleotide sequence:
ATGCGCAGCACGGTACAGCTCAGTGTACTATATCCACCTGGAGCACCATATATTGAAGGT
TATGCAGAGGGCGAAACTGTTAGAAGAGGACAAAGTCTCGAGCTGGTTTGCCGGAGTAGG
GGAGGGAATCCGCCAGCACAGTTGATATGGTACAAGAACGGCGAGCAAATACGAATGGCA
TACAGGACAAGCGGCAGGATGTCCGAAAACGTGTTGTCTTTCAAGGCGGATGCGTCTGAT
AACAAAGCGAGATACACTTGTGAGGCTAAAAACATCATGATAAGCAACACGCTGAAGGCC
GAAATAGATCTCACTGTGCTATTTGCACCATCCCACGTGACGATATCTGGCCCTTCAGAG
GCGAGGGTCGGTGATCCAGTACCCTTAAGTTGCAGCACGGCCCCTTCAAACCCAGCTGCT
GATATCAAATGGTTGGTCTTAGGTAAACATCACAGGGAAGCAAGCAACAGAACCGTCATA
TCTCCCGATGGTGGTTGGATCACCACATCTAATATCACAGTGGTGGTGGAGCCACATCGG
CGGTCGATCGTCGTGGTATGCCACGGCATTAACGGACAACTGACTGAGAACGTGGTCGCC
ACACACACCATCAATGTACTATATCCACCTTCAGCTCCAATGATAACTGGTTACATTCCC
GGGACGACTCTCTCAGCTGGGACGGTTCAAAAGCTGTCCTGTATATCCACTGGTGGAAAT
CCGTTGGCTACCTTGACCTGGTTTAAGAATGACAAAAAGATACATTCAATAACTAAAACC
ACGGACAAGTCGGTGTCGGCTGAGATATCAATACTGACAAACGTGACTGACAACCAAGCG
CAGTATAGATGCGAGGCGACAAACAGCGCCACAGAGATACCGCTCTTTGAAACTGTCACT
CTGAATGTACATTTCGCACCCGAGACTGTAAAAGTTAGAGCATCACCCGCCGAGCTAACT
CCTGGTATAGAAGGCACACTGTACTGCGACGCCGCTTCTAGCAATCCACCCGCGACGCTA
TCCTGGTGGAGGGATGGGATACCAGTTCAAGGCCTGCCGATGCAGTTAAAGAAGGGTCTC
CACGGTGGTACCGTCTCTACCGTAGAGTTAAAGCTGAATATCACCAAGGAGCTAAATGGC
GCCGTTTATACCTGCCAGGCTTCAAACGACGCTCTACAAAGGAGCGTCCACGACGCTTTG
ACTCTTAAAGTATTCTATCCGCCGATATTCGACGACACGCCCCTCTCGATTGTGGGGGTT
GAAAACGACCCGTTGGTTGTGATGCTGCGAGCTGACGGGAACCCCTCCAGCATCACGTAC
ACCTGGACCAAGGACGGTCTCCCGGTCACACAAGCTTCATACAGCAGTGCCAACGATCGT
ATCGTCTCGTCGGGCGGGACTTTGAATATGACTCGTGTGTCACGACACGATGCTGGGACT
TACTCCTGTGAAGCTCTGAACGCTTATGGCAGCGCTCGGATTAACATAACAGTTAATGTG
CACTATCCAGCCGATATTAAATCTGTCTGGCAGACAGGTATTGTGGATCCTAATGACAAC
GCTGTACTGGCGTGTACGGCCAGCGGGAACCCTTTAACATCGGATCACATAAAATGGGAG
CGCAAAAACTATGACATGTCGACGAAATTAGTGACATTTGAATCTAAGAACCAAACAAGC
TATCTAACGATAGAGAGAGCGGCGAGAGAGGACGTCGGCTCGTTTGAATGCGTGGTGAAC
AACGGCATCGGCGGCGAGACTCGCCACGAAGTGATGTTGGTTGTCAAATTCAAACCTGAA
ATGAACACCTCGCCGACACTCGCCAAATCCGCGTCCAATGTCGGCCAAGTCGGGCGGCTA
ACTTGCAAATGCAAGTCCGCCCCGGCACCTAACTTCACGTGGTCGAAGGGCGGCGTTAAG
CTCCCCGTGAACACGTCTACGAAGTACTTCGCTGAGTATCACAGGAACGACCAGATCACA
TACACCTCTGTGTTATTAATAAACGACATAAGCACGTCGGATTACGGCGCATACGAGTGC
GGGGCGAGGAACGACCTCGGCTTCGGGTCAGTCTCCGTAAAATTGGATGTCACCGGTCCG
CCAGACCCGGTGTCGTCTATAGTTGTAACCAACGTCACCCACGACACCATAACCCTAGAG
TGGGTGCCAGGCTTCGACGGCGGACTGACCTCCTGGTTCAGAGTTCGTTATCGCAAACCC
CACGACTCTACATACACGTATCACGACGTAACCCCCAACACGACCCACTATACTGTGTCG
GGGTTGGAGCGACACACGGACTACGTACTGTCCGTCATGGCGGTCAACGGAATGGGGGAG
AGTCGCTACAGACCTGATGATACGAAGGCTACGACCCTCACTTCATCAGAAGTTGGTGAA
CTGAACGTAGTTTCTACGGAGCATGTAGAAACGGCCGACGTGTCTAAATCAGTAGTTTTA
TACGTGTGCGTGACAGTCGCTGTGTTAGTTATTATAAACGCTGTTTTAGTTGCTTGTTTT
GTATTGAAACGACGCTCCAAGCGCTCCAAAGAGCAAGCTGGGCAGTCGTCAAAATCAACG
CCCATAGAAATGTACGCTCCGTCATCTTACAACGACACTATGGGCGAAACACTAAGCTCA
GTGTCAGAGAAATCTGAAACGTATTCACAAGATGAAGCGCCCCCCGTCCCTGACGTACCC
AGCATGCCGAGACACATGATGAACCAGTCGGATTCTTATCTCCTGGATGAGAACCTGGTC
CCTCCCCCCTTAGACTACCCTCCGCCGAACTACGTGTATGACGAACACGCGAGGACTCTA
CCACATCCACACAGACTACGAGAGGTCCGGGGACACAGCACCCTCGGGCGGACGGCCGGT
AAACAAGCGTACGTACCGACGCCGAGTCCGATGCCACCATTAGACGGCTCCTACTACAAT
ATGGCGTCCGATAGATACCTGTCTTACCCACCACTCATTGGAGAATATTTACAACAGCAA
GCCGGTAGAACTCCGACTCCGCCACAGCAATACTCTAGAGATAATCATCTGAGTCCACCG
AACTGTGGAATCGATGGTGAACGAGCTGTCCCCCCTGATGTGACTGTTCTTCACCCCCCA
GTATGTACACAGCAATTTCCGTTAAACCCATCTCTATCTGTGAAGCAACCGCAGTCCATA
CTAAAAGATCCGTCGAGGCATAAATATAGTAACCAATACGGCAGTCCCATATCCTCTAGT
TCGCCTCAGAACCAAAGTCAAATATTGACAGTTCAGAATTTAACGGATGTACCACAGTAC
GGTACCATAAAGAAAGACAAGAAACAAAACGTCACTATAGACGAATCATTCAACAAACAA
CAAACGCACGTAGTTTAA
Protein sequence:
MRSTVQLSVLYPPGAPYIEGYAEGETVRRGQSLELVCRSRGGNPPAQLIWYKNGEQIRMA
YRTSGRMSENVLSFKADASDNKARYTCEAKNIMISNTLKAEIDLTVLFAPSHVTISGPSE
ARVGDPVPLSCSTAPSNPAADIKWLVLGKHHREASNRTVISPDGGWITTSNITVVVEPHR
RSIVVVCHGINGQLTENVVATHTINVLYPPSAPMITGYIPGTTLSAGTVQKLSCISTGGN
PLATLTWFKNDKKIHSITKTTDKSVSAEISILTNVTDNQAQYRCEATNSATEIPLFETVT
LNVHFAPETVKVRASPAELTPGIEGTLYCDAASSNPPATLSWWRDGIPVQGLPMQLKKGL
HGGTVSTVELKLNITKELNGAVYTCQASNDALQRSVHDALTLKVFYPPIFDDTPLSIVGV
ENDPLVVMLRADGNPSSITYTWTKDGLPVTQASYSSANDRIVSSGGTLNMTRVSRHDAGT
YSCEALNAYGSARINITVNVHYPADIKSVWQTGIVDPNDNAVLACTASGNPLTSDHIKWE
RKNYDMSTKLVTFESKNQTSYLTIERAAREDVGSFECVVNNGIGGETRHEVMLVVKFKPE
MNTSPTLAKSASNVGQVGRLTCKCKSAPAPNFTWSKGGVKLPVNTSTKYFAEYHRNDQIT
YTSVLLINDISTSDYGAYECGARNDLGFGSVSVKLDVTGPPDPVSSIVVTNVTHDTITLE
WVPGFDGGLTSWFRVRYRKPHDSTYTYHDVTPNTTHYTVSGLERHTDYVLSVMAVNGMGE
SRYRPDDTKATTLTSSEVGELNVVSTEHVETADVSKSVVLYVCVTVAVLVIINAVLVACF
VLKRRSKRSKEQAGQSSKSTPIEMYAPSSYNDTMGETLSSVSEKSETYSQDEAPPVPDVP
SMPRHMMNQSDSYLLDENLVPPPLDYPPPNYVYDEHARTLPHPHRLREVRGHSTLGRTAG
KQAYVPTPSPMPPLDGSYYNMASDRYLSYPPLIGEYLQQQAGRTPTPPQQYSRDNHLSPP
NCGIDGERAVPPDVTVLHPPVCTQQFPLNPSLSVKQPQSILKDPSRHKYSNQYGSPISSS
SPQNQSQILTVQNLTDVPQYGTIKKDKKQNVTIDESFNKQQTHVV