New model in OGS2.0 | DPOGS203993  |
---|---|
Genomic Position | scaffold2:+ 1210525-1230550 |
See gene structure | |
CDS Length | 3360 |
Paired RNAseq reads   | 1174 |
Single RNAseq reads   | 2905 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002138 (0.0) |
Best Drosophila hit   | furrowed, isoform A (0.0) |
Best Human hit | CUB and sushi domain-containing protein 3 isoform 3 (5e-77) |
Best NR hit (blastp)   | furrowed, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to C-type lectin, selectin-like (AGAP000929-PA) [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0007423 sensory organ development GO:0007476 imaginal disc-derived wing morphogenesis GO:0007155 cell adhesion GO:0005488 binding |
InterPro families    | IPR000436 Sushi/SCR/CCP IPR006585 Fucolectin tachylectin-4 pentraxin-1 IPR001304 C-type lectin IPR008979 Galactose-binding domain-like IPR016187 C-type lectin fold IPR016060 Complement control module IPR000421 Coagulation factor 5/8 type, C-terminal IPR016186 C-type lectin-like IPR018378 C-type lectin, conserved site |
Orthology group | MCL10713 |
Nucleotide sequence:
ATGTATTATTATCTATATCTGTACATTATATGTATTGAGTTTGTTTTGTTATGTTTCATT
GGGTTTTTTGTTTTAGGCGACAGACGGTGTGGCCATCCAGCTGTGCCGCCGAATGCAAAG
GTTTCTTTGGCTTCAGACACTGACATAGTACCGGGAACAGTGGCCACCTATGAATGCGAT
GACGGCTACGAGCTTTTCGGTGCACATCAAAGAGAATGTACATTAAGGGGTGACTGGACT
TCCGAACCGCCATTTTGTGGAACCAACGTTGCCTTCAGAAAACCAGCAAATCAATCAACC
ACTGTACGTGGTGGAAGCGCTAGCAACGGCAATGATGGTGAAAAGACCACCGAACATGAC
GGCAAAAGATGCACAGAAACACAAAGAGAGGCTTCACCCTGGTGGCAGGTCGACCTGCTA
CGTCACTACGCCGTCAAAGTGGTTAGAGTCACTACTAGGGGCTGTTGCGGTCACCAACCG
CTTCAAGATCTGGAGATCAGAGTAGGCAACAGCAGCAGTGATTTACAAAGAAACCCACTA
TGTGCTTGGTTTCCTGGCACCATTGACGAAGGTATAACGAAGACGTTCACTTGCGCCCGC
CCCCTCATAGGTCAGCACGTTTTCCTCCAGCTGGTTGGAGTAGAGAGCTCCCTTTCTTTA
TGTGAAGTAGAAGTATTCACCACAGAAGAGTTCTCAAACGATCGATGCGCCCCGATTGGA
GCCTCAGCAGATATTGAGTTGGCCGCTTTTTCTCGAAACTGTTACGAATTTAATGGAGCT
AAGGGAGCATCTTTTGAGGAAGCGAGAAAACAATGTCAGGAACACGGAGGAGACTTAATA
CATGGGTTTCAGGGCGCTACGTCAAGTTACCTTATTCAGGAGTTAGAAAGGCGCAGGCCA
AATTTAAAGACTCTGTTGGTCTGGATCGGAGCTCAAAAGGAACCTGGTCTTACTTCCAGA
ACTTGGAAATGGGTTGACGGAGAAACTGTTACAAAGCCAACGTGGGGAAAGGACCAGCCG
AATAATTATAACGGCGAACAAAACTGCGTGGTGCTTGATGGTGGGCGCTCCTGGCTGTGG
AACGACGTCGGCTGCAACCTGGACTACCTACACTGGATCTGCCAGTACCTGCCCCCTACT
TGTGGAAGTCCAGATAAGCTGTTGAACACGACTATTGAAGATAATGACTACCATGTAGGG
TCTTCGATAAGGTATAAATGTCCACAAGGTCACATGCTGATTGGAGATAAAACTAGAGAA
TGCAAAAAAGATGGATTCTGGTCTGGAGCAGCACCAAGTTGTAAATACATTAACTGCGGT
GGATTGACTCCAATTCAAGATGGTAGCGTCGATTTAGTGGATGGGCACACCACTTACGGA
GCGAAAGCTATTTATTCGTGTAAAGAGAATTACACGCTAGTGGGTAACGCTGAACGAATG
TGTAAGGATCAAGGAATTTGGGACGGAGAGGCACCCAAGTGTCTGTTTGACTGGTGTCCG
GAGCCACCACCAGTCTCAGGAGCTACGGTCACCACTAGTGGTCACAAAGCAGGATCGTTG
GCCACCTACACTTGCCAGAATGGATTTATACTTTTTGGTTCACCAAGTATCACTTGTAAT
CTCGGTGGAACATGGGGCGGCACCCCACCTTCATGTAAATATGTAGACTGCGGCACTCCA
GCACAAGTCCATAAAGGGTCCTTCAGACTATTGAACGGCACGACTACGTACGGCTCGATA
GCCCAATTCACCTGCGAACCCGATTACTGGTTGGCGGGAGCTGAAGTACTCACATGCTAT
CGTGATGGCAAATGGTCACATGATATACCTTCCTGTGAATTGATAAGTTGTTCGGACCCG
GAGGTGCCGACTGGTGGCTATATGGAAGCATACGATTACAACGTCCACTCAACCATTGAC
TTTCATTGTGAGAAGGGACATAAACTCATTGGTGAACCAAGTCTCACGTGTCAGCCTGAC
GGAGAGTGGTCAGGAGAATCGCCTAAGTGTGAATATGTGGACTGCGGTAAATTGCCACCT
CTGCCTTACGGTTCGGCAGAACTTTTAAATGGTACTACGCATTTAGGAAGTATCATCCAA
TACTCATGTACTACCAACTACAGACTGGTCGGACCAGTAAGAAGGATATGTACTGAAGAT
TTCCAATGGAGTGATTCATCACCAAGATGTGAAGAAATAAGATGTCCAGAGCCAATCGTA
GCGGAAAACAGCATCGTATCCGTAACTGGTAACGATCGCATGCACGGACGCACGCTCATT
CGTACACGATCAAGCACTCAGGGCAATACGTACAGAATCGGTGCCTTGGTAAAATACCGC
TGTGAGCGTGGGTACAAGGTGGTGGGCGAGAGTCTATCAACTTGTGAAGATAATGGACAA
TGGAGTGGTGTCAGACCTAAATGTCAATACGTTGACTGTGGAAATCCGGGTCGCATACAA
AATGGCAAAGTCACATTGGCTACAAACGCGACGTACTATGGGGCAGCAGCCTTGTACGAA
TGCGACGAACATTGGCAACTAGATGGTGTCTCAAGGCGATTGTGTCAAGATAACGAAACC
TGGAGTTCTGAAGCACCTGTATGTAAAGAAATAACCTGTGTGGATCCTTCAATCCAAATA
AAGGGCAGTATTGGTTTATTGGTTGTGACGTCAACTCTCAGCATTGGAGGCGAAGCACAC
TACCGCTGTGAACGGGGATACAGCCTAAAAGGAAATGAAACTAGAACTTGTCTACCGAAA
GGACAGTGGGCCGGAGCACCTCCTGTTTGCATACCGATAGACTGCAAGTCACCCGGCACT
GTAGACAACGGCAGAGTGATTATTTCAAATAGTTCGACAATCTTCGGCAGCTCTATAGAG
TATCATTGCTTACCGCAATATCAGCGAGTTGGACCATTCCTTCGCAAATGTTTAGACGAT
GGCAAATGGTCAGGAGAAGAACCCAAGTGCGAATTGATCACGAACGAAGCTGCTGAAAAT
GGCGCTCTACCACTCAGTGTTGGAGTTGGTTGCGGTATCGTTCTATTTTTGCTTATGTTG
CTCGGAGTCATCTATTTAAGACTACGTAAAGCAACGCCAGTCAAGAACACTGAAAATATA
GAAGGAGCTGAACGGAAAGAAGACCAAAACGCAGCCGTAATGAGCTACGCAACCCTCCAC
GATACTAACGGACGGCATATTTACGACCACGTAACGGACAATCTGTACGATTCACCGTAC
GGCGAGAGTTTGGCCGAGAACTCCGCGTACGGTAGACGCAGTGACACCGAATCCGCATAC
GAACCAGAACCCACCGGCCCCAACGCTGTAGTCACCATCAATGGAGTGGCCGTTCGTTGA
Protein sequence:
MYYYLYLYIICIEFVLLCFIGFFVLGDRRCGHPAVPPNAKVSLASDTDIVPGTVATYECD
DGYELFGAHQRECTLRGDWTSEPPFCGTNVAFRKPANQSTTVRGGSASNGNDGEKTTEHD
GKRCTETQREASPWWQVDLLRHYAVKVVRVTTRGCCGHQPLQDLEIRVGNSSSDLQRNPL
CAWFPGTIDEGITKTFTCARPLIGQHVFLQLVGVESSLSLCEVEVFTTEEFSNDRCAPIG
ASADIELAAFSRNCYEFNGAKGASFEEARKQCQEHGGDLIHGFQGATSSYLIQELERRRP
NLKTLLVWIGAQKEPGLTSRTWKWVDGETVTKPTWGKDQPNNYNGEQNCVVLDGGRSWLW
NDVGCNLDYLHWICQYLPPTCGSPDKLLNTTIEDNDYHVGSSIRYKCPQGHMLIGDKTRE
CKKDGFWSGAAPSCKYINCGGLTPIQDGSVDLVDGHTTYGAKAIYSCKENYTLVGNAERM
CKDQGIWDGEAPKCLFDWCPEPPPVSGATVTTSGHKAGSLATYTCQNGFILFGSPSITCN
LGGTWGGTPPSCKYVDCGTPAQVHKGSFRLLNGTTTYGSIAQFTCEPDYWLAGAEVLTCY
RDGKWSHDIPSCELISCSDPEVPTGGYMEAYDYNVHSTIDFHCEKGHKLIGEPSLTCQPD
GEWSGESPKCEYVDCGKLPPLPYGSAELLNGTTHLGSIIQYSCTTNYRLVGPVRRICTED
FQWSDSSPRCEEIRCPEPIVAENSIVSVTGNDRMHGRTLIRTRSSTQGNTYRIGALVKYR
CERGYKVVGESLSTCEDNGQWSGVRPKCQYVDCGNPGRIQNGKVTLATNATYYGAAALYE
CDEHWQLDGVSRRLCQDNETWSSEAPVCKEITCVDPSIQIKGSIGLLVVTSTLSIGGEAH
YRCERGYSLKGNETRTCLPKGQWAGAPPVCIPIDCKSPGTVDNGRVIISNSSTIFGSSIE
YHCLPQYQRVGPFLRKCLDDGKWSGEEPKCELITNEAAENGALPLSVGVGCGIVLFLLML
LGVIYLRLRKATPVKNTENIEGAERKEDQNAAVMSYATLHDTNGRHIYDHVTDNLYDSPY
GESLAENSAYGRRSDTESAYEPEPTGPNAVVTINGVAVR