New model in OGS2.0 | DPOGS208967  |
---|---|
Genomic Position | scaffold53:+ 215257-220036 |
See gene structure | |
CDS Length | 1647 |
Paired RNAseq reads   | 83 |
Single RNAseq reads   | 215 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002440 (6e-76) |
Best Drosophila hit   | CG8483 (2e-20) |
Best Human hit | cysteine-rich secretory protein LCCL domain-containing 2 precursor (2e-21) |
Best NR hit (blastp)   | PREDICTED: similar to sol i 3 antigen [Nasonia vitripennis] (2e-27) |
Best NR hit (blastx)   | PREDICTED: similar to sol i 3 antigen [Nasonia vitripennis] (8e-27) |
GeneOntology terms    | GO:0005576 extracellular region GO:0008201 heparin binding GO:0031012 extracellular matrix GO:0005539 glycosaminoglycan binding GO:0030198 extracellular matrix organization |
InterPro families    | IPR014044 CAP domain IPR001283 Allergen V5/Tpx-1-related IPR002413 Ves allergen |
Orthology group | MCL20689 |
Nucleotide sequence:
ATGGTTGTATTAATAAAACCATTTTTTATATATATTTTTATTCCATTGTTGTATCATTTG
AATTATGGTGAAAGTAAAAATTACTGCAATTCAGATTTTTGCATCGACTCTAAAGAGCCT
ATTCTTTGCAATTTGCACAATAATGGACCGAGTAAAGATTGTTCACATTATGAAAAACTA
TTAAAAACAGAAAAAGACAAGCAAGATATTTTAAATAAAATTAATCAACGAAGAAATAAA
GTCGCATCGGGTGACATACGTTCTCTGCCCCCTGCTGCAAACATGCTTAAAATGGAATGG
AACGAACAATTGGAAATTTCTGCACAAAGATGGGCCGATCAATGTTTTAATAGCAGTGTT
ATTGAAAGGAGAGATGTCTGTACAAATTTAGTAAATGAGACAGTTGGTCAAAATGTAGCA
ACTATATACGGCGAAGCGCCTGGCTTAACTATTTCAAGTTTGGTTGATATTTGGTATATG
GAGCTTTTGAATATGAATAGCTCTTTAGTGTCGCGATACAGACCGTCATCTCTGACACGT
TTATCGGAATACGATAATTTCACTCAATTGGTGTGGGCCGAAACGAACAGGGTCGGATGT
GCTGCTGTGAAATTTAAGGAAATGGAAAGAAACGAAACCGTGTATAGATTGGTGTGTAAC
TTTGCACCAAGTGGCAATCGATTCGGGGATTCTGTGTACAATGAAGGACAATCATGTTCC
AGATGTCCCTCTAACCTAAGTTGCGACAGTCAATTCAGAAATCTGTGCTGTATCATAAAG
AATAATACATCAGAACAAGTGATTTATGACGACCCAACATCAAGTATTTTCAGAGATTTG
TTTAGAACAGAGAATAAAATATATTATGAAAGCTCCACGAAAACTATTAATGCATTCAAA
AGCACTTCAACTCCAGATAATATTACCGAAGAAGATGCTCAGTTTGATTTTTTATCTGAT
TTATTTGAAATAACAAAGGCACAGTTACTCACTGAAAGAACTACCTTAAGATGTAAAGAT
TATTTGGCTGTCGACGATTTTATAGAATTGTTGAAAAGTAAATTGACTAATGACCAAATA
CTGAAAAATTTACTGGCTACGTCTGCACAGACTAATAGCCCAGAATCTACAATCACTGAC
CGCACGATGGCAGCCATGATTAATCAAATTTATAGTACTAAGACGACAGCCACGACGACA
AAAACAACTGTAAATGATTATATTAACTCTACGCTTTTGGTTGATCTTGTTGAAGCCGTT
ATATTCCGTCACAGCAATCGATTTTCAACGATTGATGACGTAAAGTCTTTTACACAATAT
GAAGCGACTGATATAAGCGTCGTCAAAGTACAAGCGCAGACAGGTGAGGTAAAAAGTAAC
AAAGAATTTACAGGACATTTTTTTTTCCCTGAAGAAGAAGAATCAGTGGAGACAAATACG
GAAGACGAAAACACTGAGTCATACTATGATTCTGATAAATTGGCAGTTTCTGATATATCA
ATGGAAATCGAGGATTTGAAAAGGGACAGGATCACAAAAGATTTCATAGAAGAAATTTTG
GATTCAGAACTTGCGACGGAACTTATTGTAAAAGATGCAAGTTTACCGGACATAAATACT
GGTAACACTCTGGTTATTAATGAGTAA
Protein sequence:
MVVLIKPFFIYIFIPLLYHLNYGESKNYCNSDFCIDSKEPILCNLHNNGPSKDCSHYEKL
LKTEKDKQDILNKINQRRNKVASGDIRSLPPAANMLKMEWNEQLEISAQRWADQCFNSSV
IERRDVCTNLVNETVGQNVATIYGEAPGLTISSLVDIWYMELLNMNSSLVSRYRPSSLTR
LSEYDNFTQLVWAETNRVGCAAVKFKEMERNETVYRLVCNFAPSGNRFGDSVYNEGQSCS
RCPSNLSCDSQFRNLCCIIKNNTSEQVIYDDPTSSIFRDLFRTENKIYYESSTKTINAFK
STSTPDNITEEDAQFDFLSDLFEITKAQLLTERTTLRCKDYLAVDDFIELLKSKLTNDQI
LKNLLATSAQTNSPESTITDRTMAAMINQIYSTKTTATTTKTTVNDYINSTLLVDLVEAV
IFRHSNRFSTIDDVKSFTQYEATDISVVKVQAQTGEVKSNKEFTGHFFFPEEEESVETNT
EDENTESYYDSDKLAVSDISMEIEDLKRDRITKDFIEEILDSELATELIVKDASLPDINT
GNTLVINE