DPGLEAN03545 in OGS1.0

New model in OGS2.0DPOGS208967 
Genomic Positionscaffold53:+ 215257-220036
See gene structure
CDS Length1647
Paired RNAseq reads  83
Single RNAseq reads  215
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002440 (6e-76)
Best Drosophila hit  CG8483 (2e-20)
Best Human hitcysteine-rich secretory protein LCCL domain-containing 2 precursor (2e-21)
Best NR hit (blastp)  PREDICTED: similar to sol i 3 antigen [Nasonia vitripennis] (2e-27)
Best NR hit (blastx)  PREDICTED: similar to sol i 3 antigen [Nasonia vitripennis] (8e-27)
GeneOntology terms



  
GO:0005576 extracellular region
GO:0008201 heparin binding
GO:0031012 extracellular matrix
GO:0005539 glycosaminoglycan binding
GO:0030198 extracellular matrix organization
InterPro families

  
IPR014044 CAP domain
IPR001283 Allergen V5/Tpx-1-related
IPR002413 Ves allergen
Orthology groupMCL20689

Nucleotide sequence:

ATGGTTGTATTAATAAAACCATTTTTTATATATATTTTTATTCCATTGTTGTATCATTTG
AATTATGGTGAAAGTAAAAATTACTGCAATTCAGATTTTTGCATCGACTCTAAAGAGCCT
ATTCTTTGCAATTTGCACAATAATGGACCGAGTAAAGATTGTTCACATTATGAAAAACTA
TTAAAAACAGAAAAAGACAAGCAAGATATTTTAAATAAAATTAATCAACGAAGAAATAAA
GTCGCATCGGGTGACATACGTTCTCTGCCCCCTGCTGCAAACATGCTTAAAATGGAATGG
AACGAACAATTGGAAATTTCTGCACAAAGATGGGCCGATCAATGTTTTAATAGCAGTGTT
ATTGAAAGGAGAGATGTCTGTACAAATTTAGTAAATGAGACAGTTGGTCAAAATGTAGCA
ACTATATACGGCGAAGCGCCTGGCTTAACTATTTCAAGTTTGGTTGATATTTGGTATATG
GAGCTTTTGAATATGAATAGCTCTTTAGTGTCGCGATACAGACCGTCATCTCTGACACGT
TTATCGGAATACGATAATTTCACTCAATTGGTGTGGGCCGAAACGAACAGGGTCGGATGT
GCTGCTGTGAAATTTAAGGAAATGGAAAGAAACGAAACCGTGTATAGATTGGTGTGTAAC
TTTGCACCAAGTGGCAATCGATTCGGGGATTCTGTGTACAATGAAGGACAATCATGTTCC
AGATGTCCCTCTAACCTAAGTTGCGACAGTCAATTCAGAAATCTGTGCTGTATCATAAAG
AATAATACATCAGAACAAGTGATTTATGACGACCCAACATCAAGTATTTTCAGAGATTTG
TTTAGAACAGAGAATAAAATATATTATGAAAGCTCCACGAAAACTATTAATGCATTCAAA
AGCACTTCAACTCCAGATAATATTACCGAAGAAGATGCTCAGTTTGATTTTTTATCTGAT
TTATTTGAAATAACAAAGGCACAGTTACTCACTGAAAGAACTACCTTAAGATGTAAAGAT
TATTTGGCTGTCGACGATTTTATAGAATTGTTGAAAAGTAAATTGACTAATGACCAAATA
CTGAAAAATTTACTGGCTACGTCTGCACAGACTAATAGCCCAGAATCTACAATCACTGAC
CGCACGATGGCAGCCATGATTAATCAAATTTATAGTACTAAGACGACAGCCACGACGACA
AAAACAACTGTAAATGATTATATTAACTCTACGCTTTTGGTTGATCTTGTTGAAGCCGTT
ATATTCCGTCACAGCAATCGATTTTCAACGATTGATGACGTAAAGTCTTTTACACAATAT
GAAGCGACTGATATAAGCGTCGTCAAAGTACAAGCGCAGACAGGTGAGGTAAAAAGTAAC
AAAGAATTTACAGGACATTTTTTTTTCCCTGAAGAAGAAGAATCAGTGGAGACAAATACG
GAAGACGAAAACACTGAGTCATACTATGATTCTGATAAATTGGCAGTTTCTGATATATCA
ATGGAAATCGAGGATTTGAAAAGGGACAGGATCACAAAAGATTTCATAGAAGAAATTTTG
GATTCAGAACTTGCGACGGAACTTATTGTAAAAGATGCAAGTTTACCGGACATAAATACT
GGTAACACTCTGGTTATTAATGAGTAA

Protein sequence:

MVVLIKPFFIYIFIPLLYHLNYGESKNYCNSDFCIDSKEPILCNLHNNGPSKDCSHYEKL
LKTEKDKQDILNKINQRRNKVASGDIRSLPPAANMLKMEWNEQLEISAQRWADQCFNSSV
IERRDVCTNLVNETVGQNVATIYGEAPGLTISSLVDIWYMELLNMNSSLVSRYRPSSLTR
LSEYDNFTQLVWAETNRVGCAAVKFKEMERNETVYRLVCNFAPSGNRFGDSVYNEGQSCS
RCPSNLSCDSQFRNLCCIIKNNTSEQVIYDDPTSSIFRDLFRTENKIYYESSTKTINAFK
STSTPDNITEEDAQFDFLSDLFEITKAQLLTERTTLRCKDYLAVDDFIELLKSKLTNDQI
LKNLLATSAQTNSPESTITDRTMAAMINQIYSTKTTATTTKTTVNDYINSTLLVDLVEAV
IFRHSNRFSTIDDVKSFTQYEATDISVVKVQAQTGEVKSNKEFTGHFFFPEEEESVETNT
EDENTESYYDSDKLAVSDISMEIEDLKRDRITKDFIEEILDSELATELIVKDASLPDINT
GNTLVINE