DPGLEAN00373 in OGS1.0

New model in OGS2.0DPOGS209630 
Genomic Positionscaffold154:- 342091-371229
See gene structure
CDS Length12186
Paired RNAseq reads  66251
Single RNAseq reads  150082
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006693 (9e-08)
Best Drosophila hit  hemolectin (0.0)
Best Human hitSCO-spondin precursor (4e-131)
Best NR hit (blastp)  PREDICTED: similar to Hemolectin CG7002-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Hemolectin CG7002-PA [Tribolium castaneum] (0.0)
GeneOntology terms








  
GO:0005529 sugar binding
GO:0042803 protein homodimerization activity
GO:0007599 hemostasis
GO:0005576 extracellular region
GO:0035006 melanization defense response
GO:0042060 wound healing
GO:0007155 cell adhesion
GO:0008061 chitin binding
GO:0006030 chitin metabolic process
GO:0042381 hemolymph coagulation
InterPro families











  
IPR006207 Cystine knot, C-terminal
IPR000421 Coagulation factor 5/8 type, C-terminal
IPR000742 Epidermal growth factor-like, type 3
IPR001007 von Willebrand factor, type C
IPR002557 Chitin binding domain
IPR001846 von Willebrand factor, type D domain
IPR008979 Galactose-binding domain-like
IPR002919 Protease inhibitor I8, cysteine-rich trypsin inhibitor-like
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR014853 Conserved-cysteine-rich domain
IPR006210 Epidermal growth factor-like
IPR006552 VWC out
IPR013032 EGF-like region, conserved site
Orthology groupMCL10206

Nucleotide sequence:

ATGAATTTAATTTTTCGTTTCTTTATTTTATTATTAGGAATTACTATCTGTCAAGGTGGC
TACGGCGCACACTACGACGGACAAGATGCTCAATCGGATGTACGCGCGTCGCCACATGCT
CCAGCATATATTAATAATATGAGATTTGGACCAACCGGCTACCCGAGAAGATATACAGGC
ACCAAAACTGGATACGGTGGCACAAAAACAGGATATGCTGGCACGAAAACGGGATATACG
GGAACAAAAACCGGATATGCCGGCAAAAAGACAGGCTACGCTGGCACTAAAACTGGATAC
GGTGGCACTAAGTCTGGTTATGCTGAAACAAGATCTTGGCATTCTTCTGGCTCCGCGCCG
ACCTACAATGAGTTCTATGGACGGGCCAACGAGCCTTACAGCGCAAAATGTCAGGTGGAA
TGCAAAAATAATGGGATCTGCATCGACACCAACACTTGTCAATGTCCACCGGGTTTTCAC
GGACAGTACTGTGAGTTCGAGAAGAAACCGTGTCTGATGTTTCCACCACTGCCCATGAAT
TCACAGAGGAAATGCTCACAGGATTACTGCACAATTACTTGCGCGGAAGGCCATAGGTTC
ATAGATGGGACGACGGTAGCAAATATGCAATGCATGAATGGTCAGTGGCAGCCCACTCGT
GCTGACCTTTCATCAATACCTGACTGTCAGCCAGAATGTGATCCTCCGTGCTTGAATGGA
GGTGTTTGTTTGTCCGTCAATACGTGTCAATGTCCCGCAGACTACAGAGGTCCTCAATGC
CAATATGCTGCCAGTGCTTGTGACGTTCGTAAACTGGCATTCAACGGCGGCTACAACTGC
TTCGGTGACAGCGAGAAATTTTCCTGCAAACTTAGCTGCCCGTCAGGAGCATCTCTCAGT
TCTTCTAACACTGATGAGTACACCTGTGATGTCATCGTTATCACTCCATCAAACTACAAG
AGCACCCTCCCCCCTTATGCGCTACACTCATCAAACCACACCGACATCAGTCAGTCGGAG
CATTCAACAGAAGCTAACAAGTATGGCATAAAGAAGCCGGTTACCATTGTTGTACAAGAC
TTTACTCCAAAGAGTGGCACTTGTATAACCTGGGCAGGTGTTCACTATAAGACCTTCGAT
GGAAAGATATATAGCTTCCAATCTCCTTGCCAACATGTGTTACTGCGTGACTCCGTAGAA
CATAAGTTTACAGTCGCCGTGAGACACCCTGAATGCGAACACGGCGACTGTTCGTCTGAA
CTTACTGTCTACTTGCAAGAAAAGATGTACACGTTCGCTGTCTCTGATGACGGATCCGTC
TTGTTTCGCACCAGCAAACGTTTGATGCCGATCCCGGCAGCACTGCCCGGCATCCGCGTG
TCCATGCCTTCTGACCAACTCATTATCAACCTGGACCTTGGACTTACTCTCAAATGGGAC
ACTAAAAACTCGGTGGTCGCCGAAGCTTCAGTTTTGCTATGGAACAAAACCGAAGGTCTG
TGTGGAACGCTGGACGGGAATCCGGAAAACGATTTGACCACAAAAGAGAAAACTATAGCA
TTGACTAAATCTGTTATGATAGCTTCTTGGGAACTCAACAAAATTGGAGACACTTGCGAC
AGCAGTCCAACTGAGACCAGCCAGTGTTTTTCTAAAACAGACGCGGACATGAAGAGCGCT
CTACAGTTCTGTACCAAAATATTCACCAAGGATAAGTTCAGAAAGTGTTCTAAGGCAATG
GACGTTTCACAATTATTGGAAGCTTGTCAATGGGATTACTGCGCATGTCTCACTAGCCTT
ACTCCGGAAGAGTGTGCGTGTCGTACAGTGTCAGTATACGCCAAAGAGTGTTTGCGACAC
GGCGTCGAGGAAATGCGCTCCTGGAGGGACTCCGACACATGCCCGATGAGGTGCCTCGAA
GGGAAAGTCTACAAGTCCTGCGGTCCGGAAGTCCAGGCCAGCTGTGCATTCCCTACGGCG
AGCAACTCGTCTTGTGTGGAGGGCTGTTTCTGTCCGGAGGGTGAGCTACTGGAGGGCGGG
CGATGCGTGCAAAAATCTGAGTGTCCCTGTAGAGTGCGCAACCAGAGTTTCCCCCCTGGA
ACTGTTATGCCCAAGAAATGTAATACCTGTACTTGCGAGGCGGGCCAGTGGACTTGCACG
TCGGCAGCGTGCGGAGCTCGGTGCGGTGCCGTGGGAGACCCTCATTACACCACCTTCGAT
GGACTGAGATATGATTTCATGGGCCACTGCACGTACACCATGCTCAAAACGGACAACCTC
ACCATTGACGTCGAAAATGTAGCCTGCTCGGGCGCCATCACCGAGGCCATGAACCTCGCT
CCTTACAAGGGAGACGGCAAGCCATCCTGTACGAAAGCCGTCAACTTAATGTACAACGGC
GCCACCATACACCTCAAGCAGGGCGGATTCATTCTCGTCAACGGCAAGGAAGTCGACTCC
TTACCTGTCAGCGTTGGTGATATTAGGATACGAGCTGCCTCATCTCTGTTCCTCATCGTT
CAACTACCTATCAAAGTTGATCTCTGGTGGGACGGCAATACTCGAGTGTTCGTGGACGTA
CCACCGTCCTTCAAAGACAGTACTAAGGGTCTATGTGGAACATTCAATTTGAATCAAAAA
GATGACTTCCTGACTCCCGAGGGAGACGTCGAGCAGACAGCTCTGGCCTTCGCAAACAAG
TGGAAGACCAGGGAGTTTTGTGACGACGTCTCCACCAAAGAGCCGGAGCACCCGTGCAAG
GCCAACATGCAGAACAAGGACACCGCCGAGACCTATTGTAGCAAATTGAAGAGTAAAATA
TTTGAAGCTTGTCACTGGTACGTGGACGTGCAGCCTTACTACGAAGACTGTTTGTACGAC
ATGTGCGCGTGCGCCGGGGACGTGTCGCGCTGCCTGTGCCCCATCATCGGGGACTACGCT
GACGCGTGCGCTCGCAGTGCGGTCATGGTGCAATGGAGATACCACGTCAAGGAGTGCGAG
TTGCAATGTACGGGAGGTCAGGAGTACACCGTGTGTGGTGACAGTTGTCTGAGGACGTGC
GCGGACGTGTCTCTCGGGGCCGCGGACTGCAGGCCGCGGTGCGTGGAGGGGTGCGCCTGC
CCAGTGGGACAGGTGTTGGACAACAACAACGTGTGCGTACCGGTTGGACTGTGTCCGTGC
TACCATAATGGAATGGAATTTAAACCCGGTTATAAAGAAGTGAGAGCTGGAAAGAGGGAG
AGAGAGCTTTGTACATGTGTGGGTGCTCGCTGGTCGTGTGTCCCGGCCACGTCGGAGGAC
ATCGTTAACTACCCCCCGGCCGAGGACCTGCGCTCAGCATGCAGCGCCTCCGACCATAAG
GAGTTCACCACCTGCGAGATCGCTGAACCCCTCACCTGCAAGAACATGCACCTGCCGCCG
ACGGTGACCTCGCAAGAGTGTCGTCCGGGTTGCCAGTGTAAGAAGGGCTACGTGATGGAT
GCGGCTAGTAAGAAGTGCGTTCTACCCTCTGAATGTCCCTGTCACCACGGCGGTAGAAGC
TATCAAGACGGAGATACCATGCAGGAGGGATGTAACACGTGCTCTTGTAATGGTGGTAAG
TGGTCGTGCACGTCTCGTCCGTGTGCTGGCGTGTGCAGCGCCTGGGGTGACTCACACTTC
ACCACCTTCGACGGACAGCACTATGACTTCGAAGGAGTTTGTACTTACCTCTTCGCCAAA
GGAACTGTCGACGGGAAGGATGGATTCAGCGTTGAGATACAGAACGAGCCTTGCGGAACA
ACAGGCGCAACCTGCTCAAAATCCGTCACACTTCGAGTGGGTAATGGCGGAGCTGATGAT
GAAACCGTAGCTCTCACTAAGAGCGCTCAACTACCTGATACATCCAAATTGAAACGTATT
AAGCTGCGGCTGGCGGGAGCATACGTGTTCCTGGACGTCCCCTCCCTTGGCGCCAGTCTT
CAATGGGATCGAGGACTGCGGGTCTACGTGAAACTGGACACGATGTGGCAGAATAGGGTC
AAGGGACTCTGCGGTAATTTCAATTCGGACATGCGGGACGACTTCCAGACGCCGTCTGGT
GGAGGTTTCTCGGAGACCTCGTCTCTCATCTTCGCCGACTCCTGGAAACTAAAACCCAAC
TGTCCCAAGCCGCAAGAAGTTACAGATCACTGTAAGGAGCGACCTCACAGACAAGCCTGG
GCATCTAGTATGTGCGGTGTGTTGAAACAATTCCCGTTCACTCTGTGCCACTCGGAGCTA
CCGGCCGGGGAGCACGTGGCTCGCTGCGAGTCAGACGCCTGCGCGTGCGACTCCGGCGCC
GACTGCGACTGCGCGTGCGCAGCGATCGCAGCTTATGCAGCAGCGTGCGCAGCTAGAGGA
GTTACCTTCAAATGGCGTACTCAGGAACTCTGTCCGATGCAATGCGATGAAGAATGCTCC
AACTATAACGAGTGTATGTCACCGTGTCCACCAGAGACCTGCGAAAACACGCTCGAATAC
GATAAGATAAAGGCGGCTTGTGAGAACGAAACTTGCGTCGAAGGTTGCAAACTAAACGAA
AACAAAACCTGCCCCGATGGTACAATATACTCGAACAGTTCTCTGAAGGAGTGCGTACCT
CGAGCAAAGTGCAAACCAGTTTGCATGACCCTTCCGGACGGTAAAGAGGTGCTGGAGGGG
GAGGTCATGGAAGAGGATGCGTGTCACACTTGTAGATGTTCAAAGAATGAGAAAATTTGC
ACTGGACAACCATGTGCCACCGACGAGCCAGTACCCACGGGTCCTCCAGAGACGACACCT
TACGACCAGCCCCTTACCTGTAAGACGGGTTGGACCGAGTGGATCAGCAGAGCAGTGCCC
GAAATCACCGCCAGTGGAGCCTCCGTCGATAACGAACCCCTGCCGGAAGTCAACGAACTT
ACCATCGGCTCTCCGATGTGCAAGAAAGAGATGATGACTAAGATCGAATGTCGTACCGTC
GGCGACCACCGCAGTCCCAAAGAGACGGGGCTCAATGTAGAATGTAGTCTGGAGAAAGGC
TTGATATGTTCTGAACCTGAGAAAAATTGCTCCGACTTCGAGATACGAGTTTACTGCGAA
TGCGAGGAAAAGCCATTCCAGTGTTTGGACTCGTCGCACCCAAGTTACCCTCACCCGACG
GAATGCTCTCAGTTCTACGAGTGTACTCCGGAGCTCGGGGCTCCTGGGACCTTGCACGCC
GTCCTCAAGAACTGTGGGGAGGGCCTTGTGTACAACCCCACCATCATGGTGTGCGACTGG
CCAGCATCTGTCGCGCTAGTAAGGCCCGAGTGCGCCAATATTACCACTACCGTTACCACT
CCCACGACTACTGTCGGCACGACCGCCGCGTTTGTCACAGAATCTTCCACGTCTCCTTCT
ACTTCGACTGTGATTCCTTTAGATGTCACAACAGCAGTGTGTCCTCCTCATCAAGTTTAC
AAGGAGTGCGCGTTCCCGTGTGACAAACTGTGCGACCACTTCAAGCATACCCTGCAAGCC
GAAGGACAGTGCTTGAACGGAGAGAAATGCGTTGCTGGCTGTGTGGATGTGCCGGTGTCG
ATGGTGGAGTGCGGGTTCGGATCCATGTGGCGTGATCGAAGTACATGCGTTCCCATCAAA
GACTGCACCTGCGATGACGACGGATACGTTATTAAGCCCGGCGGAGTGGTTGTAGAGGGT
TGTCGAAAATGTCAATGCCTGGACAACGTATTGCATTGCGATTCTTCAGAGTGTGTTTCT
ATCAAGACCCCGATAGAGGGATCCACGCACGCTCCCATGATCAGCACTACTCCATTGCCT
ACTACTGTAACCACACCGAGTACCACTACCACTACATTAACTACTCCCATGTCTGTGGCA
ACTTCCGGTTCCACTGCAGCTCCTACAACTGTTTCTCCCTCCACTTCCACTACTCCATTG
ATTATACAAACGACGGTATCTCCACCACCTGAATGCAGTCCGGATAGGTATATAAATCTG
TTGTGGTCTGATGAGCCGCTGCCGCCCACCTCGTTCAGCGCCAGCTCCTCCGCCAATAAT
CTCTTCCAGCCGCAGTTTGCGGTTCTTAACGGACATCCACTGGATGTGTCTGCTGGTAGT
TGGAATCCGGCTCACATGGACAAAAACCAGTACATCCAAGTGGAACTGCCCCAGAAGGAG
CCAGTCTACGGTATATTAATGCAAGGCAGTCCCTTGTTCGATCAATATGTCACCAGCTAC
GACGTTATGTACGGAGATGATGGAAATGTCTTCTCGCCTGTCAATGGGCCCGACGGACAG
CCGAAGGTTTTCCGTGGACCGGTAGATAACAACACTCCTCTTAAACAAATGATCGAACCG
CCGATAGAAGCGAAGTTTGTACGCATCCGACCACTCACTTGGCACGAGGAGATTGCGGTG
CGCTTCGAGCTTATCGGTTGCCAGGAAGTCACTTCTACGACTAGTACCTCAACAACAACA
ACGACATCTACCACAACCACTTCTACAACAACAGCCTCTCCCACAACAGTAATGGAAACA
ACATCGGTTCCGATTACTACGCTGGAGCCCCTACAGTGTACGGAACCGCTGGGAATCGGT
GCAAACCTACCGATAGATATGATAGAAGTAAGCTCGAATAACGACGCCCGGCCGCTCCTC
AAACTGAACTCTGAGCGAGGGTGGACTCCGCTATACAGCACTCCCGGGGAATGGATCATG
TTTAACTTCACCTCGCCTCGTAATATAACCGGGATCAAGACGAAAGGCGGTGCCAACGGC
TGGGTAAGCGGGTACCACGTCATGTATACATCAGACTTCTCGAAATTCAATCCCGTCATA
GACGCCAGCGGCGAACCCAAGCTGTTCCCGGGCAACTTCGACGATGACACAGAAGTCCTT
AATGAGATAAGGCCTCCACTCCACGCGCAGTACCTGAAGGTGTTACCGATTCGATGGTAC
AATAATATAGAGATGAGGGTCGAACCAATCGGATGCTTTGAACCGTATCCCACTCCTGAG
CCTCCCCGCGACGAGCGCGTGCCCGTCCCCTGTCCTCTGTGCCCCGGCGTGCCCACCGCG
GCCTGTTCTTGTCCCAACAACACTTACTACGACGGAGAGAACTGTGTATCCCGGGACCAG
TGTCCGTGTCTTGTTGGGTTTATAACGTACCCGGTTGGTGCTACTTATCGCGGAGACGCT
TGCTCGCAGTGCGTGTGTAAACTGGGAGGAGTTCCCTCTTGCAGCCCCGCCGCTGACTGC
CGCTGTGAAGAGGATCTGGTACCTACCTTAACGGAATCCTGCAAGTGTCTCTGCGAGCCG
TGTGCTAACGGCACTAAGATCTGTCCGACGAGCAAGCTTTGTCTTCCGTTAGAGAAATGG
TGCGATGGACTCCAGGACTGTCCGGACGACGAACGTGACTGTACCACCCTAGCGCCAGTA
ACCGAGACCGTTGTCACCACGGTGGTGTCAACAGTGGCCCATAAACCGCGCACCACTGAA
ATTACCACTATAGCTCCAACTACTACCACCGAAAAACCCATGGAATGTCCAAAAGTGGAA
TGTCCGCCGGGTTATTTTGTCAAACAAATCTCGACTCAACCCACCTACTCATGGTCTTCA
ACGTCCGATCTGCCACCACCGAGACCCAGATACTCCTACCAGAGATACTACAAGGGAGGC
TTCTCTAAAGGTGGAAGACGTGGATACGCTAAGGGAGGTTTCTCTAAGACTGCCTACTCC
AAAGGCGGATTTTCAAAAGGAGGCTTTAGTGGTCCACGATCACCGCAACAGAACGAAGCC
TTCACACTAGACAAGCCGGCCCTGAGTAGCGCCAGCCAGAGTAAGCAGCAATGTGTACAG
TTCAAGTGCGTGCCGGCGCTCCCGCCTCCCCCCCGCCCGGGCTCCACACCTGCACCTGTG
GTGTGCTCCGCCCCCACATGCCCGAAGAACTACGCACTCAAGCTAGAACATGTACCACTG
GAGACCAACCAGTGTCCACAATACGTATGCGTGCCTCCTCCGGAACGTCCGGTGTTCTGT
AACGTGACCGGCCGCACGTTCACAACTTTCGATGGCTCCGAATACAAATATGACGTTTGC
TTCCACATTCTCGCGAGGGAAAATCGTTTGGGCGCTTGGACAGTTCTCGTTCGCAAAAAG
TGCCGAGTAGAAGGATGCGCTAACGAACTTTTCGTCTGGCAAGATGACCAGATCATCCTG
GTCAAGCCGAACCTCATGATTGAGTACGACAACTACGAGTACACCGTAGAACAGACTAGC
AAGATCTGTTTCCAGAAGAACAGCTTCGACGTTAATCGTCTCGGGAGTGGAATATCGATT
AAGTCCAGGAAATACAACTTCACGGTTCAGTACAGTTCAGATGGAGATGTCAAGATCGGG
GTGTCCAAGAAGTACATGGGTCAGGTGGACGGTCTTTGCGGATTGTTTGACGGTGATTCG
TCCAACGACCGGTCGCTGCCGGACGGTCGCCTCGCGGCCGGGGTCGAGGCTTTCGGCCGT
TCCTGGGCCAAGCCCGGCCTGTCGCCGGACGCGTGCAGGACAAAGGTCATCCCTCCAGAG
AAACAGAAGCGCGTTTGGGACCTCTGTAAAGCTATTACCGAGGAACCTTTATCTCAATGT
GCGAAGGTGCTCAACCTAGACAAGTGGCGGAGCATTTGCTTGGAAAAGATCTGTAAGTGT
GCGGAGCTAGTTATAAACGGCACCAAGATGACTGAAGAGAACTGTCGCTGTCTGCTAGTG
GAGAAGCTGGTGGCGGAGTGTCTGGCAGCTGACAAGAATATCGACGTGACAGGATGGAGG
ATCAAGATGGGTTGTCCCGCGGAATGCCCAGCTCCTCTGGTCCACTACGACTGCTATCGA
CGTGGCTGCGAGCCGTCCTGTGCACCCATCACGTCGGGAGCCGCGCGCTGTGACGCAGCT
GACGGACAGTGCTCTCCCGGCTGCTACTGTCCCGAGGGCAAACTCAGGAAGGGAGATGCC
TGTGTGGCGCCATCTGACTGTCTGGACTGCACCTGTACCGGTGTGGGCACTCCGGCCAAA
TACACCACCTTCGAGGGCGACGACCTTCCCTTCCTCGGAAACTGCACCTATCTCGTATCA
AGAGACAGGCAGGAGAAAGGGGAAAGTCAATATCAGGTGTACGCTACGAATGGACCGTGT
GAAGGAGCGGGCGGCACCTGCGTGACGTCACTGCACGTGCTGCGAGCTGCCAACCTGCTG
CACGTCACTAAGGATCCGAACACTATGAAGCTGATAACAACAGTGAACGGCGAGCGAGTG
TTCAAGTATCCTTTCAGCAGCGACTGGGCGGTCATCACCCTCGTCAATGGACAAGACGTT
AACGTTTTACTTCCGGAGCTGCATGTTGAGGTGATGGTGTTGCAGTCTAAACTTCAGTTC
ACTGTGAGCGTCCCGTCACACGACTACAGCAACCGCACGGAAGGTCTGTGTGGAGTGTGC
GCCGGGTACCAGGACCAGCTCATCACCAGCAATGGAACCGTCACCGACGACTTTGAATTG
TACGGCAAGAGTTGGCAGGCGAGCCCGGAAGTACTGACGAAACTGGAAGTGCCACCTCAG
GAGCAGTGCGGCGACATCCCACCGCCACCACCCTGTGTGCCTCCTCCACCGGAAAGCAAT
CCGTGTTACAACCTAAACAACGTAGAGAAGTTTGGAGCTTGCCACGCGCTTGTAGAGCCT
CAGTCCTACATAGAGCAGTGCGAGTCAGAGCTCTGCGAGTTGAACTCGACTGACGCTTGT
CCGGTGCTGGAGCGGTACGCGGCCGAGTGTCGCAAACAGGGCGTTTGCCTCGACTGGAGA
AGCGATCTATGTCCATACCCATGCGACGAGCCACTCGTATACAGAAAGTGCGTGGACTGT
GAGAGAACTTGTGAAAATTACGAAGAACTGAAGGACAACCCAAAACTCTGCGATAAACAA
CCCGTCGAAGGATGCTTCTGTCCGGAAGGAAAGGTGAGAGTGAACAACACGTGTATCGAA
CCGAGCAAATGCTTCCCATGTGATACAAAGAAGGAACACTACGCCGGGGACGAGTGGCAA
GAAGACGCGTGCACTCATTGCACGTGCAGTAAGTCGGGCGAGAGCGCACACGTGTCGTGT
ACAACGCGTACGTGCGCGCCGCCCGTGTGCGCTGACGGAGAGGAACGTGTGCCGGCCGCC
ACACCACCAGGAGCCTGTTGCAAGGAGTACCTGTGCGTTCCTAAACCGCCGGACGTGGTC
TGCGACGAACCGAAGAAGATGGAATGCGGGTTCGGACAAGTTTTGAAGCTGAAGAGCAAA
CCCGATGGATGTTCAGAATTCGTCTGCGAATGCAAGCCGGAAAGCGAATGTGAACCTCTT
CCCGATGAGAGTGAAGTGGAGATGTTGGAGCCGGGGATGGAGCGCGTCGTGGACCGCTCG
GGATGTTGTCCGCGAGCCTCGCTCCACTGCCGCCCCGAGGCCTGCCCCGCGGCCCCCGAT
TGTCCCGCACTACATAACCTACGTACTACCAATGTCACGGGCCAGTGTTGTCCCGAACAC
AAGTGCGAACTGCCCAAGGACAAATGCTTTGTGACTCTGGAGTGGGAGGCTGCGCCAAAA
GGAGGAGAGAAGGCCCGTCCGACGCCACAGGTTATGTTGAAAGATTTGGATTCAGCCTGG
CTGGACGGACCGTGCCGCTCGTGTCGCTGCGAATCAACGGCCGCGGGTCCGTCCCCCCAG
TGTCACGTGACCTCTTGCCCCACTCTCACCCCCACCGATCAGTTCGTGCTAGAGCCTCGT
CCGGTGCCCTTCGCCTGCTGCCCCGAACCATACCAAGTTGGAGAAAATTGGACGTCTCCG
AACGACCCCTGCGAGTCCTACCAATGTTCACAGGTCGGTGAGGGACAGCTGGAGAAAGTC
ACCACACTACAGAATTGTCACACCGACTGTGATCTAGGCTGGCAGTACTTCCCGTCTGAA
GACAGTAGCCAGTGCTGTGGGCGATGCCGTCCTGTGTCGTGTGTGGTGGACGGAGTGACC
AGGGACGTGGGTGCCAAGTGGACCTCAACTGACTTCTGCACCAACTACACTTGTGCAGAC
ATCAACGGCACTCTCCAAGTCCAAAGCTCCAACGAGTCGTGTCCGGAACCATCGATGACT
ATGAAGAAAATGTTCGTACTGAGTGAGGAACACGAACCGGGTCACTGCTGTGTGCGCCGA
GAGCCCGTCGCTTGTCGGGATGGAGACCGCATCTATCAGGAGGGCCAAAGTTGGCAATCG
TCCGATCCTTGCAAGAACTACACCTGCTCCCGAGACGAGGCGGGTTCTTTGGCTAGGGGG
GAGAGCGTGGAGGCCTGTCACAAGGAATGCCCCGAAGGACACGAGTACTTGGAACCTGAC
AAAGGAATATGCTGCGGCCACTGCGTGCAGACACAATGCGTGGTCGGTGACGAGCTGAAG
AAACCAGGTACAAGCTGGCAGTCCGCGGACAACTGCACTACGTTCTCTTGCGAGAACAAT
ACGGGACAGGTGGTGGTGACGTCATCCCGGGAGCTCTGCCCCGACGTTAGCCACTGCGAC
CCTCAGGACCTCGTCAACGATACCTGCTGCCAGATATGCAAGGAGAAGCCACAGGATCTC
AGCAAGTGTGTGCCGAAAGCGGTGCCGCAGTCAGAGTCCGTGGGTCTTATACGAGTATTC
ATGGGTCGTTTCGGGATGTGCGTCAACAGGGAACCCTTGAAGGACTTCAAGGAGTGTCGC
GGCTCATGCGACTCCGGAACACTATATAACAATCAGACCGGAAGTCACGACTCGTCGTGT
GAGTGTTGTCAAGCGACCAGGTACGAGCCGGTGGTGGTGTCGCTGCAGTGTGAGGACGGC
TCGCACCGCGACCACCGCGTCGCCTCGCCCGCACACTGCGCCTGCCGTCGCTGCGCTGAA
CTACCATCCTTCCAGCAGAGTACCACCAATCGACATTCATACTTTAGGATACGTCCCGAC
CCCCGGCTACAGCCACCGCGGGGACGTCGACTACGAGATACCGGCCATCTACAACAGGTT
CAGCGTACCGCCCAGGGAATACTAGAGGGGCGAAGGGATAAATATGCATATGTAGAGTTT
AATAGAAAATATAACACCGATGTAATCACACATTCCCAACAATTTTTATTTATAATCCAA
TACTAA

Protein sequence:

MNLIFRFFILLLGITICQGGYGAHYDGQDAQSDVRASPHAPAYINNMRFGPTGYPRRYTG
TKTGYGGTKTGYAGTKTGYTGTKTGYAGKKTGYAGTKTGYGGTKSGYAETRSWHSSGSAP
TYNEFYGRANEPYSAKCQVECKNNGICIDTNTCQCPPGFHGQYCEFEKKPCLMFPPLPMN
SQRKCSQDYCTITCAEGHRFIDGTTVANMQCMNGQWQPTRADLSSIPDCQPECDPPCLNG
GVCLSVNTCQCPADYRGPQCQYAASACDVRKLAFNGGYNCFGDSEKFSCKLSCPSGASLS
SSNTDEYTCDVIVITPSNYKSTLPPYALHSSNHTDISQSEHSTEANKYGIKKPVTIVVQD
FTPKSGTCITWAGVHYKTFDGKIYSFQSPCQHVLLRDSVEHKFTVAVRHPECEHGDCSSE
LTVYLQEKMYTFAVSDDGSVLFRTSKRLMPIPAALPGIRVSMPSDQLIINLDLGLTLKWD
TKNSVVAEASVLLWNKTEGLCGTLDGNPENDLTTKEKTIALTKSVMIASWELNKIGDTCD
SSPTETSQCFSKTDADMKSALQFCTKIFTKDKFRKCSKAMDVSQLLEACQWDYCACLTSL
TPEECACRTVSVYAKECLRHGVEEMRSWRDSDTCPMRCLEGKVYKSCGPEVQASCAFPTA
SNSSCVEGCFCPEGELLEGGRCVQKSECPCRVRNQSFPPGTVMPKKCNTCTCEAGQWTCT
SAACGARCGAVGDPHYTTFDGLRYDFMGHCTYTMLKTDNLTIDVENVACSGAITEAMNLA
PYKGDGKPSCTKAVNLMYNGATIHLKQGGFILVNGKEVDSLPVSVGDIRIRAASSLFLIV
QLPIKVDLWWDGNTRVFVDVPPSFKDSTKGLCGTFNLNQKDDFLTPEGDVEQTALAFANK
WKTREFCDDVSTKEPEHPCKANMQNKDTAETYCSKLKSKIFEACHWYVDVQPYYEDCLYD
MCACAGDVSRCLCPIIGDYADACARSAVMVQWRYHVKECELQCTGGQEYTVCGDSCLRTC
ADVSLGAADCRPRCVEGCACPVGQVLDNNNVCVPVGLCPCYHNGMEFKPGYKEVRAGKRE
RELCTCVGARWSCVPATSEDIVNYPPAEDLRSACSASDHKEFTTCEIAEPLTCKNMHLPP
TVTSQECRPGCQCKKGYVMDAASKKCVLPSECPCHHGGRSYQDGDTMQEGCNTCSCNGGK
WSCTSRPCAGVCSAWGDSHFTTFDGQHYDFEGVCTYLFAKGTVDGKDGFSVEIQNEPCGT
TGATCSKSVTLRVGNGGADDETVALTKSAQLPDTSKLKRIKLRLAGAYVFLDVPSLGASL
QWDRGLRVYVKLDTMWQNRVKGLCGNFNSDMRDDFQTPSGGGFSETSSLIFADSWKLKPN
CPKPQEVTDHCKERPHRQAWASSMCGVLKQFPFTLCHSELPAGEHVARCESDACACDSGA
DCDCACAAIAAYAAACAARGVTFKWRTQELCPMQCDEECSNYNECMSPCPPETCENTLEY
DKIKAACENETCVEGCKLNENKTCPDGTIYSNSSLKECVPRAKCKPVCMTLPDGKEVLEG
EVMEEDACHTCRCSKNEKICTGQPCATDEPVPTGPPETTPYDQPLTCKTGWTEWISRAVP
EITASGASVDNEPLPEVNELTIGSPMCKKEMMTKIECRTVGDHRSPKETGLNVECSLEKG
LICSEPEKNCSDFEIRVYCECEEKPFQCLDSSHPSYPHPTECSQFYECTPELGAPGTLHA
VLKNCGEGLVYNPTIMVCDWPASVALVRPECANITTTVTTPTTTVGTTAAFVTESSTSPS
TSTVIPLDVTTAVCPPHQVYKECAFPCDKLCDHFKHTLQAEGQCLNGEKCVAGCVDVPVS
MVECGFGSMWRDRSTCVPIKDCTCDDDGYVIKPGGVVVEGCRKCQCLDNVLHCDSSECVS
IKTPIEGSTHAPMISTTPLPTTVTTPSTTTTTLTTPMSVATSGSTAAPTTVSPSTSTTPL
IIQTTVSPPPECSPDRYINLLWSDEPLPPTSFSASSSANNLFQPQFAVLNGHPLDVSAGS
WNPAHMDKNQYIQVELPQKEPVYGILMQGSPLFDQYVTSYDVMYGDDGNVFSPVNGPDGQ
PKVFRGPVDNNTPLKQMIEPPIEAKFVRIRPLTWHEEIAVRFELIGCQEVTSTTSTSTTT
TTSTTTTSTTTASPTTVMETTSVPITTLEPLQCTEPLGIGANLPIDMIEVSSNNDARPLL
KLNSERGWTPLYSTPGEWIMFNFTSPRNITGIKTKGGANGWVSGYHVMYTSDFSKFNPVI
DASGEPKLFPGNFDDDTEVLNEIRPPLHAQYLKVLPIRWYNNIEMRVEPIGCFEPYPTPE
PPRDERVPVPCPLCPGVPTAACSCPNNTYYDGENCVSRDQCPCLVGFITYPVGATYRGDA
CSQCVCKLGGVPSCSPAADCRCEEDLVPTLTESCKCLCEPCANGTKICPTSKLCLPLEKW
CDGLQDCPDDERDCTTLAPVTETVVTTVVSTVAHKPRTTEITTIAPTTTTEKPMECPKVE
CPPGYFVKQISTQPTYSWSSTSDLPPPRPRYSYQRYYKGGFSKGGRRGYAKGGFSKTAYS
KGGFSKGGFSGPRSPQQNEAFTLDKPALSSASQSKQQCVQFKCVPALPPPPRPGSTPAPV
VCSAPTCPKNYALKLEHVPLETNQCPQYVCVPPPERPVFCNVTGRTFTTFDGSEYKYDVC
FHILARENRLGAWTVLVRKKCRVEGCANELFVWQDDQIILVKPNLMIEYDNYEYTVEQTS
KICFQKNSFDVNRLGSGISIKSRKYNFTVQYSSDGDVKIGVSKKYMGQVDGLCGLFDGDS
SNDRSLPDGRLAAGVEAFGRSWAKPGLSPDACRTKVIPPEKQKRVWDLCKAITEEPLSQC
AKVLNLDKWRSICLEKICKCAELVINGTKMTEENCRCLLVEKLVAECLAADKNIDVTGWR
IKMGCPAECPAPLVHYDCYRRGCEPSCAPITSGAARCDAADGQCSPGCYCPEGKLRKGDA
CVAPSDCLDCTCTGVGTPAKYTTFEGDDLPFLGNCTYLVSRDRQEKGESQYQVYATNGPC
EGAGGTCVTSLHVLRAANLLHVTKDPNTMKLITTVNGERVFKYPFSSDWAVITLVNGQDV
NVLLPELHVEVMVLQSKLQFTVSVPSHDYSNRTEGLCGVCAGYQDQLITSNGTVTDDFEL
YGKSWQASPEVLTKLEVPPQEQCGDIPPPPPCVPPPPESNPCYNLNNVEKFGACHALVEP
QSYIEQCESELCELNSTDACPVLERYAAECRKQGVCLDWRSDLCPYPCDEPLVYRKCVDC
ERTCENYEELKDNPKLCDKQPVEGCFCPEGKVRVNNTCIEPSKCFPCDTKKEHYAGDEWQ
EDACTHCTCSKSGESAHVSCTTRTCAPPVCADGEERVPAATPPGACCKEYLCVPKPPDVV
CDEPKKMECGFGQVLKLKSKPDGCSEFVCECKPESECEPLPDESEVEMLEPGMERVVDRS
GCCPRASLHCRPEACPAAPDCPALHNLRTTNVTGQCCPEHKCELPKDKCFVTLEWEAAPK
GGEKARPTPQVMLKDLDSAWLDGPCRSCRCESTAAGPSPQCHVTSCPTLTPTDQFVLEPR
PVPFACCPEPYQVGENWTSPNDPCESYQCSQVGEGQLEKVTTLQNCHTDCDLGWQYFPSE
DSSQCCGRCRPVSCVVDGVTRDVGAKWTSTDFCTNYTCADINGTLQVQSSNESCPEPSMT
MKKMFVLSEEHEPGHCCVRREPVACRDGDRIYQEGQSWQSSDPCKNYTCSRDEAGSLARG
ESVEACHKECPEGHEYLEPDKGICCGHCVQTQCVVGDELKKPGTSWQSADNCTTFSCENN
TGQVVVTSSRELCPDVSHCDPQDLVNDTCCQICKEKPQDLSKCVPKAVPQSESVGLIRVF
MGRFGMCVNREPLKDFKECRGSCDSGTLYNNQTGSHDSSCECCQATRYEPVVVSLQCEDG
SHRDHRVASPAHCACRRCAELPSFQQSTTNRHSYFRIRPDPRLQPPRGRRLRDTGHLQQV
QRTAQGILEGRRDKYAYVEFNRKYNTDVITHSQQFLFIIQY