New model in OGS2.0 | DPOGS209630  |
---|---|
Genomic Position | scaffold154:- 342091-371229 |
See gene structure | |
CDS Length | 12186 |
Paired RNAseq reads   | 66251 |
Single RNAseq reads   | 150082 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006693 (9e-08) |
Best Drosophila hit   | hemolectin (0.0) |
Best Human hit | SCO-spondin precursor (4e-131) |
Best NR hit (blastp)   | PREDICTED: similar to Hemolectin CG7002-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Hemolectin CG7002-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005529 sugar binding GO:0042803 protein homodimerization activity GO:0007599 hemostasis GO:0005576 extracellular region GO:0035006 melanization defense response GO:0042060 wound healing GO:0007155 cell adhesion GO:0008061 chitin binding GO:0006030 chitin metabolic process GO:0042381 hemolymph coagulation |
InterPro families    | IPR006207 Cystine knot, C-terminal IPR000421 Coagulation factor 5/8 type, C-terminal IPR000742 Epidermal growth factor-like, type 3 IPR001007 von Willebrand factor, type C IPR002557 Chitin binding domain IPR001846 von Willebrand factor, type D domain IPR008979 Galactose-binding domain-like IPR002919 Protease inhibitor I8, cysteine-rich trypsin inhibitor-like IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR014853 Conserved-cysteine-rich domain IPR006210 Epidermal growth factor-like IPR006552 VWC out IPR013032 EGF-like region, conserved site |
Orthology group | MCL10206 |
Nucleotide sequence:
ATGAATTTAATTTTTCGTTTCTTTATTTTATTATTAGGAATTACTATCTGTCAAGGTGGC
TACGGCGCACACTACGACGGACAAGATGCTCAATCGGATGTACGCGCGTCGCCACATGCT
CCAGCATATATTAATAATATGAGATTTGGACCAACCGGCTACCCGAGAAGATATACAGGC
ACCAAAACTGGATACGGTGGCACAAAAACAGGATATGCTGGCACGAAAACGGGATATACG
GGAACAAAAACCGGATATGCCGGCAAAAAGACAGGCTACGCTGGCACTAAAACTGGATAC
GGTGGCACTAAGTCTGGTTATGCTGAAACAAGATCTTGGCATTCTTCTGGCTCCGCGCCG
ACCTACAATGAGTTCTATGGACGGGCCAACGAGCCTTACAGCGCAAAATGTCAGGTGGAA
TGCAAAAATAATGGGATCTGCATCGACACCAACACTTGTCAATGTCCACCGGGTTTTCAC
GGACAGTACTGTGAGTTCGAGAAGAAACCGTGTCTGATGTTTCCACCACTGCCCATGAAT
TCACAGAGGAAATGCTCACAGGATTACTGCACAATTACTTGCGCGGAAGGCCATAGGTTC
ATAGATGGGACGACGGTAGCAAATATGCAATGCATGAATGGTCAGTGGCAGCCCACTCGT
GCTGACCTTTCATCAATACCTGACTGTCAGCCAGAATGTGATCCTCCGTGCTTGAATGGA
GGTGTTTGTTTGTCCGTCAATACGTGTCAATGTCCCGCAGACTACAGAGGTCCTCAATGC
CAATATGCTGCCAGTGCTTGTGACGTTCGTAAACTGGCATTCAACGGCGGCTACAACTGC
TTCGGTGACAGCGAGAAATTTTCCTGCAAACTTAGCTGCCCGTCAGGAGCATCTCTCAGT
TCTTCTAACACTGATGAGTACACCTGTGATGTCATCGTTATCACTCCATCAAACTACAAG
AGCACCCTCCCCCCTTATGCGCTACACTCATCAAACCACACCGACATCAGTCAGTCGGAG
CATTCAACAGAAGCTAACAAGTATGGCATAAAGAAGCCGGTTACCATTGTTGTACAAGAC
TTTACTCCAAAGAGTGGCACTTGTATAACCTGGGCAGGTGTTCACTATAAGACCTTCGAT
GGAAAGATATATAGCTTCCAATCTCCTTGCCAACATGTGTTACTGCGTGACTCCGTAGAA
CATAAGTTTACAGTCGCCGTGAGACACCCTGAATGCGAACACGGCGACTGTTCGTCTGAA
CTTACTGTCTACTTGCAAGAAAAGATGTACACGTTCGCTGTCTCTGATGACGGATCCGTC
TTGTTTCGCACCAGCAAACGTTTGATGCCGATCCCGGCAGCACTGCCCGGCATCCGCGTG
TCCATGCCTTCTGACCAACTCATTATCAACCTGGACCTTGGACTTACTCTCAAATGGGAC
ACTAAAAACTCGGTGGTCGCCGAAGCTTCAGTTTTGCTATGGAACAAAACCGAAGGTCTG
TGTGGAACGCTGGACGGGAATCCGGAAAACGATTTGACCACAAAAGAGAAAACTATAGCA
TTGACTAAATCTGTTATGATAGCTTCTTGGGAACTCAACAAAATTGGAGACACTTGCGAC
AGCAGTCCAACTGAGACCAGCCAGTGTTTTTCTAAAACAGACGCGGACATGAAGAGCGCT
CTACAGTTCTGTACCAAAATATTCACCAAGGATAAGTTCAGAAAGTGTTCTAAGGCAATG
GACGTTTCACAATTATTGGAAGCTTGTCAATGGGATTACTGCGCATGTCTCACTAGCCTT
ACTCCGGAAGAGTGTGCGTGTCGTACAGTGTCAGTATACGCCAAAGAGTGTTTGCGACAC
GGCGTCGAGGAAATGCGCTCCTGGAGGGACTCCGACACATGCCCGATGAGGTGCCTCGAA
GGGAAAGTCTACAAGTCCTGCGGTCCGGAAGTCCAGGCCAGCTGTGCATTCCCTACGGCG
AGCAACTCGTCTTGTGTGGAGGGCTGTTTCTGTCCGGAGGGTGAGCTACTGGAGGGCGGG
CGATGCGTGCAAAAATCTGAGTGTCCCTGTAGAGTGCGCAACCAGAGTTTCCCCCCTGGA
ACTGTTATGCCCAAGAAATGTAATACCTGTACTTGCGAGGCGGGCCAGTGGACTTGCACG
TCGGCAGCGTGCGGAGCTCGGTGCGGTGCCGTGGGAGACCCTCATTACACCACCTTCGAT
GGACTGAGATATGATTTCATGGGCCACTGCACGTACACCATGCTCAAAACGGACAACCTC
ACCATTGACGTCGAAAATGTAGCCTGCTCGGGCGCCATCACCGAGGCCATGAACCTCGCT
CCTTACAAGGGAGACGGCAAGCCATCCTGTACGAAAGCCGTCAACTTAATGTACAACGGC
GCCACCATACACCTCAAGCAGGGCGGATTCATTCTCGTCAACGGCAAGGAAGTCGACTCC
TTACCTGTCAGCGTTGGTGATATTAGGATACGAGCTGCCTCATCTCTGTTCCTCATCGTT
CAACTACCTATCAAAGTTGATCTCTGGTGGGACGGCAATACTCGAGTGTTCGTGGACGTA
CCACCGTCCTTCAAAGACAGTACTAAGGGTCTATGTGGAACATTCAATTTGAATCAAAAA
GATGACTTCCTGACTCCCGAGGGAGACGTCGAGCAGACAGCTCTGGCCTTCGCAAACAAG
TGGAAGACCAGGGAGTTTTGTGACGACGTCTCCACCAAAGAGCCGGAGCACCCGTGCAAG
GCCAACATGCAGAACAAGGACACCGCCGAGACCTATTGTAGCAAATTGAAGAGTAAAATA
TTTGAAGCTTGTCACTGGTACGTGGACGTGCAGCCTTACTACGAAGACTGTTTGTACGAC
ATGTGCGCGTGCGCCGGGGACGTGTCGCGCTGCCTGTGCCCCATCATCGGGGACTACGCT
GACGCGTGCGCTCGCAGTGCGGTCATGGTGCAATGGAGATACCACGTCAAGGAGTGCGAG
TTGCAATGTACGGGAGGTCAGGAGTACACCGTGTGTGGTGACAGTTGTCTGAGGACGTGC
GCGGACGTGTCTCTCGGGGCCGCGGACTGCAGGCCGCGGTGCGTGGAGGGGTGCGCCTGC
CCAGTGGGACAGGTGTTGGACAACAACAACGTGTGCGTACCGGTTGGACTGTGTCCGTGC
TACCATAATGGAATGGAATTTAAACCCGGTTATAAAGAAGTGAGAGCTGGAAAGAGGGAG
AGAGAGCTTTGTACATGTGTGGGTGCTCGCTGGTCGTGTGTCCCGGCCACGTCGGAGGAC
ATCGTTAACTACCCCCCGGCCGAGGACCTGCGCTCAGCATGCAGCGCCTCCGACCATAAG
GAGTTCACCACCTGCGAGATCGCTGAACCCCTCACCTGCAAGAACATGCACCTGCCGCCG
ACGGTGACCTCGCAAGAGTGTCGTCCGGGTTGCCAGTGTAAGAAGGGCTACGTGATGGAT
GCGGCTAGTAAGAAGTGCGTTCTACCCTCTGAATGTCCCTGTCACCACGGCGGTAGAAGC
TATCAAGACGGAGATACCATGCAGGAGGGATGTAACACGTGCTCTTGTAATGGTGGTAAG
TGGTCGTGCACGTCTCGTCCGTGTGCTGGCGTGTGCAGCGCCTGGGGTGACTCACACTTC
ACCACCTTCGACGGACAGCACTATGACTTCGAAGGAGTTTGTACTTACCTCTTCGCCAAA
GGAACTGTCGACGGGAAGGATGGATTCAGCGTTGAGATACAGAACGAGCCTTGCGGAACA
ACAGGCGCAACCTGCTCAAAATCCGTCACACTTCGAGTGGGTAATGGCGGAGCTGATGAT
GAAACCGTAGCTCTCACTAAGAGCGCTCAACTACCTGATACATCCAAATTGAAACGTATT
AAGCTGCGGCTGGCGGGAGCATACGTGTTCCTGGACGTCCCCTCCCTTGGCGCCAGTCTT
CAATGGGATCGAGGACTGCGGGTCTACGTGAAACTGGACACGATGTGGCAGAATAGGGTC
AAGGGACTCTGCGGTAATTTCAATTCGGACATGCGGGACGACTTCCAGACGCCGTCTGGT
GGAGGTTTCTCGGAGACCTCGTCTCTCATCTTCGCCGACTCCTGGAAACTAAAACCCAAC
TGTCCCAAGCCGCAAGAAGTTACAGATCACTGTAAGGAGCGACCTCACAGACAAGCCTGG
GCATCTAGTATGTGCGGTGTGTTGAAACAATTCCCGTTCACTCTGTGCCACTCGGAGCTA
CCGGCCGGGGAGCACGTGGCTCGCTGCGAGTCAGACGCCTGCGCGTGCGACTCCGGCGCC
GACTGCGACTGCGCGTGCGCAGCGATCGCAGCTTATGCAGCAGCGTGCGCAGCTAGAGGA
GTTACCTTCAAATGGCGTACTCAGGAACTCTGTCCGATGCAATGCGATGAAGAATGCTCC
AACTATAACGAGTGTATGTCACCGTGTCCACCAGAGACCTGCGAAAACACGCTCGAATAC
GATAAGATAAAGGCGGCTTGTGAGAACGAAACTTGCGTCGAAGGTTGCAAACTAAACGAA
AACAAAACCTGCCCCGATGGTACAATATACTCGAACAGTTCTCTGAAGGAGTGCGTACCT
CGAGCAAAGTGCAAACCAGTTTGCATGACCCTTCCGGACGGTAAAGAGGTGCTGGAGGGG
GAGGTCATGGAAGAGGATGCGTGTCACACTTGTAGATGTTCAAAGAATGAGAAAATTTGC
ACTGGACAACCATGTGCCACCGACGAGCCAGTACCCACGGGTCCTCCAGAGACGACACCT
TACGACCAGCCCCTTACCTGTAAGACGGGTTGGACCGAGTGGATCAGCAGAGCAGTGCCC
GAAATCACCGCCAGTGGAGCCTCCGTCGATAACGAACCCCTGCCGGAAGTCAACGAACTT
ACCATCGGCTCTCCGATGTGCAAGAAAGAGATGATGACTAAGATCGAATGTCGTACCGTC
GGCGACCACCGCAGTCCCAAAGAGACGGGGCTCAATGTAGAATGTAGTCTGGAGAAAGGC
TTGATATGTTCTGAACCTGAGAAAAATTGCTCCGACTTCGAGATACGAGTTTACTGCGAA
TGCGAGGAAAAGCCATTCCAGTGTTTGGACTCGTCGCACCCAAGTTACCCTCACCCGACG
GAATGCTCTCAGTTCTACGAGTGTACTCCGGAGCTCGGGGCTCCTGGGACCTTGCACGCC
GTCCTCAAGAACTGTGGGGAGGGCCTTGTGTACAACCCCACCATCATGGTGTGCGACTGG
CCAGCATCTGTCGCGCTAGTAAGGCCCGAGTGCGCCAATATTACCACTACCGTTACCACT
CCCACGACTACTGTCGGCACGACCGCCGCGTTTGTCACAGAATCTTCCACGTCTCCTTCT
ACTTCGACTGTGATTCCTTTAGATGTCACAACAGCAGTGTGTCCTCCTCATCAAGTTTAC
AAGGAGTGCGCGTTCCCGTGTGACAAACTGTGCGACCACTTCAAGCATACCCTGCAAGCC
GAAGGACAGTGCTTGAACGGAGAGAAATGCGTTGCTGGCTGTGTGGATGTGCCGGTGTCG
ATGGTGGAGTGCGGGTTCGGATCCATGTGGCGTGATCGAAGTACATGCGTTCCCATCAAA
GACTGCACCTGCGATGACGACGGATACGTTATTAAGCCCGGCGGAGTGGTTGTAGAGGGT
TGTCGAAAATGTCAATGCCTGGACAACGTATTGCATTGCGATTCTTCAGAGTGTGTTTCT
ATCAAGACCCCGATAGAGGGATCCACGCACGCTCCCATGATCAGCACTACTCCATTGCCT
ACTACTGTAACCACACCGAGTACCACTACCACTACATTAACTACTCCCATGTCTGTGGCA
ACTTCCGGTTCCACTGCAGCTCCTACAACTGTTTCTCCCTCCACTTCCACTACTCCATTG
ATTATACAAACGACGGTATCTCCACCACCTGAATGCAGTCCGGATAGGTATATAAATCTG
TTGTGGTCTGATGAGCCGCTGCCGCCCACCTCGTTCAGCGCCAGCTCCTCCGCCAATAAT
CTCTTCCAGCCGCAGTTTGCGGTTCTTAACGGACATCCACTGGATGTGTCTGCTGGTAGT
TGGAATCCGGCTCACATGGACAAAAACCAGTACATCCAAGTGGAACTGCCCCAGAAGGAG
CCAGTCTACGGTATATTAATGCAAGGCAGTCCCTTGTTCGATCAATATGTCACCAGCTAC
GACGTTATGTACGGAGATGATGGAAATGTCTTCTCGCCTGTCAATGGGCCCGACGGACAG
CCGAAGGTTTTCCGTGGACCGGTAGATAACAACACTCCTCTTAAACAAATGATCGAACCG
CCGATAGAAGCGAAGTTTGTACGCATCCGACCACTCACTTGGCACGAGGAGATTGCGGTG
CGCTTCGAGCTTATCGGTTGCCAGGAAGTCACTTCTACGACTAGTACCTCAACAACAACA
ACGACATCTACCACAACCACTTCTACAACAACAGCCTCTCCCACAACAGTAATGGAAACA
ACATCGGTTCCGATTACTACGCTGGAGCCCCTACAGTGTACGGAACCGCTGGGAATCGGT
GCAAACCTACCGATAGATATGATAGAAGTAAGCTCGAATAACGACGCCCGGCCGCTCCTC
AAACTGAACTCTGAGCGAGGGTGGACTCCGCTATACAGCACTCCCGGGGAATGGATCATG
TTTAACTTCACCTCGCCTCGTAATATAACCGGGATCAAGACGAAAGGCGGTGCCAACGGC
TGGGTAAGCGGGTACCACGTCATGTATACATCAGACTTCTCGAAATTCAATCCCGTCATA
GACGCCAGCGGCGAACCCAAGCTGTTCCCGGGCAACTTCGACGATGACACAGAAGTCCTT
AATGAGATAAGGCCTCCACTCCACGCGCAGTACCTGAAGGTGTTACCGATTCGATGGTAC
AATAATATAGAGATGAGGGTCGAACCAATCGGATGCTTTGAACCGTATCCCACTCCTGAG
CCTCCCCGCGACGAGCGCGTGCCCGTCCCCTGTCCTCTGTGCCCCGGCGTGCCCACCGCG
GCCTGTTCTTGTCCCAACAACACTTACTACGACGGAGAGAACTGTGTATCCCGGGACCAG
TGTCCGTGTCTTGTTGGGTTTATAACGTACCCGGTTGGTGCTACTTATCGCGGAGACGCT
TGCTCGCAGTGCGTGTGTAAACTGGGAGGAGTTCCCTCTTGCAGCCCCGCCGCTGACTGC
CGCTGTGAAGAGGATCTGGTACCTACCTTAACGGAATCCTGCAAGTGTCTCTGCGAGCCG
TGTGCTAACGGCACTAAGATCTGTCCGACGAGCAAGCTTTGTCTTCCGTTAGAGAAATGG
TGCGATGGACTCCAGGACTGTCCGGACGACGAACGTGACTGTACCACCCTAGCGCCAGTA
ACCGAGACCGTTGTCACCACGGTGGTGTCAACAGTGGCCCATAAACCGCGCACCACTGAA
ATTACCACTATAGCTCCAACTACTACCACCGAAAAACCCATGGAATGTCCAAAAGTGGAA
TGTCCGCCGGGTTATTTTGTCAAACAAATCTCGACTCAACCCACCTACTCATGGTCTTCA
ACGTCCGATCTGCCACCACCGAGACCCAGATACTCCTACCAGAGATACTACAAGGGAGGC
TTCTCTAAAGGTGGAAGACGTGGATACGCTAAGGGAGGTTTCTCTAAGACTGCCTACTCC
AAAGGCGGATTTTCAAAAGGAGGCTTTAGTGGTCCACGATCACCGCAACAGAACGAAGCC
TTCACACTAGACAAGCCGGCCCTGAGTAGCGCCAGCCAGAGTAAGCAGCAATGTGTACAG
TTCAAGTGCGTGCCGGCGCTCCCGCCTCCCCCCCGCCCGGGCTCCACACCTGCACCTGTG
GTGTGCTCCGCCCCCACATGCCCGAAGAACTACGCACTCAAGCTAGAACATGTACCACTG
GAGACCAACCAGTGTCCACAATACGTATGCGTGCCTCCTCCGGAACGTCCGGTGTTCTGT
AACGTGACCGGCCGCACGTTCACAACTTTCGATGGCTCCGAATACAAATATGACGTTTGC
TTCCACATTCTCGCGAGGGAAAATCGTTTGGGCGCTTGGACAGTTCTCGTTCGCAAAAAG
TGCCGAGTAGAAGGATGCGCTAACGAACTTTTCGTCTGGCAAGATGACCAGATCATCCTG
GTCAAGCCGAACCTCATGATTGAGTACGACAACTACGAGTACACCGTAGAACAGACTAGC
AAGATCTGTTTCCAGAAGAACAGCTTCGACGTTAATCGTCTCGGGAGTGGAATATCGATT
AAGTCCAGGAAATACAACTTCACGGTTCAGTACAGTTCAGATGGAGATGTCAAGATCGGG
GTGTCCAAGAAGTACATGGGTCAGGTGGACGGTCTTTGCGGATTGTTTGACGGTGATTCG
TCCAACGACCGGTCGCTGCCGGACGGTCGCCTCGCGGCCGGGGTCGAGGCTTTCGGCCGT
TCCTGGGCCAAGCCCGGCCTGTCGCCGGACGCGTGCAGGACAAAGGTCATCCCTCCAGAG
AAACAGAAGCGCGTTTGGGACCTCTGTAAAGCTATTACCGAGGAACCTTTATCTCAATGT
GCGAAGGTGCTCAACCTAGACAAGTGGCGGAGCATTTGCTTGGAAAAGATCTGTAAGTGT
GCGGAGCTAGTTATAAACGGCACCAAGATGACTGAAGAGAACTGTCGCTGTCTGCTAGTG
GAGAAGCTGGTGGCGGAGTGTCTGGCAGCTGACAAGAATATCGACGTGACAGGATGGAGG
ATCAAGATGGGTTGTCCCGCGGAATGCCCAGCTCCTCTGGTCCACTACGACTGCTATCGA
CGTGGCTGCGAGCCGTCCTGTGCACCCATCACGTCGGGAGCCGCGCGCTGTGACGCAGCT
GACGGACAGTGCTCTCCCGGCTGCTACTGTCCCGAGGGCAAACTCAGGAAGGGAGATGCC
TGTGTGGCGCCATCTGACTGTCTGGACTGCACCTGTACCGGTGTGGGCACTCCGGCCAAA
TACACCACCTTCGAGGGCGACGACCTTCCCTTCCTCGGAAACTGCACCTATCTCGTATCA
AGAGACAGGCAGGAGAAAGGGGAAAGTCAATATCAGGTGTACGCTACGAATGGACCGTGT
GAAGGAGCGGGCGGCACCTGCGTGACGTCACTGCACGTGCTGCGAGCTGCCAACCTGCTG
CACGTCACTAAGGATCCGAACACTATGAAGCTGATAACAACAGTGAACGGCGAGCGAGTG
TTCAAGTATCCTTTCAGCAGCGACTGGGCGGTCATCACCCTCGTCAATGGACAAGACGTT
AACGTTTTACTTCCGGAGCTGCATGTTGAGGTGATGGTGTTGCAGTCTAAACTTCAGTTC
ACTGTGAGCGTCCCGTCACACGACTACAGCAACCGCACGGAAGGTCTGTGTGGAGTGTGC
GCCGGGTACCAGGACCAGCTCATCACCAGCAATGGAACCGTCACCGACGACTTTGAATTG
TACGGCAAGAGTTGGCAGGCGAGCCCGGAAGTACTGACGAAACTGGAAGTGCCACCTCAG
GAGCAGTGCGGCGACATCCCACCGCCACCACCCTGTGTGCCTCCTCCACCGGAAAGCAAT
CCGTGTTACAACCTAAACAACGTAGAGAAGTTTGGAGCTTGCCACGCGCTTGTAGAGCCT
CAGTCCTACATAGAGCAGTGCGAGTCAGAGCTCTGCGAGTTGAACTCGACTGACGCTTGT
CCGGTGCTGGAGCGGTACGCGGCCGAGTGTCGCAAACAGGGCGTTTGCCTCGACTGGAGA
AGCGATCTATGTCCATACCCATGCGACGAGCCACTCGTATACAGAAAGTGCGTGGACTGT
GAGAGAACTTGTGAAAATTACGAAGAACTGAAGGACAACCCAAAACTCTGCGATAAACAA
CCCGTCGAAGGATGCTTCTGTCCGGAAGGAAAGGTGAGAGTGAACAACACGTGTATCGAA
CCGAGCAAATGCTTCCCATGTGATACAAAGAAGGAACACTACGCCGGGGACGAGTGGCAA
GAAGACGCGTGCACTCATTGCACGTGCAGTAAGTCGGGCGAGAGCGCACACGTGTCGTGT
ACAACGCGTACGTGCGCGCCGCCCGTGTGCGCTGACGGAGAGGAACGTGTGCCGGCCGCC
ACACCACCAGGAGCCTGTTGCAAGGAGTACCTGTGCGTTCCTAAACCGCCGGACGTGGTC
TGCGACGAACCGAAGAAGATGGAATGCGGGTTCGGACAAGTTTTGAAGCTGAAGAGCAAA
CCCGATGGATGTTCAGAATTCGTCTGCGAATGCAAGCCGGAAAGCGAATGTGAACCTCTT
CCCGATGAGAGTGAAGTGGAGATGTTGGAGCCGGGGATGGAGCGCGTCGTGGACCGCTCG
GGATGTTGTCCGCGAGCCTCGCTCCACTGCCGCCCCGAGGCCTGCCCCGCGGCCCCCGAT
TGTCCCGCACTACATAACCTACGTACTACCAATGTCACGGGCCAGTGTTGTCCCGAACAC
AAGTGCGAACTGCCCAAGGACAAATGCTTTGTGACTCTGGAGTGGGAGGCTGCGCCAAAA
GGAGGAGAGAAGGCCCGTCCGACGCCACAGGTTATGTTGAAAGATTTGGATTCAGCCTGG
CTGGACGGACCGTGCCGCTCGTGTCGCTGCGAATCAACGGCCGCGGGTCCGTCCCCCCAG
TGTCACGTGACCTCTTGCCCCACTCTCACCCCCACCGATCAGTTCGTGCTAGAGCCTCGT
CCGGTGCCCTTCGCCTGCTGCCCCGAACCATACCAAGTTGGAGAAAATTGGACGTCTCCG
AACGACCCCTGCGAGTCCTACCAATGTTCACAGGTCGGTGAGGGACAGCTGGAGAAAGTC
ACCACACTACAGAATTGTCACACCGACTGTGATCTAGGCTGGCAGTACTTCCCGTCTGAA
GACAGTAGCCAGTGCTGTGGGCGATGCCGTCCTGTGTCGTGTGTGGTGGACGGAGTGACC
AGGGACGTGGGTGCCAAGTGGACCTCAACTGACTTCTGCACCAACTACACTTGTGCAGAC
ATCAACGGCACTCTCCAAGTCCAAAGCTCCAACGAGTCGTGTCCGGAACCATCGATGACT
ATGAAGAAAATGTTCGTACTGAGTGAGGAACACGAACCGGGTCACTGCTGTGTGCGCCGA
GAGCCCGTCGCTTGTCGGGATGGAGACCGCATCTATCAGGAGGGCCAAAGTTGGCAATCG
TCCGATCCTTGCAAGAACTACACCTGCTCCCGAGACGAGGCGGGTTCTTTGGCTAGGGGG
GAGAGCGTGGAGGCCTGTCACAAGGAATGCCCCGAAGGACACGAGTACTTGGAACCTGAC
AAAGGAATATGCTGCGGCCACTGCGTGCAGACACAATGCGTGGTCGGTGACGAGCTGAAG
AAACCAGGTACAAGCTGGCAGTCCGCGGACAACTGCACTACGTTCTCTTGCGAGAACAAT
ACGGGACAGGTGGTGGTGACGTCATCCCGGGAGCTCTGCCCCGACGTTAGCCACTGCGAC
CCTCAGGACCTCGTCAACGATACCTGCTGCCAGATATGCAAGGAGAAGCCACAGGATCTC
AGCAAGTGTGTGCCGAAAGCGGTGCCGCAGTCAGAGTCCGTGGGTCTTATACGAGTATTC
ATGGGTCGTTTCGGGATGTGCGTCAACAGGGAACCCTTGAAGGACTTCAAGGAGTGTCGC
GGCTCATGCGACTCCGGAACACTATATAACAATCAGACCGGAAGTCACGACTCGTCGTGT
GAGTGTTGTCAAGCGACCAGGTACGAGCCGGTGGTGGTGTCGCTGCAGTGTGAGGACGGC
TCGCACCGCGACCACCGCGTCGCCTCGCCCGCACACTGCGCCTGCCGTCGCTGCGCTGAA
CTACCATCCTTCCAGCAGAGTACCACCAATCGACATTCATACTTTAGGATACGTCCCGAC
CCCCGGCTACAGCCACCGCGGGGACGTCGACTACGAGATACCGGCCATCTACAACAGGTT
CAGCGTACCGCCCAGGGAATACTAGAGGGGCGAAGGGATAAATATGCATATGTAGAGTTT
AATAGAAAATATAACACCGATGTAATCACACATTCCCAACAATTTTTATTTATAATCCAA
TACTAA
Protein sequence:
MNLIFRFFILLLGITICQGGYGAHYDGQDAQSDVRASPHAPAYINNMRFGPTGYPRRYTG
TKTGYGGTKTGYAGTKTGYTGTKTGYAGKKTGYAGTKTGYGGTKSGYAETRSWHSSGSAP
TYNEFYGRANEPYSAKCQVECKNNGICIDTNTCQCPPGFHGQYCEFEKKPCLMFPPLPMN
SQRKCSQDYCTITCAEGHRFIDGTTVANMQCMNGQWQPTRADLSSIPDCQPECDPPCLNG
GVCLSVNTCQCPADYRGPQCQYAASACDVRKLAFNGGYNCFGDSEKFSCKLSCPSGASLS
SSNTDEYTCDVIVITPSNYKSTLPPYALHSSNHTDISQSEHSTEANKYGIKKPVTIVVQD
FTPKSGTCITWAGVHYKTFDGKIYSFQSPCQHVLLRDSVEHKFTVAVRHPECEHGDCSSE
LTVYLQEKMYTFAVSDDGSVLFRTSKRLMPIPAALPGIRVSMPSDQLIINLDLGLTLKWD
TKNSVVAEASVLLWNKTEGLCGTLDGNPENDLTTKEKTIALTKSVMIASWELNKIGDTCD
SSPTETSQCFSKTDADMKSALQFCTKIFTKDKFRKCSKAMDVSQLLEACQWDYCACLTSL
TPEECACRTVSVYAKECLRHGVEEMRSWRDSDTCPMRCLEGKVYKSCGPEVQASCAFPTA
SNSSCVEGCFCPEGELLEGGRCVQKSECPCRVRNQSFPPGTVMPKKCNTCTCEAGQWTCT
SAACGARCGAVGDPHYTTFDGLRYDFMGHCTYTMLKTDNLTIDVENVACSGAITEAMNLA
PYKGDGKPSCTKAVNLMYNGATIHLKQGGFILVNGKEVDSLPVSVGDIRIRAASSLFLIV
QLPIKVDLWWDGNTRVFVDVPPSFKDSTKGLCGTFNLNQKDDFLTPEGDVEQTALAFANK
WKTREFCDDVSTKEPEHPCKANMQNKDTAETYCSKLKSKIFEACHWYVDVQPYYEDCLYD
MCACAGDVSRCLCPIIGDYADACARSAVMVQWRYHVKECELQCTGGQEYTVCGDSCLRTC
ADVSLGAADCRPRCVEGCACPVGQVLDNNNVCVPVGLCPCYHNGMEFKPGYKEVRAGKRE
RELCTCVGARWSCVPATSEDIVNYPPAEDLRSACSASDHKEFTTCEIAEPLTCKNMHLPP
TVTSQECRPGCQCKKGYVMDAASKKCVLPSECPCHHGGRSYQDGDTMQEGCNTCSCNGGK
WSCTSRPCAGVCSAWGDSHFTTFDGQHYDFEGVCTYLFAKGTVDGKDGFSVEIQNEPCGT
TGATCSKSVTLRVGNGGADDETVALTKSAQLPDTSKLKRIKLRLAGAYVFLDVPSLGASL
QWDRGLRVYVKLDTMWQNRVKGLCGNFNSDMRDDFQTPSGGGFSETSSLIFADSWKLKPN
CPKPQEVTDHCKERPHRQAWASSMCGVLKQFPFTLCHSELPAGEHVARCESDACACDSGA
DCDCACAAIAAYAAACAARGVTFKWRTQELCPMQCDEECSNYNECMSPCPPETCENTLEY
DKIKAACENETCVEGCKLNENKTCPDGTIYSNSSLKECVPRAKCKPVCMTLPDGKEVLEG
EVMEEDACHTCRCSKNEKICTGQPCATDEPVPTGPPETTPYDQPLTCKTGWTEWISRAVP
EITASGASVDNEPLPEVNELTIGSPMCKKEMMTKIECRTVGDHRSPKETGLNVECSLEKG
LICSEPEKNCSDFEIRVYCECEEKPFQCLDSSHPSYPHPTECSQFYECTPELGAPGTLHA
VLKNCGEGLVYNPTIMVCDWPASVALVRPECANITTTVTTPTTTVGTTAAFVTESSTSPS
TSTVIPLDVTTAVCPPHQVYKECAFPCDKLCDHFKHTLQAEGQCLNGEKCVAGCVDVPVS
MVECGFGSMWRDRSTCVPIKDCTCDDDGYVIKPGGVVVEGCRKCQCLDNVLHCDSSECVS
IKTPIEGSTHAPMISTTPLPTTVTTPSTTTTTLTTPMSVATSGSTAAPTTVSPSTSTTPL
IIQTTVSPPPECSPDRYINLLWSDEPLPPTSFSASSSANNLFQPQFAVLNGHPLDVSAGS
WNPAHMDKNQYIQVELPQKEPVYGILMQGSPLFDQYVTSYDVMYGDDGNVFSPVNGPDGQ
PKVFRGPVDNNTPLKQMIEPPIEAKFVRIRPLTWHEEIAVRFELIGCQEVTSTTSTSTTT
TTSTTTTSTTTASPTTVMETTSVPITTLEPLQCTEPLGIGANLPIDMIEVSSNNDARPLL
KLNSERGWTPLYSTPGEWIMFNFTSPRNITGIKTKGGANGWVSGYHVMYTSDFSKFNPVI
DASGEPKLFPGNFDDDTEVLNEIRPPLHAQYLKVLPIRWYNNIEMRVEPIGCFEPYPTPE
PPRDERVPVPCPLCPGVPTAACSCPNNTYYDGENCVSRDQCPCLVGFITYPVGATYRGDA
CSQCVCKLGGVPSCSPAADCRCEEDLVPTLTESCKCLCEPCANGTKICPTSKLCLPLEKW
CDGLQDCPDDERDCTTLAPVTETVVTTVVSTVAHKPRTTEITTIAPTTTTEKPMECPKVE
CPPGYFVKQISTQPTYSWSSTSDLPPPRPRYSYQRYYKGGFSKGGRRGYAKGGFSKTAYS
KGGFSKGGFSGPRSPQQNEAFTLDKPALSSASQSKQQCVQFKCVPALPPPPRPGSTPAPV
VCSAPTCPKNYALKLEHVPLETNQCPQYVCVPPPERPVFCNVTGRTFTTFDGSEYKYDVC
FHILARENRLGAWTVLVRKKCRVEGCANELFVWQDDQIILVKPNLMIEYDNYEYTVEQTS
KICFQKNSFDVNRLGSGISIKSRKYNFTVQYSSDGDVKIGVSKKYMGQVDGLCGLFDGDS
SNDRSLPDGRLAAGVEAFGRSWAKPGLSPDACRTKVIPPEKQKRVWDLCKAITEEPLSQC
AKVLNLDKWRSICLEKICKCAELVINGTKMTEENCRCLLVEKLVAECLAADKNIDVTGWR
IKMGCPAECPAPLVHYDCYRRGCEPSCAPITSGAARCDAADGQCSPGCYCPEGKLRKGDA
CVAPSDCLDCTCTGVGTPAKYTTFEGDDLPFLGNCTYLVSRDRQEKGESQYQVYATNGPC
EGAGGTCVTSLHVLRAANLLHVTKDPNTMKLITTVNGERVFKYPFSSDWAVITLVNGQDV
NVLLPELHVEVMVLQSKLQFTVSVPSHDYSNRTEGLCGVCAGYQDQLITSNGTVTDDFEL
YGKSWQASPEVLTKLEVPPQEQCGDIPPPPPCVPPPPESNPCYNLNNVEKFGACHALVEP
QSYIEQCESELCELNSTDACPVLERYAAECRKQGVCLDWRSDLCPYPCDEPLVYRKCVDC
ERTCENYEELKDNPKLCDKQPVEGCFCPEGKVRVNNTCIEPSKCFPCDTKKEHYAGDEWQ
EDACTHCTCSKSGESAHVSCTTRTCAPPVCADGEERVPAATPPGACCKEYLCVPKPPDVV
CDEPKKMECGFGQVLKLKSKPDGCSEFVCECKPESECEPLPDESEVEMLEPGMERVVDRS
GCCPRASLHCRPEACPAAPDCPALHNLRTTNVTGQCCPEHKCELPKDKCFVTLEWEAAPK
GGEKARPTPQVMLKDLDSAWLDGPCRSCRCESTAAGPSPQCHVTSCPTLTPTDQFVLEPR
PVPFACCPEPYQVGENWTSPNDPCESYQCSQVGEGQLEKVTTLQNCHTDCDLGWQYFPSE
DSSQCCGRCRPVSCVVDGVTRDVGAKWTSTDFCTNYTCADINGTLQVQSSNESCPEPSMT
MKKMFVLSEEHEPGHCCVRREPVACRDGDRIYQEGQSWQSSDPCKNYTCSRDEAGSLARG
ESVEACHKECPEGHEYLEPDKGICCGHCVQTQCVVGDELKKPGTSWQSADNCTTFSCENN
TGQVVVTSSRELCPDVSHCDPQDLVNDTCCQICKEKPQDLSKCVPKAVPQSESVGLIRVF
MGRFGMCVNREPLKDFKECRGSCDSGTLYNNQTGSHDSSCECCQATRYEPVVVSLQCEDG
SHRDHRVASPAHCACRRCAELPSFQQSTTNRHSYFRIRPDPRLQPPRGRRLRDTGHLQQV
QRTAQGILEGRRDKYAYVEFNRKYNTDVITHSQQFLFIIQY