New model in OGS2.0 | DPOGS209103  |
---|---|
Genomic Position | scaffold773:- 316-29502 |
See gene structure | |
CDS Length | 9315 |
Paired RNAseq reads   | 3204 |
Single RNAseq reads   | 7466 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002251 (2e-25) |
Best Drosophila hit   | huntingtin (8e-15) |
Best Human hit | huntingtin (1e-68) |
Best NR hit (blastp)   | PREDICTED: similar to Huntington disease gene homolog [Tribolium castaneum] (8e-97) |
Best NR hit (blastx)   | PREDICTED: similar to Huntington disease gene homolog [Nasonia vitripennis] (5e-95) |
GeneOntology terms    | GO:0005515 protein binding GO:0008219 cell death GO:0006915 apoptosis GO:0000052 citrulline metabolic process GO:0019244 lactate biosynthetic process from pyruvate GO:0048167 regulation of synaptic plasticity GO:0034399 nuclear periphery GO:0003714 transcription corepressor activity GO:0005215 transporter activity GO:0009887 organ morphogenesis GO:0007029 endoplasmic reticulum organization GO:0016023 cytoplasmic membrane-bounded vesicle GO:0016197 endosome transport GO:0030073 insulin secretion GO:0007610 behavior GO:0019805 quinolinate biosynthetic process GO:0051881 regulation of mitochondrial membrane potential GO:0005634 nucleus GO:0005794 Golgi apparatus GO:0006917 induction of apoptosis GO:0006888 ER to Golgi vesicle-mediated transport GO:0006916 anti-apoptosis GO:0007005 mitochondrion organization GO:0008542 visual learning GO:0009790 embryo development GO:0021756 striatum development GO:0008017 microtubule binding GO:0005246 calcium channel regulator activity GO:0005625 soluble fraction GO:0007420 brain development GO:0007569 cell aging GO:0016234 inclusion body GO:0021988 olfactory lobe development GO:0021990 neural plate formation GO:0035176 social behavior GO:0042445 hormone metabolic process GO:0048341 paraxial mesoderm formation GO:0050809 diazepam binding GO:0051592 response to calcium ion GO:0005737 cytoplasm GO:0007212 dopamine receptor signaling pathway GO:0007626 locomotory behavior GO:0022008 neurogenesis GO:0030424 axon GO:0046902 regulation of mitochondrial membrane permeability GO:0048666 neuron development GO:0000050 urea cycle GO:0006606 protein import into nucleus GO:0007369 gastrulation GO:0008340 determination of adult lifespan GO:0009952 anterior/posterior pattern formation GO:0043524 negative regulation of neuron apoptosis GO:0047496 vesicle transport along microtubule GO:0055072 iron ion homeostasis GO:0007283 spermatogenesis GO:0008088 axon cargo transport GO:0051938 L-glutamate import |
InterPro families    | IPR016024 Armadillo-type fold IPR000091 Huntingtin IPR011989 Armadillo-like helical IPR000357 HEAT |
Orthology group | MCL12845 |
Nucleotide sequence:
ATGAATGTGTTAGAAAAAGCCGAAAAGGCTTTGGAATATTTAAAACACAATGAAGGAATT
GCGAAAGGTCATGAGCTTCAAACGGCAGCAGGGACTCTCGGCCGTTGTTTAGGAGCACTA
GGAAGTCGGCAAAACTGCGCTAGACACTACGCTAATTTGTTACATACAGCCTTGCCTACC
CTTCTTACTTTAGCTAGTAATGAAAGTGCTGAAGTCAGATTACTGGGAGACGAAGCTCTG
AATAGAGCTGTAGCTGGTGGTTTTGCTTTTCATTCTTATAAAACAAATATTATACTGCAA
AATCAAATAGATGCTACGAAAAGTGCACGATGGATAAAAGCCGCATTATCTCGTATTTGC
CTCGGCGATTGTTGGCTTCGACCGGGTATTGGCAAGATTCGCTCTCAAGCCCAGACGTTG
TTCCCAAAACTCACCCAAATAGTGCGAACAGGGGAAGTAAATCTCATAGTGGAAGCCTTA
GAGCATAATCTGCCAAGGATACTGAAGGCTTTAGCGGAGTACACGACGGATGAAGAGATC
TTTGATTTATCCAAAGCTTTACTTTCTCACATCGAGAATGTCGAACCAGCTTTGAGACGC
GGCATATCAAACTGTGTAGCCAACTTGTGTTCACACAGAGAATTGCTGCTTACAAGCGTG
CTCGGAATGATGTATGAGAGGTTGTGGCCGTTAACACAGAATGGCTGTGCGGTGATCGGC
TGGTTCTGTGTAATCAAAGCCATATTCCAGATAAATGACTTCGATAAGTTTAAAGAAAAT
GATTTGTTTGGTGTTCAGGACTATTTGGAGTTATATCAACTGATAATAAACTATATAGAA
GTATCGACTGAACATAACGCCCAGAACACACTATTGGAGTGTTTGGCCGTACTTTTGTCA
CAAGCTAAAGATGAATATAGGATTGCTTTGTTAGAAAAACACCCCCACGCCATATGCCTT
AATGATAGACAGAGCGTTAAAGGACATAAGAGGAATATGAGTTATTCAAGTGCTATATCA
TCAGTTACTATACCAAGTATGGTGTTAGAGGGACGAGAGTCTGAGATATATTTCACTGGA
GCTCTACAGTTAGGAAGTGTTATGAGAATTGATGAACTAACGGCCAGTACTGATGAACTA
CCCATACAGACACCAGATTCTCCCGTCACTGAATCCGACCTGGAGGCGTCGGATACAGAG
CAGCTGTCGCGAGAGGTGTCCGAAAAGCTCAAGGATTACGAGGAGTATGGAGAAGACATC
AGAGATAGTGGGCTTAAGATAAACATCGGTTCATCGGATGAGGATGACGTCAGACTGAGA
TACTGTTCGCGTCTGCTGGCATCAAAGTTCCTTCTCACTGGACATAAAGGAGATTTCATA
TCTGACAGATTGGTTCGCGTGTCTGTGAAGTCATCGGCTCTGTCTTGTTTGTCAGTCATT
TCGTCTCTCTACCCTCAAGCCTTGACCTTATATCTGGATAAAGAAGAAGATGTTAGATTT
CAGAATTATTCTGGTACATCGGAGGGTACGGAGACGAATCCGGACCAAATAAACGAATAC
ATCCTGGAGCGTAACGTCTGTTACCAGGATTCCATAACGGAGTCGCTCAGTCAGGAACTA
CTGAACCGCAGCACCGAGAGTCAGATATTCAAAAAGGATAAATCTTCGGATAAATCGTCG
GATAAATCGCTGGAAAGTGATATGAAGAGTTCAAAAGATGTGCACAGTGCTATACTAGAT
AGTAGAGCTAACCTGATGGCTGAAAGTACCAATCCCAATATGACGAACAGCAACTTCACG
TCCAACTCGAACATGACCAGCAGCATAGACCATGTGTCGACCGGATATCCGATGTCCAGC
AGCGGGGACGTGCTGACCTCCAGCGCAGACCTGGATATGAAATTTGATCATTTCGGTGAA
TCTATAACAGCTCTGGAAGACGTTTTCCATGAGAAAACTGTGACTGTTAAAGAAATTGAC
AAAGACAAAGTCGATAAGGACTGTTTTAAACATCAGCTTTTATTTGATATATTTTCTTTG
GAGGGTCACAGTGACCCTCAAATACGTGGTATTGTTAGGGTCTGTATGGGTAATTATTTG
TGTGCTGCCCTAGAGTCGGCCCACGGTGATTATAACAGGTGGAAATGTTACAGTCAACTG
CCTAAAGATGTTAGTGAGGACATTAGTGTTGAGAAACTCATGGAAACTATATTGAAGGGT
GTTCATGATGAGATTCACTCAACAGTGAATCACACTCTATCCGCCTTGGGTCGGCTGTGT
AATGTTCTATGCCACAGCGTTCACTGGTCACTCATAACAGACTCACTCAATTCACTCATA
ACAGCCCAGTACAATACATACTGGTTATGTAGAGTGAATCTCGCCAAGTTATACGAGAAG
CTGCCGTATGAAAGACTCTTCACGCTGCATCCTGAATATTATTCGAGGATCCAACTTATA
ATGGATGCATTGTTTCACCTGTGTGGCGATCAAGATCAGAAAGTAAGGACGGCGGCTGCT
CAGGCCATAGCTAAGATAATACCTAAAGCGTATCCATCACGTAAATCCACTCCGATATCA
GTGATGGCCGAAATATCGAAGCGGCAGAGCAAAATATTCACGTTCGACTCGAACAGACTG
TCATTGGAGCTCGTACGTGATATATATTTCTATAACGATCTCCCGGAACAGTTGAAAGAT
GGCGCCACCGTCACCAACAGAGACAGCCTGAAGATGTTCCTGGGGGATTTAATGGGAAAA
CTAATGGCTTCCAATAGTAAATATTATACGCAAGGTCTGATAGAAGTTCTTCAAGCCGTA
TGCCTTAAATGGTCTCCGTGGAAGCATATAGAGTCGTTTACCGGCCAAGGGATTCTGGAC
TATTGTGTGGATGACCTAGACTATTGCAATAGCTCAGTAGTTGTGAGGACTATGTTAATG
GATATATGCCGTTTGGTGTATCCAGTCGAAATCCACAACGTAATGCGCCATAAGACAGCG
AATAGGGATATATTTGAAAGGGACACTCAAGTTAAGGAGAAATGGCAACATCTGTATCAA
GACTTTGAGGGTACAGAGTATACAAAAATCGACAATAGGGTTGCCAATATAGCAGAAAAA
TTTCTACAAGTGACATTGAAAATGTTAAATGTTTTGGTACATTTGATAGAGGAGATCAAC
CCTAATATACATTTGAATAAGAGCGGTATTGCGTTGCCCGGAAGTCCTGTTAGGAGGAAA
ACTCAAGAATCAATACCAAGAAAGAGTAGTTTGACTGAAGAAGACAAAAAAAAGCGACCT
CTACCCGCTGCAAGCTTGAAGGCTAACTTTAGTGGACACTTCTTTAATGAACCGTTTTAC
ATGAGGCTGTATGAGAACTTGAGAGCTACGTACTCCAATCATAAGATCAATCTAGATCCG
AAAAGCAGTATATTTTATGCGTTTCTGTCCACAACACTGGACTGTCTATCCATTCAACTG
GAACTGGCCACTGAGAAGGAGTTCGGTCATGTCACAGAGGAAATATTATTCTATCTGAAA
GTTATAATGCCGCTGTGTCCGAACAAAACCGTGCATTGTATAACGCAACTGTTGAAGTGT
CTCTTCGCCACGAACATGATCAATCAGTACAACGACTTCATGAGCATAGCGGACAAAGTT
CTAGAAGATTCTACGAGTTTCTACGAGCATGTCATGCTAGTGAGCGCTATAGATGACGAC
AGAAGCAGCGTCAGCTCCAACTGCAGCCAGATGTTGGAGGTCAGGAACACGAGGATGCTG
GATGAGAGGAAAATGCTCATGAATTTGGAGAATTTTGCCAAAAACCGCACGGACAGGAAG
TGGAGCACCAACAAGAAGGAACTAGAGAGATATATAAGACTGTTCGAACCTGTCGTCATA
CAGTCTCTAAAGGCCTACACCATGCAGAACGACATACCCCTACAATGTTCGGTGCTATAC
CTCCTCTGCGCGTTACTGTCACTGCGCGTGAACTACTGTATGCTGGACAGCGACCAGGTG
TTCATAGGATATCTCATGAAACAGCTGGACAACATCGAACAGCATGAGATACCGAACTGT
TGTCAGCTGGTCAACAGTATAATGCTATTCCTGGTACAACTGTCGTCGTCCAAACACAAC
ACGAAACAGATCATTGAGATCCCAAAGCTGCTTCAGCTATGTGATGGTTTGTTAGCGGCT
GGGGCTCAGGACGAGTGTGTGAGTGGCCTGGCGCCTGTAGCAGCGAGAGTGTTCAGCAAC
ATGGCCGGAGTGAGTGTACTGGGTCGAGCTCAACAGCAAGAGCTCCAGGCGACTAAGGAG
GTTCTATTCTATATGTTACAGAAGACGATGCATTTACCTCAAGTCCTGAGTCTAGTATCG
ACTGTTCTATCTCTTTCCCAAGAACATCCGGAGAGCTACTATCGCTGGTCGGAACTAGCC
AGCGACTCATTGCTCAATTTGCTCAGCGAGCGTACAGTTGGCGACTCATTTTGCTCAGTG
GAATCTATAGAAATGCTATTGGAATGTGTCTACAGAGACGTATTGTTGGAACAGTCGAGG
GTTGAGATTGTACTGAAGATATTGTTCAAGTGTCCGCCAGATAAGGAAACGTCACCAAAG
AATCTGAAGATCAGATATCTGTCTATAATAATGATACTATTACGGAAAGTGCTGATACTG
ATACCGGAAACGGAAATACTATTGTCAATAAACTATCTTAAGTCGACATGCATTTCACCC
CAGTCTATATTCTTCAATGTTAAACAGAACGTGGACCCTTTGAATGTTCAGAATGTTAAC
GAAAACTGCGCGAACCTTTCGCCGGATGTTATATTAGTGAGGTTTTTGTTCAAAACTTTG
ACTTATGCTATAATGGAAATGGATGGGTTCCAAGCTAATTCAGACTTTATTACGAACCAA
GATAAGGATGAAAATACGTTGTTGTATTCTGTATGCGTGAATATAATAGTTCAAGCAAAA
AACATGCTACACTTGACCCCCGGCTGTATGTTCCCTCTAACAGCAAAAACAGCGCAAACA
ATCCTGCACAACGAACAGAGCGGCCTCAACACCGGCCTGTACAGTCCCGAAGAAAACATA
CCGCTGGATGCACTGAATGTTATATGTCTGAAGCTAGCTCACTCGATGCCATTGCTGGCT
GTGCACTGGTCGCATCTGTTGATACGCCTTAACTTCTTGTCACACAAGTACTGGCACAAG
CTGATAGGCTCCGCTCTCATGACCCCCAGCCCTCACGGATGTGACTCCAGCGTGATGAAG
GTTGACTTGTTACAGACGGCCTGTGTTATGGCTTATTGTGAATATTTTGTAGAGAGAGGA
ACTTGGACGGTGGAGGTAAGCGTTTACCGCGCGTTAATTAGGGGCCCCTTAAACCTACAC
CTGGGTCGCAAGTCCCGGGCACTGTTGAAGCTCCTCTCCCTACAACGCGATGGACCCGGC
AGCCGGACGCTGGATAATGGTTTCAACGAGGCCGTTCACCTGTCCTGGCTGTTAGTGAAC
AGAATACATATCCTCATATCACAGTACAAGGAACGCGTCGTACATTCTCTTATACAGCGT
GTTCAGATAACCCCTGGAGCTTCTGGGTTGTTGCTGCAGGCTGTGGCCGCCAGGTGTCAA
GGATGTCTTAAGGACGAGTTCGCTTTAAACACGTACAAAATACTATCACTCTCGCACGAG
AGTCAATCCGGCGCATTAGTGTTCCTGATATGTCGCATCATTGGAAAGTTGGACCCGTCC
ATAGCTGCGAGATTCGCAAAGCTAGCTATAGATAGATGTAAGGCTTTGAAGGATTTGCCC
TCTGACGTCATCAACGCTCAACTATCAAAGGAGGATGTACAAGTTGCTTTGGAGCTGTTA
CAAAGAGACAAAGTACATATACGGTATCCAAAGCTAGTGATGTCTTTGAACTCGCTGGCT
TCATCAGCCTTGGATCTCTCGCCGCTGGATCTGTCCCAGGAACGTGATATAAATCCTCAT
TACATACATACAACTACACCCGACGATGCGTGGCTTATCAGCCAAATAAAGAGCAGATGT
TGTCAGTCGGATTGCAAAATGCCTAAAGTACCAAAATATAAAGATCTCGCAAACCTGTTG
AAGAATTTATCACTAGAAGATGTGACGACTGTGATGTCCTGCTCCGAATTCGATAGGAAG
ATATTGAGTATGTGCTTCAAGATATCATGTGAAGATTTTACAAGGGATTTCTTGAATGCC
ACATCGGCGCAGGAGTTGAGAGAGATGGAGATGGAGTCGTTGAAAGAGGAGGAAGGGAAT
AGATTTAAAGATGAAATGTCGTCCTCGATAGACAGCAATAAACTCAGCGTGGTCAACACG
AACGTGTTCAAGAGAAACGACAATAAGGAATCGAAATCCCAATTCTTCATACCGATAGTG
GATGACGTCGCCGAGAAGTTTGGTAATCTAGAGATATGCGAGGAGGAAATGGAGATTGTT
GTGCCGGACATGCCTAAGCTGTACCAGGCGTCACTCGCGACGCTGGACAAATCCTTGGCG
GATATCCTCAAGTTGTTCCCGAAACAGAACCGGCCATTGAGCCAATCCGAAAACTTTAAC
CTAAATATAGACCACACCTTGGACAGGTACACCAGGAGGTGCCATTTAGTCTTCCAAGAT
AAATTATTCTATCAAGAGTTCATGACTATACAATCAGCTCTGACGGGTTTTCTGAATTCC
GTTGATGGGATTCTGACCCTTATTGACGAAACGGATTGCGATTTACTCGAAAAATGCATA
GAGAATATAATACCGAACAATTTTGCACGCAACATGGCCATGTTTTCGATAGTATCGCTG
CAGTATTTATCATTTTTGATAAAAAACAAAAAAGTCGTCGAAACTCCAGTCAATACCGAC
GTGTCATTCCGTTCGCCGGCCGTTAACACTGAAAATGTGTGCGTGGACTACGTCATACTC
GTAACGATCGATAATGTGTCCAGAGCCTTAAGTATCGATGAGGTGTGGACTAAGCTTAAC
GCTGATAACTACATGAATAAAACACAGTCAGCGATAACCTGTCTGTACGCGGTCCTTAAG
TATTTAGTTAAGGACACCAAACCCCTTATATTGAAGTCTCACATGCCGTACCGGGATGCG
GGTCCCAAGCCCGACATAATAATAACAGGCAATAAGTTGGCGACGCTCATAGAGTATTGG
GAGGAGAATTTCTACACGAGGACAAACAGACGGGTCCTCGGTGCGAGATATAAGAAACCG
ATTGAAAGTTTATTAATCAGTTTGTCGAGGTTAGAAATAGTGACCAACATAGCGCTGATA
CCGCCCATAGCTTGGTCCTATGTTGAAGTCAAAATGAAGAATGATAATTTGGAGAAAATA
GACATCCCGCTGCATCCGCTGCAGGACATGGATGTGCTCGAGGCGTTCATACTCCGGGTG
AATCTAATCGGTTGGTCCACTAAGAAGCAATTCGAAGAAATTTGGGTAGGTTTGTTGGGA
GCCTTACAAGGGAACGGTACACACTGGGCTATCAACGGTATCACACAACTGCTTCTCACA
ACGGCCCCCTCCGTTAGGGGACGGATGATACACGTGCCGAGGAAATACGTCAAGATCAAC
GAGGGAATGCAACGGCTAAGGAACATTTTGGTTGGAACATCAATATATGATATCTTTAGA
GACGTGAATTTAGAACGCGTGCCGCTCATCAAGGACGGCTTCGACGGCTATCATCAGGGA
CAGTTCAGTACTGAATATTTGAAGTACGCTTCAGATATATCAAACGAGGCGTCATACAAA
GTACGTAAAGCTGTGAAACGGAAGAGAAAAAATAAAGATATAGATGTGAACAGCTGCATA
CAACTACTGATGGACGTCACGACCGACATGTTGGATCCTAAGTCGTCAACCGGCGTGGCG
GGTCGTCTATACGTGATCAAGTCCCTGGAGCATACATGGTGTGTGGTGTCGTCATCGTCC
CAGTGGTCTCGGTCGGCGGCCCTCCTGTCCCGGGCTCCGCGGGCCGCCGCCGCCGCGCCT
CTCCTACACGCCCTCGCCGGCTGTTTCGCTGTACTACAAGGAGACTTCCAGGAGGTGTCC
AGTGTTCACGAGAAGCTTCTCAAGTCGTTATCATCTGGGGCGGCTGCAGTCCGTCAGGCG
GCGCTGAGGGGCTGGCTGTTACAGCTGGCGGGGAGAGGGCCGGGGGACGCCGCTGGGGCG
CTCCGGGACAGTGTGAGGTGTGGAGGGGATAACAGGCTATCACCACACGAGCAGAGTCTC
AACTGGTCGGTACTGTTCACACTAGTGGAGCTGGGTCACAGTGATCTAATGCATACAGCT
GTTGACTTTGTATTGAATAAACCTAGACATTATTGTACGGATCTTGTTGTTAAGGGTATA
ACTTCCGTGCTGCGGCAACAAGTCCTGTCGAAGGACCTCAAGAAGTCTATCATAGAAAAA
CTGCTGGACAATATGAAGATGTACTCGGAGCACCACGCGGTTCAGATACTGATGGTGCAT
TTGTTCTCTGCGGACAGTAAGCTGATAAGTCCAAGATTCGAAACCGACGTGTCCAACATG
GACCCGGACGTGCTGATGAACTCAATGGAACGCATAACGCTGCTGTACAAAGTCCTGAAG
CAGTGTAAATATAGAGAGAACCAACAGATCTGTACCGCGACACTGAAATATTTCCTGCGA
GAGACTCTACCGCCGGCCGCGACTCTGAGTAGAGTCGTGATAGAGTATTTGGAGTGCTGC
AAGGAAACGGAAAGGCTAAATATGACTGCACTCAAAGAATTCAACAATAACATAGAATGT
GCTATCATGAACGCTGATATTGTGTTCGAGGTATTCAACACCTCTATATCTCAAGATCAA
TTGCCAGTTCTAAGCGGTTGGATATTTGAAGCTCTCTGTCATTTACTCTCAGGGAAGATA
TCACATAAGCTGGTCCCGTACTGTTTGCTCACGTTATTGGTGTCCGCATCCGCCAACGCC
AACATAAGGACGTTACATCCGCTAACATATTACATATTCAGACAGGGACTGCATAATAAT
TCCATGTACATGAGGAATAACACGGAAGAAAATGATAAAAATGATGTTTTAACCCCGAAT
AGGACTGATTCGGGCCAAAACTGGGGTATATTTGGTGATTTTACAAGACATACGCCAATG
TCGTTCACCGACAGACGTCTTCTATGCGTAGTGGCTCTGCATTCAAATTTCAGCTCCAAC
CAGTTGGAGAGGTTGAAGCAACTGTGCGAAGGGAACGAGTTTTTGGGAGATCTGATGAGA
TGTTTAACGGAATAA
Protein sequence:
MNVLEKAEKALEYLKHNEGIAKGHELQTAAGTLGRCLGALGSRQNCARHYANLLHTALPT
LLTLASNESAEVRLLGDEALNRAVAGGFAFHSYKTNIILQNQIDATKSARWIKAALSRIC
LGDCWLRPGIGKIRSQAQTLFPKLTQIVRTGEVNLIVEALEHNLPRILKALAEYTTDEEI
FDLSKALLSHIENVEPALRRGISNCVANLCSHRELLLTSVLGMMYERLWPLTQNGCAVIG
WFCVIKAIFQINDFDKFKENDLFGVQDYLELYQLIINYIEVSTEHNAQNTLLECLAVLLS
QAKDEYRIALLEKHPHAICLNDRQSVKGHKRNMSYSSAISSVTIPSMVLEGRESEIYFTG
ALQLGSVMRIDELTASTDELPIQTPDSPVTESDLEASDTEQLSREVSEKLKDYEEYGEDI
RDSGLKINIGSSDEDDVRLRYCSRLLASKFLLTGHKGDFISDRLVRVSVKSSALSCLSVI
SSLYPQALTLYLDKEEDVRFQNYSGTSEGTETNPDQINEYILERNVCYQDSITESLSQEL
LNRSTESQIFKKDKSSDKSSDKSLESDMKSSKDVHSAILDSRANLMAESTNPNMTNSNFT
SNSNMTSSIDHVSTGYPMSSSGDVLTSSADLDMKFDHFGESITALEDVFHEKTVTVKEID
KDKVDKDCFKHQLLFDIFSLEGHSDPQIRGIVRVCMGNYLCAALESAHGDYNRWKCYSQL
PKDVSEDISVEKLMETILKGVHDEIHSTVNHTLSALGRLCNVLCHSVHWSLITDSLNSLI
TAQYNTYWLCRVNLAKLYEKLPYERLFTLHPEYYSRIQLIMDALFHLCGDQDQKVRTAAA
QAIAKIIPKAYPSRKSTPISVMAEISKRQSKIFTFDSNRLSLELVRDIYFYNDLPEQLKD
GATVTNRDSLKMFLGDLMGKLMASNSKYYTQGLIEVLQAVCLKWSPWKHIESFTGQGILD
YCVDDLDYCNSSVVVRTMLMDICRLVYPVEIHNVMRHKTANRDIFERDTQVKEKWQHLYQ
DFEGTEYTKIDNRVANIAEKFLQVTLKMLNVLVHLIEEINPNIHLNKSGIALPGSPVRRK
TQESIPRKSSLTEEDKKKRPLPAASLKANFSGHFFNEPFYMRLYENLRATYSNHKINLDP
KSSIFYAFLSTTLDCLSIQLELATEKEFGHVTEEILFYLKVIMPLCPNKTVHCITQLLKC
LFATNMINQYNDFMSIADKVLEDSTSFYEHVMLVSAIDDDRSSVSSNCSQMLEVRNTRML
DERKMLMNLENFAKNRTDRKWSTNKKELERYIRLFEPVVIQSLKAYTMQNDIPLQCSVLY
LLCALLSLRVNYCMLDSDQVFIGYLMKQLDNIEQHEIPNCCQLVNSIMLFLVQLSSSKHN
TKQIIEIPKLLQLCDGLLAAGAQDECVSGLAPVAARVFSNMAGVSVLGRAQQQELQATKE
VLFYMLQKTMHLPQVLSLVSTVLSLSQEHPESYYRWSELASDSLLNLLSERTVGDSFCSV
ESIEMLLECVYRDVLLEQSRVEIVLKILFKCPPDKETSPKNLKIRYLSIIMILLRKVLIL
IPETEILLSINYLKSTCISPQSIFFNVKQNVDPLNVQNVNENCANLSPDVILVRFLFKTL
TYAIMEMDGFQANSDFITNQDKDENTLLYSVCVNIIVQAKNMLHLTPGCMFPLTAKTAQT
ILHNEQSGLNTGLYSPEENIPLDALNVICLKLAHSMPLLAVHWSHLLIRLNFLSHKYWHK
LIGSALMTPSPHGCDSSVMKVDLLQTACVMAYCEYFVERGTWTVEVSVYRALIRGPLNLH
LGRKSRALLKLLSLQRDGPGSRTLDNGFNEAVHLSWLLVNRIHILISQYKERVVHSLIQR
VQITPGASGLLLQAVAARCQGCLKDEFALNTYKILSLSHESQSGALVFLICRIIGKLDPS
IAARFAKLAIDRCKALKDLPSDVINAQLSKEDVQVALELLQRDKVHIRYPKLVMSLNSLA
SSALDLSPLDLSQERDINPHYIHTTTPDDAWLISQIKSRCCQSDCKMPKVPKYKDLANLL
KNLSLEDVTTVMSCSEFDRKILSMCFKISCEDFTRDFLNATSAQELREMEMESLKEEEGN
RFKDEMSSSIDSNKLSVVNTNVFKRNDNKESKSQFFIPIVDDVAEKFGNLEICEEEMEIV
VPDMPKLYQASLATLDKSLADILKLFPKQNRPLSQSENFNLNIDHTLDRYTRRCHLVFQD
KLFYQEFMTIQSALTGFLNSVDGILTLIDETDCDLLEKCIENIIPNNFARNMAMFSIVSL
QYLSFLIKNKKVVETPVNTDVSFRSPAVNTENVCVDYVILVTIDNVSRALSIDEVWTKLN
ADNYMNKTQSAITCLYAVLKYLVKDTKPLILKSHMPYRDAGPKPDIIITGNKLATLIEYW
EENFYTRTNRRVLGARYKKPIESLLISLSRLEIVTNIALIPPIAWSYVEVKMKNDNLEKI
DIPLHPLQDMDVLEAFILRVNLIGWSTKKQFEEIWVGLLGALQGNGTHWAINGITQLLLT
TAPSVRGRMIHVPRKYVKINEGMQRLRNILVGTSIYDIFRDVNLERVPLIKDGFDGYHQG
QFSTEYLKYASDISNEASYKVRKAVKRKRKNKDIDVNSCIQLLMDVTTDMLDPKSSTGVA
GRLYVIKSLEHTWCVVSSSSQWSRSAALLSRAPRAAAAAPLLHALAGCFAVLQGDFQEVS
SVHEKLLKSLSSGAAAVRQAALRGWLLQLAGRGPGDAAGALRDSVRCGGDNRLSPHEQSL
NWSVLFTLVELGHSDLMHTAVDFVLNKPRHYCTDLVVKGITSVLRQQVLSKDLKKSIIEK
LLDNMKMYSEHHAVQILMVHLFSADSKLISPRFETDVSNMDPDVLMNSMERITLLYKVLK
QCKYRENQQICTATLKYFLRETLPPAATLSRVVIEYLECCKETERLNMTALKEFNNNIEC
AIMNADIVFEVFNTSISQDQLPVLSGWIFEALCHLLSGKISHKLVPYCLLTLLVSASANA
NIRTLHPLTYYIFRQGLHNNSMYMRNNTEENDKNDVLTPNRTDSGQNWGIFGDFTRHTPM
SFTDRRLLCVVALHSNFSSNQLERLKQLCEGNEFLGDLMRCLTE