New model in OGS2.0 | DPOGS205071  |
---|---|
Genomic Position | scaffold1352:+ 251-18763 |
See gene structure | |
CDS Length | 8586 |
Paired RNAseq reads   | 4350 |
Single RNAseq reads   | 11811 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006816 (0.0) |
Best Drosophila hit   | rudimentary, isoform A (0.0) |
Best Human hit | CAD protein (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to carbamoyl-phosphate synthase large chain [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to carbamoyl-phosphate synthase large chain [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0006207 'de novo' pyrimidine base biosynthetic process GO:0004070 aspartate carbamoyltransferase activity GO:0004088 carbamoyl-phosphate synthase (glutamine-hydrolyzing) activity GO:0004151 dihydroorotase activity GO:0006541 glutamine metabolic process GO:0070406 glutamine binding GO:0005524 ATP binding GO:0004087 carbamoyl-phosphate synthase (ammonia) activity GO:0005737 cytoplasm GO:0016597 amino acid binding |
InterPro families    | IPR005483 Carbamoyl-phosphate synthase, large subunit, CPS-domain IPR001944 Glycoside hydrolase, family 35 IPR001317 Carbamoyl-phosphate synthase, GATase domain IPR013781 Glycoside hydrolase, subgroup, catalytic core IPR002474 Carbamoyl-phosphate synthase, small subunit, N-terminal IPR013817 Pre-ATP-grasp fold IPR013815 ATP-grasp fold, subdomain 1 IPR013816 ATP-grasp fold, subdomain 2 IPR005480 Carbamoyl-phosphate synthetase, large subunit, oligomerisation IPR011607 Methylglyoxal synthase-like domain IPR011761 ATP-grasp fold IPR017926 Glutamine amidotransferase type 1 IPR006130 Aspartate/ornithine carbamoyltransferase IPR002195 Dihydroorotase, conserved site IPR005479 Carbamoyl-phosphate synthetase, large subunit, ATP-binding IPR019801 Glycoside hydrolase, family 35, conserved site IPR006274 Carbamoyl-phosphate synthase, small subunit IPR006275 Carbamoyl-phosphate synthase, large subunit IPR002082 Aspartate carbamoyltransferase, eukaryotic IPR017853 Glycoside hydrolase, superfamily IPR016185 PreATP-grasp-like fold IPR008979 Galactose-binding domain-like IPR011059 Metal-dependent hydrolase, composite domain IPR006132 Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding IPR006131 Aspartate/ornithine carbamoyltransferase, Asp/Orn-binding domain IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal IPR006680 Amidohydrolase 1 |
Orthology group | MCL10751 |
Nucleotide sequence:
ATGTTGGCAACATTGCGATTTCTATCCGTATCAATATGTGTGATGCAATTGATTTTGGCA
CAACCGACGACCATCCAAGAAACAGATCCAATAGTGTTACAGATAACAGACAACAACCAA
GGAGGTCAGCTATCGCAAAAAGACTTTCAGAACGCACGTAATATAAGCATTGTAGGTGAT
GACTTTATGCTCGATGGCAAACCGCTCCGTATTGTGTCAGGATCTGTGCACTACTACAGA
CTACCGGCAGAATATTGGAGAGATAGGTTACGGAAAATTAGAGCCGCCGGTTTAAACGCT
GTTTCCACATATGTGGAGTGGAGCAGTCATGAGGAAGAAGAAGGTGCCTATTCTTTTGAA
GGTGACAAGGATATTGCCAGATTTCTAAAGATTGCCGCAGAAGAGAATTTGTATGTGCTA
CTTCGACCTGGGCCGTACATTTGTGCTGAAAGAGATCTAGGTGGACTACCGTATTGGCTT
TTGAGCAAATATCCTGATATCAAATTGCGAACTACAGACGGAAATTTTATAGCAGAAACC
AAAAAGTGGATGGCTAAACTATTTGAAGAAGTTAAACCTTTTCTATTAGGCAACGGAGGC
CCTATCATATTGGTCCAGGTAGAAAATGAGTATGGCAGTTACGGAGCTTCAAAGGAGTAT
ATGAAGCAAATTCGGGACATAATAAAATCTCACGTAGAAGACGCAGCACTACTTTACACC
ACCGATGGTCCCTACAGATCATACTTCATCGACGGATCTATATCCGGTACACTTACAACT
ATAGATTTCGGACCAACGACTAGTGTTATTAACACTTTTAAAGAGCTGAGAGCATACATG
CCTGTTGGTCCTTTGATGAATTCCGAATTCTATCCTGGATGGCTAACACATTGGAGTGAA
CACATTCAGCAGGTGTCCACTGACCGTGTAACTTTTACTCTTCGAGACATGTTGGAAAAC
AAGATTAATTTAAATTTTTACGTTTTCTTTGGAGGAACTAACTTCGAATTTACATCTGGT
GCTAACTATGGAAGATTTTATCAACCCGATATAACATCTTATGATTATGATGCTCCATTA
TCCGAAGCTGGCGATCCTACGGAGAAGTATTACGCTATACGAGACGTACTGTCCAATTAC
GATTTAGTGCCAGATGATATACCAGTTCCAGTACCATCAAAAAAAGGAGCCTATGGACGA
ATTGAAGTAGCAAACAAAATTAATCTGCTATCAACAGAAGGACGTTCCAGTTTAGGAATT
AAATACAAAGACGTGGAAGGTGCGAAATTACCAACGTTTGAGGAATTGAAACAAAGAAGC
GGTCTCATGCTTTATGAAATGACACTCAATGGAACCGGTGGAGTTTTGAATATAAAAAAA
CCACGAGATTTCATATTTGTTTACGTTGATAAGAAACTGCAAGGAGTTATAAGCAGAATG
ATGATGTTATATTCGCTCAGTATAAACTCAAAACCAGGCTCTACTTTGTCGTTGCTCGTT
GAGAATCAAGGTCGTATAAATTTTGGAAACCGAATTCACGACTTCAAGGGCATACTTGGC
TCTGTGTTATTAAACAATAAAACCTTAGAAGGTCCCTGGTCTGTAACTGGTTACTCATTA
GATGTTAAGAAGAGTAAATTGTTGAGTGATGACAATATCTCTGCCTTCACTGAGGATGCT
TTATCAGACGGTCCCATGATGTTCGAAGGACAGTTCGTGATTCCTGAAGGAGAAGAGCCA
TTGGACACTTTCATTGATACAACCAATTGGGGGAAGGGTTACATATTCGTCAACGGGTAC
AACTTAGGAAGATATTGGCCAAAGGTTGGACCCCAAATTACTCTTTATGTACCAGGTGTA
TGGCTGAAACCAGCACCAGCGGTTAATTCAATAAAAGAAATGGTTGAAAGCGAGGAATTA
AGTTGCTCAGTCGGAAACCCGTGCAGCCTGGTTTTGGCAGATGGCACAGTCTTTAGTGGA
AGAAGTTTCGGGGCCAATGTACCCGTAGAGGGTGAAGTGGTATTCCAAACTGGTATGGTA
GGGTATCCCGAATCATTGACAGACCCTTCCTACCACGCACAGTTGTTGGTCCTTACGTAC
CCATTAATCGGAAACTATGGTGTTCCTGACGATAAGGATAAAGACGAGCATGGGCTGCCA
AGATGGTTTGAATCGAGTCGTATATGGGCTGCTGGGTTAATAGTTGGTCAAATAAGCACT
CAAGCCTGCCACTGGCGTGCGAAACGATCTCTTGGTAAGTGGTTGGAGGCCAACGGTATA
CCTGGCCTCTGTGTTCCCCCTTTTGGACCTCTGCCACCCCTGAAGGATCCAAATTCCCGC
AATTTAGTAGCAGAAGTATCTATAAAGGAACCTAAGATATTTAATGAATCAGGAAAAGTA
ACCATAATGGCTATTGACTGCGGTCTAAAATATAATCAAATAAGATGCTTAATAAAAAGG
AACGCTAGGGTCGTATTAGTGCCTTGGAATTACAAATTTGAGACCAATTCATATGACGGT
CTGTTTATAAGCAACGGTCCTGGTGACCCCGAGGTTTGCAAAAAGGTTGTTGAAAATTTG
AATGATGTAATCAGCAGCGAAACTATTACTAAACCAATATTTGGTATATGTCTCGGTCAT
CAACTGCTTGCGACTGCTGCGGGATGTAAAACCTACAAAACAAAATATGGAAACCGTGGA
CATAATTTACCGTGTACACATTCAGGTACCGGCAGATGTTTTATGACTTCTCAAAATCAC
GGCTACGCTGTTGATGCCAATACCCTCCCTAAAAATTGGGAAATATTGTTTACCAACGAA
AATGACAAAACCAATGAAGGCATAATACACAAGACACTTCCATATTTTAGTGTTCAATTT
CACCCAGAGCATACAGCTGGTCCCACTGATTTAGAATGTCTTTTTGATGTGTTTATTGAT
ACAGTCACAGCATATAAAAACAATGTAACATGTGTTGTGAAAGACTTAATATGTGAAAAA
CTTAAATTTACGCCGACAATTTATGAAAGACCGAAGAAAATATTGATTCTTGGTTCTGGT
GGTTTATCTATTGGGCAGGCAGGTGAATTCGATTATTCTGGATCTCAAGGTGTTAAAGCT
ATGCAAGAAGAAAAAATTCAAACTGTTCTTATTAATCCTAATATTGCAACAGTTCAAACA
TCCAAAGGTCTCGCAGATAAAGTATATTTCTTACCCATTACACCAGAATATGTAGAACAA
GTTATTAAGGCCGAAAGACCAACAGGTATTTTACTCACTTTTGGTGGACAAACAGCTTTA
AATTGTGGAGTGGAATTACAAAAAAACAAAGTATTTGAAAAATACAATGTAAGCGTTTTG
GGAACACCGGTACAATCAATAGTCGACACGGAAGACAGAAAGATATTTGCTGAAAAAATT
AATGCCATTGGAGAAAAAGTTGCACCTAGTGCTGCTGTAGCCTCTATTGAAGAAGCTTTA
AATGCAGCACGTCAAATCGGATATCCGGTTATGACCCGATCGGCGTTTTCGCTTGGAGGT
CTTGGATCAGGATTTGCAAATGATGAAGAAGAGTTAAAAAAACTAGCTCACCATGGGTTA
TCACATTCCGACCAGTTAATTATTGATAAGTCCTTAAAAGGATGGAAAGAAGTTGAATAC
GAAGTTGTGAGGGATGCATATGATAATTGCATAACAGTATGTAATATGGAAAATGTAGAC
CCTTTGGGAATACATACCGGAGAGTCCATTGTTGTTGCTCCTAGTCAAACTTTATCAAAC
AAAGATTATTATATGTTAAGAAATACTGCTATTAAAGTAATCAGACATTTTGGGATTGTT
GGTGAATGCAACATTCAATATGCACTAAATCCTAACTCTGAAGAGTTTTATATCATAGAA
GTGAACGCACGATTATCTAGAAGTTCAGCTTTAGCTAGTAAAGCTACGGGTTATCCGTTA
GCATATGTTGCTGCAAAGCTAGCCCTTGGAATTCCATTACCCGCAATAAAAAACTCTGTA
ACGGGGGTTACAACAGCATGTTTTGAACCGAGTTTAGACTACTGTGTCGTTAAAATTCCA
AGATGGGATTTAGCGAAATTCAATCGAGTTAGTACAAAGATTGGAAGTTCTATGAAAAGT
GTTGGTGAAGTTATGGCCATAGGAAGGAATTTTGAGGAAGCATTCCAAAAAGCTTTAAGA
ATGGTCGATGAAAACGTTAATGGTTTCGATCCGTACCTTAAAAAAGTTAATGAAAATGAA
CTGCGAGAGCCAACAGATAAGCGAATGTTTGTTTTAGCAGCAGCTCTTAGACAAAATTAC
AGTGTTGAAAAATTGTACGAGTTAACTAAAATAGACCGATGGTTCCTTGGAAAATTTAAA
AATATTATAGATTACTATCAAACACTTGAGTCCATAGATTCGGGATCAATTACTTCTGAT
ATATTAAAAACAGCAAAACAAATGGGATTTTCGGATAAACAAATTGCTGTTGCTATTAAA
AGTACAGAATTAGCAGTTCGAAAATTACGAGAAGAATTCAAGATTACTCCGTTTGTTAAA
CAAATAGACACAGTGGCAGCTGAATGGCCCGCATCTACCAATTATTTATATCTTACTTAC
AATGGTTGTACCCATGATTTAGTTTTCCCTAAAGATTTGACTATGGTACTCGGTTCCGGA
GTATATAGAATAGGAAGTTCTGTTGAATTTGATTGGTGTGCTGTTGGCTGCTTAAGAGAA
TTGAAAAAGCAGGGCAAAAAAACAATTATGGTCAATTACAACCCTGAAACTGTCAGCACA
GATTATGACATGAGTGACCGACTATATTTCGAAGAAATTTCCTTTGAAGTTGTTATGGAT
ATATATAACATTGAACAACCTCATGGAGTAATATTATGTATGGGAGGACAGTTACCCAAT
AATATTGCTATGGATTTACACAGACAGCAGGCTGTTATATTAGGAACCTCCCCTGATATG
GTTGATAATGCTGAAAATAGATTTAAATTTTCTCGCATGCTCGACCGTAAGGGTATTCTG
CAACCAAGATGGAAAGAATTAACTAATCTTGATTCAGCAGTAAAGTTCTGTGAAGAAGTT
GGATATCCATGTTTAGTTCGTCCATCATATGTTTTAAGTGGAGCTGCGATGAATGTAGCA
TATTCAAACCAAGATTTGGAAACGTATCTGAAATCTGCTAGTGAAGTGAGTAAAGAACAT
CCAGTAGTGATATCAAAATACATTTCAGACGCAAAAGAAATAGATGTTGATGCCGTTGCC
GCAGATGGGGTTATTCTTTGTATGGCTGTATCAGAACACGTAGAAAATGCTGGAGTGCAT
TCTGGAGATGCTACATTAGTAACACCACCGCAAGACATCAATGATGAAACATTGGACAAA
ATCAAAGAAATAGCGAGAATTATTGCAGAGACACTTGATGTTACCGGGCCATTCAATATG
CAACTTATAGCAAAGGACAATGAATTAAAGGTTATAGAGTGCAATGTGAGGGTTTCAAGA
TCATTCCCATTTGTTTCAAAAACATTGGATCATGATTTTGTGGCAATGGCAACAAAAGTT
ATCCTCGGTTTACCGGTTGAACCTGTGAATATAATGGGTGGCTGTGGAAAAGTTGGCGTC
AAAGTGCCACAATTCTCATTTTCAAGATTATCAGGAGCCGATGTCACACTTGGGGTAGAA
ATGGCATCCACCGGTGAAGTTGCTTGTTTTGGTGAAAATCGTTATGAAGCTTATCTAAAA
TCTTTAATGAGTACTGGCTTTAGAATTCCCAAAAAAGCTATTTTACTTTCTGTGGGAACA
TTTAAGCATAAAATGGAGTTATTGCCAAGTGTTCGAATATTACAAAAATTAGGATATAAA
TTGTATGCCAGTATGGGTACTGGGGATTTCTATATGGAACATGGAGTTGAGATTGAAAGT
GTGCAATGGACTTTTGACCACATTGGGGATCTAGAGGACGATAGATCAGATGGAGAATTA
ATGCATTTAGCCGATTTTATGGCTCGAAGAGAATTGGATTTAGTAATAAACTTGCCTATG
AGAGGTGGAGCCCGGCGCGTCTCTTCATTTACTACACATGGCTATCGAACCCGCCGTTTA
GCCGTAGACTATGCAGTTCCTTTAGTTACTGATGTGAAATGCGCTAAACTTCTAGTTCAG
GCTATGCTGCGGTGTAGTGGTGCGCCGCCAATGAAAACAAAACTTCTAGTTCAGGCAATG
CTGCGGTGCAGTGGTGCGCCGCCAATGAAAACACATACAGATTGTATGACTTCTCGAAAC
ATACTTAAACTACCAGGGTTTATCGATGTTCATGTTCATGTTCGTGAACCAGGGGCGACA
TACAAAGAAGATTTTAATTCCTGTACAGCTGCTGCATTGGCTGGAGGTATCACTATGATT
TGCGCTATGCCAAACACAAATCCTCCGGTAATCGATCGCGTGTCATATGACTATGTTTCC
ACATTGGCACGTGTAAGTGCTCGTTGTGACTACGCTTTATTTGTGGGAGCTTCAACGACG
AATTGTGATACAGCTGCTGAATTAGCACCTCAAGCGGCAGCATTAAAAATGTATCTCAAT
CAAACTTTCACTACCTTAAGGTTGGACGATATGACTGTTTGGCAACGACATCTTCAGAAC
TGGCCCAAAAAAATGCCTATATGTGCTCACGCTGAGCGTGAAAAGACTGGCGCAATCATT
TTGATGGCGTCTCTGTTGGACAGACCTATTCATATATGTCATATCGCTAGGAAAGAAGAA
ATTTTGATCGTGAAAGCGGCCAAAGAAAAAGGACTTAAAGTAACTTGCGAAGTATGTCCA
CATCATCTTTTTTTAAGCACAGATGATGTAAGTAGCATTGGTGAAGGACGTGCTGAAGTC
CGTCCTGTCTTATGTAGCCCACAAGACCAAGCTGAGTTGTGGAAAAATATGGATATTATT
GATGTATTTGCAACAGACCATGCTCCTCATTCCGTCGAAGAAAAGAATTCTGAAAAGCCT
CCCCCAGGCTTCCCTGGTCTTGAAACTATCTTGCCTCTACTTCTAAACGCTGTTCACGAA
GGACGTTTAACAATAGATGATTTAATTAATAAATTTCATAAAAATCCAAGAAGAATTTTT
AATCTACCTGAACAACAAAATACATATGTTGAGGTGGACATGGATTATGAATGGGTAATT
CCTCAGGCATTGGAATTCTCAAAGTCTAAATGGACCCCCTTTGCTGGGAAACGGGTATGT
GGAGCTATTCATCGCGTGACTCTACGAGGCGAAATAGCCTACGTTGAAGGCCAAGTCTTG
GTACCGCCTGGATTTGGTCAAAACGTACGTGACTGGCCAGCACCAAAAAAACTTGCTCAC
CCCAGCATTGTATCTGAGAAACTAGAAAAAGAGACCAGTCGGCCAAACTCTTCATTAGAT
TTTCACAGTTCTCTGGACTTTATTAAAGTTAACGATCTAGACGTAGACCAAGCTGAAGTC
TCTAAGTCAGATCAAAATAAACTGAATGTTCATTTCCACGAAGACTCCGGTTTAAGAAGT
GTTTCACCCTTAATTCCACAAACTACTACTAGACAGAGATTGGACAGTTCGTCTTACCCG
TCTCATGCAGCGCCACCTAATCGTCAAAGAAGTGATTTATTTGGAAAGAGCATATTAACA
GTGGACACATTCGGAAAAGAAACATTGAATGATATATTCAATTTAGCTCAATTTATGAAA
ACTAATGTTACAAAAGGGCGCGTATTAGATGACATTTTACGGGGTAAAGTCATGGCATCA
ATATTCTACGAAGTAAGTACGAGGACGAGTTGTAGTTTCGCCGCTGCTATGCAAAGACTT
GGCGGATCTGTGATCCACACGGACGCAACGAGTTCCTCAGCCAAAAAAGGTGAAACACTA
GAGGACAGTGTGACTGTTATGGCCAGCTACGCAGACGTCGTAGTATTGCGTCATCCAGAA
CCCGGCGCGGTGACACGTGCCTCAAGACACTGTCGGAAACCAATCATAAATGCTGGGGAT
GGTGTAGGTGAACATCCCACTCAAGCCCTTCTGGATGTTTTTACAATTCGAGAGGAAATC
GGTACCGTTAATGGCTTAACCATAACGATGGTCGGTGATTTGAAAAATGGAAGAACCGTT
CACTCTTTAGCCCGACTTCTTACTCTGTATCAAGTACAACTACAATACGTAAGTCCGCCT
GGACTTGGAATGCCTAAACATATAATGGATTACGTAGCATACAAAGGCATTCCTCAAAAA
GTATATGAACGCTTAGAAGATGTTCTCGGCGAAACTCATGTTCTCTATATGACAAGAATT
CAACGTGAACGATTTGAAAGTGAGCAGGAATACGAAAAGATGCGAGGCCTGTTGGTGGTT
ACACCACAACTTATGACACGAGCCAAACGCCGTATGATAGTTATGCATCCACTTCCTCGT
GTCGATGAAATTTCACCAGAGTTTGATACCGATCCACGGGCAGCTTATTTCAGACAAGCG
GAATACGGGATGTATGTCCGTATGGCGTTACTTTCCATGGTCGCTGGAGTTAATCCTCTC
ACGTAA
Protein sequence:
MLATLRFLSVSICVMQLILAQPTTIQETDPIVLQITDNNQGGQLSQKDFQNARNISIVGD
DFMLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTYVEWSSHEEEEGAYSFE
GDKDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKYPDIKLRTTDGNFIAET
KKWMAKLFEEVKPFLLGNGGPIILVQVENEYGSYGASKEYMKQIRDIIKSHVEDAALLYT
TDGPYRSYFIDGSISGTLTTIDFGPTTSVINTFKELRAYMPVGPLMNSEFYPGWLTHWSE
HIQQVSTDRVTFTLRDMLENKINLNFYVFFGGTNFEFTSGANYGRFYQPDITSYDYDAPL
SEAGDPTEKYYAIRDVLSNYDLVPDDIPVPVPSKKGAYGRIEVANKINLLSTEGRSSLGI
KYKDVEGAKLPTFEELKQRSGLMLYEMTLNGTGGVLNIKKPRDFIFVYVDKKLQGVISRM
MMLYSLSINSKPGSTLSLLVENQGRINFGNRIHDFKGILGSVLLNNKTLEGPWSVTGYSL
DVKKSKLLSDDNISAFTEDALSDGPMMFEGQFVIPEGEEPLDTFIDTTNWGKGYIFVNGY
NLGRYWPKVGPQITLYVPGVWLKPAPAVNSIKEMVESEELSCSVGNPCSLVLADGTVFSG
RSFGANVPVEGEVVFQTGMVGYPESLTDPSYHAQLLVLTYPLIGNYGVPDDKDKDEHGLP
RWFESSRIWAAGLIVGQISTQACHWRAKRSLGKWLEANGIPGLCVPPFGPLPPLKDPNSR
NLVAEVSIKEPKIFNESGKVTIMAIDCGLKYNQIRCLIKRNARVVLVPWNYKFETNSYDG
LFISNGPGDPEVCKKVVENLNDVISSETITKPIFGICLGHQLLATAAGCKTYKTKYGNRG
HNLPCTHSGTGRCFMTSQNHGYAVDANTLPKNWEILFTNENDKTNEGIIHKTLPYFSVQF
HPEHTAGPTDLECLFDVFIDTVTAYKNNVTCVVKDLICEKLKFTPTIYERPKKILILGSG
GLSIGQAGEFDYSGSQGVKAMQEEKIQTVLINPNIATVQTSKGLADKVYFLPITPEYVEQ
VIKAERPTGILLTFGGQTALNCGVELQKNKVFEKYNVSVLGTPVQSIVDTEDRKIFAEKI
NAIGEKVAPSAAVASIEEALNAARQIGYPVMTRSAFSLGGLGSGFANDEEELKKLAHHGL
SHSDQLIIDKSLKGWKEVEYEVVRDAYDNCITVCNMENVDPLGIHTGESIVVAPSQTLSN
KDYYMLRNTAIKVIRHFGIVGECNIQYALNPNSEEFYIIEVNARLSRSSALASKATGYPL
AYVAAKLALGIPLPAIKNSVTGVTTACFEPSLDYCVVKIPRWDLAKFNRVSTKIGSSMKS
VGEVMAIGRNFEEAFQKALRMVDENVNGFDPYLKKVNENELREPTDKRMFVLAAALRQNY
SVEKLYELTKIDRWFLGKFKNIIDYYQTLESIDSGSITSDILKTAKQMGFSDKQIAVAIK
STELAVRKLREEFKITPFVKQIDTVAAEWPASTNYLYLTYNGCTHDLVFPKDLTMVLGSG
VYRIGSSVEFDWCAVGCLRELKKQGKKTIMVNYNPETVSTDYDMSDRLYFEEISFEVVMD
IYNIEQPHGVILCMGGQLPNNIAMDLHRQQAVILGTSPDMVDNAENRFKFSRMLDRKGIL
QPRWKELTNLDSAVKFCEEVGYPCLVRPSYVLSGAAMNVAYSNQDLETYLKSASEVSKEH
PVVISKYISDAKEIDVDAVAADGVILCMAVSEHVENAGVHSGDATLVTPPQDINDETLDK
IKEIARIIAETLDVTGPFNMQLIAKDNELKVIECNVRVSRSFPFVSKTLDHDFVAMATKV
ILGLPVEPVNIMGGCGKVGVKVPQFSFSRLSGADVTLGVEMASTGEVACFGENRYEAYLK
SLMSTGFRIPKKAILLSVGTFKHKMELLPSVRILQKLGYKLYASMGTGDFYMEHGVEIES
VQWTFDHIGDLEDDRSDGELMHLADFMARRELDLVINLPMRGGARRVSSFTTHGYRTRRL
AVDYAVPLVTDVKCAKLLVQAMLRCSGAPPMKTKLLVQAMLRCSGAPPMKTHTDCMTSRN
ILKLPGFIDVHVHVREPGATYKEDFNSCTAAALAGGITMICAMPNTNPPVIDRVSYDYVS
TLARVSARCDYALFVGASTTNCDTAAELAPQAAALKMYLNQTFTTLRLDDMTVWQRHLQN
WPKKMPICAHAEREKTGAIILMASLLDRPIHICHIARKEEILIVKAAKEKGLKVTCEVCP
HHLFLSTDDVSSIGEGRAEVRPVLCSPQDQAELWKNMDIIDVFATDHAPHSVEEKNSEKP
PPGFPGLETILPLLLNAVHEGRLTIDDLINKFHKNPRRIFNLPEQQNTYVEVDMDYEWVI
PQALEFSKSKWTPFAGKRVCGAIHRVTLRGEIAYVEGQVLVPPGFGQNVRDWPAPKKLAH
PSIVSEKLEKETSRPNSSLDFHSSLDFIKVNDLDVDQAEVSKSDQNKLNVHFHEDSGLRS
VSPLIPQTTTRQRLDSSSYPSHAAPPNRQRSDLFGKSILTVDTFGKETLNDIFNLAQFMK
TNVTKGRVLDDILRGKVMASIFYEVSTRTSCSFAAAMQRLGGSVIHTDATSSSAKKGETL
EDSVTVMASYADVVVLRHPEPGAVTRASRHCRKPIINAGDGVGEHPTQALLDVFTIREEI
GTVNGLTITMVGDLKNGRTVHSLARLLTLYQVQLQYVSPPGLGMPKHIMDYVAYKGIPQK
VYERLEDVLGETHVLYMTRIQRERFESEQEYEKMRGLLVVTPQLMTRAKRRMIVMHPLPR
VDEISPEFDTDPRAAYFRQAEYGMYVRMALLSMVAGVNPLT