Monarch geneset OGS2.0

DPOGS205071
TranscriptDPOGS205071-TA8835 bp
ProteinDPOGS205071-PA2944 aa
Genomic positionDPSCF300074 - 23938-44563
RNAseq coverage392x (Rank: top 31%)
Annotation
HeliconiusHMEL0121180.085.59% 
BombyxBGIBMGA006816-TA0.083.48% 
Drosophilar-PA0.067.42% 
EBI UniRef50UniRef50_F4X7C60.068.34%CAD protein n=312 Tax=Eumetazoa RepID=F4X7C6_ACREC
NCBI RefSeqXP_972190.10.071.77%PREDICTED: similar to carbamoyl-phosphate synthase large chain [Tribolium castaneum]
NCBI nr blastpgi|910901530.071.77%PREDICTED: similar to carbamoyl-phosphate synthase large chain [Tribolium castaneum]
NCBI nr blastxgi|910901530.071.87%PREDICTED: similar to carbamoyl-phosphate synthase large chain [Tribolium castaneum]
Group
Gene OntologyGO:00068074.3e-125nitrogen compound metabolic process
GO:00040864.3e-125carbamoyl-phosphate synthase activity
GO:00059757.9e-123carbohydrate metabolic process
GO:00038247.9e-123catalytic activity
GO:00431697.9e-123cation binding
GO:00045532.5e-114hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00165971.9e-100amino acid binding
GO:00167431.9e-100carboxyl- or carbamoyltransferase activity
GO:00065201.9e-100cellular amino acid metabolic process
GO:00040702.3e-100aspartate carbamoyltransferase activity
GO:00062072.3e-100'de novo' pyrimidine base biosynthetic process
GO:00055241.9e-88ATP binding
GO:00168741.9e-88ligase activity
GO:00081524.6e-81metabolic process
GO:00065417.5e-34glutamine metabolic process
GO:00168107.3e-20hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:00167871.6e-08hydrolase activity
KEGG pathwaytca:6609000.0 
 K11540 (CAD)maps-> Alanine, aspartate and glutamate metabolism
    Pyrimidine metabolism
InterPro domain[1093-2145] IPR0062750Carbamoyl-phosphate synthase, large subunit
[709-1066] IPR0062744.3e-125Carbamoyl-phosphate synthase, small subunit
[119-446] IPR0137817.9e-123Glycoside hydrolase, subgroup, catalytic core
[126-444] IPR0019442.5e-114Glycoside hydrolase, family 35
[2633-2939] IPR0061301.9e-100Aspartate/ornithine carbamoyltransferase
[2640-2937] IPR0020822.3e-100Aspartate carbamoyltransferase, eukaryotic
[115-450] IPR0178533.2e-95Glycoside hydrolase, superfamily
[1290-1482] IPR0138161.9e-88ATP-grasp fold, subdomain 2
[1215-1418] IPR0054794.6e-81Carbamoyl-phosphate synthetase, large subunit, ATP-binding
[1106-1120] IPR0054835.2e-63Carbamoyl-phosphate synthase, large subunit, CPS-domain
[1479-1631] IPR0054802.6e-60Carbamoyl-phosphate synthetase, large subunit, oligomerisation
[1615-1737] IPR0138171.2e-59Pre-ATP-grasp fold
[708-847] IPR0024742.8e-54Carbamoyl-phosphate synthase, small subunit, N-terminal
[1631-1751] IPR0161854.4e-48PreATP-grasp-like fold
[887-1063] IPR0179262.1e-46Glutamine amidotransferase type 1
[2639-2782] IPR0061325.2e-42Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding
[2787-2936] IPR0061313.3e-36Aspartate/ornithine carbamoyltransferase, Asp/Orn-binding domain
[885-899] IPR0013177.5e-34Carbamoyl-phosphate synthase, GATase domain
[564-701] IPR0089796.1e-30Galactose-binding domain-like
[2014-2175] IPR0116072.7e-27Methylglyoxal synthase-like domain
[1764-1820] IPR0138154.9e-21ATP-grasp fold, subdomain 1
[2417-2538] IPR0110597.3e-20Metal-dependent hydrolase, composite domain
[1095-1210] IPR0054812.6e-16Carbamoyl-phosphate synthase, large subunit, N-terminal
[2186-2234] IPR0066801.6e-08Amidohydrolase 1
Orthology groupMCL10572 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205071-TA
ATGACACATTTATCTTCTAATAAAGTAAGTCATTTTCAAGTTTCCAAAGGAGTGGCGAAGCGGACGGTCGTGTGCAACTGGCTCTTTCGCTCTAATAACCTAGCTTCGCCCGTGATTGTGAAGTCGGGGCCGAGAGTTCACTGGCACTCGAGACTCTACCCGGGGTCATTGCCGCCAATGAAATATTTACCAAACATGTTGGCAACATTGCGATTTCTATCCGTATCAATATGTGTGATGCAATTGATTTTGGCACAACCGACGACCATCCAAGAAACAGATCCAATAGTGTTACAGATAACAGACAACAACCAAGGAGGTCAGCTATCGCAAAAAGACTTTCAGAACGCACGTAATATAAGCATTGTAGGTGATGACTTTATGCTCGATGGCAAACCGCTCCGTATTGTGTCAGGATCTGTGCACTACTACAGACTACCGGCAGAATATTGGAGAGATAGGTTACGGAAAATTAGAGCCGCCGGTTTAAACGCTGTTTCCACATATGTGGAGTGGAGCAGTCATGAGGAAGAAGAAGGTGCCTATTCTTTTGAAGGTGACAAGGATATTGCCAGATTTCTAAAGATTGCCGCAGAAGAGAATTTGTATGTGCTACTTCGACCTGGGCCGTACATTTGTGCTGAAAGAGATCTAGGTGGACTACCGTATTGGCTTTTGAGCAAATATCCTGATATCAAATTGCGAACTACAGACGGAAATTTTATAGCAGAAACCAAAAAGTGGATGGCTAAACTATTTGAAGAAGTTAAACCTTTTCTATTAGGCAACGGAGGCCCTATCATATTGGTCCAGGTAGAAAATGAGTATGGCAGTTACGGAGCTTCAAAGGAGTATATGAAGCAAATTCGGGACATAATAAAATCTCACGTAGAAGACGCAGCACTACTTTACACCACCGATGGTCCCTACAGATCATACTTCATCGACGGATCTATATCCGGTACACTTACAACTATAGATTTCGGACCAACGACTAGTGTTATTAACACTTTTAAAGAGCTGAGAGCATACATGCCTGTTGGTCCTTTGATGAATTCCGAATTCTATCCTGGATGGCTAACACATTGGAGTGAACACATTCAGCAGGTGTCCACTGACCGTGTAACTTTTACTCTTCGAGACATGTTGGAAAACAAGATTAATTTAAATTTTTACGTTTTCTTTGGAGGAACTAACTTCGAATTTACATCTGGTGCTAACTATGGAAGATTTTATCAACCCGATATAACATCTTATGATTATGATGCTCCATTATCCGAAGCTGGCGATCCTACGGAGAAGTATTACGCTATACGAGACGTACTGTCCAATTACGATTTAGTGCCAGATGATATACCAGTTCCAGTACCATCAAAAAAAGGAGCCTATGGACGAATTGAAGTAGCAAACAAAATTAATCTGCTATCAACAGAAGGACGTTCCAGTTTAGGAATTAAATACAAAGACGTGGAAGGTGCGAAATTACCAACGTTTGAGGAATTGAAACAAAGAAGCGGTCTCATGCTTTATGAAATGACACTCAATGGAACCGGTGGAGTTTTGAATATAAAAAAACCACGAGATTTCATATTTGTTTACGTTGATAAGAAACTGCAAGGAGTTATAAGCAGAATGATGATGTTATATTCGCTCAGTATAAACTCAAAACCAGGCTCTACTTTGTCGTTGCTCGTTGAGAATCAAGGTCGTATAAATTTTGGAAACCGAATTCACGACTTCAAGGGCATACTTGGCTCTGTGTTATTAAACAATAAAACCTTAGAAGGTCCCTGGTCTGTAACTGGTTACTCATTAGATGTTAAGAAGAGTAAATTGTTGAGTGATGACAATATCTCTGCCTTCACTGAGGATGCTTTATCAGACGGTCCCATGATGTTCGAAGGACAGTTCGTGATTCCTGAAGGAGAAGAGCCATTGGACACTTTCATTGATACAACCAATTGGGGGAAGGGTTACATATTCGTCAACGGGTACAACTTAGGAAGATATTGGCCAAAGGTTGGACCCCAAATTACTCTTTATGTACCAGGTGTATGGCTGAAACCAGCACCAGCGGAAATGGTTGAAAGCGAGGAATTAAGTTGCTCAGTCGGAAACCCGTGCAGCCTGGTTTTGGCAGATGGCACAGTCTTTAGTGGAAGAAGTTTCGGGGCCAATGTACCCGTAGAGGGTGAAGTGGTATTCCAAACTGGTATGGTAGGGTATCCCGAATCATTGACAGACCCTTCCTACCACGCACAGTTGTTGGTCCTTACGTACCCATTAATCGGAAACTATGGTGTTCCTGACGATAAGGATAAAGACGAGCATGGGCTGCCAAGATGGTTTGAATCGAGTCGTATATGGGCTGCTGGGTTAATAGTTGGTCAAATAAGCACTCAAGCCTGCCACTGGCGTGCGAAACGATCTCTTGGTAAGTGGTTGGAGGCCAACGGTATACCTGGCCTCTGTGATATTGATACCCGAGCCCTTACATTCCGTCTCCGGGAAGGAGTAACTCTTGGGAGGATTGTACAAGGTGTTCCCCCTTTTGGACCTCTGCCACCCCTGAAGGATCCAAATTCCCGCAATTTAGTAGCAGAAGTATCTATAAAGGAACCTAAGATATTTAATGAATCAGGAAAAGTAACCATAATGGCTATTGACTGCGGTCTAAAATATAATCAAATAAGATGCTTAATAAAAAGGAACGCTAGGGTCGTATTAGTGCCTTGGAATTACAAATTTGAGACCAATTCATATGACGGTCTGTTTATAAGCAACGGTCCTGGTGACCCCGAGGTTTGCAAAAAGGTTGTTGAAAATTTGAATGATGTAATCAGCAGCGAAACTATTACTAAACCAATATTTGGTATATGTCTCGGTCATCAACTGCTTGCGACTGCTGCGGGATGTAAAACCTACAAAACAAAATATGGAAACCGTGGACATAATTTACCGTGTACACATTCAGGTACCGGCAGATGTTTTATGACTTCTCAAAATCACGGCTACGCTGTTGATGCCAATACCCTCCCTAAAAATTGGGAAATATTGTTTACCAACGAAAATGACAAAACCAATGAAGGCATAATACACAAGACACTTCCATATTTTAGTGTTCAATTTCACCCAGAGCATACAGCTGGTCCCACTGATTTAGAATGTCTTTTTGATGTGTTTATTGATACAGTCACAGCATATAAAAACAATGTAACATGTGTTGTGAAAGACTTAATATGTGAAAAACTTAAATTTACGCCGACAATTTATGAAAGACCGAAGAAAATATTGATTCTTGGTTCTGGTGGTTTATCTATTGGGCAGGCAGGTGAATTCGATTATTCTGGATCTCAAGGTGTTAAAGCTATGCAAGAAGAAAAAATTCAAACTGTTCTTATTAATCCTAATATTGCAACAGTTCAAACATCCAAAGGTCTCGCAGATAAAGTATATTTCTTACCCATTACACCAGAATATGTAGAACAAGTTATTAAGGCCGAAAGACCAACAGGTATTTTACTCACTTTTGGTGGACAAACAGCTTTAAATTGTGGAGTGGAATTACAAAAAAACAAAGTATTTGAAAAATACAATGTAAGCGTTTTGGGAACACCGGTACAATCAATAGTCGACACGGAAGACAGAAAGATATTTGCTGAAAAAATTAATGCCATTGGAGAAAAAGTTGCACCTAGTGCTGCTGTAGCCTCTATTGAAGAAGCTTTAAATGCAGCACGTCAAATCGGATATCCGGTTATGACCCGATCGGCGTTTTCGCTTGGAGGTCTTGGATCAGGATTTGCAAATGATGAAGAAGAGTTAAAAAAACTAGCTCACCATGGGTTATCACATTCCGACCAGTTAATTATTGATAAGTCCTTAAAAGGATGGAAAGAAGTTGAATACGAAGTTGTGAGGGATGCATATGATAATTGCATAACAGTATGTAATATGGAAAATGTAGACCCTTTGGGAATACATACCGGAGAGTCCATTGTTGTTGCTCCTAGTCAAACTTTATCAAACAAAGATTATTATATGTTAAGAAATACTGCTATTAAAGTAATCAGACATTTTGGGATTGTTGGTGAATGCAACATTCAATATGCACTAAATCCTAACTCTGAAGAGTTTTATATCATAGAAGTGAACGCACGATTATCTAGAAGTTCAGCTTTAGCTAGTAAAGCTACGGGTTATCCGTTAGCATATGTTGCTGCAAAGCTAGCCCTTGGAATTCCATTACCCGCAATAAAAAACTCTGTAACGGGGGTTACAACAGCATGTTTTGAACCGAGTTTAGACTACTGTGTCGTTAAAATTCCAAGATGGGATTTAGCGAAATTCAATCGAGTTAGTACAAAGATTGGAAGTTCTATGAAAAGTGTTGGTGAAGTTATGGCCATAGGAAGGAATTTTGAGGAAGCATTCCAAAAAGCTTTAAGAATGGTCGATGAAAACGTTAATGGTTTCGATCCGTACCTTAAAAAAGTTAATGAAAATGAACTGCGAGAGCCAACAGATAAGCGAATGTTTGTTTTAGCAGCAGCTCTTAGACAAAATTACAGTGTTGAAAAATTGTACGAGTTAACTAAAATAGACCGATGGTTCCTTGGAAAATTTAAAAATATTATAGATTACTATCAAACACTTGAGTCCATAGATTCGGGATCAATTACTTCTGATATATTAAAAACAGCAAAACAAATGGGATTTTCGGATAAACAAATTGCTGTTGCTATTAAAAGTACAGAATTAGCAGTTCGAAAATTACGAGAAGAATTCAAGATTACTCCGTTTGTTAAACAAATAGACACAGTGGCAGCTGAATGGCCCGCATCTACCAATTATTTATATCTTACTTACAATGGTTGTACCCATGATTTAGTTTTCCCTAAAGATTTGACTATGGTACTCGGTTCCGGAGTATATAGAATAGGAAGTTCTGTTGAATTTGATTGGTGTGCTGTTGGCTGCTTAAGAGAATTGAAAAAGCAGGGCAAAAAAACAATTATGGTCAATTACAACCCTGAAACTGTCAGCACAGATTATGACATGAGTGACCGACTATATTTCGAAGAAATTTCCTTTGAAGTTGTTATGGATATATATAACATTGAACAACCTCATGGAGTAATATTATGTATGGGAGGACAGTTACCCAATAATATTGCTATGGATTTACACAGACAGCAGGCTGTTATATTAGGAACCTCCCCTGATATGGTTGATAATGCTGAAAATAGATTTAAATTTTCTCGCATGCTCGACCGTAAGGGTATTCTGCAACCAAGATGGAAAGAATTAACTAATCTTGATTCAGCAGTAAAGTTCTGTGAAGAAGTTGGATATCCATGTTTAGTTCGTCCATCATATGTTTTAAGTGGAGCTGCGATGAATGTAGCATATTCAAACCAAGATTTGGAAACGTATCTGAAATCTGCTAGTGAAGTGAGTAAAGAACATCCAGTAGTGATATCAAAATACATTTCAGACGCAAAAGAAATAGATGTTGATGCCGTTGCCGCAGATGGGGTTATTCTTTGTATGGCTGTATCAGAACACGTAGAAAATGCTGGAGTGCATTCTGGAGATGCTACATTAGTAACACCACCGCAAGACATCAATGATGAAACATTGGACAAAATCAAAGAAATAGCGAGAATTATTGCAGAGACACTTGATGTTACCGGGCCATTCAATATGCAACTTATAGCAAAGGACAATGAATTAAAGGTTATAGAGTGCAATGTGAGGGTTTCAAGATCATTCCCATTTGTTTCAAAAACATTGGATCATGATTTTGTGGCAATGGCAACAAAAGTTATCCTCGGTTTACCGGTTGAACCTGTGAATATAATGGGTGGCTGTGGAAAAGTTGGCGTCAAAGTGCCACAATTCTCATTTTCAAGATTATCAGGAGCCGATGTCACACTTGGGGTAGAAATGGCATCCACCGGTGAAGTTGCTTGTTTTGGTGAAAATCGTTATGAAGCTTATCTAAAATCTTTAATGAGTACTGGCTTTAGAATTCCCAAAAAAGCTATTTTACTTTCTGTGGGAACATTTAAGCATAAAATGGAGTTATTGCCAAGTGTTCGAATATTACAAAAATTAGGATATAAATTGTATGCCAGTATGGGTACTGGGGATTTCTATATGGAACATGGAGTTGAGATTGAAAGTGTGCAATGGACTTTTGACCACATTGGGGATCTAGAGGACGATAGATCAGATGGAGAATTAATGCATTTAGCCGATTTTATGGCTCGAAGAGAATTGGATTTAGTAATAAACTTGCCTATGAGAGGTGGAGCCCGGCGCGTCTCTTCATTTACTACACATGGCTATCGAACCCGCCGTTTAGCCGTAGACTATGCAGTTCCTTTAGTTACTGATGTGAAATGCGCTAAACTTCTAGTTCAGGCTATGCTGCGGTGTAGTGGTGCGCCGCCAATGAAAACAAAACTTCTAGTTCAGGCAATGCTGCGGTGCAGTGGTGCGCCGCCAATGAAAACACATACAGATTGTATGACTTCTCGAAACATACTTAAACTACCAGGGTTTATCGATGTTCATGTTCATGTTCGTGAACCAGGGGCGACATACAAAGAAGATTTTAATTCCTGTACAGCTGCTGCATTGGCTGGAGGTATCACTATGATTTGCGCTATGCCAAACACAAATCCTCCGGTAATCGATCGCGTGTCATATGACTATGTTTCCACATTGGCACGTGTAAGTGCTCGTTGTGACTACGCTTTATTTGTGGGAGCTTCAACGACGAATTGTGATACAGCTGCTGAATTAGCACCTCAAGCGGCAGCATTAAAAATGTATCTCAATCAAACTTTCACTACCTTAAGGTTGGACGATATGACTGTTTGGCAACGACATCTTCAGAACTGGCCCAAAAAAATGCCTATATGTGCTCACGCTGAGCGTGAAAAGACTGGCGCAATCATTTTGATGGCGTCTCTGTTGGACAGACCTATTCATATATGTCATATCGCTAGGAAAGAAGAAATTTTGATCGTGAAAGCGGCCAAAGAAAAAGGACTTAAAGTAACTTGCGAAGTATGTCCACATCATCTTTTTTTAAGCACAGATGATGTAAGTAGCATTGGTGAAGGACGTGCTGAAGTCCGTCCTGTCTTATGTAGCCCACAAGACCAAGCTGAGTTGTGGAAAAATATGGATATTATTGATGTATTTGCAACAGACCATGCTCCTCATTCCGTCGAAGAAAAGAATTCTGAAAAGCCTCCCCCAGGCTTCCCTGGTCTTGAAACTATCTTGCCTCTACTTCTAAACGCTGTTCACGAAGGACGTTTAACAATAGATGATTTAATTAATAAATTTCATAAAAATCCAAGAAGAATTTTTAATCTACCTGAACAACAAAATACATATGTTGAGGTGGACATGGATTATGAATGGGTAATTCCTCAGGCATTGGAATTCTCAAAGTCTAAATGGACCCCCTTTGCTGGGAAACGGGTATGTGGAGCTATTCATCGCGTGACTCTACGAGGCGAAATAGCCTACGTTGAAGGCCAAGTCTTGGTACCGCCTGGATTTGGTCAAAACGTACGTGACTGGCCAGCACCAAAAAAACTTGCTCACCCCAGCATTGTATCTGAGAAACTAGAAAAAGAGACCAGTCGGCCAAACTCTTCATTAGATTTTCACAGTTCTCTGGACTTTATTAAAGTTAACGATCTAGACGTAGACCAAGCTGAAGTCTCTAAGTCAGATCAAAATAAACTGAATGTTCATTTCCACGAAGACTCCGGTTTAAGAAGTGTTTCACCCTTAATTCCACAAACTACTACTAGACAGAGATTGGACAGTTCGTCTTACCCGTCTCATGCAGCGCCACCTAATCGTCAAAGAAGTGATTTATTTGGAAAGAGCATATTAACAGTGGACACATTCGGAAAAGAAACATTGAATGATATATTCAATTTAGCTCAATTTATGAAAACTAATGTTACAAAAGGGCGCGTATTAGATGACATTTTACGGGGTAAAGTCATGGCATCAATATTCTACGAAGTAAGTACGAGGACGAGTTGTAGTTTCGCCGCTGCTATGCAAAGACTTGGCGGATCTGTGATCCACACGGACGCAACGAGTTCCTCAGCCAAAAAAGGTGAAACACTAGAGGACAGTGTGACTGTTATGGCCAGCTACGCAGACGTCGTAGTATTGCGTCATCCAGAACCCGGCGCGGTGACACGTGCCTCAAGACACTGTCGGAAACCAATCATAAATGCTGGGGATGGTGTAGGTGAACATCCCACTCAAGCCCTTCTGGATGTTTTTACAATTCGAGAGGAAATCGGTACCGTTAATGGCTTAACCATAACGATGGTCGGTGATTTGAAAAATGGAAGAACCGTTCACTCTTTAGCCCGACTTCTTACTCTGTATCAAGTACAACTACAATACGTAAGTCCGCCTGGACTTGGAATGCCTAAACATATAATGGATTACGTAGCATACAAAGGCATTCCTCAAAAAGTATATGAACGCTTAGAAGATGTTCTCGGCGAAACTCATGTTCTCTATATGACAAGAATTCAACGTGAACGATTTGAAAGTGAGCAGGAATACGAAAAGATGCGAGGCCTGTTGGTGGTTACACCACAACTTATGACACGAGCCAAACGCCGTATGATAGTTATGCATCCACTTCCTCGTGTCGATGAAATTTCACCAGAGTTTGATACCGATCCACGGGCAGCTTATTTCAGACAAGCGGAATACGGGATGTATGTCCGTATGGCGTTACTTTCCATGGTCGCTGGAGTTAATCCTCTCACGTAA

Protein sequence:

>DPOGS205071-PA
MTHLSSNKVSHFQVSKGVAKRTVVCNWLFRSNNLASPVIVKSGPRVHWHSRLYPGSLPPMKYLPNMLATLRFLSVSICVMQLILAQPTTIQETDPIVLQITDNNQGGQLSQKDFQNARNISIVGDDFMLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTYVEWSSHEEEEGAYSFEGDKDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKYPDIKLRTTDGNFIAETKKWMAKLFEEVKPFLLGNGGPIILVQVENEYGSYGASKEYMKQIRDIIKSHVEDAALLYTTDGPYRSYFIDGSISGTLTTIDFGPTTSVINTFKELRAYMPVGPLMNSEFYPGWLTHWSEHIQQVSTDRVTFTLRDMLENKINLNFYVFFGGTNFEFTSGANYGRFYQPDITSYDYDAPLSEAGDPTEKYYAIRDVLSNYDLVPDDIPVPVPSKKGAYGRIEVANKINLLSTEGRSSLGIKYKDVEGAKLPTFEELKQRSGLMLYEMTLNGTGGVLNIKKPRDFIFVYVDKKLQGVISRMMMLYSLSINSKPGSTLSLLVENQGRINFGNRIHDFKGILGSVLLNNKTLEGPWSVTGYSLDVKKSKLLSDDNISAFTEDALSDGPMMFEGQFVIPEGEEPLDTFIDTTNWGKGYIFVNGYNLGRYWPKVGPQITLYVPGVWLKPAPAEMVESEELSCSVGNPCSLVLADGTVFSGRSFGANVPVEGEVVFQTGMVGYPESLTDPSYHAQLLVLTYPLIGNYGVPDDKDKDEHGLPRWFESSRIWAAGLIVGQISTQACHWRAKRSLGKWLEANGIPGLCDIDTRALTFRLREGVTLGRIVQGVPPFGPLPPLKDPNSRNLVAEVSIKEPKIFNESGKVTIMAIDCGLKYNQIRCLIKRNARVVLVPWNYKFETNSYDGLFISNGPGDPEVCKKVVENLNDVISSETITKPIFGICLGHQLLATAAGCKTYKTKYGNRGHNLPCTHSGTGRCFMTSQNHGYAVDANTLPKNWEILFTNENDKTNEGIIHKTLPYFSVQFHPEHTAGPTDLECLFDVFIDTVTAYKNNVTCVVKDLICEKLKFTPTIYERPKKILILGSGGLSIGQAGEFDYSGSQGVKAMQEEKIQTVLINPNIATVQTSKGLADKVYFLPITPEYVEQVIKAERPTGILLTFGGQTALNCGVELQKNKVFEKYNVSVLGTPVQSIVDTEDRKIFAEKINAIGEKVAPSAAVASIEEALNAARQIGYPVMTRSAFSLGGLGSGFANDEEELKKLAHHGLSHSDQLIIDKSLKGWKEVEYEVVRDAYDNCITVCNMENVDPLGIHTGESIVVAPSQTLSNKDYYMLRNTAIKVIRHFGIVGECNIQYALNPNSEEFYIIEVNARLSRSSALASKATGYPLAYVAAKLALGIPLPAIKNSVTGVTTACFEPSLDYCVVKIPRWDLAKFNRVSTKIGSSMKSVGEVMAIGRNFEEAFQKALRMVDENVNGFDPYLKKVNENELREPTDKRMFVLAAALRQNYSVEKLYELTKIDRWFLGKFKNIIDYYQTLESIDSGSITSDILKTAKQMGFSDKQIAVAIKSTELAVRKLREEFKITPFVKQIDTVAAEWPASTNYLYLTYNGCTHDLVFPKDLTMVLGSGVYRIGSSVEFDWCAVGCLRELKKQGKKTIMVNYNPETVSTDYDMSDRLYFEEISFEVVMDIYNIEQPHGVILCMGGQLPNNIAMDLHRQQAVILGTSPDMVDNAENRFKFSRMLDRKGILQPRWKELTNLDSAVKFCEEVGYPCLVRPSYVLSGAAMNVAYSNQDLETYLKSASEVSKEHPVVISKYISDAKEIDVDAVAADGVILCMAVSEHVENAGVHSGDATLVTPPQDINDETLDKIKEIARIIAETLDVTGPFNMQLIAKDNELKVIECNVRVSRSFPFVSKTLDHDFVAMATKVILGLPVEPVNIMGGCGKVGVKVPQFSFSRLSGADVTLGVEMASTGEVACFGENRYEAYLKSLMSTGFRIPKKAILLSVGTFKHKMELLPSVRILQKLGYKLYASMGTGDFYMEHGVEIESVQWTFDHIGDLEDDRSDGELMHLADFMARRELDLVINLPMRGGARRVSSFTTHGYRTRRLAVDYAVPLVTDVKCAKLLVQAMLRCSGAPPMKTKLLVQAMLRCSGAPPMKTHTDCMTSRNILKLPGFIDVHVHVREPGATYKEDFNSCTAAALAGGITMICAMPNTNPPVIDRVSYDYVSTLARVSARCDYALFVGASTTNCDTAAELAPQAAALKMYLNQTFTTLRLDDMTVWQRHLQNWPKKMPICAHAEREKTGAIILMASLLDRPIHICHIARKEEILIVKAAKEKGLKVTCEVCPHHLFLSTDDVSSIGEGRAEVRPVLCSPQDQAELWKNMDIIDVFATDHAPHSVEEKNSEKPPPGFPGLETILPLLLNAVHEGRLTIDDLINKFHKNPRRIFNLPEQQNTYVEVDMDYEWVIPQALEFSKSKWTPFAGKRVCGAIHRVTLRGEIAYVEGQVLVPPGFGQNVRDWPAPKKLAHPSIVSEKLEKETSRPNSSLDFHSSLDFIKVNDLDVDQAEVSKSDQNKLNVHFHEDSGLRSVSPLIPQTTTRQRLDSSSYPSHAAPPNRQRSDLFGKSILTVDTFGKETLNDIFNLAQFMKTNVTKGRVLDDILRGKVMASIFYEVSTRTSCSFAAAMQRLGGSVIHTDATSSSAKKGETLEDSVTVMASYADVVVLRHPEPGAVTRASRHCRKPIINAGDGVGEHPTQALLDVFTIREEIGTVNGLTITMVGDLKNGRTVHSLARLLTLYQVQLQYVSPPGLGMPKHIMDYVAYKGIPQKVYERLEDVLGETHVLYMTRIQRERFESEQEYEKMRGLLVVTPQLMTRAKRRMIVMHPLPRVDEISPEFDTDPRAAYFRQAEYGMYVRMALLSMVAGVNPLT-