Monarch geneset OGS2.0

DPOGS210300
TranscriptDPOGS210300-TA10878 bp
ProteinDPOGS210300-PA3625 aa
Genomic positionDPSCF300305 - 75350-105829
RNAseq coverage823x (Rank: top 16%)
Annotation
HeliconiusHMEL0028840.081.44% 
BombyxBGIBMGA013894-TA0.068.10% 
DrosophilaCG15828-PC0.028.30% 
EBI UniRef50UniRef50_UPI0002064CAF0.028.99%UPI0002064CAF related cluster n=1 Tax=unknown RepID=UPI0002064CAF
NCBI RefSeqXP_001850736.10.030.77%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700463600.030.77%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700463600.030.76%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00053191.2e-68lipid transporter activity
GO:00068691.2e-68lipid transport
KEGG pathway 
InterPro domain[121-410] IPR0152551.2e-68Vitellinogen, open beta-sheet
[119-495] IPR0158192.5e-64Lipid transport protein, beta-sheet shell
[7-124] IPR0110305.3e-19Vitellinogen, superhelical
[426-535] IPR0094545.7e-19Lipid transport, open beta-sheet
[229-255] IPR0158177.7e-19Vitellinogen, open beta-sheet, subdomain 1
[5-86] IPR0017473.1e-12Lipid transport protein, N-terminal
[277-480] IPR0158181.4e-10Vitellinogen, open beta-sheet, subdomain 2
[3081-3220] IPR0018462.8e-08von Willebrand factor, type D domain
Orthology groupMCL10529 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210300-TA
ATGAATATAATTGGCGATTCTCTGATTCCTGTACCCACTAGATTGACGGCTATAGACGCGTTCAGGCGAACACCCTGTACGGAGACCAGGGAATACTTCTTAGAAACATACAGAGCCGACCTCGTTGATGTTGAAGTCCGAATCGCTTCGTATTTGCAAGTCATGAAATGCCCCAATCTAAGTACAATTCGGAAAATTTTCCACTCCCTCAAAAAAGAACCTGTTAACCAAGTGGCAACCTTCGTTTGGAGTCACCTGAACAATTTAGGTCAATCTTCTCTACCATCAAGAGTGGAAATCCAAGGACTTTTATCTGGTAACACAGTACCCCAATTGGAAGATAGTCCAGATTTTAGAATGTTCTCAAGGAATTATGAGCAGAGCGTCTTTTTCGATCAATACAATGCTGGTGGTAACTATGAAGCCAATGTGATATTCTCTCCCGACTCATACATACCCAGATCATTGTCTCTCAATTTGACTGTTGATATGTTTGGAGAATCCATCAATCTGCTGGAGATAAAAGCACGCGGTGAAGGCTTCGAGAGATACTTTGAAAATCTTTTCGGTAACGATGGGCCGCTAAGCAAGAATAAAATAAACGACAAAATCAGCAAAATGCGCTTCTTCCGTTCAACAAACGAAGCTGATGATATTAGGGAAAAAATTGATGATTTAGAATTCAATAATAATGCGCTGAAACACAGGTTCCCCACCGCCGAATTGGGGTTCAAAGTTTTTGGGAACGAGATATCTTTTTGGAGCGCTGAAGGGGACGAGGAGATAAGGAAGTCTTTGGAGAGATTGAACCCCAAGTTGAGAATACTGGAGATATTATCGGGGAAGGAAATTTCCTACAACAAAGCGTCTCTCTTCCTAGACACGACCTTCTCAGTGCCAACGGGCTGCGGTCTGCCGCTTGGCATGAATCTCATGGGAACTAGTTACGTAGACACAAAACTATCAGGGAACGTCGTTGATAAGTTTAGTCAATCTAGGAATTTGGATTTCGAAGGCAAGATACGGCCTAGCGTTGCTGTGAACATAGCGGCTACAATGAGTGTGTCGGCCGGTTCGCTGGCTAATAGCGGAGTGCGGTTGTCGTCACGTCTGTACACAGCAACAGCGCTGGAAGCGAAGCTATCAATACGCGGCCTAGGAGTTATAAGACTAGATTTGTCCTTACCTAGAGAAAAACAGGAGATATTTGCTGCCAAATCGGAGCTGTTAATTCTTCATGGAGATCAAGAGATTCAACAGCAGGGTCTTAACAAGAATAGAATAGAACAGAACACTTGTAGTTGGACTACATTCGACAAAGCAATCGGTATTAAAGTCTGCGCGGCGTATCAGTTCCCGAATATGACGAATTTGAAAAACGCCCCATACTTCATATTAAGCGGCCCAGCAAAGTATATTGTGTCCCTAGAAAAGGCTGATCCCTCGGCTAATACGTACGCCTTCCAATACAAATGGGACAAAAACGACACTACCGATGTACTGGACTTCTCATTCGACACACCGAACAGCAAGGAAAAACGTAAATTCAACGCTGTTCTACTATTATCGAACACTTCAAGCATAGCCGAAGTAACGCTGCTATCTTCGGAGAGCACTATAAAGGCCAAAGCTCTATATAGAAATATGCCTTATGATAAGTCCTTAGAGGCCAGTTTGGACGTAGATGGCAGAAAACAGTTTGACAGTGTTATGGCATTACAACGTCATGATATAAAACATGGCTACGTATGGCTCCCGCATGCCTATTGGGTGGTGAACGATCAAAAGATCGCTGAATTATCAGGTAGTATCAAAGTGAAATCCAAAGGCGGTGTGACTCAATGGGATATTACAGCCGATTTCCAAACAAAGCAGCTAGCGAGTCGCCTCATTGGATATTATACCGTAAACGGACCGACACAAGGCACAAAATTGCAGTTAGATTACCAGTTTTATAAAACTGCCAAGCAAACTATTAAAATTGAGGGTGTTTATAGTAAACGGGCTATGGCTTACCGATATGACGTCTATGGTGAACTATCGATGCAATTTACCGCCTACCCCATATATAACTTCTATTCTGTATTACGTAATGTGCAAACTCAAAATCATCTAGACATAGGTTTGAATGTAAGTTCGTCTAAAGATCTTAAACATGATCCGGCCTTTACGTTTGTATACAAAAAGGTGGATAAACTGAACGGTCTCAAATTAGACACGCAAATGTCCTTAAAAAGACCACGACCGGTGCTTATGAAGTTTCAATATGATCAAACAGGACCAAAATATTCGGCTCTAGCACTTCTGAACTTCAATCCAAAATCTCGTGAGATTTTAATATCAGGTTACGTATTCGCACCTCCTGGCACACAACTCTACATGGACGCCGAACTGAACGTGACTCTACCAACGTTACATCCATGCGCTATAAAGACGAAACTACATGAGAAACGACCTAATGAATTTCAGGTAAACGCAGTTGGCGTCTGGTTTACCGGCGTGGACTTCAACATTGACGCAGTTTACCAAGACACCTCCAAGACAAATCTGGCGTCACACAGGCTCAAGGTCCTGATTAATAGCAGCCACTTCAAAGACATAGCTGTTGACGCTAGATTTACTCAAGATAATAGACAAATAACGTTCATTGCTGAGGGGGAATACAACGAAGACACTTACAAAAGCTTAGTACGATATCTTCTTCTGTCGGAACAAAACTTCACAGCGTACGGTGAAGTTGACATCAGCGGTAGAGCTTACAGTTTGAACCTAAACGCAGATCTCAATAACAACACAAACGTCAACATGGATATACATTTCGATCAACTCCGCGATGTTCATATATCTTACCAAAGATCTGTCACACCAACCCAAAAACGTTTGAGCGCTTCACTCAATTGGGATGCAAACCGAGATCCTTCGCAGAAATTAAGCATAGACGCCCGCTTGAACCACAAAGGACAGTGGCATCACTCCGGGCAGGTCACGCTACACTATCCAGGCCGTGTTGTCAATGGAGAGTTTGAATTTTTGCTTAAAGATTGGTACTGCGAATGGTTGGTCAGAATGGGCTGGGCGAGTCCTACGGCTGTAGTGTGGCGAGTTAAGGCATATTCAGAGGCTCGTGAAGAAACTGTATACGCGCTGTTATCAAACTTGGCCACTCCCTTATCAGGATGGGAAGATACCAGTTTCAATGTGATGTGGCGTTACCGAGATAATCTTCAAGCTATCAATGGCAGTATGAACTGGCAAGAAGATTATTTGGCCTTTAGTTTGTTAGCCGACTATCTCTTCAGTACTTCCAAGTTCTACGGCGAGATCAATGCCTCTGTCAACTCTACTATACCGACTCTACCTAGAGCTGCCGCTGTTGCCAAACATACTGTTGTTTGGAAGAAGAGCGCTGATACTCTATTAAGCTTCCAGTACAACGAAGATGGTCTCCTAATGGTGAATTCCTCATGGGCCATTGACAAGGGTCAGAATGAAAACAATATAACTGGGAGAGTAACGCTTCTTACACCTTTCCAGGGCTATACCAAAGGATTTCTTAGAACAGAATTCATTTTGGGACACAAAAGAGATATAAAAGGCGTCACATATTTAGACTTAGAGGAGAAGGTTGTGAAGATATATGTCAATGGTCATATGCGTCGTATAACTAACTGCATGTTAGTAGTGAACGTCACAAGTCCAATACCACAGTTCTCACAAACAACAGCTAGATTCGGTTTCGTTGAAACAGACAGGCACTTGGTCGCCATGATTGTTACACCGAATTCTACTACTGGGATCGAAGTACTACTGCAATTAAATACGATGCAAGACTTTTCAATATTTGGGCACGTCGCACTTCCCATACAGTACTTGAACAGGGCAATGATTACTGCCAAAAGAGGTTCCGAAGAGGTCGACTTTAGAGTTGGTTGGGGAACAATGGACTTTGGTTTCACAGGAATATGGCAGTGGAAGAACATATTGAACTTCGTGTACGTCTACAAATTGTACACTCCGTTAGACGGATTCGAAGAAAACGGCCTGGTATTGAAAAATATATATGGAAACGGTCTGGATACTGAGCTTTCAGTACGACTTTCTAAGCACAAGTTCGGCATATCAATACTGTTAGTTGATAACGGCAAAGGTCTTTTGGACGTAATCAAGGACAGTTTCCAACATAAAGCTGGTGATTCTGATATGTTCGTGGAAAATTTCGACACGGAGGCCACTATTAATTTGGACACCCTGTATTATCCTACTATAAACTTCCACGCTCATATGTTAAAGTTTGTTGGTCCCGACGAAGAAGATATATTGGAAGCGAACGCAACACTCCACCTCCCAAACAAATCACCGATTGTACTGACAGATGTTTTCATACTAGAGGAATACACAGTTATGAGGAACACTTTGAATTTAGTGACACCATTCCAAGCGGTGAAGCAATTAAAGTCTGTATACACAGTGGACATTGCGATAGGAGAGAAATTTAATGTGACCTGTGTAGCCTTGCTTTATAATGGGACGTATTGGCATGATATATCATATAAAATATACTACGAGACGGAAACGGGTGAAGATGAAGCGTACAAATCATATCTGGCCTCCGTGGGTATAGCGACACCGCTGGCAGTTCTGCCAGCCTTAGAGGCTAGAGTCTCGGCCCGTCTGGAAGATGCTTTATGGAAATTGTCCGCAGATATAGCTATGCCGTCGTTCACTGTTACCGCACTCGCAAGACTTGAGCTAGATGATCCGTTTGTCGAAACTTCAGGCAGTTTGAACCTGACCTCGCTTTATTTAGAAGATTACTTTATCAAGATGCAATTCAAAAAGGATTTCTCTGACGTTGAGAGCGTTGTCGGCGGCGGAATTCACATACAGCAGGGCGAACAAAATAATTATGTGTTCGCTGACGTAGTATGGCGTCCGCCGCCCTCTCGCCACGTTCGCTTCACAGCTCGCGGCGCCCTCGTGCCGGTACTGGAACCAGCGGAGATAGCGTTCCAATTCTCAGAGGAAGAACGCACGAGAACCCTCACACTAGACCTAACCAGCGCTGACGGTTACTACTCCTTGAAGGCGGACCAAAGACCGTTATCAATAAACGTAGCTCTTAGTACGCCGCATAAGAGATTTAGAGCAATGAAAATAATCGGTGAGTTCGCGGGAGAAGACATCAAAGGATCGTTCATAACAGACACCACCGAATATAGCGTTAGCGGAAAAATGGTCAATAAAAATCCCCTTGAGCTATCTCTGATTTTGGTACCTAAAGGTCAAGGTCAACAAATATCAATCCGAATTAAATGTGAGTCATCCCCGACATCCTACGCTTTGACGGCACACATTATAGGCCCCATAGAAGCCACAGTACGGGCGAAGGCTGAGGTCGAAAAGAATTACACTGATATATTCTTTAAGGTTGATCTGCCAAAAGTGAGGAGTAAAGAGATCTTCTTCAAGACTCGCGTTGATTCCTACCCCATGTTACGTCGCGTGGTTAGCGTACAAGCCGCTTCACCCATACAAAAACTCAGTTTCGTCAAGGGTGATGCAGATTTTGTTTTTGGTCCTAAAACCGGTTATCTTCTATGTAAATACGAGCTTCCTGATATGAAAGGGGATGGGGATTTAAAATGGAGCTTTTTACTGGGCGATTTATATATACGGGCGATAGGTGACCAATTGGTAAAGCAAATACAAAGGAGCGTTGACTTAGATATTTACTTTGGAAACACAACGGAGGAGGGCCTGCCTAAAACGAACGCTGGTTTTAGAATGGATTTGGATCATATCTGGCAAATAGGTGCGAATGCGTCATTCGGCTTCATAATGGAACAGCGTCTGAACCTGTTCATCAACGCCGTTCTTCCGAAGCCAAATGTTGATGTACATTCATTGATGTTAGACGCACGTCTCAACGAGTCGCCTGTGACCGTGGAGGCGTTATACTACACGGACGTTACTGCGGTCAAAGCCGGTCTCAAAGGACAGTTGCTGAGCCTGGCAGAGAGTTTCGATGGTAATGCAACAGTCATTTGGACGGCAAATTCCCAGCACAAGAGCATAGACAATATCGTTTCGTACAAATGGGATGTGAACGGATCTAAGTACATCGAGTACTCGCTCAACACTCCCTTGCACGAAACGCAAAATACCTTCAAACTAAAAGGCTCATATCAAAGGGATTTCGTTCACGGATATCAAGTTGTTAAAGGTCGCATGCATGCGCCCGGTACGAGACAAATCGGTGAAGTAGATATCACGTACGGCGGTGTTAGGCACACTGATGGATACTTCAATATGACAACACCATTCACTACACTGCCCTGGCTCAAGAGCATATTTGATATAAATAATGCTGAAGAGATTTCCGACAACAAAGTCGACTTGTTCTGGCCAAACAAATCTGCTACAGTTAACACGACGCACGTTTACAAAAAATTTGACAAAGGATTCACTCAAACCGGCACTATATCACTTTCCGTTCCATTGAACACTAAACACTTGGTCAACACGAAATATTATTATATAGAGGAAGAAAAAACAAGCAATGGTAATGCAACGATAGATTTTGATCAAGAGAGATTCGTGAAGGGCTCGTTTAATAAGGTACTTAGTAAGAGTGAGAGGAATTTGGACCTTGAGACTATGAACATAGAGGTCGAAAACGTACATACACCAGTAGGTGTTAAATATATCCACGAATACGATGACACTGGTAATGTTGACGTAAAACAAGCGACAGTATTCCACTTACTGAACGCAACCAAATTCAACGTGACGGGAAAACTAGATACTCACACATACGACATCGGCAAAGAGATAAAATTAACAGCAATACACGGGAATCGAACCTGGAACTTCGAGAACAAATACGAAGCGGCTGATAAGGAATTGAAACAGGGCAGCAAAGTCACGTGGGCCGAGGACGTGTGGATACATTACGACATACATGTTACTAACATGAGTGAGGCGGACACAGAATCTCAGAACATAATTCTGAACATCCTGTATCCAAGGCGAACGTTCCAAGCTCGTGGAGTGTACCGTCTACAGGATGCTCTGTTAGATGGTAATATTGTATTACTGTGGGATGTCAGAGGTGAGAACAAGACTGCGGAGTTGAAAGGCAAATGGGAAAATCCAGCGATGGAGGGAGGGAATTTACATGATATGTATCTAGCCTTATCTCACCCATCTTTCAGAAAGGATGTAACACTCAAAGGCCAATATCTAACAACGGCGTCGGTGATGTCAAATCTGTCTATGGAACTGCAATATTCGGACTACGAGAAGGAGTATTTGAAAGTGCAATCAATACTCATGGATAATTCCAACGGACCAATCAGAGACTATAAATACATTTTAAGATGCACTCATCCGGCCACGAGCCTGGATTTGGATATGAAATCGGACATAAACATCCACAGCCGCTGGTACTTCATAGACAACTACTACAGATTCCAGAAATCAATGTTCTATGAGAAATTGAGAAGCAACAAACTATTGATAGATTTAAATAATAGCGGCGTTACCTGGGAGCGTGCAAACGAAACTTACTTCTACAAGATGAATGGCACGTGGTCTTTGGTTTACCCTCGATATATAGCAAGTGGTTTGATCCGACGACCCAACGCCAACGACACCTTAGTAGCTGAACTATCAATGGTGGATAAGTCTTTGGTGGCGCACTACAACAGCACTGATGATATATCATACCACTTGATAGGCAAAATCATAGACACTAGATCTGCTAGGTTAGACTCGTGGAGGAACTACGATGACGTCACAACCGTAGATCTAGCTTCTTACATACGTTTGAATCACTCACGCCTTCTAACCAGCGCTGTAGTATGGCGGCCGGAAATATTCAGCGAGGCTAAATCACAAGCTATATATACACTGAAGATATTATATGAGCAAATCAATGATACATTACTCGTTATCAAAGAAGCTCCGATGGAAGCACATCTTGCTTTAAGGAATATATGGAGCGATGCGAAGCCGAGGGTCCGGGAGTTTCTAGACGATTTGAACGATTTACACGTTATTAAGGACGATTTGGACGAATTCGAAGGCTTCTTGAAAGAATCGTACGATCACAATGACTTTTACGTTAAGGATATAGTGGAATTCACGTATTACGTGCTAGACGAAATGGCTATAAGGAATCATTTGGAAAGTTTGCCCGGATTCGTAAACGACATGTGGGGAATGATGGGCAACACCAGCCAGTCCATCAAACAGAGCCTAACATACGTCGTTGATTCAATAAAGAAGGCGTACGCTAATTTCTTGGAAACCGTCAACAAAATCCTGGAAGCCGATTTGATGGAGCTCGTGTCGGATAAACTGGAAGCCATGATATTACAATACGATAACTTCATTAGAGATCTGCACATGAAGTTTTTGGATTACTGGGAAGAGACCTGGGTCAACGCTACCACGAGATTGTCGAAATACTGGCACGATTTACTAAAGTCCATAGAGCCGTTGTTCTTCAAAGTGGTCCACTACAGTGAAGCGTTTGTTGTGACTATCTGGCGAGGCATAATGGACTTTTTCCATGAGAGGACCCACGAATTGACGGATACGCCTTACTTCAATTACGTCTCGACGTTCGGTCACGAAATGGATCGCATATACAAAGACCTCATACACAACGACATCATAACAAACATAAAGAAGTACACCAAGAAACTGGTTAATGTTATTTGGGCCAAAATAGAAAAATACATTCCGTTTACCGAAGAATTCAAACAGTTGTTCAATGAATTCAAGAACGCCTGGGAAAACTTCCTGAAGACTCCGCAAGTGGTCTACGTGAGGGAGAAGTACAATGAAGCCTACGTCCGTCTTCGATGGTGGTATGACTATTTCCTGATCGGCGAAGCTTTGGACCAAATTTGGGGAATAGTTTACGCGAAGGTCACCGACCTCGCGAAAACCGCACTACAATACGAAGAACTACACAGAACTCCCAAAACGAACTTTATATTCGATCCGCAGAAGGGTGAAATAATTCTTGAACAGAAATTACCGATGTCGTGGCACGCGTTCAACAGGACACCAGACTTCAGTGAAATATCGGAATACAAAGCTGTCAGAGATTTCATGGACCAATGGTTGATCAGTAACAGGAGTGTATGGGCCTATTACTACGATATAAGACCCTATATGGACTTCAATAATATCCTGCCACCTTTTGCGGGTATGGCAATGATGACGGCTCAAGGAACACTGGTGACCTTCGACAAACAAGTATTCACGGTAACCGAACCCGGGACCTTCCTCCTGACTAAGGATTACAGACAAAACAACTTCACCATACTAATGGAAAGTAACGATCAGGGGAGATATGATCTAGTTATATTGACGAAGAAGAATTTGGTCTTTATCGACCTGTACAGACAGCAAGTGTCGCTCGGTCGGACAATGCCATTGAATCTACCCGCAGTGATTGACGATCTGATAGTAGACAGACAGACTGACATCATATCAGTCGAGGGTCACAGCGGCTTGGGAGTGGAGTGCAATCTACTGTACCATACATGCAAGTTAGAAGTCGCGGGTTGGTTCTACGCTAGTCTTGGCGGTCTCCTCGGTACGTACAATAATGAACAGTACGATGAACTGCAACTACCCTCCGGTATCATGCAGACAGATAAACGAGCTCTGTCAAAATCATGGGCTATTCGGCACAGCAACGGGACCATCGCTGATAGAACAAATAACACGGCCTGTGACAAATTCTTCAGGAATAAAGTATCGCCGCTTCATCCTTGCTTCACATTGATCGATGCCGTTCCATTCCATTCTGAGTGTATGTCGGGAGCGGACGCCTGTTCCCTGGCAGGTGCATACCTCCAGCTATGCGAACACCAGCACGTGCCAGCGCATATACCAGATCACTGTGTCCAGTGTACAACACCGACAGGAGACATCATAGAAGAAGGATCGTTTCTCCAACTCCAGAATATTCCATCTTCAATGGATGTGGTATTCGTGGTTGAAGCCCAGAACTGTAATAAAAATATACGTAAGGCCAAAAACATCGATTTGTTCGTGGAGACATTGGACAGCAAACTACAAGGAAATGGGTTCTCTGATAACAGATACGCTGTAGTTGTATACGGAGGTCGCGGCGTGTTCAGTCGCGCTCGAGCTCTGTACGTCAACAACAAGCCGTTCACAGACGCGGTTGATATACCGAGATATTTTGAAGCATTCCAGATCGAAAAGTCAACAGCTGACCGTAACCGGTCCTCGGAGGCGTTGCGTGCTCTACAGACAGTGAGCACCCTGCCCCTAAGAGCGGGCGTACCTCGAATTGTCATACTGTTCCCGTGCCGCTCGTGTGGCAGCGGAGAGGAGCTGGACTACTCAACCATCTACCACAATTTAATGGAGAACTCGATCACGCTCCACATACTTATGGACGGCGATTTTTCACTGTCAAAGAAACGAGTCGCTAAATATTTGTTTGGAATGGATAATTCCGTCGCCTACACTAATAAAGACTACGAGCGGCTGACAGGCGACGCCGGGTTGAAGAAACAAGTTAGATTACCAAAGGAAAAACTTGGACTGTGTAGCTCATTGGCTCTAGAAACTAATGGTACAATCTGGGCGGGTTCCAAATTAGAGTCGGACCGCGCTGCCGCTCGTCGTTTCTCGACGGTTTGCGGCGGTAGAGTTTCCCGTGTCACGCCGTGTGCCGCTCCACGCTGCGAATGCCGTAACGCAGCGTTACACTGCAGGCCGTGCGCCAATCACGATCCGTTGGAGCTATCTTTTTGGAACTCCGATGACATCGATGAACTCATTGACCTCGCAATGGATCCACCAACATTACCCTCGTTTAGATGA

Protein sequence:

>DPOGS210300-PA
MNIIGDSLIPVPTRLTAIDAFRRTPCTETREYFLETYRADLVDVEVRIASYLQVMKCPNLSTIRKIFHSLKKEPVNQVATFVWSHLNNLGQSSLPSRVEIQGLLSGNTVPQLEDSPDFRMFSRNYEQSVFFDQYNAGGNYEANVIFSPDSYIPRSLSLNLTVDMFGESINLLEIKARGEGFERYFENLFGNDGPLSKNKINDKISKMRFFRSTNEADDIREKIDDLEFNNNALKHRFPTAELGFKVFGNEISFWSAEGDEEIRKSLERLNPKLRILEILSGKEISYNKASLFLDTTFSVPTGCGLPLGMNLMGTSYVDTKLSGNVVDKFSQSRNLDFEGKIRPSVAVNIAATMSVSAGSLANSGVRLSSRLYTATALEAKLSIRGLGVIRLDLSLPREKQEIFAAKSELLILHGDQEIQQQGLNKNRIEQNTCSWTTFDKAIGIKVCAAYQFPNMTNLKNAPYFILSGPAKYIVSLEKADPSANTYAFQYKWDKNDTTDVLDFSFDTPNSKEKRKFNAVLLLSNTSSIAEVTLLSSESTIKAKALYRNMPYDKSLEASLDVDGRKQFDSVMALQRHDIKHGYVWLPHAYWVVNDQKIAELSGSIKVKSKGGVTQWDITADFQTKQLASRLIGYYTVNGPTQGTKLQLDYQFYKTAKQTIKIEGVYSKRAMAYRYDVYGELSMQFTAYPIYNFYSVLRNVQTQNHLDIGLNVSSSKDLKHDPAFTFVYKKVDKLNGLKLDTQMSLKRPRPVLMKFQYDQTGPKYSALALLNFNPKSREILISGYVFAPPGTQLYMDAELNVTLPTLHPCAIKTKLHEKRPNEFQVNAVGVWFTGVDFNIDAVYQDTSKTNLASHRLKVLINSSHFKDIAVDARFTQDNRQITFIAEGEYNEDTYKSLVRYLLLSEQNFTAYGEVDISGRAYSLNLNADLNNNTNVNMDIHFDQLRDVHISYQRSVTPTQKRLSASLNWDANRDPSQKLSIDARLNHKGQWHHSGQVTLHYPGRVVNGEFEFLLKDWYCEWLVRMGWASPTAVVWRVKAYSEAREETVYALLSNLATPLSGWEDTSFNVMWRYRDNLQAINGSMNWQEDYLAFSLLADYLFSTSKFYGEINASVNSTIPTLPRAAAVAKHTVVWKKSADTLLSFQYNEDGLLMVNSSWAIDKGQNENNITGRVTLLTPFQGYTKGFLRTEFILGHKRDIKGVTYLDLEEKVVKIYVNGHMRRITNCMLVVNVTSPIPQFSQTTARFGFVETDRHLVAMIVTPNSTTGIEVLLQLNTMQDFSIFGHVALPIQYLNRAMITAKRGSEEVDFRVGWGTMDFGFTGIWQWKNILNFVYVYKLYTPLDGFEENGLVLKNIYGNGLDTELSVRLSKHKFGISILLVDNGKGLLDVIKDSFQHKAGDSDMFVENFDTEATINLDTLYYPTINFHAHMLKFVGPDEEDILEANATLHLPNKSPIVLTDVFILEEYTVMRNTLNLVTPFQAVKQLKSVYTVDIAIGEKFNVTCVALLYNGTYWHDISYKIYYETETGEDEAYKSYLASVGIATPLAVLPALEARVSARLEDALWKLSADIAMPSFTVTALARLELDDPFVETSGSLNLTSLYLEDYFIKMQFKKDFSDVESVVGGGIHIQQGEQNNYVFADVVWRPPPSRHVRFTARGALVPVLEPAEIAFQFSEEERTRTLTLDLTSADGYYSLKADQRPLSINVALSTPHKRFRAMKIIGEFAGEDIKGSFITDTTEYSVSGKMVNKNPLELSLILVPKGQGQQISIRIKCESSPTSYALTAHIIGPIEATVRAKAEVEKNYTDIFFKVDLPKVRSKEIFFKTRVDSYPMLRRVVSVQAASPIQKLSFVKGDADFVFGPKTGYLLCKYELPDMKGDGDLKWSFLLGDLYIRAIGDQLVKQIQRSVDLDIYFGNTTEEGLPKTNAGFRMDLDHIWQIGANASFGFIMEQRLNLFINAVLPKPNVDVHSLMLDARLNESPVTVEALYYTDVTAVKAGLKGQLLSLAESFDGNATVIWTANSQHKSIDNIVSYKWDVNGSKYIEYSLNTPLHETQNTFKLKGSYQRDFVHGYQVVKGRMHAPGTRQIGEVDITYGGVRHTDGYFNMTTPFTTLPWLKSIFDINNAEEISDNKVDLFWPNKSATVNTTHVYKKFDKGFTQTGTISLSVPLNTKHLVNTKYYYIEEEKTSNGNATIDFDQERFVKGSFNKVLSKSERNLDLETMNIEVENVHTPVGVKYIHEYDDTGNVDVKQATVFHLLNATKFNVTGKLDTHTYDIGKEIKLTAIHGNRTWNFENKYEAADKELKQGSKVTWAEDVWIHYDIHVTNMSEADTESQNIILNILYPRRTFQARGVYRLQDALLDGNIVLLWDVRGENKTAELKGKWENPAMEGGNLHDMYLALSHPSFRKDVTLKGQYLTTASVMSNLSMELQYSDYEKEYLKVQSILMDNSNGPIRDYKYILRCTHPATSLDLDMKSDINIHSRWYFIDNYYRFQKSMFYEKLRSNKLLIDLNNSGVTWERANETYFYKMNGTWSLVYPRYIASGLIRRPNANDTLVAELSMVDKSLVAHYNSTDDISYHLIGKIIDTRSARLDSWRNYDDVTTVDLASYIRLNHSRLLTSAVVWRPEIFSEAKSQAIYTLKILYEQINDTLLVIKEAPMEAHLALRNIWSDAKPRVREFLDDLNDLHVIKDDLDEFEGFLKESYDHNDFYVKDIVEFTYYVLDEMAIRNHLESLPGFVNDMWGMMGNTSQSIKQSLTYVVDSIKKAYANFLETVNKILEADLMELVSDKLEAMILQYDNFIRDLHMKFLDYWEETWVNATTRLSKYWHDLLKSIEPLFFKVVHYSEAFVVTIWRGIMDFFHERTHELTDTPYFNYVSTFGHEMDRIYKDLIHNDIITNIKKYTKKLVNVIWAKIEKYIPFTEEFKQLFNEFKNAWENFLKTPQVVYVREKYNEAYVRLRWWYDYFLIGEALDQIWGIVYAKVTDLAKTALQYEELHRTPKTNFIFDPQKGEIILEQKLPMSWHAFNRTPDFSEISEYKAVRDFMDQWLISNRSVWAYYYDIRPYMDFNNILPPFAGMAMMTAQGTLVTFDKQVFTVTEPGTFLLTKDYRQNNFTILMESNDQGRYDLVILTKKNLVFIDLYRQQVSLGRTMPLNLPAVIDDLIVDRQTDIISVEGHSGLGVECNLLYHTCKLEVAGWFYASLGGLLGTYNNEQYDELQLPSGIMQTDKRALSKSWAIRHSNGTIADRTNNTACDKFFRNKVSPLHPCFTLIDAVPFHSECMSGADACSLAGAYLQLCEHQHVPAHIPDHCVQCTTPTGDIIEEGSFLQLQNIPSSMDVVFVVEAQNCNKNIRKAKNIDLFVETLDSKLQGNGFSDNRYAVVVYGGRGVFSRARALYVNNKPFTDAVDIPRYFEAFQIEKSTADRNRSSEALRALQTVSTLPLRAGVPRIVILFPCRSCGSGEELDYSTIYHNLMENSITLHILMDGDFSLSKKRVAKYLFGMDNSVAYTNKDYERLTGDAGLKKQVRLPKEKLGLCSSLALETNGTIWAGSKLESDRAAARRFSTVCGGRVSRVTPCAAPRCECRNAALHCRPCANHDPLELSFWNSDDIDELIDLAMDPPTLPSFR-