Monarch geneset OGS2.0

DPOGS214120
TranscriptDPOGS214120-TA8364 bp
ProteinDPOGS214120-PA2787 aa
Genomic positionDPSCF300014 - 1641811-1655548
RNAseq coverage387x (Rank: top 31%)
Annotation
HeliconiusHMEL0113870.093.52% 
BombyxBGIBMGA006169-TA0.085.38% 
DrosophilaNipped-A-PA0.049.35% 
EBI UniRef50UniRef50_E2AUX10.062.19%Transformation/transcription domain-associated protein n=8 Tax=Formicidae RepID=E2AUX1_CAMFO
NCBI RefSeqXP_001607204.10.067.19%PREDICTED: similar to ENSANGP00000029084 [Nasonia vitripennis]
NCBI nr blastpgi|3454801610.067.21%PREDICTED: transformation/transcription domain-associated protein-like [Nasonia vitripennis]
NCBI nr blastxgi|3454801610.067.23%PREDICTED: transformation/transcription domain-associated protein-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054882.9e-90binding
KEGG pathway 
InterPro domain[2332-2333] IPR0119892.9e-90Armadillo-like helical
[776-1755] IPR0160245.9e-21Armadillo-type fold
Orthology groupMCL12356 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214120-TA
ATGATGGCTTCAATGTCGACCGGCGGGCCCGGGGACCCGCATACACAGATGAATACCTATCGAAGCTATGTGACTATGCTAGCAGACCCCGGTGCCAAAGACGAGATAAAATTAAAAGCTGCCCAGGAGCTTAGTGAAAATTTTGAGGTTATTTTGAGTTCTCCCCAGTATCCACAGTTTTTGGATCATTCTTTAAAAATTTTTCTAAAAATATTGCAAGAAGGAGAACCACATTTTATAGCGGAGTACAATATACAGCAAGTAAGAAAACTTATACTTGAGATGATACATAGGCTTCCAATCAGTGAGACTTTGAGGCCATATGTAAAAAGCATACTTATACTTATGCTTAAATTGATGGAAATAGAAAATGAAGAAAATGTTCTTGTATGCTTAAAAATTTTCATGGAACTACATAAGCAATATAGACCTGCTTACACCACAGATGTTGATATTCACAAGTTTCTGCAGTGGGTGAAGGGAATATACTCAGATTTGCCGAACCATTTGCCTAAAATATTTGAACCCAAACCAACAATCAGAGTTAAGGACTTATCTGAGGTCAATATAGACCAGCTTTTACAAGAAACTTACACAACAACCCCTATACACACTGAGAAGAAATTGTTGGATGGAAGTGTGGTCACTTATAATCTTATACCAAGATCAGTATTATCTCTGAAAGTAATACAAGAGCTGCCAATTATAGTTGTTCTGATGTATCAATTGTATAAACAAAATGTACATCAAGAGGTTAGCAACTTTATACCCTTGATTATGGAAACTATAACCTTGCAGCCAGCAACAACTCACAGGCAGTCATCGTCATTTAATAAGGAAGTGTTTGTGGACTTTATGGGAGCACAAATCAAAACATTGGCCTTTTTAGCATACATAATAAGAATATATCAAGATACAATTGCAAATCATGCTAATTTAATGGTTAAGGGAATAATTGGACTTTTGACATTATGTCCACCTGAAGTGGCACATCTGAGAAAAGAATTAGTAATTGCCACCCGCCATATTTTAGCTACCGATTTAAGACTAAAATTCGTTCCATATATGGAGCGTCTTTTTGATGAGGAAGTGTTACTTGGTGGCGGCTGGACGGTACATGAATCTTTGCGGCCTCTTGCTTATTCAACTTTGGCTGACTTAGTGCATCATGTTCGTCAACATTTACCTTTAACGGATCTTGCCATTGCAGCTCATTTATTTTCTAAAAATGTCCATGATGAATCCTTGCCCACCAGCATCCAGACTATGTCGTGCAAGCTGTTACTAAACTTGGTTGACTGCATCCGTCAACGCTCTGACAGTGAAGCCGGTGCGCCGCAGGGAAGACACTTGCTCATGCGTATACTTGAAGTGTTTGTACTAAAGTTTAAAACAATTTCTAAACTACAGCTACCTGCATTAATGGCTAAATGCAAACAAAACACGCCTGCAACACCGAATGGGACAAATGGTACACCGAATCCGAGTACGCCTACAACCCCGGCTCCTAGTACATCGTCGGCGGAAGTTAAAGTTGAGGAAGAGAAACCTACACCGGATTTCTTAGATAGCATAAACAAGACTGAGGAAAAATCTAAAATCGGATTCCCAACATCTCAAATGACCAACCTCAATGTCGGAGATTACAGAACTCTCGTAAAAACGCTGGTCTGCGGTGTTAAAACTATCACATGGGGCTGTGCTTCATGTAAAACAACAACCAACACGGAGGGTGCTGCAACAACGACTATAACTGGTCAGAAGCAGTTCAGTGCGCGGGAGACGTTAGTGTTCATCAGGTTGGTGCGATGGGGCTTACAATCTTTGGACATCTACACGCTGGCGGCACCACGAGCACCCGCCCAGCCACCCGCACAACCACACGTCCGCTCCAAGGAAGAGAAGGAAGTTCTAGAACATTTCAGTGGGGTTTTCTCCATGATGAACCCGCAAACGTTTCAAGAGATTTTCACTGCGACGATATCTCATATGGTTGAAAGAATTAACAAAAACCCTACTTTGCAAATTATTGCAAACACGTTTTTATCCAATCCGGCGACATCGCCAATATTCGCTACGGTGCTTGTAGAATATCTGTTAAAGAGAATGGAAGAAATGGGTACTAATATAGAGAGATCGAATTTATATCTAAGACTATTTAAGCTTGTTTTTGGTTCTGTTAGCTTGTTTCCCACTGAGAATGAACAAATGCTCCGCCCTCATTTACATAGTATAGTAAACAAAGCCATGGACTATGCCATGACAGCAAAAGAACCTTACAATTATTTCTTGCTTTTAAGAGCATTATTCCGTTCCATCGGTGGGGGTAGTCATGACTTGTTATACCAAGAATTCTTGCCATTGTTACCTAATTTGCTAGAAGGTTTGAACCGCCTACAGAGTGGTCTTCATAAACAACATATGAAAGATCTGTTTGTTGAGTTGTGCTTGACCGTGCCTGTGAGACTTAGCAGTCTTTTGCCTTATTTGCCAATGTTAATGGATCCTTTGGTATCTGCTCTAAATGGATCTCACACCTTAATTTCACAAGGTTTGCGAACCTTGGAACTATGTGTGGATAACTTACAACCTGACTTCTTGTATGAGCATATCCAGCCAGTTAGAGCAGACCTAATGCAAGCGCTTTGGAGGACGCTGCAGAATAATGAAGTTGCGCGAATAGCATTCAGAGTGCTAGGGAAGTTTGGCGGAGGAAATAGAAAAATGATGATTGAACCACAGAGGCTGGAGTACCGCGAAACGGACGCCCCTCCGCCTGCCGTCCAAGCATATTTCCAAGATCAACCAAAACCCATAGACTTCGAGGTAGATAAAGTTATAGAAACCGCATTTTCCGCGCTCAAGTCAAGCACAACAGATCCTTTTTACCGCCGTCAATGTTGGGAAGTCCTACGTTGTTATTTGGCTGCCTCTTTGAATCTTGACGACGACAAGGCAACTCTTCAAAAACTGTTCAATCATCCCAGTTTCCTGGAAGGCAAAATACCGGCACAAAACGGACCTTACTATAAATGTACTAACAGCATTGTTAGAAACACTCATCGTACTGCTCTTACGGGAATGTTTGTCGCCGCAGCTATTAAGGAATTACGACATCATGTTCTTCCCACTATGGTGTCTCTTGTTCGTCATTACACATTAGTTGCTATCGCCCAAGAAGCTGGGCCTTTTGCAGGTTCAGGTGGTCCTAAGGAAGGCCTCGATGCTTTAGTCCTAGTTGATGCTATAGCGGTAGTAATGGGCCATGAAGAGAAGGAACTATGCAAGCCAGGTCACTTGGCTTTAGTGTTAATGATCGAAACAGCTGCTACAGTTTTAGGAAGCAAAGAGCGCGCTTGCCGATTGCCATTAATGGAGTACTTAGCAGAGCGCATGTCTGCACTATGTTATGAAAGGGCATGGTATGAAAAGCTAGGCGGATGTATTGCTGTAAAATTCATGTTTGAGAAAATGGCTCCGGAATGGGTGTACAAGCACGTGTTCACCTTTTTAAAGGCTGTATTATTTGTTATGATGGATCTTACCGGTGAAGTGTCTTCGGGTGCCATCGATATGGCCACTGTTAACCTGGAGCGTTTGGTCCGTGTTTGTGTGACCGGCCCGGGCGGACAAGGCGTTGAACCGGAAGGGGAAGTAGCAGCGGCTAAAGCACGTGCTCTACACGATGTCTTGCAGGAACTTGTCTTACAAGTCACAAGTCCGCATCTTCTTGTTAGGCAACAGGCGATGAAATCTTTGGAGTTAATTGCGGAATTGCAAAATAAGACGGTCACTGAAGTAATGGATCCACACAGAGAGGTGTTGGCTGACATTATACCACCTAAAAAACATTTACTTCGTCATCAACCAGCCAATGCGCAAATGGGACTTATGGACGGCACCACATTCTGTACGACCCTCAAACCGAGACTTTTCACTATCGATTTGAACATTAACGAACACAAGGTATTCTTTCGTGAACTATTGTCCCTGTGCGAGGCGGAGGACGCTATGTTAGGCAAGCTGCCGTGCTATAAAGGCGTCAACCTCGTGCCTTTACGGACCTCCGCTTTACGAGCACTCGCCGCCTGCCATTATATTCAGGAGAAGCCTTGCCGGGAAAAGATATTCCAAGTGCTTTATAAGAGTCTCGAGAAGAACGACCAGGAATTACAACAGGCGGGATTCGAGTGTATGCAGAAATTCCTATCAGGTTTCCAAATCGACATGGAGATGGTGCATCCTGTGATGCGACCCTTGTTACTTACTCTGGGAGACCATCGCAACCTCAGTGTCAACGGAGCTAAACGATTATCCTTTCTGACTCAACTCTTCCCCTCGACCTTTAGCGAGAAGTTGTGTGAACAATTGCTGCAGTTGCTGAAGAAGCTCCTCGACTACTCCATCCAAACAAACAGAGGGGGCAATTTCTTACAAAGCGTGTCGAAGAATATGGAAAATGAACAGAAAATTATAATACTAATTGGCATTTTCCACCAAATACCCGCCGCGTCGCCACGTTTCATCGATGTGCTGTGTCGTCTTATCTTCCACACGGAGAAGTCGCTCATGATAGAAGCCGGATCTCCCTTTAGAGAGCCGCTTGTAAAGTTTTTGTTGCGGTATCCTAAAGAGACCTTGGACTTTATAATGAGTGACAACAACATTAAAGACCAGCAATGGAGTCGTTTCCTCGTGTTTTTAGTGAAGCATTCAGAAGCGGGTCCCGCTTTCAGGGAAGCTCTACATACCACGAAAAAAGCTAGGCTAATGCAATTATTAGCGGCTAACAGCGGCGCGGCAGCTATACCACAGGCTGATAAAGCTGAAATGCAGTTCCAAGCTGTTAGGGTTATATCGCTGCTGATAAAGTATGACGACCAGTGGCTGTCAACGCAACACGATCTCATCGAGCTTTTGAAAAGAATATGGTGCAGCGATCAGTATCATGAAGTACATAAGAAAGTTGAAAACGTCGACTGCACCCACTGGAAGGAACCGAAACTTATTGTTAAAATCTTGTTACACTACTTTTGTCATCATCCGTCCAACATTGATCTATTGTTCCAATTGTTGCGAGCATTGTGTGATAGATTCATACCAGATTTCCAATTCCTACGAGATTTCTTAGAGAACACAGTGGCACAGAATTACACGGTGGAATGGAAGCGATCAGCGTTCTTCCGCTTTGTGGAACACTTCGCTAGCGACGCCATGTCGCAGGAGTTGAAGGCGAAGGTTCTACAAATGATTCTAATACCGTGCTTCGCTGTTAGCTTCGACAAGGGACAGAAGATTGTGGGCGGCCCACCGGCGCCCTATCAGGACAATCCAGATAACGTCGTCTCGGTGTTCATCAATAACGTGATCGACCCCGAGAATCCGTTCGCGTGTTCGGACGCGGTCCGTATTTCGCTTCTCCAGTTCGCGTGTCTTCTGCTGGAACAGGCCTCCCCACACATACACGACGCTAACAATAAGAAACAAGGCAACAAACTCCGCAGACTCATGACCTTCGCCTGGCCGTGTCTCCTGGCTAAGAACTACGTGGATCCCGCTACCAGATATCACGGTCATCTCCTGTTAAGTCACATTATTGCCAAGTTTGCGATACACAAAAGGATTGTTTTGCAAGTTTTTCATAGTCTCCTCAAAGCGCACGCGGTGGAGGCGCGGGCAGTGGTCCGCGCTGCTCTGGAGATCCTCACACCAGCCATGCCTCAGCGGATGGAGGATGGTAACACTATGCTCACACACTGGACCAAGAAGATCATAGTAGAAGACGGACATTCAGTGCAACAACTTTTCCACATTTTGCAACTCGTTGTGCGCCATTATAAGGTGTACTACCCAGTGAGGCATGCTCTGGTGGGTCACATGGTAGCGGCTATGCAGCGGTTGGGCTTCTCAGCGACCGCCTCGCTCGAACACCGCCGTCTCGCTGTAGACCTGGCCGAGGTCGCGACGTCCCCAAGCGGCGCCATGAAACGCGTGTCGTCGGACGAGAGCGGTAATGAAGCCCGCAAAGCGCTGAACACGGGCTGGGCCAGTCCGCAGGCGAGCGTGTCCAGGCTCGAGCCTGATGCAGCCAAGCCGCTTGACAGACAACACGTTGATGTCGTTGTCAATTTGCTGCTGAGGCTCGCTTGTCAGGTAAACGAGGGTGGAGTGACGGGTGCCAGTGCGGCGGGAGCTGCGGGTGCGGGCGGGGCGGGGGGTTCCCCGGGCGAACAACTGTCACGACGCTGCGTGCTACTGCTGCGAGCCGCGCTCAAGCCTGATGTGTGGCCACACCTCTGCGAGCCCAAACTGGCCTGGCTAGATAAAGTATTTTCGACGGCGGACACGAATGCAGCGGCTTGTGCTAACGCATGCACAGCCCTTGAACTGCTAGTGTTCCTCCTTGGCGTGCTCCGACGAGAGCAGATCCTCGCCGCTCTGAAACCTCTACAACGCGGTCTGGCGGCGTGCGTCGCCTCTACTAACACCAAGATAGTGCGCCTCACACACAACCTACTCGCTAAACTCACTGCCCTGTTCCCAACTGAACCTTCGGGGGCCGCTCAAGCGTCAAAGTATGAAGAGTTAGAAACGTTATACGCCAGCGTGAGCAAATACGCGTTCGAAGGCCTGGCGGCGTATGAGAAGTCTGCCGGCGCTACTGGTGCGACCGCAAGTGGAGCTGCGAGTGGGGCAGCCGCTTTATTAGGACCTCTCATGATGTTGAAGGCCTGCTGTGCGTCTAGTCCGGGATACGTGGACCGCTTGCTATTACCTCTCATGAGGGTTCTACAGAGAATGGCTAGAGACCACGTCGCTCCAAACCCTGATGCAGCAACTTCAGATTTACTCATATTGGCCCTTGATCTCCTCAAAGCTAGGGTATCTGTGATGCCAGTTGAAACGAGGAAAACCTTTATTGGTACAATACTAGTGGGTCTCATTGAAAAAACTAACGATGCTAAGGTAATGAAAGCCATTACCCGTATGGTTGAAGAATGGGTGAAATGGAAGGGCACGGGAGCTGGTGCTGCTCCGTCGTTGAGAGAAAAGTCTATACTGCTCGTGAAGCTCATGCAATACGTTGAGAAGCGCTTCCCCGACGACCTAGAGTTGAACGCACATTTTCTGGATCTCATTAACTATGTTTATCGAGATGAGCACCTAAAAATGACGGAATTGTCCATGAAGTTAGAACCGGCATTCTTGGCCGGACTACGATGCCCGCAACCTCATATACGGGCTAAATTCTTTGAAGTGTATGACGGTAGCGTGCGGAAAAGGGTGTTCGACCGCCTGCTTTACATTATATGTTCTCAAAACTGGGAACACATTGGACAACATTTTTGGATTAAACAGTGCCTCGAGTTACTACTTGTTACATGTGTTTCTAGCACACAAATAAGATTGTCCAATTCAAAATATTTGCTGCCAAACATAACCGCCGTGATCAACTGGGCGGATAGTGAAGAAAGAAAATCGTTCGTAATATTCAGTAGCGTAAAAGAGGAATCAGTGGATGGATTCAGTGATTCCCTCGACCCGGACAAGGAGGACGTGTTGGATATGGATTTAGATTCGAGTTCGAATACAAAAGATGATTTGACCAAAAACGTTCCAAACAGGCAACGCGCCCTGAATCAGATAGTTGGTAAGCAGTGTGAGTTCGTGGAGTTGGCTCGTCGTGTGCGCACAGAGCAGCTAGTGACTGCGGCTGCTCAGCTCGCGCATATGGACGACTCGCTGGCACATCACACTTGGCTCACTATGTTCCCGGCGCTGTGGGCGGCGTTAGACGACAGGCAGCTTGCGACAATCATGAATGAAATAGTACCATTTATAATATCGGGAGTACATGTTATTCAAAGAGACCAACCACTGAGCGCGCTCAACACTTTCATAGAAGCACTCGCGAGATGCAATCCTCCAATCTCAATAAAGCCACCGATGATGAAATATCTCGGAAAGACACACAATCTGTGGCACAGAATGACATTGAATTTGGAACAGATGGCCATTGATCAGGCTAGTGGTCGCGCGAATCGCGAAGCGTTGGACATTTTTGATTATGATGTTGAAAGTACAACACCCACCGAAGTTCTAGACTCTTTGAGTGACATGTACGAACTATTACAAGAAGAGGACATGTGGTCTGGTTTGTGGCAAAAACACGCGCGGTACAGGGAGACGAATGTAGCTATAGCCTATGAACAGCAGGGCTTCTTCGAGCAGGCGCAGGCCGCGTACGACGTCGCTATGGCTAAACTAAAACAAGAATATTCAGCAAATCCGTCGTCTTATAATATGCACAAGGAATGTACTCTATGGACCCAACATTGGATCAAATGTGCCAAAGAGACGTGA

Protein sequence:

>DPOGS214120-PA
MMASMSTGGPGDPHTQMNTYRSYVTMLADPGAKDEIKLKAAQELSENFEVILSSPQYPQFLDHSLKIFLKILQEGEPHFIAEYNIQQVRKLILEMIHRLPISETLRPYVKSILILMLKLMEIENEENVLVCLKIFMELHKQYRPAYTTDVDIHKFLQWVKGIYSDLPNHLPKIFEPKPTIRVKDLSEVNIDQLLQETYTTTPIHTEKKLLDGSVVTYNLIPRSVLSLKVIQELPIIVVLMYQLYKQNVHQEVSNFIPLIMETITLQPATTHRQSSSFNKEVFVDFMGAQIKTLAFLAYIIRIYQDTIANHANLMVKGIIGLLTLCPPEVAHLRKELVIATRHILATDLRLKFVPYMERLFDEEVLLGGGWTVHESLRPLAYSTLADLVHHVRQHLPLTDLAIAAHLFSKNVHDESLPTSIQTMSCKLLLNLVDCIRQRSDSEAGAPQGRHLLMRILEVFVLKFKTISKLQLPALMAKCKQNTPATPNGTNGTPNPSTPTTPAPSTSSAEVKVEEEKPTPDFLDSINKTEEKSKIGFPTSQMTNLNVGDYRTLVKTLVCGVKTITWGCASCKTTTNTEGAATTTITGQKQFSARETLVFIRLVRWGLQSLDIYTLAAPRAPAQPPAQPHVRSKEEKEVLEHFSGVFSMMNPQTFQEIFTATISHMVERINKNPTLQIIANTFLSNPATSPIFATVLVEYLLKRMEEMGTNIERSNLYLRLFKLVFGSVSLFPTENEQMLRPHLHSIVNKAMDYAMTAKEPYNYFLLLRALFRSIGGGSHDLLYQEFLPLLPNLLEGLNRLQSGLHKQHMKDLFVELCLTVPVRLSSLLPYLPMLMDPLVSALNGSHTLISQGLRTLELCVDNLQPDFLYEHIQPVRADLMQALWRTLQNNEVARIAFRVLGKFGGGNRKMMIEPQRLEYRETDAPPPAVQAYFQDQPKPIDFEVDKVIETAFSALKSSTTDPFYRRQCWEVLRCYLAASLNLDDDKATLQKLFNHPSFLEGKIPAQNGPYYKCTNSIVRNTHRTALTGMFVAAAIKELRHHVLPTMVSLVRHYTLVAIAQEAGPFAGSGGPKEGLDALVLVDAIAVVMGHEEKELCKPGHLALVLMIETAATVLGSKERACRLPLMEYLAERMSALCYERAWYEKLGGCIAVKFMFEKMAPEWVYKHVFTFLKAVLFVMMDLTGEVSSGAIDMATVNLERLVRVCVTGPGGQGVEPEGEVAAAKARALHDVLQELVLQVTSPHLLVRQQAMKSLELIAELQNKTVTEVMDPHREVLADIIPPKKHLLRHQPANAQMGLMDGTTFCTTLKPRLFTIDLNINEHKVFFRELLSLCEAEDAMLGKLPCYKGVNLVPLRTSALRALAACHYIQEKPCREKIFQVLYKSLEKNDQELQQAGFECMQKFLSGFQIDMEMVHPVMRPLLLTLGDHRNLSVNGAKRLSFLTQLFPSTFSEKLCEQLLQLLKKLLDYSIQTNRGGNFLQSVSKNMENEQKIIILIGIFHQIPAASPRFIDVLCRLIFHTEKSLMIEAGSPFREPLVKFLLRYPKETLDFIMSDNNIKDQQWSRFLVFLVKHSEAGPAFREALHTTKKARLMQLLAANSGAAAIPQADKAEMQFQAVRVISLLIKYDDQWLSTQHDLIELLKRIWCSDQYHEVHKKVENVDCTHWKEPKLIVKILLHYFCHHPSNIDLLFQLLRALCDRFIPDFQFLRDFLENTVAQNYTVEWKRSAFFRFVEHFASDAMSQELKAKVLQMILIPCFAVSFDKGQKIVGGPPAPYQDNPDNVVSVFINNVIDPENPFACSDAVRISLLQFACLLLEQASPHIHDANNKKQGNKLRRLMTFAWPCLLAKNYVDPATRYHGHLLLSHIIAKFAIHKRIVLQVFHSLLKAHAVEARAVVRAALEILTPAMPQRMEDGNTMLTHWTKKIIVEDGHSVQQLFHILQLVVRHYKVYYPVRHALVGHMVAAMQRLGFSATASLEHRRLAVDLAEVATSPSGAMKRVSSDESGNEARKALNTGWASPQASVSRLEPDAAKPLDRQHVDVVVNLLLRLACQVNEGGVTGASAAGAAGAGGAGGSPGEQLSRRCVLLLRAALKPDVWPHLCEPKLAWLDKVFSTADTNAAACANACTALELLVFLLGVLRREQILAALKPLQRGLAACVASTNTKIVRLTHNLLAKLTALFPTEPSGAAQASKYEELETLYASVSKYAFEGLAAYEKSAGATGATASGAASGAAALLGPLMMLKACCASSPGYVDRLLLPLMRVLQRMARDHVAPNPDAATSDLLILALDLLKARVSVMPVETRKTFIGTILVGLIEKTNDAKVMKAITRMVEEWVKWKGTGAGAAPSLREKSILLVKLMQYVEKRFPDDLELNAHFLDLINYVYRDEHLKMTELSMKLEPAFLAGLRCPQPHIRAKFFEVYDGSVRKRVFDRLLYIICSQNWEHIGQHFWIKQCLELLLVTCVSSTQIRLSNSKYLLPNITAVINWADSEERKSFVIFSSVKEESVDGFSDSLDPDKEDVLDMDLDSSSNTKDDLTKNVPNRQRALNQIVGKQCEFVELARRVRTEQLVTAAAQLAHMDDSLAHHTWLTMFPALWAALDDRQLATIMNEIVPFIISGVHVIQRDQPLSALNTFIEALARCNPPISIKPPMMKYLGKTHNLWHRMTLNLEQMAIDQASGRANREALDIFDYDVESTTPTEVLDSLSDMYELLQEEDMWSGLWQKHARYRETNVAIAYEQQGFFEQAQAAYDVAMAKLKQEYSANPSSYNMHKECTLWTQHWIKCAKET-