Monarch geneset OGS2.0

DPOGS208029
TranscriptDPOGS208029-TA8970 bp
ProteinDPOGS208029-PA2989 aa
Genomic positionDPSCF300203 - 124016-139130
RNAseq coverage400x (Rank: top 30%)
Annotation
HeliconiusHMEL0146020.068.65% 
BombyxBGIBMGA001497-TA0.068.74% 
Drosophilaash1-PC0.039.93% 
EBI UniRef50UniRef50_D6WB780.045.75%Putative uncharacterized protein n=3 Tax=root RepID=D6WB78_TRICA
NCBI RefSeqXP_971447.10.045.66%PREDICTED: similar to set domain protein [Tribolium castaneum]
NCBI nr blastpgi|2700014770.045.75%hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
NCBI nr blastxgi|2700014770.036.31%hypothetical protein TcasGA2_TC000311 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.6e-41protein binding
GO:00036773.4e-24DNA binding
GO:00056341.1e-16nucleus
GO:00180241.1e-16histone-lysine N-methyltransferase activity
KEGG pathwaytca:6600940.0 
 K06101 (ASH1L)maps-> Tight junction
    Lysine degradation
InterPro domain[2149-2271] IPR0012142.6e-41SET domain
[2683-2864] IPR0010253.4e-24Bromo adjacent homology (BAH) domain
[2100-2147] IPR0065601.1e-16AWS
[2595-2657] IPR0110112.4e-10Zinc finger, FYVE/PHD-type
[2524-2601] IPR0014871.1e-06Bromodomain
[2605-2655] IPR0130831.3e-06Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13059 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208029-TA
ATGAAGACAGTAAAATATCACGCATTCGCGTCTTCGGCTGCGGAGGGCGGTGCGTGCGGCAGCAGTACATCTTCTTCCACGTCATCGTCATCGTCTTCTTCGTCATCGTCATCCTCGTCAGACTCGGACACAGCGGACGCTGCGGGTGTTCACGCCTCGCCAGATGACTCCAGTGCAGCCCTGGCAGCGGCGTGTCAGTTGCAGGTCATCGACGATGCAGACCTCGCTGAACTATTTCCAGAACAAACTTCGAGCGCGCCGACGCAACAGCCCAATTTCACGACCTTTTCAGTTCACAATGCATCGCCGGAAAACGACGACAAATCCATAGCCGACAATGGCGATGGTTCAAGTAGTGAGATGGAACTAACACCACAGCTAGTAACAGCGGCTATACAGAGAGCGACGGCGGACTCATCAGGCTCGGAGAATGAGTGCTCCAATTCCGATACAGTGCATCAAAATACGTCGCATTACGCTTCTAGTTTATTGCAACAGTTTGTAGCACAAACTCAACTACTGAGCAGCACAGCGCCGCTTCCATCTATGAACTCTGCTGGATCTTCGTCGTTGAGCTGTGCACTGTCAACGAACTCAATAGATGGGGTCGGAGCCATCAGCGATTGTGTACTAGGACAAATCAATAGTCTACCAGAAATACCCGCGATCTCGTCCAACTATTTAAATAACCAACATTTGAGTCCGCAACAGACAGAAGAATTATCTCAAATAAATAAAGATCTAGAGGAAATTTCATCAGTCACGGAGGCTGTCGGTCTATCGATAACAAATCCGCCTAGTTTGGAAGACTGTGTCGATAACAACGACTTTATGAACTTGGACATAGCTTCAGGTGGATCAGAAATAGGCAGTGCGAGTGATCTTCTAAAGAGTTCGCCTATTACTGTAACTAATGAATCCGGATCAGTTAGTCAAATAGACGCTCAAAAGCTAGATACAATCAGTACAATTTCTGTAGAATCTCTGGATGATGTTAAGAACATTATTGTCGAATCTAGAAGGAAGAGAGGTCGACCTCGCAAAGTTAGGAAAATAGACGAATACGATAATAAATGTGTTGAATCTCATAAGAATACTAAATCTAGAGTTAATGTTATAAATGACTTCGATAACAACAATGATCCTCCGAACGTTTCGCCCGATTCTGGAATTTTGTCAAATCACAATTCTCCAACACATTCACCATTGAGAAGGCATAACGAAGAAACCCAAGGTAGGCACAACAGGAAATCAGTCCAGAAAGAAAACACTAAGGTGGAGAGAAAAAGTGTACGCTCAAAGTCCAAAACCAGAAGTAGGGGACATAGATCAGATTCGAGCGATTGTGATTTCCATAAGAAAAGATTAGAAAATGACATCAAACAAATCAAAACCGAGGCACCTTCACCTATACCGCTCAAACAAGAGCAAAATAAATTAGACAAAACGAAAAAGAATGACCAAAAATTAGACATAGCGGCATTAGACAGAATGCTGTATGCCACAGACAGAGTTTTATATCCACCCAGAAAGAAGGTCGGTCGTAAACCGCAGAACAAAACAAAAAATGCTAAAGCTACGCCAAAAACTTTACGAGATAAAAATCAATACGACTCCGCGGACAGCGACAACGATTCACTATCGTGCAACAAATCCGTTCTGTCCGGTGTTTACGCTAAGAGGAAAGAGGTGAACAGCAAACTAGCTAATGTGTCCAAGAGCAATAAAACTAGTAAAACCAACAATTCATGGAGAGATCATCAGAGTGAGAACGAAGCCGCCGCTGACGATCCTCTGGACCCGACGTGGAAACAAATAGATCTAAACCCGAAATATAAAGACATTTTATCGGGCTACAAAAGCGATTATGAATTCAAACCCTATAAAAGTTGCAGTAGGTTAATAGAGTCGGGTTATAAAAGTGATTATGGCTGCAGATCAGGTTATAAAACTGATTGCTATCGCTCCGGCTACAAGAGCGATCATAAGTCAGGTTACAAGAGCGATAAATCTGGATATAAAACAGATTACAGCATAAGGAGTATGCGGCGGCGAACGAGGAAGTTGAAGAAAACGAGGTCCGTAAGAAATAGGTCTTATTACAAAAATCAAAAGCATTTCGTATCGGACCAAGAAATAGTGCTACTGTCTAATAAAACATTCAGTAGCTTAAACTTGGGACACAGTTCCAGTGATTCGGAATGCGAACCTTATTTAAGGAAACCAAATGCTAGTCCAAAATACGTAAGCGTCTGTACTAAATATCCGCTACCATCGAAATATAACTACGCCTTCTCTAAATCAAACCAAAAACATATATTGAGACCGAACTCCGCTGACCCGTTTGCCCCCGGTCCGGTATTCAGCGGATTGCAACGGCCGGTTACCACTTATAATATCTTTAAAGTGAGTCATAATAATAACAACACACTACTTAAGTCGCCCGTTAAAAGTGGTAATTGTGCTAATATCTTGCCCTCATTAAAACCATCTTTCACTCATAAATTCGGCGTCTCTCCAATTTGTTCCACTCTTAAATCAAAAAAGGAAGAAATCTCCAAACATTTTACTAGAACGCTATCACCTAAAGGCAGAAAAATTTCTAAAAAGGATTCACCGATACGAAAGAAGGAGACACTGAAACCATTACCCAGTATATTTGAAAAGGCTAAAAATATCTATGTACCGAATAAGTCATTATCAAGAAATGTTTTTTTGAATTTTAAAAGGCCTCATATAAAACCGACGTTCCATACAAAAAAACCAAGAAGAACCGTTGTACTCGCACCTCCAAAAATTAATTTGAAGCCTACAGAAAGGCCATCTCGAATAACAATTAAGACTGAAAGTAAACATCACAGGCACAGAAGTAAACGTAGACATCGGTCGAGATCAGTGTCAAGGTGTAGAGAGAGTAGAAACTCTGTGGACTCAGTAGATCAAAAATTTACCCAAAACATGGATGTGCTGATCGAGAATTTTATGAAACTTTGCCAAATTGCTCCAAAATTCGTCGCTAGTGCTCCTCAAATACAGCAAAGAAATGTTGAAAAAGATAAGCCCCAGCCGGAGACTGCGCCGACTAAAGTAATAAAAAGGGCGTCTAAAAAACGAAAGACCTCGGATAATCAGGAAATAGCAACGCCTACTTCAAAGCGTAGACACAAGAAACAGTTGACGGAATCACAAAATAAAGGAGGGAAAGAGACGAATGAACACAAACTGCCCCTCAAGAAGAGACATTACCACATCAATTCCTCTACGAGCAACTCTTTGAGCCTCAGTTTGGTCACAGCGGAGTTTGATGAAAACTCGAAGACTGGAAGTGGTTCCGAACAGTTCTTATGTACTGAAACTGACTGTATGGGCGATGTGAATGATAAGTTGTCCGACGAATCTTCTAAACAGAAATCAACATCTGAAAAGAGTAAAAGGGATGCGAAGAATATCCCAACGGATTCCAAATCTCAAAGTCTTGAATCTCTTCCGAAACCGACTAAATCTTCCACGGACACGTGCCCTTCTAAAGAAAAAATTTATGAAACTTCGGAAAAACTGAAAGCTGTTCATAAAATGGTTCATGATTTAGAGAAGTCTTTACCGAAATCCAAAGAATCCGACGGTAAAGCGGAGCTTTTAAAGCCTGAAAACAACAAAACATTATCGCCGAGATCAGACACTAAAGCATCCCCACTAAGAAATTCTGCTCCCATAGTCACACCGAAAAAGCGTCATAGATTAGAAGCCGAAAAGGTTTTATCCCACAACAGTTTGGACCAAGTAGTTCAGTCCCTGTCTAAAAAATTATCGGAGGATAAAATAACGTCACCAGTCTCTAAGGATATTAATCAAAACTCTTCCAACGATACTACAAAGAAGGGTGATGTTTTGAATTCTAATACGGAAGCAGTTCAAAACGCAAACAACAATACATCAGTCGATCCACTTAAGAGCATGTCAGCTCGTACTTTATACAAAAGTGCAATTCCTCCAGCCCAAAAATCAGAGATAATGACCCGTAAGAAAAACAGACTCGAAGGTCTTACTAGTAATCTAGTTTCCAAGATCAATCCGAGCGCAGCAACGAAAGTGCTGGACACTCTTTTGAATAATAACATACGGAAATCTATTGAATCGAGAATTCTTGAAAAAGAAAAGAACAGTTCGGATACTTTACATAAATTATACGATGATAAATATAAAGGAAAGGATATTACCCACAATACAAGAGCCAGTGTCATAAAATCTCCAGTGTCTAAAGGGAAGGTGATTGAATCTAAGAAACCTAAAATCATAGAACCAATTGTAGAGATAATTCCCGTGGTGAGCGTCGACAAGCCCACAGGAATCTTTGAACCTTCCATCGATTTAGAAGATCAAATACCGAAATCTTCAATTTGTGTTTCTAATCTGCTTGCTGAAAGCAACAGAAACAGGAATAAATCTAGAGGGAATGATCAGAAAAATTCAAACGGTTTATTGGCTATAGACAATGAGTCTGAAATCCCTTTGGCTTTAATTTCCGAAACGTCTGATGCCATAATTAGGCCCAAAAGAGGTGAATCCATTGCATCAGTTTTATCCGATAAAATACAGGAAACAAGCAGTGGTCATAACTTGAGGCAAACTAAACGAAATTTACCTTCAGACAACGATGATACTAATGATAAGAAAAAGAAAAAGTCTTCTAACAGTATAATACGAGAAAGTAAATTGGTTTTGCCAACTAAAGTACTATTGTCCAAAATGCAAACTGATAAGTTAATAGCGAATGCTGAATTGAATAAGAAATCAGGTTTACCAGCTAAGGCCGTAGAAACAAACGTCACTTCAAATATTAAAGCTGCAGAAGTATCCAAGAAGAGAACAAGACGACGCAAAGCCATCAATCGTACCGGTTTTCCAAGTATTAAAAGAAAGAAGAAGAAGATAGAACCGAGTCTCTCAGCCAACATCATGTCCGACGGTCATTTCACATCTGAAGATACCGATCATTCTGTCTTTGAAAGAGTCCCAAAAGACGGTGAAGCGACAAGCACTTTCTTAGAACGCACAAATAGTAAAAAGCAAGAATTAAAAGTTGTCTTAAACAAGGAAGATATTCCCAAACAAGGTCGTCTCACAGTAGTTGCTCTTGAGAAACTACAAGGAAAGGAAATCTCAACGGACAACCAGAGTAGAAGTCACAGCATCGCGAAGAACATCACTGCCAAGAAGACAGGATCTTCTATTCTGAGAGCACCAGCATTGCATCTAAAACAAAACAAACCTGACAGAGAAATTAAAAATCACATAAACAAATGGGAGGTGCTCAGTGAAACTGATAGCATTCCCTCTCTAACCAGCTCTTTAAGCAACGATCCCGAGGATAGTATCCCTTTGAGCTTATTAAATCTCAAGGCTGGGAAGCAAACGAGTCGATTGGATAACTTGGAGAGGTTGAAGCGGAAAACACGAGCTATGTCACCCTCACACGAAATAGAAGAGATATTCTCAAAGAGGAAAATTGTAGAGAAGATCCCGAAAGTAGGGCTACGACCAAAATCAAGCTTGGCCATACTGTATCCGAGTGAACGAAGGCATACGAGAAGCTTCGATAATCTGGAAGACGGAAAGATGAAGACCAAAAGAATTGAGACTAAGAAAACGACCGCGGAAATTGTTGACAAAACTTCAAAGCCGATGAGCATAGCGACCAGGAGAAAGTCAAGGTCGTGTCAAGTCAACAAGAAAGTCACCGAAATACATTCCAGCTCACGAGAGAGTTCGTTAGACACCGTGGTCAGTCGGAGGATTCATTCCAAGTCACGGGAACCGTCCTTGGACACACTTCACGACAACGACGAAAACGAACCCCTGCCGCTTCACGAAAAAGAAATAGATTTCGAGAAAAGCATCGACGTACTGTCGAAGACTATCATCTGTAAGAAGCGAGTCGCTTCATCCCGCGACGAAAGTCCAGTCAACGGAGTAGACGTCAGAGACAAACCAGTCGTCTCCAAGAAGAACCCTCGCCTGAGAAAGAAGTTTTTGGTGGCCGGACTGTTCTCCGACTATTACAAAGACGATCCAAAACCGGATGGGAAAGGTAAAAATCTGGTCACTCAGACGGAGTTTCCGCCGGGTTTGCTCGCGCCGCCGCCCTACTGCGAGCGGTGGGTACGCAAGAGACTTCAGCATTTCACATTGCCTTACGACATATGGTGGGAACAACACTACAACCAGCCCGTGCCCTCCTGGAACTATAAGAAGATACGGACAAACGTGTACTACGACGTGAAGCCATCAGCGGAGGAGTGTGAGAGCGTGGCGTGTAACTGCGCGCCCTCCTCCGCCTGCAACGAAGACTGCATCAACAGACTCGTGTACTCCGAGTGCTCGCCGCAACTCTGTCCGTGCGGGGATAAATGCAAGAATCAACGTATTCAGCGGCACGAGTGGGTCCCGGGGCTGGAGAAGTTCATGACGGAGAACAAGGGCTGGGGCGTCCGCACCAAGCAGATGATCCGCTCCGGGGACTTCATACTCGAGTACGTGGGGGAGGTCGTCTCCGACAAGGAGTTTAAGGAGCGGATGGCGACCCGTTACGCTCGCGACACGCACCACTACTGCCTCCACCTGGACGGGGGGCTGGTCATAGACGGACACCGGGTGGGCGGAGACGGCCGGTTCGTCAACCACTCCTGCAGACCCAACTGCGAGATGCAGAAGTGGACCGCCAACGGTACATTCAGGATGGCGTTGTTCGCGTTGAGAGACATCGAGCCGGATGAAGAGCTCACTTACGACTACAACTTCTCACTGTTCAATCCAGCCGTCGGTCAGCCATGCAAATGCGATTCTGAAGACTGCAGGGGCGTCATAGGCGGAAAGTCCCAGCGGATCACAAAGCAGCCGGTCAAGTCACAGAACAGGACGGCGTCGAACGCCTCCAACCAGTCCGGCGGCTCGGGCAACCAGCCCCGGGTCGGCCGGCCGAGGAAGGCGGCCAAGTGTAACAAGAAGCCGGAACAACAGAACGTGTGCGCCGGGGACGTCAAGAACATGACCATACTCAAATACCAACAACACCTGAACAAGCTGTGGCAGGAGCCCGCACCCAGACCACTCACCGCCAAGGAGAGGACGCTCGTCAAGGAGCGGCACTGCTTCCTGTTCAGGAACCTGGAAAATGTGCGTCGCATCCGCGAGCGTCTGACCCTCCCCCTACCTCCATCCCCTCCGCCAGCTGGCCCTGCACCCGCTGCCCCGCCTCCTCCACCGCCTCCCCCCACTCCTCCCGCAGCCCCCGCCGTTGTGAACGTGAATCCCCTGGAGCTCCCCGACACTATGAACCCAGCCGTGTTCCTTAAAAGACTCCAAACACTCAGAGCGAGTAAAGAGGAAACTATGAAGGAACTGACTCGCTTGGAAGACGACCCTTCCTTGGATGTCAAGACACGCTTGACCAGGGTGTTCAGAGCTTTATATAACACTGTGGTCAATGTTAAAGACGAGGAAGGTAAACATGTCAGCGGTTTGCTGATGAAGTTGAAGGCGGGTCAAGTGAAGTCGGATGCACAGACTGTAGTAGACCTCAGCACTATACAGGCTCATATAGAGGGGGGGAACTACGAAACCATCGCTCAGTTCGACGGCGACATGAACACCCTGTTCACCAGCATCGTACGAGAACAAGGCAAGGCGACGGCTCTCGGCGCCGCGGCCACCCAACTTAAAAAGGTCTACAACTCGACGAAATCGGATTTCGCCGAACATTTGATTAAGATAATCGGTCCACAGGAATCTCTGCCTAATGGATTCATACAGAAAACTAAACCTGAGGAGGTGATCCTGTGTATCTGCGGGCTTCACGTGGAGGAGGGTCTCATGGTGCAGTGCGGCCTGTGCGGCGTCTGGCAACACGCGCGCTGCATGCGCCTGGCCGACACGCGCCTCACGCATCACTGTCACTACTGCAACCCCGCACCGGTCGATCGCGAAATACCTCTGGATGAATACACTGAGGAGGGTCACCAGTTCTATCTGTCGCTGATGCGAGGCGACTTACAAGTACGGCAAGGGGACACGGTGTACGTGCTGAGAGATATACCCATAGACGAGAGAAGACCTGACGTCACTCACAGGACCGGCGACACTAGCGACTCGCCCAAGACGAAGCGCCTCGACAGGAAGAAGGTTAAGAACATCGGCAAGGGGAAGGACGGGAAGGAGAAGGCGGACGAGACTAGTGAGGTTGAAGTACGCAAACATACATATCAAACTATCGGGGAAGTCCCTGTGTCTGAGCTGGATATATTCAGAGTGGAACGGCTGTGGAAGCACAAGGACACACAGGAGAGATACGTGTACGGTCATCACTACCTCCGACCTCATGAGACTTTCCACGAACCTACCAGGAAATTCTTCCACAACGAAGTAATGCGTGTCCCGCTGTACGAAGCCGTGCCTATCGAGCTCGTGATGTCTCAGTGTTGGGTGATGGACCTCAACACGTACTGCAAGGGTCGGCCAGTGGGTGCTCGCGAACAGCACGTGTATATCTGTGAGCTACGCGTGGATCGCTCGGCTCGTCTGTTCACACGAGTCTCTCGGCCGAAGTACCCGCTATGTACTAGGACTTACGCCTTCGATCACTTCCCCACAAGACTGAAACTCACAAGGACATATGCGCCTCACGAGGTATTGCCCGAATATCTGAAAGGTCGTGCTGCTAAGAATGCTGTTACAAACGATAAAGTTAAAAGTAACCAAGAAACAAAAAAGAAATCATCTTCGGCCGTCGCCGCGTCTACGACACCGGCTAAGGCGCTGTCGAAGTCGGAGCGTCGAGAGCAGCAGAAGGATCGTGTGAACAACATCGCCCGCGAGCTTCTATCGCGCGGCGGGCAGAGGGGGGCGGTGGACGCCTCGTACCTGCTAGCGCCGCGACCCGCCAGGCAACATAGACGCCGGCCGAGGACCTCCTGA

Protein sequence:

>DPOGS208029-PA
MKTVKYHAFASSAAEGGACGSSTSSSTSSSSSSSSSSSSSDSDTADAAGVHASPDDSSAALAAACQLQVIDDADLAELFPEQTSSAPTQQPNFTTFSVHNASPENDDKSIADNGDGSSSEMELTPQLVTAAIQRATADSSGSENECSNSDTVHQNTSHYASSLLQQFVAQTQLLSSTAPLPSMNSAGSSSLSCALSTNSIDGVGAISDCVLGQINSLPEIPAISSNYLNNQHLSPQQTEELSQINKDLEEISSVTEAVGLSITNPPSLEDCVDNNDFMNLDIASGGSEIGSASDLLKSSPITVTNESGSVSQIDAQKLDTISTISVESLDDVKNIIVESRRKRGRPRKVRKIDEYDNKCVESHKNTKSRVNVINDFDNNNDPPNVSPDSGILSNHNSPTHSPLRRHNEETQGRHNRKSVQKENTKVERKSVRSKSKTRSRGHRSDSSDCDFHKKRLENDIKQIKTEAPSPIPLKQEQNKLDKTKKNDQKLDIAALDRMLYATDRVLYPPRKKVGRKPQNKTKNAKATPKTLRDKNQYDSADSDNDSLSCNKSVLSGVYAKRKEVNSKLANVSKSNKTSKTNNSWRDHQSENEAAADDPLDPTWKQIDLNPKYKDILSGYKSDYEFKPYKSCSRLIESGYKSDYGCRSGYKTDCYRSGYKSDHKSGYKSDKSGYKTDYSIRSMRRRTRKLKKTRSVRNRSYYKNQKHFVSDQEIVLLSNKTFSSLNLGHSSSDSECEPYLRKPNASPKYVSVCTKYPLPSKYNYAFSKSNQKHILRPNSADPFAPGPVFSGLQRPVTTYNIFKVSHNNNNTLLKSPVKSGNCANILPSLKPSFTHKFGVSPICSTLKSKKEEISKHFTRTLSPKGRKISKKDSPIRKKETLKPLPSIFEKAKNIYVPNKSLSRNVFLNFKRPHIKPTFHTKKPRRTVVLAPPKINLKPTERPSRITIKTESKHHRHRSKRRHRSRSVSRCRESRNSVDSVDQKFTQNMDVLIENFMKLCQIAPKFVASAPQIQQRNVEKDKPQPETAPTKVIKRASKKRKTSDNQEIATPTSKRRHKKQLTESQNKGGKETNEHKLPLKKRHYHINSSTSNSLSLSLVTAEFDENSKTGSGSEQFLCTETDCMGDVNDKLSDESSKQKSTSEKSKRDAKNIPTDSKSQSLESLPKPTKSSTDTCPSKEKIYETSEKLKAVHKMVHDLEKSLPKSKESDGKAELLKPENNKTLSPRSDTKASPLRNSAPIVTPKKRHRLEAEKVLSHNSLDQVVQSLSKKLSEDKITSPVSKDINQNSSNDTTKKGDVLNSNTEAVQNANNNTSVDPLKSMSARTLYKSAIPPAQKSEIMTRKKNRLEGLTSNLVSKINPSAATKVLDTLLNNNIRKSIESRILEKEKNSSDTLHKLYDDKYKGKDITHNTRASVIKSPVSKGKVIESKKPKIIEPIVEIIPVVSVDKPTGIFEPSIDLEDQIPKSSICVSNLLAESNRNRNKSRGNDQKNSNGLLAIDNESEIPLALISETSDAIIRPKRGESIASVLSDKIQETSSGHNLRQTKRNLPSDNDDTNDKKKKKSSNSIIRESKLVLPTKVLLSKMQTDKLIANAELNKKSGLPAKAVETNVTSNIKAAEVSKKRTRRRKAINRTGFPSIKRKKKKIEPSLSANIMSDGHFTSEDTDHSVFERVPKDGEATSTFLERTNSKKQELKVVLNKEDIPKQGRLTVVALEKLQGKEISTDNQSRSHSIAKNITAKKTGSSILRAPALHLKQNKPDREIKNHINKWEVLSETDSIPSLTSSLSNDPEDSIPLSLLNLKAGKQTSRLDNLERLKRKTRAMSPSHEIEEIFSKRKIVEKIPKVGLRPKSSLAILYPSERRHTRSFDNLEDGKMKTKRIETKKTTAEIVDKTSKPMSIATRRKSRSCQVNKKVTEIHSSSRESSLDTVVSRRIHSKSREPSLDTLHDNDENEPLPLHEKEIDFEKSIDVLSKTIICKKRVASSRDESPVNGVDVRDKPVVSKKNPRLRKKFLVAGLFSDYYKDDPKPDGKGKNLVTQTEFPPGLLAPPPYCERWVRKRLQHFTLPYDIWWEQHYNQPVPSWNYKKIRTNVYYDVKPSAEECESVACNCAPSSACNEDCINRLVYSECSPQLCPCGDKCKNQRIQRHEWVPGLEKFMTENKGWGVRTKQMIRSGDFILEYVGEVVSDKEFKERMATRYARDTHHYCLHLDGGLVIDGHRVGGDGRFVNHSCRPNCEMQKWTANGTFRMALFALRDIEPDEELTYDYNFSLFNPAVGQPCKCDSEDCRGVIGGKSQRITKQPVKSQNRTASNASNQSGGSGNQPRVGRPRKAAKCNKKPEQQNVCAGDVKNMTILKYQQHLNKLWQEPAPRPLTAKERTLVKERHCFLFRNLENVRRIRERLTLPLPPSPPPAGPAPAAPPPPPPPPTPPAAPAVVNVNPLELPDTMNPAVFLKRLQTLRASKEETMKELTRLEDDPSLDVKTRLTRVFRALYNTVVNVKDEEGKHVSGLLMKLKAGQVKSDAQTVVDLSTIQAHIEGGNYETIAQFDGDMNTLFTSIVREQGKATALGAAATQLKKVYNSTKSDFAEHLIKIIGPQESLPNGFIQKTKPEEVILCICGLHVEEGLMVQCGLCGVWQHARCMRLADTRLTHHCHYCNPAPVDREIPLDEYTEEGHQFYLSLMRGDLQVRQGDTVYVLRDIPIDERRPDVTHRTGDTSDSPKTKRLDRKKVKNIGKGKDGKEKADETSEVEVRKHTYQTIGEVPVSELDIFRVERLWKHKDTQERYVYGHHYLRPHETFHEPTRKFFHNEVMRVPLYEAVPIELVMSQCWVMDLNTYCKGRPVGAREQHVYICELRVDRSARLFTRVSRPKYPLCTRTYAFDHFPTRLKLTRTYAPHEVLPEYLKGRAAKNAVTNDKVKSNQETKKKSSSAVAASTTPAKALSKSERREQQKDRVNNIARELLSRGGQRGAVDASYLLAPRPARQHRRRPRTS-