Monarch geneset OGS2.0

DPOGS207288
TranscriptDPOGS207288-TA8271 bp
ProteinDPOGS207288-PA2756 aa
Genomic positionDPSCF300008 + 537957-550431
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0163300.093.54% 
BombyxBGIBMGA012021-TA0.091.93% 
Drosophilazfh2-PA3e-9053.09% 
EBI UniRef50UniRef50_D6WH700.062.12%Zn finger homeodomain 2 n=2 Tax=Tribolium castaneum RepID=D6WH70_TRICA
NCBI RefSeqXP_969252.10.059.41%PREDICTED: similar to Zinc finger homeobox protein 3 (Zinc finger homeodomain protein 3) (ZFH-3) (Alpha-fetoprotein enhancer-binding protein) (AT motif-binding factor) (AT-binding transcription factor 1) [Tribolium castaneum]
NCBI nr blastpgi|2700049020.062.12%Zn finger homeodomain 2 [Tribolium castaneum]
NCBI nr blastxgi|2700049020.059.32%Zn finger homeodomain 2 [Tribolium castaneum]
Group
Gene OntologyGO:00036772.1e-20DNA binding
GO:00063552.1e-20regulation of transcription, DNA-dependent
GO:00435652.4e-20sequence-specific DNA binding
GO:00037002.4e-20sequence-specific DNA binding transcription factor activity
GO:00055152.7e-20protein binding
GO:00036761.4e-05nucleic acid binding
GO:00081525.2e-05metabolic process
GO:00038245.2e-05catalytic activity
KEGG pathway 
InterPro domain[2075-2139] IPR0122872.1e-20Homeodomain-related
[2075-2137] IPR0013562.4e-20Homeobox
[2073-2148] IPR0090572.7e-20Homeodomain-like
Orthology groupMCL11569 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207288-TA
ATGCCTACGCCGCTGCAGCCCGGCGGCCCCGCTAGCGGCCCGCCTCCGGAGCGTAAGCGGCGCCGCAAACGCGACGACCCACAAAGTTCCGCCGCCCTTGAACCTGACGAAGACGACGGCGAAATGAGCCCTGAGGAAGAGCCGCGCAATACTCCCGCTGCTCCTGCAGCACCCACACCCGCGCCATCGTCTGCGCCTGCACCTACATCACCTCCTGCCGCACCAGACGCAGTTGATCTTACTTCACGTCGAGATTCACCACCACTATCGTCTGATGTAGAACATTTTGATGGTAAAATCGTTTACAATCCTGATGGATCTGCATACATTATCGAGGACCCAGAAATATCAGAAGGTGAAACGAGTCTTCCTGACTTACCTAAAATAGAACCCGGTTGCATAGTGGATAGCAGAGAAAGCAATATCACAGAGAGGCAGCTTGAGTTTCCACAAATTGCAAGTGCATTCTATGTGTCCAGAAATCCTTCCTTGTACGGTGCTTTGTACGGAAGATTAGCAGCTGAAAGAGCGCGCGCAAGGCCGGATGCTCCGGTTATGCATAGTTACAGAGTGTTCAGTTTTCGTGGAGGAAAAGACGCTCCTAGATCTTCATCTCCAGTGCCAGAATGCCCTGTTAGTGTGCCGGTTAAACCGATTCTTATGTGTTTTATATGTAAATTGAGTTTTGGTTATGCCAAATCTTTTGTGGCACATGCCCAATCGGATCACAGTTTATCACTGCTTGATTCCGAAAGAGACGCTTTATCAAGAGAAAACGCTTCAGCGATTATCCAGTGTGTAGGAAAAGACAAAGAAGCTTTGGTATCATTTTTAGAACCTCTAGGTTCGGCTGCGCCACGAGCTTCTGTTTCTGGGAGCCCTGCAAATATATCTGAATTATCAAGTCCTTCTATTGACAAGTTACAAGATAACATGACTACAGAAAGAGAAAATGATTTATCGTTACCCAATGGCTCGTGTGATGACAGACGGCCAAGTCCACCTACCTGGAGGCCGCCAAGATCATTGGCTGAATCATTGATGCCACAACATTCCATGATCTCAGTGGCTCATCCACATACTATAAACGCTTCAATACAATCACGTGCTAGCCCTAACTCTTCACCACCTTTTCAAGCTGCACCCCCAGCTTTTCTGTCTGGAACAACGATAGGAGTATGTCCTGACCATCTAGGAGGACGACCATCTGGTGCAGATTGTCCAAAATGTGAGCTTATCTTAAATTCTGGAAGGTTAGGTGGTCCATTAGCAGGAATGCATAGCAGAAATTCTTGTAAAACCCTAAAATGTCCAAAATGTAACTGGCATTACAAGTATCAAGAAACGCTAGAAATACATATGAAAGAAAAACATCCAGAAGCAGAAACAAGCTGTATTTATTGTATTGCTGGTCAGCCTCATCCACGATTAGCACGGGGAGAAACTTATACATGTGGATACAAACCATACAGATGTGAAGTCTGTAACTACTCTACTACTACAAAAGGAAACCTGAGCATACATATGCAATCAGATAAACATTTAAATAATATGCAAGAGTTACAAAATGGAGGCAACCCTGGAGAAGGAAATCTACCACCAGCCCAACATACTCCACCAGGTGCCCATAAGCCTCCACTGCCACATCATTCGCCATTGGGACAAAAACCAAAACCGACATTTAGGTGTGACGTATGTAACTATGAAACAAACGTTGCACGAAATTTAAGAATACATATGACATCCGAAAAGCATACACATAACATGCTTGTACTACAACAAAATGTAAAACACATGCAAACGCTATCTGCTTTACATCACAGGCAACAGAGTCAGCAACAGCTGGAGAACTTGTTACATTTTCATGGCGGTGATGCGCCTCCACCAAATCCTGAGGCAGCTTTAGCTGATATGGCATACAATCAAGCCCTTATGATTCAACTAATGACTGGTGGACCTGGGCCTTCGCCCCCAGAATTAGGTGCCCATCTTGACGTTGGGTTGAATCCAGAGGCCATGGAACCTCCACCAGAGCCGGCAGATCCAGAACCTGAAAGAACATTTCACTGTTGTATTTGCAATTGTTTTTCAACGGATTCTTTAGAAGCCCTGGGCCATCATTTAGCTCAAGATCGCACAAAAATCAGAGAACAAGAAATATTGGCCCTAGTTGCTGGACATTACGTATGTAAATTATGCACATACAAAACTAATTTGAAAGCAAACTTCCAATTGCATTGCAAGACAGACAAGCATCTTCAAAGGCTACAGCATGTGAACCATGTGAAAGAAGGTGGACCCCGAAATGAATGGAAACTGAAGTTTTGTGGTGGAGTCGGCACTGGAGGAGCTGGAGTAGGCGGTGTTCAAGTCAGATGTTGTGCTTGTGATTATTACACAAATTCTGCTCATAAATTACAATTACATGCCGCTGGAGCTAGACACGAAGCAGCTGCATTGTTATTACGTCATCTTAGAGAATGCGCTTCAAGAATACCCCGTGAACGTCCTCGTGTATATCGTTGTGCACTTTGTGGGTTCAATGCTCCCCACAGACTACCACTTCTACAACACGTGCGTTCTGTGAAACATTTACAAATGGAACAAATTCATCAACTTCAACGTCGCTCCGAGGGCAAAGATCCAACTCCTGATGTAGCTGAACTCTTCCAAGTTATTCCTCAACCTCCAGAGCTGCCCTACGATCAACAAGATAATGATGTTAAGGAGCCAGTTGACCAAAAGCCGGAACTGACACAAGAACAGAAAATGATGCGATTTTTAGAACAACACCAACAACAACAGCACCTGCAGCAGCAACAAACACAGCCTATACAACAACAATCTCAACAGACTGTTATCGATAAAGAAGAAGAACAAGATGTTTCAGGTCAACATACCTGTCCTTACTGCAATTTTAGCTGTGGAAGCGAAAGTAAACTAACTGTCCATGTTAATTCTGTACATGGTGATACTGCACGTCATTTTATTTGCCCTCTGTGTCAAGATGCATTTAAAGACAGACCATCTCTCGAGAGGCATGTAATGCAAATCCACTCTGTAAACTCGGAAGGATTACAACGTCTTCTATTACTGGTTGACCAAAGTCATTGGCTTAATGGTGGAACGCAACCACAGAGAGACGAATCGCGACAGAATGAAGAAAATGAAAGAGAAATTTCATCGCCTCGCTCTGAAGGGAGCGTAGACGGAGAAACTGAACGATGTTTAACATGTAATCGCACATTCCGCAACGTCGATGAATTGTGTCAACATCAGAACGAATCTGGCCACCTAGAATTGAAACAAACTCCCCAAGGTCCGGGATATGTATGTTGGAAAAAGGGATGCAACAGATATTTTGATTCTGCTCATGCATTACAAAATCACTTTAGAGAAGCTCATGCACGTAATTCAATTGCAAACATGTCAGTATCGGAAAAACATGTATATAAATATCGCTGCAATCAGTGTAGCTTGGCCTTTAAAACGGTAGAAAAGCTGCAATTACATTCCCAGTACCATGTCATTCGTGATGCTACTAAATGTGTTTTGTGTGGGCGGAGTTTTAGATCCATTTTAGCTCTTCAAAAACATGTTGAGACTTCGCACCCTGAACTTTCAGAGGAGGAATTAAATGCTTTCAAGCGTAGTCTAGCTTCAAATCCATTATTACAAAGTAACCAAGGAGTTGCTTTAGACGCCACCACTGTAGATTTGTTACGTAAGGAGTCACTCAGAACCCCAGAAGATGAGCTTGGTGAAATCGAAGACAGGGATTCAAGTGCTACTGCTGCTGATGAATCTGGCCACAATGACGCCGAAAACTCAGATGATTCAATTATTTATAAAGACCAACAATTTCTTGAAGACTACCTTAATTCTCAAGCGATGGCTGAAGATTCTTATAATGATCCTAACAGAAAATATAAATGTCATCGCTGCAAAGTTGCTTTTACTCGACAATCCTATTTGACTGCTCATAATAAGACCCTTCTGCATAGAAAAGGCGAAAAATTAACCTACCCTATGGAGAAATATCTTGACCCTAACAGACCCTTTAAATGTGACGTATGTAAAGAATCTTTTACTCAGAAAAATATATTACTTGTTCATTATAACAGTGTGAGCCATTTGCATAAACTCAAGCGCGCTATGCAAGAGCAACAAAATAATAATAATCCTCCAGTTTCGCCAAGCGCAGGTACAGCGCCCTCGAATTTAACTTTAACACCTAAAAGTACCTCCAGCGAAGAAGACGATCGTAAACGATATAAATGTAACATATGCAAAGTTGCATACACCCAAGGCAGCACCCTAGATATACACATGAGATCGGTATTACATCAAACTCGAGCTGGCAAACTACAAGAGTTAGCAGCCGCTGGTCACGTGGACTTATCTCGGCCCTTAGTAGAACAACCGGACAGAAACGACCCCGCTAAAATTTTACAAGACGTTTTGTCGCCAAAAAATACATCCCCTTCATCTACTAGCAGCGGGGGGCCTCGCTCTTCACCCCCCGCACGCCCAGGTAGTCCGCGATCCCCTCGCACAGGTAGCGCCTCGTGCGAGCGTTGCCACGCGTCATTCCCAACTGGGGAGTTACTCGACGCACACCGTGCAACCTCATGCCCGTTTGGAGATGCGCGGGCTCATTCGCCGCTCGGTGACGCAGAGGCAGCCGCTTTAGACGAGATGGTAGCCAAGGGCAATCCGCCCAAACGAAATTCACAAATGTATAAGCAACTCTTAGAAACATTTGGATTCGACCTCGTGATGCAGTATAATGAAAATCAGCGGCGCAAATTGCAAGAGGAACGTGAAATGGCCCGTGTACCTAGTCCTCCCCCTCCCCCTCCGGAAGAGAAACCTCCCGATGGAGAGATAAAATCGACGTGTCAACATTGTAACAAGGAGTTTTCTAGTGTGTTTGTTTTAAAGACTCATTGTGAAGAAGTGCATAAGGACAAAGTACCACTTGAATTTCTAGAACAGTTCGCAGAACAGTTTAAATCAGAATATGAAAGGAAATCTGGTGCACCGAACTCACCACGTGCTGCCTCCCCAGCACCTCAAGGCGACCGGTCACCATCACCTCGCGGTGATAACAGTAGCAACTTCAACGATAGTGCTAATGGTCAGGGTGAGGCTCAAGCCGGTGCCTTATTGGCAGCTCAAATGCAAGAAATGCAGGCTGCCTTAAATATGATACAGCTACAGCAACTCGGTCAACTTCATCCTATGATGGCTCAAATGTTATCCCTGGGTCTTCCACTGGGCTTGAACATGAGTGCTTTGGCTGCTATGAACCTGCAGCCTCCACTGGTTTCCCTGATGCTTCCTCCGCCACCCTTCGACGCCATGCCTTTTTCACACGATGCTCAATTAAAACAACAGCAGCTGTTGCAGCAACAGCAGCAGGCAAATGCTGCAGCTGGTCAAAAACGCGCTCGTACTCGTATTACTGACGAGCAATTGAAAATTTTACGCTCTCACTTTGACATCAACAACTCACCAAGTGATGAAGCTATTGCTAAAATGGCCAAACAATCTGGACTTGCTACTAAGGTTATCAAGCATTGGTTCAGAAACACTCTGTTTAAAGAACGTCAACGTAATAAAGATTCACCATACAACTTCAATAATCCACCGTCAACGACACTTAATCTAGAAGAATATGAAAAGACTGGGGAAGCTAAAGTGACTCCTTTAGATTCCTCGGCCAGCTCTGATGAAGGAAAACCACCGCCTGAAAAAAAAGCAAAAACTGAAGACATGACTGCTTTGAACATGAGTCATCAACATGAAGTAAAATCCGAACCAAATGATGAACCTACTATGGAAGAGAAATACAACTCATATGATGATCGCCATCAAGAAGATAGAAGTTTTAATTTTCCTCAAGTTCCTCAGAGCCCAGCATTAAGTACAGCTGAAGTTGTACAAAATATGTCACGACCGCAGACACCTACACATATATCATTGAATTCTTTGATTCAATCGCAACTTGACTCTATACCGGCAACAAGCATACCCCAGCCACCTCATCCATCTATGCTACCACCGAAGTTGAATCCGAATTTCACGTCCCCAAACTCCGCGCCCCCGAACGTCCTACCTTTGACCCCAAATCGCAGCCTAAGCCCTGGCAGGGTTCCGGCCGATTTCGGACTTTCCGGGGGCAACTCGAATGGATCCAACAGCTCGGGGTCTTCTGGCAAACGCGCCAACAGAACTAGATTTACTGACTACCAAATCAAAGTTTTACAAGAATTTTTTGAAAATAACGCATACCCAAAGGATGATGACCTAGAATACCTGTCAAAGCTACTAGGATTAAGTCCTAGGGTAATCGTAGTATGGTTTCAAAACGCACGTCAAAAAGCTAGAAAGGTATACGAAAACCAGCCCGCAGCCGATCCACCAGCTGGATCAATGGATGACGCTAATAGATTTCAAAGAACACCTGGCTTGAACTACCAATGTAAGAAGTGCCAACTTGTATTTCAGAGATATTATGAACTAATAAGACATCAAAAGACTCATTGTTTTAAGGAAGAGGATGCGAAACGATCAGCGCAGGCGCAGGCGGCTGCTGCTCAAGTGGCTGCTACCCTTAGCAGCGAAGACTCTAATTCAAGTACCGTAGAACACCACGTTCCGCATGTGCCAGTATCACCGGCACCACCTCGAACTCCTACGCCTGCAGCCCCAAATTATCCAGTATCCCCGGCCCCTACAACTCCTCACACACCTGTCACTCCTCTCACTCACCAACCGAGAGAAGAAAAAGATGGAAATTTTCAGTGTGATAAATGTAACTTAGTCTTCCCACGATTCGATCTTTGGCGAGAACATCAATTAGTTCATATTATGAACCCTAATTTATTTCCTTCCTATCCACCAGATTCCCCTTTTGGAATCTTACAACAACATGCTCAGCTACAACAACTCAATGCTTCACTCACGAATGATGAATCACGCCATCCTCTGGTAGCAGCACTCAACCAACAAGTAGCGAAGAGAAAATTTGATGAATTTGAAGAATCTGAAACTGGTGAACAACCAAAAGATAAGCGACTTAGGACCACCATACTACCTGAACAATTGGATTATCTCTATCAGAAGTACCAAATAGAATCCAATCCATCACGAAAAATGTTAGAAAATATTGCTCGTGAAGTTGGATTAAAAAAAAGAGTGGTACAAGTTTGGTTCCAAAATACGCGAGCAAGAGAAAGAAAAGGGCAATTTCGGGCTCATGCACAAGTAATAAATAAGCGTTGTCCATTTTGTCCGGCTCTATTCAAAGTTAAGAGTGCTTTAGAAAGCCATTTAAGCACTAAACACGCCGACCAGTGCGCACGAGGGGAAATCAACGTTGACGCTCTACCAGACGAAGAACTTAGTACAGAGTCAACACCAAGTTTTGGTTCGCAACAAAGTGATAGACAGCAAAATTTTTCCCAAGCGGGGGCTCCCATGTTGCCCCCTATTTTTCCACCTTTTCACTCAGACATGGAGAAGTTTATCAAACAGTACAGTGAAGAATCAATGAAACGATATGTCAGTGAGCTTCAAGCACACGCAGCAGCTCAACAGAATGGTAATAGTAATGAAATCTCTGAGCAACGTGAACATGGAAAATCGGAAATACCACTAGATCTAAGTAAACCCGTCGACTTATCACGTCCAGGATCAGATGCTGATGAACGATCTGATACCGCATCCGAAACTATGGAGTTTTATGAGGAAGATGAACCTACGTCACCTATAGCCGGTCAACAGCATACGCCGAGACCTCCTGGTAAACGATTTCGTACTCAAATGTCTTCAGTTCAAGTCAAAATAATGAAATCATTATTTAGCGACTATAAAACTCCTACAATGGCCGAGTGCGAAGCACTTGGTCGAGAAATCGGCTTACCAAAGCGAGTAGTTCAAGTATGGTTCCAAAATGCTAGAGCCAAGGAGAAGAAAGCGCGTCTTGCAGCAGGATTAGCTGAAGTATCTGATGCGCAACCACCAGAAGAATGTCGAGTATGTGACTTTAAATACAGTCATAAGTATTCGGTACAGGACCACGTGTTCACACGAGGGCACATCGCTGCGGTGCGCGCTCGATTAGAGAGCAGCGCAGGCAGCGGAGAGGACAGCACGCTTGCGTTAATGCAGATGGCGGCGCGACTGGAGAGCGGCCTTGGGGGCGAGCTGCATAACGCATTTCTGCGGCCACAACTCGCCGGCAACGGTGAGTGTACAACTTGA

Protein sequence:

>DPOGS207288-PA
MPTPLQPGGPASGPPPERKRRRKRDDPQSSAALEPDEDDGEMSPEEEPRNTPAAPAAPTPAPSSAPAPTSPPAAPDAVDLTSRRDSPPLSSDVEHFDGKIVYNPDGSAYIIEDPEISEGETSLPDLPKIEPGCIVDSRESNITERQLEFPQIASAFYVSRNPSLYGALYGRLAAERARARPDAPVMHSYRVFSFRGGKDAPRSSSPVPECPVSVPVKPILMCFICKLSFGYAKSFVAHAQSDHSLSLLDSERDALSRENASAIIQCVGKDKEALVSFLEPLGSAAPRASVSGSPANISELSSPSIDKLQDNMTTERENDLSLPNGSCDDRRPSPPTWRPPRSLAESLMPQHSMISVAHPHTINASIQSRASPNSSPPFQAAPPAFLSGTTIGVCPDHLGGRPSGADCPKCELILNSGRLGGPLAGMHSRNSCKTLKCPKCNWHYKYQETLEIHMKEKHPEAETSCIYCIAGQPHPRLARGETYTCGYKPYRCEVCNYSTTTKGNLSIHMQSDKHLNNMQELQNGGNPGEGNLPPAQHTPPGAHKPPLPHHSPLGQKPKPTFRCDVCNYETNVARNLRIHMTSEKHTHNMLVLQQNVKHMQTLSALHHRQQSQQQLENLLHFHGGDAPPPNPEAALADMAYNQALMIQLMTGGPGPSPPELGAHLDVGLNPEAMEPPPEPADPEPERTFHCCICNCFSTDSLEALGHHLAQDRTKIREQEILALVAGHYVCKLCTYKTNLKANFQLHCKTDKHLQRLQHVNHVKEGGPRNEWKLKFCGGVGTGGAGVGGVQVRCCACDYYTNSAHKLQLHAAGARHEAAALLLRHLRECASRIPRERPRVYRCALCGFNAPHRLPLLQHVRSVKHLQMEQIHQLQRRSEGKDPTPDVAELFQVIPQPPELPYDQQDNDVKEPVDQKPELTQEQKMMRFLEQHQQQQHLQQQQTQPIQQQSQQTVIDKEEEQDVSGQHTCPYCNFSCGSESKLTVHVNSVHGDTARHFICPLCQDAFKDRPSLERHVMQIHSVNSEGLQRLLLLVDQSHWLNGGTQPQRDESRQNEENEREISSPRSEGSVDGETERCLTCNRTFRNVDELCQHQNESGHLELKQTPQGPGYVCWKKGCNRYFDSAHALQNHFREAHARNSIANMSVSEKHVYKYRCNQCSLAFKTVEKLQLHSQYHVIRDATKCVLCGRSFRSILALQKHVETSHPELSEEELNAFKRSLASNPLLQSNQGVALDATTVDLLRKESLRTPEDELGEIEDRDSSATAADESGHNDAENSDDSIIYKDQQFLEDYLNSQAMAEDSYNDPNRKYKCHRCKVAFTRQSYLTAHNKTLLHRKGEKLTYPMEKYLDPNRPFKCDVCKESFTQKNILLVHYNSVSHLHKLKRAMQEQQNNNNPPVSPSAGTAPSNLTLTPKSTSSEEDDRKRYKCNICKVAYTQGSTLDIHMRSVLHQTRAGKLQELAAAGHVDLSRPLVEQPDRNDPAKILQDVLSPKNTSPSSTSSGGPRSSPPARPGSPRSPRTGSASCERCHASFPTGELLDAHRATSCPFGDARAHSPLGDAEAAALDEMVAKGNPPKRNSQMYKQLLETFGFDLVMQYNENQRRKLQEEREMARVPSPPPPPPEEKPPDGEIKSTCQHCNKEFSSVFVLKTHCEEVHKDKVPLEFLEQFAEQFKSEYERKSGAPNSPRAASPAPQGDRSPSPRGDNSSNFNDSANGQGEAQAGALLAAQMQEMQAALNMIQLQQLGQLHPMMAQMLSLGLPLGLNMSALAAMNLQPPLVSLMLPPPPFDAMPFSHDAQLKQQQLLQQQQQANAAAGQKRARTRITDEQLKILRSHFDINNSPSDEAIAKMAKQSGLATKVIKHWFRNTLFKERQRNKDSPYNFNNPPSTTLNLEEYEKTGEAKVTPLDSSASSDEGKPPPEKKAKTEDMTALNMSHQHEVKSEPNDEPTMEEKYNSYDDRHQEDRSFNFPQVPQSPALSTAEVVQNMSRPQTPTHISLNSLIQSQLDSIPATSIPQPPHPSMLPPKLNPNFTSPNSAPPNVLPLTPNRSLSPGRVPADFGLSGGNSNGSNSSGSSGKRANRTRFTDYQIKVLQEFFENNAYPKDDDLEYLSKLLGLSPRVIVVWFQNARQKARKVYENQPAADPPAGSMDDANRFQRTPGLNYQCKKCQLVFQRYYELIRHQKTHCFKEEDAKRSAQAQAAAAQVAATLSSEDSNSSTVEHHVPHVPVSPAPPRTPTPAAPNYPVSPAPTTPHTPVTPLTHQPREEKDGNFQCDKCNLVFPRFDLWREHQLVHIMNPNLFPSYPPDSPFGILQQHAQLQQLNASLTNDESRHPLVAALNQQVAKRKFDEFEESETGEQPKDKRLRTTILPEQLDYLYQKYQIESNPSRKMLENIAREVGLKKRVVQVWFQNTRARERKGQFRAHAQVINKRCPFCPALFKVKSALESHLSTKHADQCARGEINVDALPDEELSTESTPSFGSQQSDRQQNFSQAGAPMLPPIFPPFHSDMEKFIKQYSEESMKRYVSELQAHAAAQQNGNSNEISEQREHGKSEIPLDLSKPVDLSRPGSDADERSDTASETMEFYEEDEPTSPIAGQQHTPRPPGKRFRTQMSSVQVKIMKSLFSDYKTPTMAECEALGREIGLPKRVVQVWFQNARAKEKKARLAAGLAEVSDAQPPEECRVCDFKYSHKYSVQDHVFTRGHIAAVRARLESSAGSGEDSTLALMQMAARLESGLGGELHNAFLRPQLAGNGECTT-