DPGLEAN14791 in OGS1.0

New model in OGS2.0DPOGS208927 
Genomic Positionscaffold31:- 26153-56300
See gene structure
CDS Length13383
Paired RNAseq reads  6889
Single RNAseq reads  17230
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002502 (3e-09)
Best Drosophila hit  CG5591 (5e-146)
Best Human hithistone-lysine N-methyltransferase MLL3 (1e-77)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC000606 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC000606 [Tribolium castaneum] (0.0)
GeneOntology terms





  
GO:0030528 transcription regulator activity
GO:0023034 intracellular signaling pathway
GO:0003677 DNA binding
GO:0005515 protein binding
GO:0005634 nucleus
GO:0008270 zinc ion binding
GO:0006911 phagocytosis, engulfment
InterPro families





  
IPR001965 Zinc finger, PHD-type
IPR001841 Zinc finger, RING-type
IPR018518 FY-rich, N-terminal subgroup
IPR011011 Zinc finger, FYVE/PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR019787 Zinc finger, PHD-finger
IPR003888 FY-rich, N-terminal
Orthology groupMCL10236

Nucleotide sequence:

ATGGACGGACAAGACGAAGGGGTGTTAGATGTGGATTTATTAGCTGACGATGGCTCTGAT
GAAGAATGCGAAGCTCAGCCGTCTTCAGACTTTTATGCAGGACCCGCGTCTACACCGACG
TCATGCGCGTCATCTCCGCGGGGCGAGGAGCCTGCTTCTCCTCTCGCCGTGGTGTCGCAA
CCAACATTTTTCCACCACCAAACTTATACTCGCCCGTTTTTCACATCGGGAAGACGTGGA
CCAGGACGGCCGCGAAAAGAGGGCGCGAAATTAGCCCGAGAAGGAAAGATTGTCAGAAGA
AATAGAGGCAGTGCTGGATCGGTAAGAGGAGCTAAAAGGCATCGCACTAGTCGTGATGAG
GCTCTTGATGATATGATGGATGAAGATGATTTCACTATGCCTGCACCCGAAGAACCCCCC
TATGCACCTGAAAAATGGCCAGGGAAATTGTGTGCCTTATGCAACTTAAGTGAGCGTAGT
CAACTTGGTCAAGGAGAAATGAGGCAAATAGTTTGTAATACAGAAAGTGAAGGTGGTACC
ACACCAGGGGTCTCCAACTCTGGTGGAGCAACACCAACCAGTATAACAACGCCTAGTACA
CCAACATTTCCTTTACCGCCTGGTCTTACTTCGCCTTCACCTGAAATGTTAGATTCAAAT
CAACCACAACAATCTCTGCCTTTGAGTAGACGTCAAAAAGCATTTATGAAGTGCAAAACT
CCCCTTTACAACATGGAACATACAGATGAGTTATCCATTATTGGTTGGACTGAATCATTA
GAGCTTCCTGCAGTAGTGTCATCGGGAATGTTTTATGTACACCGGTGCTGTCTTGAATTC
AGTCCTCCTTATCAAGATCAAGTGGCGATAGCATGTAATAATGATGATGACAAAGAACAG
CTGGAGGAAGCGAGGATAAGAGGAATTGTGATTGCTGCTTTGGGAAGAAAATGTGCCTTT
TGTCAAAGACATGGTGCTAGTATACCCTGCAAGATGAGTTGCAGTAAATATTTTCACCTA
CCATGTGTTCTTGCATCCGGCGGTTTCATGGATTTTCAAACAAAGTCATCATTTTGTAAG
GATCATTTACATCAAGCACCATTAATATGTACAGCGGATATAGACTGTCGGACGTGTCGC
ACCATCGGCGACATGGCGAATCTGATGACGTGCGTCACTTGCGGTGGACACTACCACGGA
ACATGCGTCGGTCTCGCTCAACTTCCTGGCGTGCGCGCCGGTTGGTCGTGTAGATCGTGC
CGCGTGTGTCAGGTGTGTCGCGGCGAGGCGGGAGGCGGCGCAGGCGGCGAGGCGCGGGCC
GTGGCCTGCGAACACTGCGACAAGTTATACCACGCGGCGTGCTTACGACCCGTTATGGCC
ACTGTACCCAAATACGGTTGGAAGTGCAAGTGCTGCCGCGTGTGTTCGGACTGTGGTGCT
CGCTCACCGGGCGCGGGACCATCGTCGCGATGGCACGCGCATTACACAGTGTGTGACTCT
TGTTACCAACAGCGCAATAAGGGCTCGTGTTGTCCGCTATGCCGCCGCGCCTACCGCGCT
GCCGCTTATAGAGACATGATACGCTGTTCCGCCTGCCGGAGATACGTACACGGGATGTGC
GATCCTGAAGCAGAGCCTCAGAACTACAAGCAAAAGAAAGGAGAGAATTCATCATATGAG
TACACATGCCCTATATGTAAAACTAATCAGCAAATGGCTGGTTCTAAATCAGGCTCATTT
GATGAAGACACAATGGCTTCGACATCACAGGATTCATCATTTGGTGATGAAAATACACAA
GAGCAGGATCCTCTTGCTATTGAGACTAAGCCTGACGTTGGACTCGGTAAAGGAAAACCA
TATACTGTATCATCCAAAGTAGCTAAGAAAAAAATAGGTGGATACAAGCTTAAGGGAGGT
TTCCCTCCAGCAGGAAAAATAGGTTTTCAAAAACGACAGAGATCCGTTTTAGATTTCGGA
AGGAAGAGAGGGTCTAAACCAAAGATGAGGGGTGTGTTCGGAGTCCCTGGCCTGGGCTTG
CAGAGACCTCAGGCTCCTGACACCAAAACCTCCGAGGACGATCCAGGAGTTGAAAATAAA
CTAGTCTTATGTTCGAGTAAAGACAAGTTTGTGTTAACTCAAGATCTATGTGTTATGTGC
GGTGCTGTGGGGACAGACTCCGAAGGGTGTCTCATAGCTTGCTCCCAGTGCGGACAGACA
TATCATCCTTATTGCGTTAATATTAAGGTGTCTCAAGTGATAGTGAGTCTCGGCTGGCGT
TGTCTGGACTGTACGGTGTGCGAGGGCTGCGGCTCCCGCGGAGACGAGCCGCTGCTGGTG
CTGTGTGACGACTGCGACACGGCATGGCACACGTACTGCGCGCGCCCCGCCCTGGCCGAG
GTGCCGCGCGGCGCCTGGCGCTGCGGCCGCTGCAGACGATGTCTCGTGTGCGGCACGAGA
GATACCGCGCTGTGGTGCGACAACTACACCGAGTGCGCTCCGTGTGCGTCCCTAGTGATG
TGCTGCGTGTGTTCAGAGCCGTATTCGGACGGGGAATTAATAATCCAGTGCACCGCTTGT
TCGCGCTGGCTGCACGCGGCTTGCGACTCCATTCGGTCCGAGGCGGATGCGGAGACGTGC
TGCCGCGCGGGCTACAAGTGTACGTGGTGTCGCGGACGAGAGGTGCCCCCGCCGCACGCT
CTGCCGGGCTCCCGCGCGCCCAGTCCTCGGCCGCGTCTTACACCGCACGCCCTCGGCCTC
GCGGGCGAGTACTACGTGGACGACGTCTGTCTTTCGCAGCGAGGAGCCCACCACATGAAG
CAACTGGAGGCTGAGCTCGGCATCACACACACGAGACGCAAACGACGCTTTAAGAACGAC
AACACCGAGAAGGATGCCGAAATAATGGCATCTATTGAAACAGTGGTTCAGAACGTAGAA
GGGGAGATGGGAGACGACACGAAGCCTGACAGCACCGCGGAAGTAAAAGAGGAACCTGGC
TTGTCATCGACCAACTTCAAGGAAGGCATTCTATGGAATGTTGCAAATGATGGACCAGTT
CCAGAAGGCTTCACTATTTACACAACCGACCTCGGACTGACGGTTCTTCGGCGAAAACGA
CAAAGAAATCTTAACAAGCTCGGCATAGGGGGCTTTGTCGTTCGACAGAGACAAACTACA
AAAGCTCAGGCTGATGAGGAGAAGGATGGAGACGGAACACAGGGCTCTGGAGAGTCACCC
AGCAACAAGCGCAAGCCTCGTCGAAAGCCGCGGTCTAAGTTGATGGAGCAATTTCCGTCG
TACATGCAGGAAGCGTTTTTCGGGAAAGACTTGCTCGAGCCCGCGAAACCTGCAATGGGT
TCTACCGGCTCGGCGGATGGAGAAGGTCTGTTGGGTGATCGAGACAACTCCGACAGCGAG
TCCGACGACGTATTGTCAGCTCTACATACGTTTAACGAACACGACACCACCGTTTTTATA
ACCCTCAATACAGAAGAGACGGAGCTCCTCCAGTCGCTTAAACCTAAGGACGAAAAGGAC
GATCCGTCCACATTGAACCTGGAGCACGTCAAGATAAAGACTGAGTGTGAGGAAGGTCAA
CTGAAGCACACCGAGGACTCGACGGCCCTCAAGAATGCGATTCTCGGCCCGCAGCCCAGC
CCCGACCACGCCGACCACGCGCCGCACCCGCCCGCCAACGAACAGAACACCGAGAGCGCT
CCACAATCTGCCGTAATGTCCGCTAAAGAGGAGCTGTCCCTCCTGGGGGTGTCCCTGGAC
ACGCTGCCTGACATGGACAGCAACGACGTTGACGAGATATTCAAGGGAGTGCTCACAGAT
GACTCGCAGGAATCTCAAGAATCATCAGTTTCCTATGTGAATTCTATGGCCGGCACCCCG
TACTCTCAGCAGCGGCAGCAGTTACAGAGTCCGATGGATTATGCTTCTCCTTACCATACA
GAATTCGGTAGTAGTGTTGATCTCTCTCTCATGACGAGAAATGTTTCGTACAGCAGTTCT
CTCAGTCCGCTGTACGAGTGCGGTTCAGGCGCCGGCTCGTGGCCCGACGCGCCCGCGCCA
CCGCCCTCTTATAACCAACGCAGCGCCGATAAAATGCGTGCCGACGAAAGGCAGCCCGCC
GGGCGCGACCCCGCCCCCACCCCGACAGCCCTGCCCGGACCCGCCCCTCACCCCCCGGCA
CACCCTCCTCCCCACGCTACTCCTCACCGTTTCCCTCTGTACTCCACCAACGAGGACATC
AACCGCCAGCTGCGGGACCTGCTGCAGCGACATCCCGAGAAGATGTGGCCTGGACCTTCT
GGCGAGGAGTCCCCAGCGGGGTGTGTGGAGGGCGGAGGCCCGTCGTTCCGACCGCCGCTG
CCGGTGCCGCTCCGACCCCGCGCTCCCTTACATCACGCGCATAGACCAGAACATATGATC
ATTCAGCATCGCCTGGTGTTTTCGGACGGCGGTCAGATGCAGCAGAATGTCCAGAACTCC
TCGCTGCAACAGATCATGGAGCAGAAACAGGTCACCCAAGGAAACATGAGCAGTAATGAC
AAACAAGACAGCGGGCCAGAGCAGGGAACCGATGAGATGGACATTGGGGACATGGACAAG
CTGGAACAAGACGCCGGGAACATTGGAGAGGTTGATATCTTGACTGGTCTCGGAGCGGAG
GACGACGAGGAACTGCTGGAGTCGCTGACGGCCGAGATCGGGGAACAGTTCAACATACTG
GAGTACGCTGACCCCGAGCTGGCCGCGCTCAACGACGAGACGCTGCTGGACGGACTCGAC
CTGCCCGACGACGTGCAGCACCACAAGAGGGGGAACACTGATGACGCTGATCAAAAAGGA
GACAACAAAATGGACAGCGACTTGCAGACGGCAGTTAAGAATGTAGAGCCGAAAACTGAG
ATGAAAAGTGAACCGGAAGTGAAAACTGAATTGCAAGACGTCAAGCCGGAGATTCCGGTG
TCATCGAGCATGGGACAAATGTTCCCGGACGGCGTCCCAAGAGTGATCACGCAGAATCAA
CAGATAGCTTTACACTCGGCCATGCAAGCCATGCAGGCCATGCAGGCGGTGAAGGCAGAG
GTGAAGGCGGAGGGGGACGACAAGCAGGACGGGGACTGGCCGCGCCCTGTTTTCTCACAC
ATGTACGGTCTGCAGCTACCCGCACAGACACAGGGCGGTGTGAGTGCGAGCACTCAGAGT
ATCGTGAACGCTATGACCGCTCAAGTACAAGCAGCGCTGGCCGCGGGTCGCGCCATCGCC
CCCGGGACAAGACTGTTGGGGGCCGACGGGGCCGTCGGGGTCGTCAGGCACGATCGCTCT
GTGGCGCTGGCTGTACTTCCCCATTCAGCCGCTCGTGGTATGGTGAACGCCGCTCGTCCC
CACGCGCCGCCGCCGCTGCGTCAGGACACGCCGGTTGCCGGAATACATAGTACAGTCCTC
CCAGCAGGCGCCGCTCGCAGTGCTCCTCCGCCGCCATACCCGGGACTGCGCGCTCCGCCG
CCGCCGCCGTACCCGCAGCTCCAGCAGAACGTAAACCAGGAGCAACCGTTGCTTCTGGAG
GAGCTGTTGGAGCAGGAGAAGCGCGAGCAGGAGCGGGAAGGGGGCGACTGGGGAGCGGCG
GGGCCCGGGCCGGCGCCCCCAGCGCCACCCGCCGCTGCCGCTGCCCCCACAGTGCCACTT
TTCGCTGGTTCTGCCCCACCCGCACCTCCTGCGCCTCCAGACCGTGCGCTCTCAGACGCC
GATCACCATGCTCGCCTTGTCTACGAGCAGTGGCTCAAGCAGTACAACACCTTCGCGGCC
GACCAGCTGCGCTACTATGAGCAGGAAGTGCAGAAGTTGCGCAAGATACGCAAGTCGCTG
AATAGTAAGCAACGTCAATTGAGAAAATCCGGAAATGAGTTAATGCCTAACGATGCTGCT
GAACTCCAGCGGGTATCGACAGAGCAGCAGGCGCTTCAAAAACATTTAGAAGCTGCTAGA
AAACAAGCTAGACAACATAGTATGCTTATACAGGAGTACGAAACAAAACAGCGTCAAACA
AATCCACAGCTAGCCCAACAGACCATTATAAATCAATCACAAATCGGCAGTCAAACAGTC
TTTGCCCAAAATCAAAACATTACACAAAATCAACAAATGCAACAACAGACAATCAATCGG
ACTATGCAAATAGGCCTCCAAGGGTCCCAAATACAACAGCAAACGTTAATACGAACACAG
CAAACTGTTCTCGGCTCTGACGGACAACCGATGTCGCAGCAAAGAATCGGCCTTGTCACC
TCACCAATGAATACACAAGGGAGGATCCTACAAGGTGGTCGGACGCTTGTAATCGAGCAG
GCGGGGGCACCACAGCAGCAACTAGTCCGACAGCTGTCCGGTGTACAGGATAGGCCGCAG
TCGGTCGGAGGCATCGTCCAATTCAGCCAGCAGCAACTTGGTGGCGTGCGAGGACCTATG
CCTGCTGGTGGGCCCGCTCCGAGAACACCAGCACGTCCAGCGATGTCGCCACTTCATGTG
CAGTCACCTCACTCGCAGTCCCCTCATCACTCGCTTGCGCCGCAGTCCCCTCTGCATCCC
ATGCAGTCACCGATGCAGTCTCCACTGCACTCGCAACAGTCGCCGCTTCACCACACGTCA
CAGTCGCCCTTGCATCCATCGACGCAATCGCCTCTCCACTCGCAACAGTCGCCGATGCAT
GTGCAGCAGAATTCTAATATGACACAGCAGAGTCCGATGCATCCACAACAGAGTCCGATG
CATCCACAACAGAGTCCGATGCTCCCACAGCAAAGCCCGATGCATCCTCAACAAAGTCCT
ATGCATCCCCAGCAATCTCCTATACACCAACAGTCACCCATGCATTCTCAGAATCAAATG
ACTACGCAAAATTCTCCGATGCATCCTCAGCAAAGTCCTATTCATCCCCAACAAAGTCCT
ATGCACCCGCAGCAGAGTCCCATGCATCCACAGCAGAGTCCGATGCACTCACAGCAGAGT
CCAATGCATCCTCAACAATCTTCGATGCATCCTCAGCAGAGTCCAATGCATTCCCAACAA
TCACCGATGCATATGCAACAGTCACCTATGCACACACAGCAGTCGCCGATGCAATCTTCG
ATGTCTCCGCAGCATTTCTTGCAGCAACTGCAAGCTCAAAGGACGAGTCCGCAATTGCCA
ACTCCCAGTCGTAGTCCACAGGCTGACTACACCTCGGGAAGATTTCCCCGGCCAACATCG
GGTTGTTCTCCACTACCGACCCGTTTTGCAAGACCGCCTCATCACCAGCAAGGTATGAGA
GTAGGAGTGCCTTTCGTTAATCGTCAACAAACATCACCACTAGGAAGTCCGCAACCTCTT
CCATCACCGGGAAATAATACCAATGAAATGACACGTCAACAACTGCTTCAGCGTCAGCAG
TACAGCCCTATGGGAGGTGCGCCATCATCTCCATCGTTATCTCGTTCGCCGCACGTTGCA
CGGACGCCGACACCGTCTGCGTCACCCGCCGCCTCTCCTGCGCCGCATCACGACCACGGT
GGCGGGCACCATCACCACGTAGCTCCCATTCCCCCTGGCTATAGATACTTTAAGCCCGGC
CTGTACGGCGGAGCACCCGTCTGGCCTAACAAGGACGACGACAGGCGACGGGCAACTCAC
ATAAACAAAGTATCCATTCTGAAACGCCGAACCCCACCCAGACCTAGATTCACGGGTAAG
ACGGCGCCTTGTACCGACCGCTCGGAGGACACTGGAGATTCGACTGACACCAGGAAATAC
GTTCTCTATTCTTCCGATAGCGTTGAATATTCTACTACTTCGGACGACTCAGCACGTATG
AGCTTTTCTAAAGACGATGACGATGACGATATTCAAGAATTTGTCGAACACTTGGATGAT
GATGACATTGTTGTAGTTGAGCCTGGCGCTATCTCTCCAGAGGAGCGAATAGCCACAGAA
GAAGATTTCGAAGAACTTATCGACAGTGGGAAACCAGAAGATGAGGTGGAGGAAGAAGTT
TCTCAAGAAAAGCCCATTACGCGGACAGAAACAACGACTAAGACTGTCGCAGTAATAAGT
CTAGCACCAGCAAAGCAGCCGTCATCTAATATGCAGCAATATAAATTACTAAATCCCTCT
CAAAGCTCTGCGTTAACACGCCTACCGGTATTAAGTGCTACAGTGATTTCACCGCACTCT
AACGCAAAAATTTTAAACGAGGTCTCTCAAGCTACGGTTGCATCCGTTTCTATCGCTAAT
CACACTATATCAGTTCCAGTATTAAAAAGTCTTTCTGTCCCAATCGCTAATGTCACCGGT
CATGCTAGCGGTATGGCAAAATCACAAATTAAAAAAGTTCCCTTGAATATAAGTAATTCA
CCGATGACAGTAAACATGTCATCCTCTCTATCAAAGCCAATAAGCTCGGTTATAAGTGTA
ATATCATCGAATGTCAAAAATGCTACGGTTAACCCGGTGATAACGTTGGCTACATCTAAC
ACAGTTGCTGGTGCGACAAGAGTTCCAGTACCTTTACTGACGCAACATCAAGTTAAACAG
AAAATAGTAAAAACTATAGAAAAACCAATGGCTTTATCGTTGTCTAACACAAAATTGTCA
AGTCTTTCGGTAACGCATGACGCAACATTACCAGCCAAAATTTTCCAAGATGATGTCGCT
TCACCTGATAGTACTGTAGCAACGGAAAGTAATTTAGAAATGCAAAACCCTACTTCATCA
CGACATATAATGTCGATCTCGCCACAAGATCACTCACAGAAAGAGCAAAAATCTAGCATT
TTAAAGGAGTCAACTAAAGATATCGATATAGAGTCTGAAATAAGTCAACAAACTGAACTT
CCTCACACAATTTGTCTCTCGGACAGAACCCCCCTTTTACTTGGCAAAGCGGATATTAAA
AGTCCCGACCCCATACCCGAAAAAATACCTGATAATATCATGGAAGGTGACGATGAAGAT
AAGCCTGAAGATGACCTTCAGAGTCACCTCCTACTATCATATAATAAAACAATACAAGAT
CAAAGTATTCTAAATCCGCCTTCCACTGATAAAAAGCCAGAAGAAAATGTGGTAAAAACA
ATAAAAGCCATTGCTAATCAAACCAAGGAAATGGTCATAGACTCTAATATGCAAATTACT
TCGGATTTAACACAAGATTCAGTGCAGATTTCGATACCATCTCCTACACCGTCGCAAGAA
CGATATTTGAACGACATTACTATGCGTGAACATTCGGAAACGGTTGAAGGCAACCAAAAA
CATCTGCGAACTATAATGTCTTCGCTCAATACTAACACTTCCAAAGTTGAAACACAAACC
GTTATGAGGAAATCTAGTGAACCTACTACTCCAACGCAAGTTAATTTTGAGAATTTGCTT
CCATCGTCTAAAGTTGAAGTAACAACTTCAAGACCATCGCCGATTCAAAGAATGGAAAAA
CCTCAAACTTCCACTTCTTGTTCTGACCCAGTACCAATGAGTGCTGCACAAGTAATAGGA
TCAAGGGTCAATGCGTTGTCTTCTGTGGGACAAGTTCGGAAATCACCAACAGTTTCGCCT
ATTAGTTCACCAGTTGGGATAGGGAATCAAAACCTTATGAAATCTCCAGCACAATCTCCT
TTAATAAATAATCAATCGTATAATGTAAATGATGAACCCCAGACAACGTCTATGTCTACG
CAAATATTACAAATGCCAGCCCTATCTAAAATACATAGTAGTTCATCAGTACCTACGTCA
CTTATGCACGGAAACAATATTATTACAACTAGTGTAAGCAATTTTCAAAAGTCACAAACT
TTACCTTCGAGTATTTTAGGCCATACGTTATTACAACCGACGAGGCAAATAAATACAAGT
AATTTACAGTTCAATCCACAAAGTATAAGTTCTAGTCAACCGCCTGCCTTAGTCATGACT
TCGAGGACAATAATTGGTAACAAAGAACCAGCGCCCAATGTTACCGTTCGTACTCACAGC
ATTGTCAGCCCTGGAATGAATCAAATGCAAAATAAAGTGTCACAGGGCTCAATTAACTTT
ATAACATCTTCGAAACTTTTACACACACAACTGACATCACCATTAAAGCGATCAAAATCG
ACAGATGAACCAAAGAGTGAAGTGATCGTAGGTCACATTCAACCTACAAAACGACACAGT
GTTGAATCAGTGACGGTTAAATCTGAACCAATGGAAACTGAAGAATCTAATAATGTGCCA
ACATCAACGCCTGAAAGTTCTGTAAATAAAATAATACAGCAAAACGTAGTGAATCAGAGA
AACGATGAATCCCAGAATGTGTTACTGAAACAATTACTACAAACTACTACAACTTCATCA
AGTGTAGTTCCTCAAAGAACTATGACAATCCAAAGAACGGCACCTGCGCTAGGCACTATT
CCTTCGTTAGAAGCACAACTGGCTAGACCATCGATACCACCTCCCACGTTATCTCTATCA
CAAGAGGTCGATATACCTAAAAATTCACCCCGACATATAACATCTGTGAGTTCCCATTTC
GCAACTCGTTCAATGCAGTCAACTATGTCTACATCTACTTTATCTCCTCTCATACATACT
ACAACGCAATCTTTAATGGACGTAAGAAAGCAACCGATGAAAATAATGAATAAGGAGGAA
ACCACACCGCTGCCAGAGTGCTCTCCAACAATGAAAACTTTGCCAATGTATCCGATGAGT
ATAGATAACCAACATCAACCTATAAAGAAGGAAGTCGCACCACCTCAACAAAGTCCCGTA
CATCGTCCATTTACTCCCTTAGACGTAAAGAAAGAATTGCTAGATGAAAGCTCTCAGCAA
TCAGCGACTTCTGGTGTATCAACAGCTTCAGATCAAGGAAAACTGGAACAACCGATGAAA
GAAGAATATCCTGAAAGCATTAGTATGGAGCACTCATCAGACATAAACCCGTCGGAAACG
CCTTCTGAGGCTAAGAAACGTAAGCGTCGAGAATACCAACAAAAGAAACGTAAACAAATG
CAATTGAACATGAAAGCTGCTGCCGAAAACAACATGAGCTCTATCAATGCTAAAAAGAGA
CCACGTAAGGGCTCAAGGTATGAAGAGGACTATGATACGTTTATAGATAATTTGATGGCG
CAATTGAGACTACTACCGCCTATGCAAATTCAAGAACCCTCTTTAACTACTAATTTTGCT
GTTTGTCCTTTATTTGGTTCCGGTGATTTAACTAAGCTCAAAAATCAGGACTATGATATT
CTTAAGGGTGACTTATTAGGGGAGTTCGGTAATGCTAAAATTCCTAACGTAGCGGATTAT
TACAATACTAAACCATTTGGCGATGAAGAACCCTTACCAGAGAAATCTACGGCTTCTACT
CAAAGAGGATTTTACGATCAAGAATTTCAACCCATTATATTTGATGAAGATCCGGAGGAT
AAAAAACTTGACTTTATTTGCAAAGAAAGAGATACGGAAACACCTGATACTATTGTTAGT
TGTTCAAGTCCTGAAGGTTATGAAATTGAGCTTGACGATAGTTTTCCTTTTCTTAAACCT
ATCGATGATGATGATGAAGATGAAGACGAAGAAGCATCGCCATCCGGAAAAGTTTCCCCC
ATCATACCACTTATAGCTCCTATTCCTATTAGAGTTAAACCTTGTTCACTGTATCAGTTA
AAAGACGAAGAAGATAATCAAAAATCACTTAAGGGCTTGGATTCAGATGCTCCTCTCAAG
ACGAAAGGGGACGACTCTCCTAGTAGTACAGAGAGTAATGAAAACGTTACAGTAACTCTT
ACTCTTACATCTGGTGCTGCGGAAGATATATTAGGTGTATTGAAAGAACTGGCTGGAATA
TTGCATATCCCTCCGCCGACATCGTATCAAATCATCGAACGCACGGCTACCCCACCATCA
CACAAGTTGGGTCTATATAGATCTAAAGGCAAGGATGGCAAAGAAGGAACGCCTATTGAC
ATACAAAGTATATTAAACGGAGCTGCCAAATTCTGTCGCCATTGTGACGTCGTTATTCTA
GATTCAGTGGTGAGAGCGAAGGCATCAGAGTTTCCTCTACTAGCAGCAAACAAAGGAAAT
GCTGAAGAAATTCTTTGTGATAGTGATTCGGAGCTATATTTCTGTAGTACGCAGTGTTAT
GAACGATTCGCCTGGCGACCAACTAATATTATATTGGACGGCAAAACTAAAATAGCTAAT
GTTAAAGAAGATAATAAATCGGATGTCGATACTAATCTTTCCAAAGACAGAGACGATTTC
GACACAGCATCCACTGAAAGCATGGAAACTGACGATTTGGATATAAAACCAGACATCAAA
GACGAAAAGATGGATTTGTCTTTCATAGACTCCTTAGATAACGACGAATTGATGAAAGAG
GTCGGTGATGACGTCAGCGCTTTGAACGAAGACTTGAAGAGTATGGAACAGGACGAAAAG
AGTAATCAAAGCGTCGAGACGGAGAAGTTCAAGGGAATTCGGTACAAAACCTGGTCACCC
GGCTGCATCGGACCACCCATAAAATATAAACGACCTACGGATCGAGAGCTCACCGAACTT
GTGTTCAGGACGGGAGTGGCCATCATGCCGGTCACCAATGAAGACACCCGGAAATGTGAA
CTATGCGGTATACAAGGAGACGGTGTGGCGGACGGTGTGTCGAGGCTGCTCAACTGTGAC
GTCGACAGATGGGTTCACTTGAATTGCGCCCTGTGGTCGGAGGGCGTTTATGAGACAGTC
AGTGGAGCCTTAATGAACGTGGACACGGCATTGGTAGCTGGCTCTAATGCGACGTGTGCG
GTGTGTCGCCGCCTAGGAGCTACCGTGCGCTGCTTTAAAGTCCGCTGCGGCAGTGTCTAC
CACTTGGGGTGTGCTGTCAAAGACAATTGCGTTTTCTACAAGAATAAAACCGCTTTTTGC
GCATCTCACGCCCCTAAGAACGAAAAAGACAACGAGCTTACGACTCTCAGTGTCGGTCGT
CGGGTGTACGTGTGTCGCGACGAGCAACGCCAGGTGGCGTCTGTGATGCTGCACTCCGAC
ACCAACCACCTCATCCGCGTGGGAGGACTCATCTTCCTCAGTCCTGGACATCTTCTCTCC
CATCAACTCGCAGCTTTCCATACGCCAAATTACATCTACCCCATCGGCTACAAAATCGTT
CGTTTCTACTGGTCGATGCGTCGCGCGAACAGTCGCTGTCGCTACGTGTGCTGGATCTCA
GAGGACGACGGCCGACCGAGGTTCCACGTGCGAGTACAAGACGAGTCCAGGCACGAGATG
AGTGCGCCCACACCGCGCGCCGCCTGGGCTGCCGTGAGTACATACCGACTCTGTACCGTA
TGA

Protein sequence:

MDGQDEGVLDVDLLADDGSDEECEAQPSSDFYAGPASTPTSCASSPRGEEPASPLAVVSQ
PTFFHHQTYTRPFFTSGRRGPGRPRKEGAKLAREGKIVRRNRGSAGSVRGAKRHRTSRDE
ALDDMMDEDDFTMPAPEEPPYAPEKWPGKLCALCNLSERSQLGQGEMRQIVCNTESEGGT
TPGVSNSGGATPTSITTPSTPTFPLPPGLTSPSPEMLDSNQPQQSLPLSRRQKAFMKCKT
PLYNMEHTDELSIIGWTESLELPAVVSSGMFYVHRCCLEFSPPYQDQVAIACNNDDDKEQ
LEEARIRGIVIAALGRKCAFCQRHGASIPCKMSCSKYFHLPCVLASGGFMDFQTKSSFCK
DHLHQAPLICTADIDCRTCRTIGDMANLMTCVTCGGHYHGTCVGLAQLPGVRAGWSCRSC
RVCQVCRGEAGGGAGGEARAVACEHCDKLYHAACLRPVMATVPKYGWKCKCCRVCSDCGA
RSPGAGPSSRWHAHYTVCDSCYQQRNKGSCCPLCRRAYRAAAYRDMIRCSACRRYVHGMC
DPEAEPQNYKQKKGENSSYEYTCPICKTNQQMAGSKSGSFDEDTMASTSQDSSFGDENTQ
EQDPLAIETKPDVGLGKGKPYTVSSKVAKKKIGGYKLKGGFPPAGKIGFQKRQRSVLDFG
RKRGSKPKMRGVFGVPGLGLQRPQAPDTKTSEDDPGVENKLVLCSSKDKFVLTQDLCVMC
GAVGTDSEGCLIACSQCGQTYHPYCVNIKVSQVIVSLGWRCLDCTVCEGCGSRGDEPLLV
LCDDCDTAWHTYCARPALAEVPRGAWRCGRCRRCLVCGTRDTALWCDNYTECAPCASLVM
CCVCSEPYSDGELIIQCTACSRWLHAACDSIRSEADAETCCRAGYKCTWCRGREVPPPHA
LPGSRAPSPRPRLTPHALGLAGEYYVDDVCLSQRGAHHMKQLEAELGITHTRRKRRFKND
NTEKDAEIMASIETVVQNVEGEMGDDTKPDSTAEVKEEPGLSSTNFKEGILWNVANDGPV
PEGFTIYTTDLGLTVLRRKRQRNLNKLGIGGFVVRQRQTTKAQADEEKDGDGTQGSGESP
SNKRKPRRKPRSKLMEQFPSYMQEAFFGKDLLEPAKPAMGSTGSADGEGLLGDRDNSDSE
SDDVLSALHTFNEHDTTVFITLNTEETELLQSLKPKDEKDDPSTLNLEHVKIKTECEEGQ
LKHTEDSTALKNAILGPQPSPDHADHAPHPPANEQNTESAPQSAVMSAKEELSLLGVSLD
TLPDMDSNDVDEIFKGVLTDDSQESQESSVSYVNSMAGTPYSQQRQQLQSPMDYASPYHT
EFGSSVDLSLMTRNVSYSSSLSPLYECGSGAGSWPDAPAPPPSYNQRSADKMRADERQPA
GRDPAPTPTALPGPAPHPPAHPPPHATPHRFPLYSTNEDINRQLRDLLQRHPEKMWPGPS
GEESPAGCVEGGGPSFRPPLPVPLRPRAPLHHAHRPEHMIIQHRLVFSDGGQMQQNVQNS
SLQQIMEQKQVTQGNMSSNDKQDSGPEQGTDEMDIGDMDKLEQDAGNIGEVDILTGLGAE
DDEELLESLTAEIGEQFNILEYADPELAALNDETLLDGLDLPDDVQHHKRGNTDDADQKG
DNKMDSDLQTAVKNVEPKTEMKSEPEVKTELQDVKPEIPVSSSMGQMFPDGVPRVITQNQ
QIALHSAMQAMQAMQAVKAEVKAEGDDKQDGDWPRPVFSHMYGLQLPAQTQGGVSASTQS
IVNAMTAQVQAALAAGRAIAPGTRLLGADGAVGVVRHDRSVALAVLPHSAARGMVNAARP
HAPPPLRQDTPVAGIHSTVLPAGAARSAPPPPYPGLRAPPPPPYPQLQQNVNQEQPLLLE
ELLEQEKREQEREGGDWGAAGPGPAPPAPPAAAAAPTVPLFAGSAPPAPPAPPDRALSDA
DHHARLVYEQWLKQYNTFAADQLRYYEQEVQKLRKIRKSLNSKQRQLRKSGNELMPNDAA
ELQRVSTEQQALQKHLEAARKQARQHSMLIQEYETKQRQTNPQLAQQTIINQSQIGSQTV
FAQNQNITQNQQMQQQTINRTMQIGLQGSQIQQQTLIRTQQTVLGSDGQPMSQQRIGLVT
SPMNTQGRILQGGRTLVIEQAGAPQQQLVRQLSGVQDRPQSVGGIVQFSQQQLGGVRGPM
PAGGPAPRTPARPAMSPLHVQSPHSQSPHHSLAPQSPLHPMQSPMQSPLHSQQSPLHHTS
QSPLHPSTQSPLHSQQSPMHVQQNSNMTQQSPMHPQQSPMHPQQSPMLPQQSPMHPQQSP
MHPQQSPIHQQSPMHSQNQMTTQNSPMHPQQSPIHPQQSPMHPQQSPMHPQQSPMHSQQS
PMHPQQSSMHPQQSPMHSQQSPMHMQQSPMHTQQSPMQSSMSPQHFLQQLQAQRTSPQLP
TPSRSPQADYTSGRFPRPTSGCSPLPTRFARPPHHQQGMRVGVPFVNRQQTSPLGSPQPL
PSPGNNTNEMTRQQLLQRQQYSPMGGAPSSPSLSRSPHVARTPTPSASPAASPAPHHDHG
GGHHHHVAPIPPGYRYFKPGLYGGAPVWPNKDDDRRRATHINKVSILKRRTPPRPRFTGK
TAPCTDRSEDTGDSTDTRKYVLYSSDSVEYSTTSDDSARMSFSKDDDDDDIQEFVEHLDD
DDIVVVEPGAISPEERIATEEDFEELIDSGKPEDEVEEEVSQEKPITRTETTTKTVAVIS
LAPAKQPSSNMQQYKLLNPSQSSALTRLPVLSATVISPHSNAKILNEVSQATVASVSIAN
HTISVPVLKSLSVPIANVTGHASGMAKSQIKKVPLNISNSPMTVNMSSSLSKPISSVISV
ISSNVKNATVNPVITLATSNTVAGATRVPVPLLTQHQVKQKIVKTIEKPMALSLSNTKLS
SLSVTHDATLPAKIFQDDVASPDSTVATESNLEMQNPTSSRHIMSISPQDHSQKEQKSSI
LKESTKDIDIESEISQQTELPHTICLSDRTPLLLGKADIKSPDPIPEKIPDNIMEGDDED
KPEDDLQSHLLLSYNKTIQDQSILNPPSTDKKPEENVVKTIKAIANQTKEMVIDSNMQIT
SDLTQDSVQISIPSPTPSQERYLNDITMREHSETVEGNQKHLRTIMSSLNTNTSKVETQT
VMRKSSEPTTPTQVNFENLLPSSKVEVTTSRPSPIQRMEKPQTSTSCSDPVPMSAAQVIG
SRVNALSSVGQVRKSPTVSPISSPVGIGNQNLMKSPAQSPLINNQSYNVNDEPQTTSMST
QILQMPALSKIHSSSSVPTSLMHGNNIITTSVSNFQKSQTLPSSILGHTLLQPTRQINTS
NLQFNPQSISSSQPPALVMTSRTIIGNKEPAPNVTVRTHSIVSPGMNQMQNKVSQGSINF
ITSSKLLHTQLTSPLKRSKSTDEPKSEVIVGHIQPTKRHSVESVTVKSEPMETEESNNVP
TSTPESSVNKIIQQNVVNQRNDESQNVLLKQLLQTTTTSSSVVPQRTMTIQRTAPALGTI
PSLEAQLARPSIPPPTLSLSQEVDIPKNSPRHITSVSSHFATRSMQSTMSTSTLSPLIHT
TTQSLMDVRKQPMKIMNKEETTPLPECSPTMKTLPMYPMSIDNQHQPIKKEVAPPQQSPV
HRPFTPLDVKKELLDESSQQSATSGVSTASDQGKLEQPMKEEYPESISMEHSSDINPSET
PSEAKKRKRREYQQKKRKQMQLNMKAAAENNMSSINAKKRPRKGSRYEEDYDTFIDNLMA
QLRLLPPMQIQEPSLTTNFAVCPLFGSGDLTKLKNQDYDILKGDLLGEFGNAKIPNVADY
YNTKPFGDEEPLPEKSTASTQRGFYDQEFQPIIFDEDPEDKKLDFICKERDTETPDTIVS
CSSPEGYEIELDDSFPFLKPIDDDDEDEDEEASPSGKVSPIIPLIAPIPIRVKPCSLYQL
KDEEDNQKSLKGLDSDAPLKTKGDDSPSSTESNENVTVTLTLTSGAAEDILGVLKELAGI
LHIPPPTSYQIIERTATPPSHKLGLYRSKGKDGKEGTPIDIQSILNGAAKFCRHCDVVIL
DSVVRAKASEFPLLAANKGNAEEILCDSDSELYFCSTQCYERFAWRPTNIILDGKTKIAN
VKEDNKSDVDTNLSKDRDDFDTASTESMETDDLDIKPDIKDEKMDLSFIDSLDNDELMKE
VGDDVSALNEDLKSMEQDEKSNQSVETEKFKGIRYKTWSPGCIGPPIKYKRPTDRELTEL
VFRTGVAIMPVTNEDTRKCELCGIQGDGVADGVSRLLNCDVDRWVHLNCALWSEGVYETV
SGALMNVDTALVAGSNATCAVCRRLGATVRCFKVRCGSVYHLGCAVKDNCVFYKNKTAFC
ASHAPKNEKDNELTTLSVGRRVYVCRDEQRQVASVMLHSDTNHLIRVGGLIFLSPGHLLS
HQLAAFHTPNYIYPIGYKIVRFYWSMRRANSRCRYVCWISEDDGRPRFHVRVQDESRHEM
SAPTPRAAWAAVSTYRLCTV