DPGLEAN21663 in OGS1.0

New model in OGS2.0DPOGS212487 
Genomic Positionscaffold840:+ 43818-60181
See gene structure
CDS Length13164
Paired RNAseq reads  3121
Single RNAseq reads  7652
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010221 (5e-15)
Best Drosophila hit  trithorax, isoform A (3e-96)
Best Human hithistone-lysine N-methyltransferase MLL isoform 2 precursor (2e-77)
Best NR hit (blastp)  mixed-lineage leukemia protein, mll [Aedes aegypti] (2e-124)
Best NR hit (blastx)  mixed-lineage leukemia protein, mll [Aedes aegypti] (3e-118)
GeneOntology terms













  
GO:0008354 germ cell migration
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0048096 chromatin-mediated maintenance of transcription
GO:0016571 histone methylation
GO:0035097 histone methyltransferase complex
GO:0051568 histone H3-K4 methylation
GO:0042800 histone methyltransferase activity (H3-K4 specific)
GO:0006355 regulation of transcription, DNA-dependent
GO:0008270 zinc ion binding
GO:0005515 protein binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0005875 microtubule associated complex
GO:0007411 axon guidance
InterPro families









  
IPR001965 Zinc finger, PHD-type
IPR018518 FY-rich, N-terminal subgroup
IPR018516 FY-rich, C-terminal subgroup
IPR001214 SET domain
IPR003616 Post-SET domain
IPR011011 Zinc finger, FYVE/PHD-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR003889 FY-rich, C-terminal
IPR003888 FY-rich, N-terminal
IPR019787 Zinc finger, PHD-finger
IPR001628 Zinc finger, nuclear hormone receptor-type
Orthology groupMCL16872

Nucleotide sequence:

ATGGGAAGGTCTAGGTTCCCCGGTAAACCACTAAAATTCCACAACAGAAAGCGCATCAGC
GTTCTATCGGGCACTGTTTACCATGAAACTGCTCCAGAGAATAGTTCAGCCAGCCGAGGC
TCGGCTGCCAACGACTTTTCTGGGGAGAAAGAGGAAGAAGAGAAGAAGGAGAATAATAGT
GAAAATGATTCAAAAATAAATGGTGATAATAAGACCAACAATAATTTAGATAAATCAGAC
AATCAGGATAACGCTGAATCATCAAAGTCTCCTGTCCGAACAAGGTCCCATAATAAAATG
GAGCCTCCAAAAAACGATAAACCTAAACCAGTGAAAAAGACAGTAACATTTGGTACAGTT
GAAAGTTGTGAGGATATATTCTTGCCATTAAAGAAGGTTCCAATAAAACATCATGCACCA
CTCGTGCCAATTATAAAGAAAAAATCAGCACTCAAACAAACTGAATATTCTACATTCAAT
AGTATTTTGAAGCCAGCCAAATTAACAGATCTCAGCTCCAAATCCTCGTTGGAACCTCAA
GAAGGTTATTTAAGGAAGAACCTAAATCCTTTATCAAAATCAACAAATAAATTTTCTCAA
TTTAGTGATATCAGAAGTACTTCACCGCCTCCAAGAATCACATCGCCTATTGAAAAGAAG
AATAGCACTAATAAGTTTGAATTGCCCATAAGGAGTAGTCATTCATCAAGAGTTATAAAA
CCTAATAAAAGGTTTATCGATGGTGATGAACAATCATCACTTCCTAGTCTCTCCTCTGGC
AAAGTTCTTAAGAAACCAAAATTGAAGAGGCTAGTTTTCAACAACTTACATGATGACGAG
GACAGTGCTGAAGATGAGGAATTAGATAAAACAAATGATGTTGAAAAAACTCGAGAAAAA
TCTTTACCTACATCAAGTTTGTTTAAGGATAAAACAACAAATTCATTTTCAAACCTCAGC
AACACTGGGAACAGAAAGGTTATCAGAGACTTCTTTTCATATGACAAAGAACCTGAGTGT
AATCAAGATGAAAATTCGAATGTGGAGAGCCTTGATAGGATAAAGTCACCAGATGGTAAT
AAAGAAGAAAGATCAGGACATGCTATGCGAAAGCTCATGGATATTTCTGATGCAAGCTCA
ACTAGTACTAGTGCCTCTGAGTCAGAAGAAAGCTGGGATAGTGACAGTGGGGAAGCTACT
GGTGAGGAAGGTGAACGACAACCACCATTGAGTCCTGAGAAAGATAAGACATCTGCCTCA
CAGTTATTAAGCGGGAAGGTGATTATAAGGGAAGCAAGGTTACAACTTACTTCTACAGCC
ACTTCCACAACTGGGCTTGATGGACCCTTCTCAGCTTTAACGGCAGTATCATCCAATAAC
CAACCAGCAACAGTCACTTGCGGGGTTTGTGGAGCAGTGAGATTCTACAAATTTGTGAGA
CAGGCAAGAAAATTTGGAATATATTCCTGTGAGTCATGCCGAAAATTTATTTCGAAATTA
CTGAAGAAAGACAAAATGGTTTTACGAACATCACCTGTGGTTACTCATTGTGTGCGGGGA
GAAGGCAATTGTCATGTTCCACCAGTAGTTCGCTCTCAGCAGTGGAAATTACTCAAGTGT
ACTAATAAATCCCGATGTCCTGCTTGTTGGCTAAAGCTATGTCTCAAAGCTTTCCAAGTT
CCACCTCACATTAGAGCCACTTTGACTGCTATGCTGCCGTCTTATATGAGAAGTGGATCC
AAACCCTCTGGGCCCTTGACTCAAACACATCCATTAAGAATAGCATCCAGTACATCACCA
ACCCAGTCAACATTCAGTACTTTGGCTTCTGAGAGTGAAAATACTCTCTTTGCCGTAAAA
CCAGTTAAAAAACCGTCAGATGACACCGATAAAACTGAGGAAACTAAAACAGCTAAAGGT
CTCCAAACTAACGACGAATCCAAGGAAGATGTTAATGCGAGCAGAAAACGTCGTATTACT
CGGACTAAAGTTAGAAAGAAAGATAAAAGCGAGTCCAAACAAACTGAGGATACGAAGCGA
CAAAAAATGGAACTCAAAGGACCTAGAGTTAAACATGTTTGTAGAAGTGCCTCCATCGTC
CTTGGACAGCCAATTGCAACATTCCCCACTCAAGAAGAGAAGGATAAAGACAAAACTCAG
CAATCAGATGACGATTCAACCATGCCAGTTATTGACAAGGATAGAATTACGCCAGATTCA
TCTTGTGATTCAGAACACGATGTTGAGAATAAACCAGTACAAGTGGTACCAGAAGCCATA
ATGAAGGAAAGTGATCAAGAAATATTAGAACCCAAGAAGAGAAAGATTGAAGTTGCTCCC
AAACCACAAACCAAAAATGAGTCCAGTGAAGATGAGACTGTCATGGATATTGTTCCAAGA
AAACCTACTTTGAAACCTTTTACCAACATCACTAATGTACGCGGGCGGGCCCAGAAGTGT
GGACAAAACAGACCTGAACTTATTTGTGTCGATTTTTGGGAGAGCTATGATCCTGATGAA
GTTTGCAATTCTGGCTTTGGTTTGATTGGGACTATGCCATTCACTCTCGCCAAACTTTGC
TTCTTATGCGGCAGTGCTGGTCGAGAAAAGATGTTAGTATGCAGCAGCTGTTGTGAGTGG
TATCATGTGTGGTGCGCTGAGGAGGCTGGCGGTGGAGGGTCGTGGACTTGCGCCCGTTGC
GTATGGTGCGCTGCATGCGCTCGACCTGCTGCAAGACTGCGCTGCCGCTCCTGTGCGAGA
CCATACCACGCCGCTTGTCTGCCGTCTGCGCCACCTGACCATCGAAGTGACTGGCCACAG
ATCTGCAGTTCATGTCTAAAATGCAAGAGTTGTGACAGCAACAGGGTGAACAAATTTGTT
GGCAGCTTACCTTTCTGCAGACCTTGCTTCAAGTTACGTCAAAAAGGCAATTATTGCCCT
CTATGTCAAGCTTGTTACAGAGATAATGATTTTGATAGCAAGATGATGGAGTGCGGTTGG
TGTGCTAGATGGGTTCATGCAAGCTGTGAAGGCCTGTCTGGTGAGGGATACCAGCTTTTA
TCCGCATTACCACCATCTATTGAATACATATGCTGTAAATGTATGCCTAATGACCCTCCC
TGGAGAAAAATGCTAACAGAACATCTCAAAGGAAGACTACTACATCTTCTTAAATTACTG
GCCAAGAACAAAAAGGCTTGCGCATTATTGAAATTGACGCCTCATAAGAATACTCCAGTT
CCAAATAAACCATATCGATTGATGTCACCCCAAGCTATAAGAAAATTGCATTTTGACGGC
GGAGATGAGTCCATGAAGAATACATCTGAAACAAAAGTGTACAGAAATACCTCGAGGGGA
CGTAGAAATACACAAAATAAATTTGACGATCTAAATTCGTCCCAAATATGGCACTGCTCA
TCGCTGTTAAATGACGTAGAAATGGATAAAGAGGAAATTCCACACACCACACACAAAGTG
GTTAATCTCCAAGAACCTCTGATGCAGAATAAAGAGATATGCTTTTCAACTGGACTCTAT
AGCAACAAGGCAGATTTTAATACACCATGTGTGGAAGTACACGAGAATTACCCGCCACAG
AAAACAGAAAAAGCAACCACAATGTTGCCAACTCTTGATGACAAGTATGAACCAATATCT
GATGATGAAGAACCCACGAAATCTTTCTCACAAGCTCGAAGTGAACAAGACTTAAGTCCT
GCCATAATAAGACAAGCAAATATTGACGATAGTCTTGAAGACTTGGGCAGGTTAATTTCG
CCGTCGCTACTAGATATAAAGAAGAGAGTAAACAGTGATGAATACATCTCATTAAAAGAT
TTTAATCATGACATGAAAGAGGTTATTGAGAGGACGGCAAGCATAGATCTGACTTCAATA
TACAAAGATCTATTCTCTGAAACATTTCCTTGGTTTGATTGTGAAAATAATTGCCTTCAG
CCAAACTTGGAAGTAGAGGGCGAACAGGAAGAGACTGATTCAATTAAAGATATTGAGATG
ACGGATACAAAGGGGATTAAGAATACTACGCAAGTGGAACGAACTCTTGATCAAATAGTT
CCTAAGTTGGAATCTGTCAGCCAGAAGGAAGAAACGGAAATATTAGAGTTTTATCCAATG
GTTGACTCTCGTATTTGCGTTTTGTGCAAAACTTGTGGCGATGGATCACCGGCTATGGAA
GGGCGACTATTGTATTGCGGTCAAAATGATTGGATTCATGCCAATTGTGCTTTATGGTCT
GCGGAAGTGTTTGAAGAAATTGATGGTTCTCTTCAAAATGTTCATTCAGCAATATCAAGA
GGAAAAATGATCAAGTGTGCAGAGTGTGAAGTTAAAGGGGCTAGCGTAGGTTGCTGTGCC
AAGAACTGTAGTGAAACATATCATTATGCATGCGCAAGAAAAGCAACATGTGCTTTTATG
GATGACAAAAGAGTATTCTGTCCCACTCATGGAAAAGATGTACCAAAGAAAAGTCTACAA
AAGAACGCAGATTTTGAATTGACTAGACCAGTTTATGTTGAACTAGATAAAAAGAAGAAA
AGGTACACGGAAATAAACAAAGTACAATTTTTAATGGGTTCATTAACAGTGCACAGCCTT
GGCAAGATTGTACCTTCTGTGTCTGATTATGAAGATTTCTTAATGCCAGTAGATTTTTCT
TGTAGCCGTCTGTTTTGGTCATGTAAAAAACCATCTAAAATTGTACGCTATACAATTAAG
ACTAAACTCATACTGGCTGAACCGATGCCTGGATTCGACTTTGGCGTCAACATAACTGTT
GATCATTCTATGGATGTGCAAGTTGTAGACCGTGTCATGGCAGAAATTGGCGCATGGCAT
GAAAGCATAGAAACGGGATCAAGTAAAACCGTTCTTATGTTAAAGAGTGATAAGTCGCCT
ATTAAATTCGGAAAAACTGTTACGCTAGAAGATCTGGCTGTAAAACAAGTCGTGGAATAT
TTGCTAGATAATCTTTGCAAGCAAGAACAACAAGACGAAGAGGAACCGCAAAACACTGCG
GATCTATTGCCCCCTGAGGTTAAAGATGCTATATTTGAGGATCTAAATCACGATCTTCTT
GATGGAATATCCATGCAAGATATTTTCCCTAAACTCATGTCCTATGAGGATCTGGCAGGT
ATAGATTTAAAGGGTGAGTTCTATGGCGCTATAAGTCAATCAGACGACAAGTCAGATTCT
CGTTCTAGTGATTTTATTGATGAATTGCTTAATGCTCGTTTGGAATCTGGTAATAAAGAG
CTTAAGCGGAGCAAATCCGAAATGTTACTACAAAATCACAATCTAAAGTCTCTGACAACA
GGCCGCGGACAACAAAGATCGTCAAGTTTAACCTGGAACAATAAATTTGACACAAGCTTA
CTTTCTTCTACCATAAAGAGATGCAAGATGGGAAAGACGTCTACCGTGACTGGAAAACTG
AGTCCTATTCAGCGAATTATAGAAAATCCATTAGAAACGATAAAAGAAATAGAAAGAAGC
AGTGTAGACAAGAAATCTAAACCAGAAACTAGTACTAGCATTTGCAGTGAAACAGGCCAT
TCTTCTGGTCTCAATTACAGAACAACATCGCCTAAACAAAAAGAAGAGTCTAATGCAGAC
AAAAACGATAAATCTGAAGGATATTATACGGACGTCATAGATTTTTATACTAAAATCCAA
TGCAGCCCTATATCTCAATTGGATGGTGCAGCTGATTGGTCAGGCTCTGAGAGCAATTGT
AGTTCAAGACCAAGCAGTCCTCAAGATGACTTTACGAATATTCAACAACTAGACGGGGCT
GAAGATCATCATCCCTGTGACAATACGTCCAAAAATCAAAACAATCAGTTTCTGTACCGA
GCTAGTGGTACTGAAAGCACCTATACGGTGACCATCGCTGGTAACTCTATTAGTGGACAG
CCGGAACTTGTCATGAGACCTCTTGACGACCCATCTGGTAATTCAGGACAAGGAGAACAA
TTAGTGAGATGCGATAGATGTCAATGTACATACAGAAACAAGGAATCTTACGACCGTCAT
ATTCCATCGTGTGACATGATGTCAACAAGTGAAAGTGAAAGCGAAAATCCTAAATCTCCT
GAAAACAGAACTCTGACTACTTTGCAAAATGCTTTCCAAGGAGCATTTGCCCAACCTATG
ATTATACATGCAGGCGTCGTAAGTAACACCGAAAATACTGTAAAAGCCGAAGGTATTTCA
GCGAGAAGAACCAGCATAAATACTAGACAAATCACTATTAATGGCACCATAGTAGAAACT
CCATCACCAGGCCCATCAAATGAAATGACAATGACAATTACAGAGCAAATGAAATCTGGA
ACGGTTATATTTGATCAAAGCAAGACTACTAATGTTCAGGTATCTGCTAACCAACAAGTT
TTATCATCTCCACAGATAGTCGTAACGCCTCAAATGCTACTACCACCTAAAACAAGTACA
TCTGCCGGACAAAAAATGATGGCCCCACAAATAATCACACAACCTGGACTTTTACCATAT
AATATTTGTGTTAAACCTGGATCAAGCAATGCAAATATACAGCAGATTCAAGGAACTGAT
ATGACACAATTACTAGCTGGTGGCGGTGGTATTCAAGTGCTGACTTCCAACTCCAATCAA
GTCTTGACTATACCAAATCAAGGGATGATCCAAGGAAAAAATCAAGTTACAAACTTTCCA
AATATGGCAAGTGCGTCTTCTATCACTTTACCTGTAAACTCTATTCAGGGCATGCCTAAT
ACGTCGATAATACAGAATATTCAACCCATGATTCGACCGCAAATACAGAGTAACCCCACT
ATTGTAGTGCCAGCTATGACGGCCCAAAGGCTTGCTTCACCTAATTCTCAAAAGCAGATT
TTAATACCGAGTGCACCACAAAAGTCTCCTAAAAAGCAACCTGCACAGGTTCAGCCCAAA
CCAATATTTCAAGCAAATCGTGGAAGAGGTAGACCGATGCCAAAACCAACAACAATTAAG
AGGCAAACTAAATTGGAGAAAACATTTACAACTATTAATCAAAGTCCTTTAGGACCTGGC
AATACTGTTATACAATTACAGACGAATAATCAAACAGGTCAGCCGTCTATTATTGTACAG
CCGGTTGCTAACCAAAACATCATGTCGGCTTATGTAGAAGCTTTATCTCAGCAGCAGAAT
CAAAATATGCAATACATTGCGACTATTGCACCTCAAAGTGATTTCAAAACTGGACCAACT
CAATTTATCTCACAAGGCGGACTAATGCCACAGACATTCCAATTACAACAAACAGAATCT
GGGGGTCTTGTTGCTGTACCCAGTGGAGGTATACCTGTTTTATTACCACAGGGGAATGTT
GGAATTTTACCTCAGACGATACAGCAAGGAACTATTCTTCCCCAAGGTGCAATTCAGACT
CAAGGAATATTATCCCAAGGACCTAATGGTCATACTATCTTACCGCAAGCAATACACACA
CAAAATGGTACTACTATCATACCTCAAGGGACTATTCAAGGACTTACAAATGGAAATATA
ACGTCACAAACTGCGACTTTAATACCAAATAATACATTACAAAGTGCCGGAGCCACAGTG
GTACCACAAAGTGGCATAAGTAATGGTCAAACTACCATAATTCCGAATGCGCTGTCATCG
CAAGGCAATACATTAATATCATCCGCGCCTGGCACAACAATACTTCCTCAAGGGACAATT
TTACCGCAGTTTTGCAATGATCAGGTGCTTCTAGGGTCAACACCTACCCTTGAAATGGTT
ACGGATCCATCTGGATGCATGTATTTAACGACTACGCAGCCCGTCTATTATGGTTTGGAG
ACAATTGTTCAAAATACAGTTATGTCCTCACAGCAATTTGTATCGACAGCTATGCAAGGA
GTATTGGCGCAGAATAGTAGCTTTTCTGCAACGACCACACAAGTATTCCAAGCGAGTAAA
ATAGAACCCATTGTTGAGGTACCCACTGGATATGTTGTTGTTAATAACGTTGGTGATGCT
GTAGCATCACAAGCTGTTTCATCAACACCGAATGTAGTAACTGCAGTCCAATCATCACCG
CCGGCACTGAAAAATGTGAAAACTTCAGTTCCTAACTTAACATTGCAACCGTTACCTGAC
AAGCAGTGTAACAATAGACAGGCGCCTAAAGTAATTCAAATGAATGCACCACCTCTTGCT
ATCGGATCACAATCAGGAAATATAATGAATGGAAAGCAAGGATTGAACCCAATAACATCC
CAAATACATAGCATGACGCTATCAAACAATACTCAAAGTATGCAAAGAGTGAACATTTCT
AAACACAATAACGTGTCCATGTCACCAAATGTTACTCTTGTTTCGTCAACAGGACAACCA
TTGATGGCAATTTCTCAGTCTGTACCTAATACTATGAATTCATTTTCAATCCAATCGATA
CCTTTAGCACAACCAATAGTTTCAAGTTCTATTGCGCCATCAAATAAAAACACAATGAAA
ATAGAAAACTCTCATGTCATAGAAATAAGTGGAAATTTGCACGAGAGTTCAAATTGTACA
AGCTTACCTACAATAAGTGTCACACAGAGCATGAAATCTCCGACTCCGTGGAGGCAACAG
GATAATAAAACTGAGCTTCATAACATTGGACTAAAGACTTCTCAAAGTTCGACCATGAAT
ATTCAAAATAATATGATTTCAGGAAGACAGATGTCACTCTCTGATTCAGGAAAGGACAAT
CAAAATTGCATGATGTCAGTAAATAACAATGCCAACTCCATGAACCATCAATTTGAATCG
ACTTTAAATGTATCTCATCACTCATCGGCCCATACGACTTCTAACAACATGATCCAAATA
ACAGCTGGTAATATCTATCCACCGTCATCATCAATGAGTAGTAATTTTGATGCTGACACA
ACAATGAAGCTAAGCAAAATTAACGCTCTGTCAAAAACTGAAGGAGGACATATGAATTTT
TCACAGGAAATAATGGCATCACATTCACATCAAAATTCTCACGACAAAAATTTCCAAAAT
ATGGGTGCCAACGATAAACACAATTCTGATAACATGCACAACCAAATTAATATACATAAA
CTAAGTAGTTGTCAAGTGAATGGAAACATGAACAACTCGCCATCAATTAGTATCATTCCA
CCAAACATGGTAAGACCACAAATGCAAAACAATCAAAACTCAATAACAGTTGGCAATAAC
CAGAATCAAATGTGCTCTATTTCAAATAATTCACATGGGCAGGTTCAACTTATGAATTCC
GTCAACGCATCAGCAATGCAAAATGTCAACATGTCATGTCAAGAGATGGTGAACAATAGA
ATGAACATGTCAGTATCAAGTGAAAGCAACCAACACGAAATGACTAACTCCATAACAACA
TCCAACATGTCCCACGCTAATATACAAATGATGTCCCATGATAACAACCAACTTTCAACT
ACAAATCAGATACAAATAATGAACGGACAAATACAAAATAATTGTTTGGTAGAGAATAAT
AGCCAGATGCAGCTGAATCAAGGGAATCCAAACAGATCGTTCCATGATAATATTATCGGT
AACCAACAGGGCCACAATCAAAACAACATGCTGTCTAATCGCAGTACACCGGATACTAGC
AATTTAATGAACCAAAGTATGAACCTACACAACATGAATAATAACCACACCATGAGTCGC
AATGATCTCGGAGAAATAAATCATATGCATGTCAATCAAACCTTACCTAATATGAATAAC
ATGCAAAACAACAGAATGATGATGGATAATATGCAATCCCATGTAATGAACAATGTTAAT
CACCAAATGCATACAAATAACAGGCCTATGGATGCCATGCATAACCAGCAAATGCATTCA
ATGCAACAAATGAACCACCATATGAACAGCAACAACAATCGTTCACAGATGGACAACAAC
ATGCATAATATGAGTCAACAAATTTCAATGCAACAAAATAGACAAATAGATAATGTTAGT
CACCATCACCAACATCAGATGTCTAATATGATGCAGTTACAAAACAATCGTCAACATAAC
ATTGATAACAATTTAATGGGTATTCATCAACATGTATCTAATCAAATGCACAACAGACAA
CAGATGGATAATAACATGCATATGCACCATAATAACGTAAATATGGGTTTGAGCGGCCAG
CTAGATAACACCGGTGTAATGGCAAACATGCATGGCAATAGGCACGACAATACTCCAATG
TCAATGCAGAATATAAATGCCATGAATCAGATGCAAAACAATAGAAATTCAATGGAACAT
AATCACATGCAACATCAGATCAACAATATGGGCTCTATGAATAATATGAATTTGCATAAT
AATATAAACAATTCGATGCAAATGCCGAACACCAATCACCAGTCTATAAATAACCCAATC
ATGCATATGCCAAATATACCAGATATTAAAAATGAAATTCAGAATTCATGCGGTGGAAGT
TGCGCAAACAGCACCCAACAGTTTTCTAATAACAATCAGTCAACTTTCGAAATGTGTGCA
AATATGAGTTCTAATAACATGAATATGACGAACACTGGAAACATGATGCGTGGAATGACT
ATAAACAACACAAACATGTCGAATAATATGATAATAAATAATGTTAACAATAACCCATAT
GGATCCAACAACCAGAACAACTTAACTGGCCATGATATGCTTATAAGTTGCAACAGTTCT
ACGGAACACAATGTATCAAATCTGATGTCTAGTAATAGCAACCATATGATGATAAATAAT
ACTCAAACACACAACAACGGCAACCAAAACCAGATTATGGGTAACCAAGTCCAAAATAAC
CCACAAATTAACATAAATAGTTGTAGCATGAACACCAATAATTCGACTCAATCAACAATA
TCCAATATTGATAATCAAATTAATAGAAACAATATGGCTGTTAACGAACAAGCCAAATTT
GTACAAGAACCAGTTAGAGGCAAGAATATTATGGGCCCTCCAACAAATATTCCTATAACA
AATATTTCCAATAATGATTGCGCTAAAGTTAATATGGGTAACAACGATAAACAAAATCAT
ACTATATCAGTTAGTAATCAACTAGGTTATATGCAAATGAATCCTCCGAATGGCAATAGA
ATTATCAATCTTCCGGAAAGAAGTAGGCTAAATAATTCAAACCTAGAAAATAATGTCAAT
CAACATGTACCACAGCAACATAACATTCGTGTTGTGACCAATAACAACACAAATAATATG
ACTGGAATTACAAATGATTCTATGACATACCAACAAAAAAGTGAAAGCAGCACCATGATA
AATGTGCCTTTTCAGGCCATACCTAACAGTGCCCCAATGAATATCAATGTCAATTCCAGC
AACAATACCGTCAACATAACAGTTCAAAACAATTTGGTCGATCCCAAGATTGCGTCTAAA
GCTGCAGCAGATTTAAATTTCCCATTGCAATCTACCGATAATATGATTTGCATACCTGTC
CAAAATAACACCATTAGCTTACCGACCCAAAACAGTTCAAGTATTACCCTACCTAAAGCA
CCATCTACTCAACAATCCGGTATTCCGACTAATGTCGTAAACCCTATGTCAATGCGGCCG
ATGAATAGAGTATTGCCTCTACATCGTGACGGACAAAGTAAATCTTCTAAGACTTCGGGA
ACTAAAACACCTACTGTTCCTAAACAAAGACCTGTAAACCAAGATATGCCAGATCTCAAC
ATGAAAGATGAAAACATGGATAGCGATCTTGAAAAAGCTATAGAGGAATCTAAGAAATTA
ATGGAACTAGAAAAGGCAAAAAAAGAAAAGGAAGAAAGAGATGCTGTTCGAGCTGTAAAA
TTATCTACTGAAATACAACAAGCAAACACTAGTTCCACATCATCTTCAGTTCTTGTTAAC
AGAATCGATACAGTAGTAAATGTACCAAAAATGACATCTCTGCCCAGTTTAGATGAAGAG
ATGGAACAGGAGCCAGCAAAAGTTTTGACAGAAAAAGATACAAACAAAACTGTCATTCCA
TCAAAACTAACGAATTCTACTACCAAAGTATTAAAAAGGCCTTTAGTTGACAAAAAGGCT
GATGCTGAAGTAACGGCTGCTACTAAAAAACAGAATAAGGTTTTGAAAGGTCAGGAAAAT
AAGTTCTCTCAACCTAAACAAGTGCCAACACCACCTAAAGACGACGGCCCAAAATTGATA
TACGAAGTATCTTCAGAAGATGGATTCAACTATTCTTCGTCCTCACTAACCGATTTATGG
GCAAAAGTAATCGACGCTGTGCAGAATGCCCGGAAGCAATCTGGCTTGCCTCTCATTCAG
TATAACCTGCCATCCACCCTGTCTGGCGCACATTTACTAGGATTAAATAATAACGCCTTA
AGATACCTCATTGAACAGCTGCCAGGAGCTAATCGTTGTTCCAAATACAAAATACGTCAA
GGTCGTCCTCTCAACAGCTGGGACAACGACTTGGACCTTAGCGAAGGTTTCAAAGAAAGT
CCTTGGGGCAGCGCAAGAACCGGCCCAATTTCAAGGAAACAGAATCACGATATGTTCAGC
TGGATGGCTTCCCCGTTCAGAGAGGAACCACCTCCTTTTGGCGGTCAAGAGGGAGAAAGT
ACCATCTCAAGACGCTTAGCAACTCTTCCACCGGCTATGAGATTCAGGCAGTTAAAAGAA
ACGTCAAAGGCCTCCGTCGGCGTCTACAGATCTCACATTCACGGTCGCGGACTTTTCTGC
AAGAGAGATATTGAAGAAGGCGATATGGTGATTGAATATGCTGGTGAAGTCATCCGAGCC
GTATTGGCCGATCAGCGTGAGAAGAAGTACGAGGCGATGAGCGGTCGCCGCGGCGTGGGA
GGTTGCTACATGTTCCGTATAGACGACAACCTGGTAGTGGACGCAACGCTCAAGGGAAAC
GCGGCCAGATTCATCAACCACTCGTGTGATCCAAACTGCTACTCTCGCGTTGTGGATATC
CACGGCCATAAACACATTCTCATCTTCGCTCTAAGGCGCATAACTATTGGAGAGGAACTT
ACCTACGACTACAAGTTCCCATTCGAAGAAGTCAAAATCCCATGTACATGCGGCGCCAAG
AAGTGCCGCAAGTATCTAAACTAA

Protein sequence:

MGRSRFPGKPLKFHNRKRISVLSGTVYHETAPENSSASRGSAANDFSGEKEEEEKKENNS
ENDSKINGDNKTNNNLDKSDNQDNAESSKSPVRTRSHNKMEPPKNDKPKPVKKTVTFGTV
ESCEDIFLPLKKVPIKHHAPLVPIIKKKSALKQTEYSTFNSILKPAKLTDLSSKSSLEPQ
EGYLRKNLNPLSKSTNKFSQFSDIRSTSPPPRITSPIEKKNSTNKFELPIRSSHSSRVIK
PNKRFIDGDEQSSLPSLSSGKVLKKPKLKRLVFNNLHDDEDSAEDEELDKTNDVEKTREK
SLPTSSLFKDKTTNSFSNLSNTGNRKVIRDFFSYDKEPECNQDENSNVESLDRIKSPDGN
KEERSGHAMRKLMDISDASSTSTSASESEESWDSDSGEATGEEGERQPPLSPEKDKTSAS
QLLSGKVIIREARLQLTSTATSTTGLDGPFSALTAVSSNNQPATVTCGVCGAVRFYKFVR
QARKFGIYSCESCRKFISKLLKKDKMVLRTSPVVTHCVRGEGNCHVPPVVRSQQWKLLKC
TNKSRCPACWLKLCLKAFQVPPHIRATLTAMLPSYMRSGSKPSGPLTQTHPLRIASSTSP
TQSTFSTLASESENTLFAVKPVKKPSDDTDKTEETKTAKGLQTNDESKEDVNASRKRRIT
RTKVRKKDKSESKQTEDTKRQKMELKGPRVKHVCRSASIVLGQPIATFPTQEEKDKDKTQ
QSDDDSTMPVIDKDRITPDSSCDSEHDVENKPVQVVPEAIMKESDQEILEPKKRKIEVAP
KPQTKNESSEDETVMDIVPRKPTLKPFTNITNVRGRAQKCGQNRPELICVDFWESYDPDE
VCNSGFGLIGTMPFTLAKLCFLCGSAGREKMLVCSSCCEWYHVWCAEEAGGGGSWTCARC
VWCAACARPAARLRCRSCARPYHAACLPSAPPDHRSDWPQICSSCLKCKSCDSNRVNKFV
GSLPFCRPCFKLRQKGNYCPLCQACYRDNDFDSKMMECGWCARWVHASCEGLSGEGYQLL
SALPPSIEYICCKCMPNDPPWRKMLTEHLKGRLLHLLKLLAKNKKACALLKLTPHKNTPV
PNKPYRLMSPQAIRKLHFDGGDESMKNTSETKVYRNTSRGRRNTQNKFDDLNSSQIWHCS
SLLNDVEMDKEEIPHTTHKVVNLQEPLMQNKEICFSTGLYSNKADFNTPCVEVHENYPPQ
KTEKATTMLPTLDDKYEPISDDEEPTKSFSQARSEQDLSPAIIRQANIDDSLEDLGRLIS
PSLLDIKKRVNSDEYISLKDFNHDMKEVIERTASIDLTSIYKDLFSETFPWFDCENNCLQ
PNLEVEGEQEETDSIKDIEMTDTKGIKNTTQVERTLDQIVPKLESVSQKEETEILEFYPM
VDSRICVLCKTCGDGSPAMEGRLLYCGQNDWIHANCALWSAEVFEEIDGSLQNVHSAISR
GKMIKCAECEVKGASVGCCAKNCSETYHYACARKATCAFMDDKRVFCPTHGKDVPKKSLQ
KNADFELTRPVYVELDKKKKRYTEINKVQFLMGSLTVHSLGKIVPSVSDYEDFLMPVDFS
CSRLFWSCKKPSKIVRYTIKTKLILAEPMPGFDFGVNITVDHSMDVQVVDRVMAEIGAWH
ESIETGSSKTVLMLKSDKSPIKFGKTVTLEDLAVKQVVEYLLDNLCKQEQQDEEEPQNTA
DLLPPEVKDAIFEDLNHDLLDGISMQDIFPKLMSYEDLAGIDLKGEFYGAISQSDDKSDS
RSSDFIDELLNARLESGNKELKRSKSEMLLQNHNLKSLTTGRGQQRSSSLTWNNKFDTSL
LSSTIKRCKMGKTSTVTGKLSPIQRIIENPLETIKEIERSSVDKKSKPETSTSICSETGH
SSGLNYRTTSPKQKEESNADKNDKSEGYYTDVIDFYTKIQCSPISQLDGAADWSGSESNC
SSRPSSPQDDFTNIQQLDGAEDHHPCDNTSKNQNNQFLYRASGTESTYTVTIAGNSISGQ
PELVMRPLDDPSGNSGQGEQLVRCDRCQCTYRNKESYDRHIPSCDMMSTSESESENPKSP
ENRTLTTLQNAFQGAFAQPMIIHAGVVSNTENTVKAEGISARRTSINTRQITINGTIVET
PSPGPSNEMTMTITEQMKSGTVIFDQSKTTNVQVSANQQVLSSPQIVVTPQMLLPPKTST
SAGQKMMAPQIITQPGLLPYNICVKPGSSNANIQQIQGTDMTQLLAGGGGIQVLTSNSNQ
VLTIPNQGMIQGKNQVTNFPNMASASSITLPVNSIQGMPNTSIIQNIQPMIRPQIQSNPT
IVVPAMTAQRLASPNSQKQILIPSAPQKSPKKQPAQVQPKPIFQANRGRGRPMPKPTTIK
RQTKLEKTFTTINQSPLGPGNTVIQLQTNNQTGQPSIIVQPVANQNIMSAYVEALSQQQN
QNMQYIATIAPQSDFKTGPTQFISQGGLMPQTFQLQQTESGGLVAVPSGGIPVLLPQGNV
GILPQTIQQGTILPQGAIQTQGILSQGPNGHTILPQAIHTQNGTTIIPQGTIQGLTNGNI
TSQTATLIPNNTLQSAGATVVPQSGISNGQTTIIPNALSSQGNTLISSAPGTTILPQGTI
LPQFCNDQVLLGSTPTLEMVTDPSGCMYLTTTQPVYYGLETIVQNTVMSSQQFVSTAMQG
VLAQNSSFSATTTQVFQASKIEPIVEVPTGYVVVNNVGDAVASQAVSSTPNVVTAVQSSP
PALKNVKTSVPNLTLQPLPDKQCNNRQAPKVIQMNAPPLAIGSQSGNIMNGKQGLNPITS
QIHSMTLSNNTQSMQRVNISKHNNVSMSPNVTLVSSTGQPLMAISQSVPNTMNSFSIQSI
PLAQPIVSSSIAPSNKNTMKIENSHVIEISGNLHESSNCTSLPTISVTQSMKSPTPWRQQ
DNKTELHNIGLKTSQSSTMNIQNNMISGRQMSLSDSGKDNQNCMMSVNNNANSMNHQFES
TLNVSHHSSAHTTSNNMIQITAGNIYPPSSSMSSNFDADTTMKLSKINALSKTEGGHMNF
SQEIMASHSHQNSHDKNFQNMGANDKHNSDNMHNQINIHKLSSCQVNGNMNNSPSISIIP
PNMVRPQMQNNQNSITVGNNQNQMCSISNNSHGQVQLMNSVNASAMQNVNMSCQEMVNNR
MNMSVSSESNQHEMTNSITTSNMSHANIQMMSHDNNQLSTTNQIQIMNGQIQNNCLVENN
SQMQLNQGNPNRSFHDNIIGNQQGHNQNNMLSNRSTPDTSNLMNQSMNLHNMNNNHTMSR
NDLGEINHMHVNQTLPNMNNMQNNRMMMDNMQSHVMNNVNHQMHTNNRPMDAMHNQQMHS
MQQMNHHMNSNNNRSQMDNNMHNMSQQISMQQNRQIDNVSHHHQHQMSNMMQLQNNRQHN
IDNNLMGIHQHVSNQMHNRQQMDNNMHMHHNNVNMGLSGQLDNTGVMANMHGNRHDNTPM
SMQNINAMNQMQNNRNSMEHNHMQHQINNMGSMNNMNLHNNINNSMQMPNTNHQSINNPI
MHMPNIPDIKNEIQNSCGGSCANSTQQFSNNNQSTFEMCANMSSNNMNMTNTGNMMRGMT
INNTNMSNNMIINNVNNNPYGSNNQNNLTGHDMLISCNSSTEHNVSNLMSSNSNHMMINN
TQTHNNGNQNQIMGNQVQNNPQININSCSMNTNNSTQSTISNIDNQINRNNMAVNEQAKF
VQEPVRGKNIMGPPTNIPITNISNNDCAKVNMGNNDKQNHTISVSNQLGYMQMNPPNGNR
IINLPERSRLNNSNLENNVNQHVPQQHNIRVVTNNNTNNMTGITNDSMTYQQKSESSTMI
NVPFQAIPNSAPMNINVNSSNNTVNITVQNNLVDPKIASKAAADLNFPLQSTDNMICIPV
QNNTISLPTQNSSSITLPKAPSTQQSGIPTNVVNPMSMRPMNRVLPLHRDGQSKSSKTSG
TKTPTVPKQRPVNQDMPDLNMKDENMDSDLEKAIEESKKLMELEKAKKEKEERDAVRAVK
LSTEIQQANTSSTSSSVLVNRIDTVVNVPKMTSLPSLDEEMEQEPAKVLTEKDTNKTVIP
SKLTNSTTKVLKRPLVDKKADAEVTAATKKQNKVLKGQENKFSQPKQVPTPPKDDGPKLI
YEVSSEDGFNYSSSSLTDLWAKVIDAVQNARKQSGLPLIQYNLPSTLSGAHLLGLNNNAL
RYLIEQLPGANRCSKYKIRQGRPLNSWDNDLDLSEGFKESPWGSARTGPISRKQNHDMFS
WMASPFREEPPPFGGQEGESTISRRLATLPPAMRFRQLKETSKASVGVYRSHIHGRGLFC
KRDIEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVGGCYMFRIDDNLVVDATLKGN
AARFINHSCDPNCYSRVVDIHGHKHILIFALRRITIGEELTYDYKFPFEEVKIPCTCGAK
KCRKYLN