New model in OGS2.0 | DPOGS212487  |
---|---|
Genomic Position | scaffold840:+ 43818-60181 |
See gene structure | |
CDS Length | 13164 |
Paired RNAseq reads   | 3121 |
Single RNAseq reads   | 7652 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010221 (5e-15) |
Best Drosophila hit   | trithorax, isoform A (3e-96) |
Best Human hit | histone-lysine N-methyltransferase MLL isoform 2 precursor (2e-77) |
Best NR hit (blastp)   | mixed-lineage leukemia protein, mll [Aedes aegypti] (2e-124) |
Best NR hit (blastx)   | mixed-lineage leukemia protein, mll [Aedes aegypti] (3e-118) |
GeneOntology terms    | GO:0008354 germ cell migration GO:0005634 nucleus GO:0003677 DNA binding GO:0048096 chromatin-mediated maintenance of transcription GO:0016571 histone methylation GO:0035097 histone methyltransferase complex GO:0051568 histone H3-K4 methylation GO:0042800 histone methyltransferase activity (H3-K4 specific) GO:0006355 regulation of transcription, DNA-dependent GO:0008270 zinc ion binding GO:0005515 protein binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding GO:0005875 microtubule associated complex GO:0007411 axon guidance |
InterPro families    | IPR001965 Zinc finger, PHD-type IPR018518 FY-rich, N-terminal subgroup IPR018516 FY-rich, C-terminal subgroup IPR001214 SET domain IPR003616 Post-SET domain IPR011011 Zinc finger, FYVE/PHD-type IPR013083 Zinc finger, RING/FYVE/PHD-type IPR003889 FY-rich, C-terminal IPR003888 FY-rich, N-terminal IPR019787 Zinc finger, PHD-finger IPR001628 Zinc finger, nuclear hormone receptor-type |
Orthology group | MCL16872 |
Nucleotide sequence:
ATGGGAAGGTCTAGGTTCCCCGGTAAACCACTAAAATTCCACAACAGAAAGCGCATCAGC
GTTCTATCGGGCACTGTTTACCATGAAACTGCTCCAGAGAATAGTTCAGCCAGCCGAGGC
TCGGCTGCCAACGACTTTTCTGGGGAGAAAGAGGAAGAAGAGAAGAAGGAGAATAATAGT
GAAAATGATTCAAAAATAAATGGTGATAATAAGACCAACAATAATTTAGATAAATCAGAC
AATCAGGATAACGCTGAATCATCAAAGTCTCCTGTCCGAACAAGGTCCCATAATAAAATG
GAGCCTCCAAAAAACGATAAACCTAAACCAGTGAAAAAGACAGTAACATTTGGTACAGTT
GAAAGTTGTGAGGATATATTCTTGCCATTAAAGAAGGTTCCAATAAAACATCATGCACCA
CTCGTGCCAATTATAAAGAAAAAATCAGCACTCAAACAAACTGAATATTCTACATTCAAT
AGTATTTTGAAGCCAGCCAAATTAACAGATCTCAGCTCCAAATCCTCGTTGGAACCTCAA
GAAGGTTATTTAAGGAAGAACCTAAATCCTTTATCAAAATCAACAAATAAATTTTCTCAA
TTTAGTGATATCAGAAGTACTTCACCGCCTCCAAGAATCACATCGCCTATTGAAAAGAAG
AATAGCACTAATAAGTTTGAATTGCCCATAAGGAGTAGTCATTCATCAAGAGTTATAAAA
CCTAATAAAAGGTTTATCGATGGTGATGAACAATCATCACTTCCTAGTCTCTCCTCTGGC
AAAGTTCTTAAGAAACCAAAATTGAAGAGGCTAGTTTTCAACAACTTACATGATGACGAG
GACAGTGCTGAAGATGAGGAATTAGATAAAACAAATGATGTTGAAAAAACTCGAGAAAAA
TCTTTACCTACATCAAGTTTGTTTAAGGATAAAACAACAAATTCATTTTCAAACCTCAGC
AACACTGGGAACAGAAAGGTTATCAGAGACTTCTTTTCATATGACAAAGAACCTGAGTGT
AATCAAGATGAAAATTCGAATGTGGAGAGCCTTGATAGGATAAAGTCACCAGATGGTAAT
AAAGAAGAAAGATCAGGACATGCTATGCGAAAGCTCATGGATATTTCTGATGCAAGCTCA
ACTAGTACTAGTGCCTCTGAGTCAGAAGAAAGCTGGGATAGTGACAGTGGGGAAGCTACT
GGTGAGGAAGGTGAACGACAACCACCATTGAGTCCTGAGAAAGATAAGACATCTGCCTCA
CAGTTATTAAGCGGGAAGGTGATTATAAGGGAAGCAAGGTTACAACTTACTTCTACAGCC
ACTTCCACAACTGGGCTTGATGGACCCTTCTCAGCTTTAACGGCAGTATCATCCAATAAC
CAACCAGCAACAGTCACTTGCGGGGTTTGTGGAGCAGTGAGATTCTACAAATTTGTGAGA
CAGGCAAGAAAATTTGGAATATATTCCTGTGAGTCATGCCGAAAATTTATTTCGAAATTA
CTGAAGAAAGACAAAATGGTTTTACGAACATCACCTGTGGTTACTCATTGTGTGCGGGGA
GAAGGCAATTGTCATGTTCCACCAGTAGTTCGCTCTCAGCAGTGGAAATTACTCAAGTGT
ACTAATAAATCCCGATGTCCTGCTTGTTGGCTAAAGCTATGTCTCAAAGCTTTCCAAGTT
CCACCTCACATTAGAGCCACTTTGACTGCTATGCTGCCGTCTTATATGAGAAGTGGATCC
AAACCCTCTGGGCCCTTGACTCAAACACATCCATTAAGAATAGCATCCAGTACATCACCA
ACCCAGTCAACATTCAGTACTTTGGCTTCTGAGAGTGAAAATACTCTCTTTGCCGTAAAA
CCAGTTAAAAAACCGTCAGATGACACCGATAAAACTGAGGAAACTAAAACAGCTAAAGGT
CTCCAAACTAACGACGAATCCAAGGAAGATGTTAATGCGAGCAGAAAACGTCGTATTACT
CGGACTAAAGTTAGAAAGAAAGATAAAAGCGAGTCCAAACAAACTGAGGATACGAAGCGA
CAAAAAATGGAACTCAAAGGACCTAGAGTTAAACATGTTTGTAGAAGTGCCTCCATCGTC
CTTGGACAGCCAATTGCAACATTCCCCACTCAAGAAGAGAAGGATAAAGACAAAACTCAG
CAATCAGATGACGATTCAACCATGCCAGTTATTGACAAGGATAGAATTACGCCAGATTCA
TCTTGTGATTCAGAACACGATGTTGAGAATAAACCAGTACAAGTGGTACCAGAAGCCATA
ATGAAGGAAAGTGATCAAGAAATATTAGAACCCAAGAAGAGAAAGATTGAAGTTGCTCCC
AAACCACAAACCAAAAATGAGTCCAGTGAAGATGAGACTGTCATGGATATTGTTCCAAGA
AAACCTACTTTGAAACCTTTTACCAACATCACTAATGTACGCGGGCGGGCCCAGAAGTGT
GGACAAAACAGACCTGAACTTATTTGTGTCGATTTTTGGGAGAGCTATGATCCTGATGAA
GTTTGCAATTCTGGCTTTGGTTTGATTGGGACTATGCCATTCACTCTCGCCAAACTTTGC
TTCTTATGCGGCAGTGCTGGTCGAGAAAAGATGTTAGTATGCAGCAGCTGTTGTGAGTGG
TATCATGTGTGGTGCGCTGAGGAGGCTGGCGGTGGAGGGTCGTGGACTTGCGCCCGTTGC
GTATGGTGCGCTGCATGCGCTCGACCTGCTGCAAGACTGCGCTGCCGCTCCTGTGCGAGA
CCATACCACGCCGCTTGTCTGCCGTCTGCGCCACCTGACCATCGAAGTGACTGGCCACAG
ATCTGCAGTTCATGTCTAAAATGCAAGAGTTGTGACAGCAACAGGGTGAACAAATTTGTT
GGCAGCTTACCTTTCTGCAGACCTTGCTTCAAGTTACGTCAAAAAGGCAATTATTGCCCT
CTATGTCAAGCTTGTTACAGAGATAATGATTTTGATAGCAAGATGATGGAGTGCGGTTGG
TGTGCTAGATGGGTTCATGCAAGCTGTGAAGGCCTGTCTGGTGAGGGATACCAGCTTTTA
TCCGCATTACCACCATCTATTGAATACATATGCTGTAAATGTATGCCTAATGACCCTCCC
TGGAGAAAAATGCTAACAGAACATCTCAAAGGAAGACTACTACATCTTCTTAAATTACTG
GCCAAGAACAAAAAGGCTTGCGCATTATTGAAATTGACGCCTCATAAGAATACTCCAGTT
CCAAATAAACCATATCGATTGATGTCACCCCAAGCTATAAGAAAATTGCATTTTGACGGC
GGAGATGAGTCCATGAAGAATACATCTGAAACAAAAGTGTACAGAAATACCTCGAGGGGA
CGTAGAAATACACAAAATAAATTTGACGATCTAAATTCGTCCCAAATATGGCACTGCTCA
TCGCTGTTAAATGACGTAGAAATGGATAAAGAGGAAATTCCACACACCACACACAAAGTG
GTTAATCTCCAAGAACCTCTGATGCAGAATAAAGAGATATGCTTTTCAACTGGACTCTAT
AGCAACAAGGCAGATTTTAATACACCATGTGTGGAAGTACACGAGAATTACCCGCCACAG
AAAACAGAAAAAGCAACCACAATGTTGCCAACTCTTGATGACAAGTATGAACCAATATCT
GATGATGAAGAACCCACGAAATCTTTCTCACAAGCTCGAAGTGAACAAGACTTAAGTCCT
GCCATAATAAGACAAGCAAATATTGACGATAGTCTTGAAGACTTGGGCAGGTTAATTTCG
CCGTCGCTACTAGATATAAAGAAGAGAGTAAACAGTGATGAATACATCTCATTAAAAGAT
TTTAATCATGACATGAAAGAGGTTATTGAGAGGACGGCAAGCATAGATCTGACTTCAATA
TACAAAGATCTATTCTCTGAAACATTTCCTTGGTTTGATTGTGAAAATAATTGCCTTCAG
CCAAACTTGGAAGTAGAGGGCGAACAGGAAGAGACTGATTCAATTAAAGATATTGAGATG
ACGGATACAAAGGGGATTAAGAATACTACGCAAGTGGAACGAACTCTTGATCAAATAGTT
CCTAAGTTGGAATCTGTCAGCCAGAAGGAAGAAACGGAAATATTAGAGTTTTATCCAATG
GTTGACTCTCGTATTTGCGTTTTGTGCAAAACTTGTGGCGATGGATCACCGGCTATGGAA
GGGCGACTATTGTATTGCGGTCAAAATGATTGGATTCATGCCAATTGTGCTTTATGGTCT
GCGGAAGTGTTTGAAGAAATTGATGGTTCTCTTCAAAATGTTCATTCAGCAATATCAAGA
GGAAAAATGATCAAGTGTGCAGAGTGTGAAGTTAAAGGGGCTAGCGTAGGTTGCTGTGCC
AAGAACTGTAGTGAAACATATCATTATGCATGCGCAAGAAAAGCAACATGTGCTTTTATG
GATGACAAAAGAGTATTCTGTCCCACTCATGGAAAAGATGTACCAAAGAAAAGTCTACAA
AAGAACGCAGATTTTGAATTGACTAGACCAGTTTATGTTGAACTAGATAAAAAGAAGAAA
AGGTACACGGAAATAAACAAAGTACAATTTTTAATGGGTTCATTAACAGTGCACAGCCTT
GGCAAGATTGTACCTTCTGTGTCTGATTATGAAGATTTCTTAATGCCAGTAGATTTTTCT
TGTAGCCGTCTGTTTTGGTCATGTAAAAAACCATCTAAAATTGTACGCTATACAATTAAG
ACTAAACTCATACTGGCTGAACCGATGCCTGGATTCGACTTTGGCGTCAACATAACTGTT
GATCATTCTATGGATGTGCAAGTTGTAGACCGTGTCATGGCAGAAATTGGCGCATGGCAT
GAAAGCATAGAAACGGGATCAAGTAAAACCGTTCTTATGTTAAAGAGTGATAAGTCGCCT
ATTAAATTCGGAAAAACTGTTACGCTAGAAGATCTGGCTGTAAAACAAGTCGTGGAATAT
TTGCTAGATAATCTTTGCAAGCAAGAACAACAAGACGAAGAGGAACCGCAAAACACTGCG
GATCTATTGCCCCCTGAGGTTAAAGATGCTATATTTGAGGATCTAAATCACGATCTTCTT
GATGGAATATCCATGCAAGATATTTTCCCTAAACTCATGTCCTATGAGGATCTGGCAGGT
ATAGATTTAAAGGGTGAGTTCTATGGCGCTATAAGTCAATCAGACGACAAGTCAGATTCT
CGTTCTAGTGATTTTATTGATGAATTGCTTAATGCTCGTTTGGAATCTGGTAATAAAGAG
CTTAAGCGGAGCAAATCCGAAATGTTACTACAAAATCACAATCTAAAGTCTCTGACAACA
GGCCGCGGACAACAAAGATCGTCAAGTTTAACCTGGAACAATAAATTTGACACAAGCTTA
CTTTCTTCTACCATAAAGAGATGCAAGATGGGAAAGACGTCTACCGTGACTGGAAAACTG
AGTCCTATTCAGCGAATTATAGAAAATCCATTAGAAACGATAAAAGAAATAGAAAGAAGC
AGTGTAGACAAGAAATCTAAACCAGAAACTAGTACTAGCATTTGCAGTGAAACAGGCCAT
TCTTCTGGTCTCAATTACAGAACAACATCGCCTAAACAAAAAGAAGAGTCTAATGCAGAC
AAAAACGATAAATCTGAAGGATATTATACGGACGTCATAGATTTTTATACTAAAATCCAA
TGCAGCCCTATATCTCAATTGGATGGTGCAGCTGATTGGTCAGGCTCTGAGAGCAATTGT
AGTTCAAGACCAAGCAGTCCTCAAGATGACTTTACGAATATTCAACAACTAGACGGGGCT
GAAGATCATCATCCCTGTGACAATACGTCCAAAAATCAAAACAATCAGTTTCTGTACCGA
GCTAGTGGTACTGAAAGCACCTATACGGTGACCATCGCTGGTAACTCTATTAGTGGACAG
CCGGAACTTGTCATGAGACCTCTTGACGACCCATCTGGTAATTCAGGACAAGGAGAACAA
TTAGTGAGATGCGATAGATGTCAATGTACATACAGAAACAAGGAATCTTACGACCGTCAT
ATTCCATCGTGTGACATGATGTCAACAAGTGAAAGTGAAAGCGAAAATCCTAAATCTCCT
GAAAACAGAACTCTGACTACTTTGCAAAATGCTTTCCAAGGAGCATTTGCCCAACCTATG
ATTATACATGCAGGCGTCGTAAGTAACACCGAAAATACTGTAAAAGCCGAAGGTATTTCA
GCGAGAAGAACCAGCATAAATACTAGACAAATCACTATTAATGGCACCATAGTAGAAACT
CCATCACCAGGCCCATCAAATGAAATGACAATGACAATTACAGAGCAAATGAAATCTGGA
ACGGTTATATTTGATCAAAGCAAGACTACTAATGTTCAGGTATCTGCTAACCAACAAGTT
TTATCATCTCCACAGATAGTCGTAACGCCTCAAATGCTACTACCACCTAAAACAAGTACA
TCTGCCGGACAAAAAATGATGGCCCCACAAATAATCACACAACCTGGACTTTTACCATAT
AATATTTGTGTTAAACCTGGATCAAGCAATGCAAATATACAGCAGATTCAAGGAACTGAT
ATGACACAATTACTAGCTGGTGGCGGTGGTATTCAAGTGCTGACTTCCAACTCCAATCAA
GTCTTGACTATACCAAATCAAGGGATGATCCAAGGAAAAAATCAAGTTACAAACTTTCCA
AATATGGCAAGTGCGTCTTCTATCACTTTACCTGTAAACTCTATTCAGGGCATGCCTAAT
ACGTCGATAATACAGAATATTCAACCCATGATTCGACCGCAAATACAGAGTAACCCCACT
ATTGTAGTGCCAGCTATGACGGCCCAAAGGCTTGCTTCACCTAATTCTCAAAAGCAGATT
TTAATACCGAGTGCACCACAAAAGTCTCCTAAAAAGCAACCTGCACAGGTTCAGCCCAAA
CCAATATTTCAAGCAAATCGTGGAAGAGGTAGACCGATGCCAAAACCAACAACAATTAAG
AGGCAAACTAAATTGGAGAAAACATTTACAACTATTAATCAAAGTCCTTTAGGACCTGGC
AATACTGTTATACAATTACAGACGAATAATCAAACAGGTCAGCCGTCTATTATTGTACAG
CCGGTTGCTAACCAAAACATCATGTCGGCTTATGTAGAAGCTTTATCTCAGCAGCAGAAT
CAAAATATGCAATACATTGCGACTATTGCACCTCAAAGTGATTTCAAAACTGGACCAACT
CAATTTATCTCACAAGGCGGACTAATGCCACAGACATTCCAATTACAACAAACAGAATCT
GGGGGTCTTGTTGCTGTACCCAGTGGAGGTATACCTGTTTTATTACCACAGGGGAATGTT
GGAATTTTACCTCAGACGATACAGCAAGGAACTATTCTTCCCCAAGGTGCAATTCAGACT
CAAGGAATATTATCCCAAGGACCTAATGGTCATACTATCTTACCGCAAGCAATACACACA
CAAAATGGTACTACTATCATACCTCAAGGGACTATTCAAGGACTTACAAATGGAAATATA
ACGTCACAAACTGCGACTTTAATACCAAATAATACATTACAAAGTGCCGGAGCCACAGTG
GTACCACAAAGTGGCATAAGTAATGGTCAAACTACCATAATTCCGAATGCGCTGTCATCG
CAAGGCAATACATTAATATCATCCGCGCCTGGCACAACAATACTTCCTCAAGGGACAATT
TTACCGCAGTTTTGCAATGATCAGGTGCTTCTAGGGTCAACACCTACCCTTGAAATGGTT
ACGGATCCATCTGGATGCATGTATTTAACGACTACGCAGCCCGTCTATTATGGTTTGGAG
ACAATTGTTCAAAATACAGTTATGTCCTCACAGCAATTTGTATCGACAGCTATGCAAGGA
GTATTGGCGCAGAATAGTAGCTTTTCTGCAACGACCACACAAGTATTCCAAGCGAGTAAA
ATAGAACCCATTGTTGAGGTACCCACTGGATATGTTGTTGTTAATAACGTTGGTGATGCT
GTAGCATCACAAGCTGTTTCATCAACACCGAATGTAGTAACTGCAGTCCAATCATCACCG
CCGGCACTGAAAAATGTGAAAACTTCAGTTCCTAACTTAACATTGCAACCGTTACCTGAC
AAGCAGTGTAACAATAGACAGGCGCCTAAAGTAATTCAAATGAATGCACCACCTCTTGCT
ATCGGATCACAATCAGGAAATATAATGAATGGAAAGCAAGGATTGAACCCAATAACATCC
CAAATACATAGCATGACGCTATCAAACAATACTCAAAGTATGCAAAGAGTGAACATTTCT
AAACACAATAACGTGTCCATGTCACCAAATGTTACTCTTGTTTCGTCAACAGGACAACCA
TTGATGGCAATTTCTCAGTCTGTACCTAATACTATGAATTCATTTTCAATCCAATCGATA
CCTTTAGCACAACCAATAGTTTCAAGTTCTATTGCGCCATCAAATAAAAACACAATGAAA
ATAGAAAACTCTCATGTCATAGAAATAAGTGGAAATTTGCACGAGAGTTCAAATTGTACA
AGCTTACCTACAATAAGTGTCACACAGAGCATGAAATCTCCGACTCCGTGGAGGCAACAG
GATAATAAAACTGAGCTTCATAACATTGGACTAAAGACTTCTCAAAGTTCGACCATGAAT
ATTCAAAATAATATGATTTCAGGAAGACAGATGTCACTCTCTGATTCAGGAAAGGACAAT
CAAAATTGCATGATGTCAGTAAATAACAATGCCAACTCCATGAACCATCAATTTGAATCG
ACTTTAAATGTATCTCATCACTCATCGGCCCATACGACTTCTAACAACATGATCCAAATA
ACAGCTGGTAATATCTATCCACCGTCATCATCAATGAGTAGTAATTTTGATGCTGACACA
ACAATGAAGCTAAGCAAAATTAACGCTCTGTCAAAAACTGAAGGAGGACATATGAATTTT
TCACAGGAAATAATGGCATCACATTCACATCAAAATTCTCACGACAAAAATTTCCAAAAT
ATGGGTGCCAACGATAAACACAATTCTGATAACATGCACAACCAAATTAATATACATAAA
CTAAGTAGTTGTCAAGTGAATGGAAACATGAACAACTCGCCATCAATTAGTATCATTCCA
CCAAACATGGTAAGACCACAAATGCAAAACAATCAAAACTCAATAACAGTTGGCAATAAC
CAGAATCAAATGTGCTCTATTTCAAATAATTCACATGGGCAGGTTCAACTTATGAATTCC
GTCAACGCATCAGCAATGCAAAATGTCAACATGTCATGTCAAGAGATGGTGAACAATAGA
ATGAACATGTCAGTATCAAGTGAAAGCAACCAACACGAAATGACTAACTCCATAACAACA
TCCAACATGTCCCACGCTAATATACAAATGATGTCCCATGATAACAACCAACTTTCAACT
ACAAATCAGATACAAATAATGAACGGACAAATACAAAATAATTGTTTGGTAGAGAATAAT
AGCCAGATGCAGCTGAATCAAGGGAATCCAAACAGATCGTTCCATGATAATATTATCGGT
AACCAACAGGGCCACAATCAAAACAACATGCTGTCTAATCGCAGTACACCGGATACTAGC
AATTTAATGAACCAAAGTATGAACCTACACAACATGAATAATAACCACACCATGAGTCGC
AATGATCTCGGAGAAATAAATCATATGCATGTCAATCAAACCTTACCTAATATGAATAAC
ATGCAAAACAACAGAATGATGATGGATAATATGCAATCCCATGTAATGAACAATGTTAAT
CACCAAATGCATACAAATAACAGGCCTATGGATGCCATGCATAACCAGCAAATGCATTCA
ATGCAACAAATGAACCACCATATGAACAGCAACAACAATCGTTCACAGATGGACAACAAC
ATGCATAATATGAGTCAACAAATTTCAATGCAACAAAATAGACAAATAGATAATGTTAGT
CACCATCACCAACATCAGATGTCTAATATGATGCAGTTACAAAACAATCGTCAACATAAC
ATTGATAACAATTTAATGGGTATTCATCAACATGTATCTAATCAAATGCACAACAGACAA
CAGATGGATAATAACATGCATATGCACCATAATAACGTAAATATGGGTTTGAGCGGCCAG
CTAGATAACACCGGTGTAATGGCAAACATGCATGGCAATAGGCACGACAATACTCCAATG
TCAATGCAGAATATAAATGCCATGAATCAGATGCAAAACAATAGAAATTCAATGGAACAT
AATCACATGCAACATCAGATCAACAATATGGGCTCTATGAATAATATGAATTTGCATAAT
AATATAAACAATTCGATGCAAATGCCGAACACCAATCACCAGTCTATAAATAACCCAATC
ATGCATATGCCAAATATACCAGATATTAAAAATGAAATTCAGAATTCATGCGGTGGAAGT
TGCGCAAACAGCACCCAACAGTTTTCTAATAACAATCAGTCAACTTTCGAAATGTGTGCA
AATATGAGTTCTAATAACATGAATATGACGAACACTGGAAACATGATGCGTGGAATGACT
ATAAACAACACAAACATGTCGAATAATATGATAATAAATAATGTTAACAATAACCCATAT
GGATCCAACAACCAGAACAACTTAACTGGCCATGATATGCTTATAAGTTGCAACAGTTCT
ACGGAACACAATGTATCAAATCTGATGTCTAGTAATAGCAACCATATGATGATAAATAAT
ACTCAAACACACAACAACGGCAACCAAAACCAGATTATGGGTAACCAAGTCCAAAATAAC
CCACAAATTAACATAAATAGTTGTAGCATGAACACCAATAATTCGACTCAATCAACAATA
TCCAATATTGATAATCAAATTAATAGAAACAATATGGCTGTTAACGAACAAGCCAAATTT
GTACAAGAACCAGTTAGAGGCAAGAATATTATGGGCCCTCCAACAAATATTCCTATAACA
AATATTTCCAATAATGATTGCGCTAAAGTTAATATGGGTAACAACGATAAACAAAATCAT
ACTATATCAGTTAGTAATCAACTAGGTTATATGCAAATGAATCCTCCGAATGGCAATAGA
ATTATCAATCTTCCGGAAAGAAGTAGGCTAAATAATTCAAACCTAGAAAATAATGTCAAT
CAACATGTACCACAGCAACATAACATTCGTGTTGTGACCAATAACAACACAAATAATATG
ACTGGAATTACAAATGATTCTATGACATACCAACAAAAAAGTGAAAGCAGCACCATGATA
AATGTGCCTTTTCAGGCCATACCTAACAGTGCCCCAATGAATATCAATGTCAATTCCAGC
AACAATACCGTCAACATAACAGTTCAAAACAATTTGGTCGATCCCAAGATTGCGTCTAAA
GCTGCAGCAGATTTAAATTTCCCATTGCAATCTACCGATAATATGATTTGCATACCTGTC
CAAAATAACACCATTAGCTTACCGACCCAAAACAGTTCAAGTATTACCCTACCTAAAGCA
CCATCTACTCAACAATCCGGTATTCCGACTAATGTCGTAAACCCTATGTCAATGCGGCCG
ATGAATAGAGTATTGCCTCTACATCGTGACGGACAAAGTAAATCTTCTAAGACTTCGGGA
ACTAAAACACCTACTGTTCCTAAACAAAGACCTGTAAACCAAGATATGCCAGATCTCAAC
ATGAAAGATGAAAACATGGATAGCGATCTTGAAAAAGCTATAGAGGAATCTAAGAAATTA
ATGGAACTAGAAAAGGCAAAAAAAGAAAAGGAAGAAAGAGATGCTGTTCGAGCTGTAAAA
TTATCTACTGAAATACAACAAGCAAACACTAGTTCCACATCATCTTCAGTTCTTGTTAAC
AGAATCGATACAGTAGTAAATGTACCAAAAATGACATCTCTGCCCAGTTTAGATGAAGAG
ATGGAACAGGAGCCAGCAAAAGTTTTGACAGAAAAAGATACAAACAAAACTGTCATTCCA
TCAAAACTAACGAATTCTACTACCAAAGTATTAAAAAGGCCTTTAGTTGACAAAAAGGCT
GATGCTGAAGTAACGGCTGCTACTAAAAAACAGAATAAGGTTTTGAAAGGTCAGGAAAAT
AAGTTCTCTCAACCTAAACAAGTGCCAACACCACCTAAAGACGACGGCCCAAAATTGATA
TACGAAGTATCTTCAGAAGATGGATTCAACTATTCTTCGTCCTCACTAACCGATTTATGG
GCAAAAGTAATCGACGCTGTGCAGAATGCCCGGAAGCAATCTGGCTTGCCTCTCATTCAG
TATAACCTGCCATCCACCCTGTCTGGCGCACATTTACTAGGATTAAATAATAACGCCTTA
AGATACCTCATTGAACAGCTGCCAGGAGCTAATCGTTGTTCCAAATACAAAATACGTCAA
GGTCGTCCTCTCAACAGCTGGGACAACGACTTGGACCTTAGCGAAGGTTTCAAAGAAAGT
CCTTGGGGCAGCGCAAGAACCGGCCCAATTTCAAGGAAACAGAATCACGATATGTTCAGC
TGGATGGCTTCCCCGTTCAGAGAGGAACCACCTCCTTTTGGCGGTCAAGAGGGAGAAAGT
ACCATCTCAAGACGCTTAGCAACTCTTCCACCGGCTATGAGATTCAGGCAGTTAAAAGAA
ACGTCAAAGGCCTCCGTCGGCGTCTACAGATCTCACATTCACGGTCGCGGACTTTTCTGC
AAGAGAGATATTGAAGAAGGCGATATGGTGATTGAATATGCTGGTGAAGTCATCCGAGCC
GTATTGGCCGATCAGCGTGAGAAGAAGTACGAGGCGATGAGCGGTCGCCGCGGCGTGGGA
GGTTGCTACATGTTCCGTATAGACGACAACCTGGTAGTGGACGCAACGCTCAAGGGAAAC
GCGGCCAGATTCATCAACCACTCGTGTGATCCAAACTGCTACTCTCGCGTTGTGGATATC
CACGGCCATAAACACATTCTCATCTTCGCTCTAAGGCGCATAACTATTGGAGAGGAACTT
ACCTACGACTACAAGTTCCCATTCGAAGAAGTCAAAATCCCATGTACATGCGGCGCCAAG
AAGTGCCGCAAGTATCTAAACTAA
Protein sequence:
MGRSRFPGKPLKFHNRKRISVLSGTVYHETAPENSSASRGSAANDFSGEKEEEEKKENNS
ENDSKINGDNKTNNNLDKSDNQDNAESSKSPVRTRSHNKMEPPKNDKPKPVKKTVTFGTV
ESCEDIFLPLKKVPIKHHAPLVPIIKKKSALKQTEYSTFNSILKPAKLTDLSSKSSLEPQ
EGYLRKNLNPLSKSTNKFSQFSDIRSTSPPPRITSPIEKKNSTNKFELPIRSSHSSRVIK
PNKRFIDGDEQSSLPSLSSGKVLKKPKLKRLVFNNLHDDEDSAEDEELDKTNDVEKTREK
SLPTSSLFKDKTTNSFSNLSNTGNRKVIRDFFSYDKEPECNQDENSNVESLDRIKSPDGN
KEERSGHAMRKLMDISDASSTSTSASESEESWDSDSGEATGEEGERQPPLSPEKDKTSAS
QLLSGKVIIREARLQLTSTATSTTGLDGPFSALTAVSSNNQPATVTCGVCGAVRFYKFVR
QARKFGIYSCESCRKFISKLLKKDKMVLRTSPVVTHCVRGEGNCHVPPVVRSQQWKLLKC
TNKSRCPACWLKLCLKAFQVPPHIRATLTAMLPSYMRSGSKPSGPLTQTHPLRIASSTSP
TQSTFSTLASESENTLFAVKPVKKPSDDTDKTEETKTAKGLQTNDESKEDVNASRKRRIT
RTKVRKKDKSESKQTEDTKRQKMELKGPRVKHVCRSASIVLGQPIATFPTQEEKDKDKTQ
QSDDDSTMPVIDKDRITPDSSCDSEHDVENKPVQVVPEAIMKESDQEILEPKKRKIEVAP
KPQTKNESSEDETVMDIVPRKPTLKPFTNITNVRGRAQKCGQNRPELICVDFWESYDPDE
VCNSGFGLIGTMPFTLAKLCFLCGSAGREKMLVCSSCCEWYHVWCAEEAGGGGSWTCARC
VWCAACARPAARLRCRSCARPYHAACLPSAPPDHRSDWPQICSSCLKCKSCDSNRVNKFV
GSLPFCRPCFKLRQKGNYCPLCQACYRDNDFDSKMMECGWCARWVHASCEGLSGEGYQLL
SALPPSIEYICCKCMPNDPPWRKMLTEHLKGRLLHLLKLLAKNKKACALLKLTPHKNTPV
PNKPYRLMSPQAIRKLHFDGGDESMKNTSETKVYRNTSRGRRNTQNKFDDLNSSQIWHCS
SLLNDVEMDKEEIPHTTHKVVNLQEPLMQNKEICFSTGLYSNKADFNTPCVEVHENYPPQ
KTEKATTMLPTLDDKYEPISDDEEPTKSFSQARSEQDLSPAIIRQANIDDSLEDLGRLIS
PSLLDIKKRVNSDEYISLKDFNHDMKEVIERTASIDLTSIYKDLFSETFPWFDCENNCLQ
PNLEVEGEQEETDSIKDIEMTDTKGIKNTTQVERTLDQIVPKLESVSQKEETEILEFYPM
VDSRICVLCKTCGDGSPAMEGRLLYCGQNDWIHANCALWSAEVFEEIDGSLQNVHSAISR
GKMIKCAECEVKGASVGCCAKNCSETYHYACARKATCAFMDDKRVFCPTHGKDVPKKSLQ
KNADFELTRPVYVELDKKKKRYTEINKVQFLMGSLTVHSLGKIVPSVSDYEDFLMPVDFS
CSRLFWSCKKPSKIVRYTIKTKLILAEPMPGFDFGVNITVDHSMDVQVVDRVMAEIGAWH
ESIETGSSKTVLMLKSDKSPIKFGKTVTLEDLAVKQVVEYLLDNLCKQEQQDEEEPQNTA
DLLPPEVKDAIFEDLNHDLLDGISMQDIFPKLMSYEDLAGIDLKGEFYGAISQSDDKSDS
RSSDFIDELLNARLESGNKELKRSKSEMLLQNHNLKSLTTGRGQQRSSSLTWNNKFDTSL
LSSTIKRCKMGKTSTVTGKLSPIQRIIENPLETIKEIERSSVDKKSKPETSTSICSETGH
SSGLNYRTTSPKQKEESNADKNDKSEGYYTDVIDFYTKIQCSPISQLDGAADWSGSESNC
SSRPSSPQDDFTNIQQLDGAEDHHPCDNTSKNQNNQFLYRASGTESTYTVTIAGNSISGQ
PELVMRPLDDPSGNSGQGEQLVRCDRCQCTYRNKESYDRHIPSCDMMSTSESESENPKSP
ENRTLTTLQNAFQGAFAQPMIIHAGVVSNTENTVKAEGISARRTSINTRQITINGTIVET
PSPGPSNEMTMTITEQMKSGTVIFDQSKTTNVQVSANQQVLSSPQIVVTPQMLLPPKTST
SAGQKMMAPQIITQPGLLPYNICVKPGSSNANIQQIQGTDMTQLLAGGGGIQVLTSNSNQ
VLTIPNQGMIQGKNQVTNFPNMASASSITLPVNSIQGMPNTSIIQNIQPMIRPQIQSNPT
IVVPAMTAQRLASPNSQKQILIPSAPQKSPKKQPAQVQPKPIFQANRGRGRPMPKPTTIK
RQTKLEKTFTTINQSPLGPGNTVIQLQTNNQTGQPSIIVQPVANQNIMSAYVEALSQQQN
QNMQYIATIAPQSDFKTGPTQFISQGGLMPQTFQLQQTESGGLVAVPSGGIPVLLPQGNV
GILPQTIQQGTILPQGAIQTQGILSQGPNGHTILPQAIHTQNGTTIIPQGTIQGLTNGNI
TSQTATLIPNNTLQSAGATVVPQSGISNGQTTIIPNALSSQGNTLISSAPGTTILPQGTI
LPQFCNDQVLLGSTPTLEMVTDPSGCMYLTTTQPVYYGLETIVQNTVMSSQQFVSTAMQG
VLAQNSSFSATTTQVFQASKIEPIVEVPTGYVVVNNVGDAVASQAVSSTPNVVTAVQSSP
PALKNVKTSVPNLTLQPLPDKQCNNRQAPKVIQMNAPPLAIGSQSGNIMNGKQGLNPITS
QIHSMTLSNNTQSMQRVNISKHNNVSMSPNVTLVSSTGQPLMAISQSVPNTMNSFSIQSI
PLAQPIVSSSIAPSNKNTMKIENSHVIEISGNLHESSNCTSLPTISVTQSMKSPTPWRQQ
DNKTELHNIGLKTSQSSTMNIQNNMISGRQMSLSDSGKDNQNCMMSVNNNANSMNHQFES
TLNVSHHSSAHTTSNNMIQITAGNIYPPSSSMSSNFDADTTMKLSKINALSKTEGGHMNF
SQEIMASHSHQNSHDKNFQNMGANDKHNSDNMHNQINIHKLSSCQVNGNMNNSPSISIIP
PNMVRPQMQNNQNSITVGNNQNQMCSISNNSHGQVQLMNSVNASAMQNVNMSCQEMVNNR
MNMSVSSESNQHEMTNSITTSNMSHANIQMMSHDNNQLSTTNQIQIMNGQIQNNCLVENN
SQMQLNQGNPNRSFHDNIIGNQQGHNQNNMLSNRSTPDTSNLMNQSMNLHNMNNNHTMSR
NDLGEINHMHVNQTLPNMNNMQNNRMMMDNMQSHVMNNVNHQMHTNNRPMDAMHNQQMHS
MQQMNHHMNSNNNRSQMDNNMHNMSQQISMQQNRQIDNVSHHHQHQMSNMMQLQNNRQHN
IDNNLMGIHQHVSNQMHNRQQMDNNMHMHHNNVNMGLSGQLDNTGVMANMHGNRHDNTPM
SMQNINAMNQMQNNRNSMEHNHMQHQINNMGSMNNMNLHNNINNSMQMPNTNHQSINNPI
MHMPNIPDIKNEIQNSCGGSCANSTQQFSNNNQSTFEMCANMSSNNMNMTNTGNMMRGMT
INNTNMSNNMIINNVNNNPYGSNNQNNLTGHDMLISCNSSTEHNVSNLMSSNSNHMMINN
TQTHNNGNQNQIMGNQVQNNPQININSCSMNTNNSTQSTISNIDNQINRNNMAVNEQAKF
VQEPVRGKNIMGPPTNIPITNISNNDCAKVNMGNNDKQNHTISVSNQLGYMQMNPPNGNR
IINLPERSRLNNSNLENNVNQHVPQQHNIRVVTNNNTNNMTGITNDSMTYQQKSESSTMI
NVPFQAIPNSAPMNINVNSSNNTVNITVQNNLVDPKIASKAAADLNFPLQSTDNMICIPV
QNNTISLPTQNSSSITLPKAPSTQQSGIPTNVVNPMSMRPMNRVLPLHRDGQSKSSKTSG
TKTPTVPKQRPVNQDMPDLNMKDENMDSDLEKAIEESKKLMELEKAKKEKEERDAVRAVK
LSTEIQQANTSSTSSSVLVNRIDTVVNVPKMTSLPSLDEEMEQEPAKVLTEKDTNKTVIP
SKLTNSTTKVLKRPLVDKKADAEVTAATKKQNKVLKGQENKFSQPKQVPTPPKDDGPKLI
YEVSSEDGFNYSSSSLTDLWAKVIDAVQNARKQSGLPLIQYNLPSTLSGAHLLGLNNNAL
RYLIEQLPGANRCSKYKIRQGRPLNSWDNDLDLSEGFKESPWGSARTGPISRKQNHDMFS
WMASPFREEPPPFGGQEGESTISRRLATLPPAMRFRQLKETSKASVGVYRSHIHGRGLFC
KRDIEEGDMVIEYAGEVIRAVLADQREKKYEAMSGRRGVGGCYMFRIDDNLVVDATLKGN
AARFINHSCDPNCYSRVVDIHGHKHILIFALRRITIGEELTYDYKFPFEEVKIPCTCGAK
KCRKYLN