New model in OGS2.0 | DPOGS207370  |
---|---|
Genomic Position | scaffold133:- 85377-101935 |
See gene structure | |
CDS Length | 12015 |
Paired RNAseq reads   | 7231 |
Single RNAseq reads   | 17512 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009003 (3e-15) |
Best Drosophila hit   | split ends, isoform C (1e-76) |
Best Human hit | msx2-interacting protein (3e-59) |
Best NR hit (blastp)   | protein gar2, putative [Pediculus humanus corporis] (5e-134) |
Best NR hit (blastx)   | PREDICTED: similar to CG18497-PA, isoform A, partial [Apis mellifera] (3e-80) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0007379 segment specification GO:0005634 nucleus GO:0030528 transcription regulator activity GO:0007411 axon guidance GO:0007400 neuroblast fate determination GO:0008347 glial cell migration GO:0007403 glial cell fate determination GO:0007173 epidermal growth factor receptor signaling pathway GO:0016055 Wnt receptor signaling pathway GO:0000166 nucleotide binding GO:0003676 nucleic acid binding GO:0007474 imaginal disc-derived wing vein specification GO:0008407 bristle morphogenesis GO:0007422 peripheral nervous system development GO:0007163 establishment or maintenance of cell polarity |
InterPro families    | IPR012921 Spen paralogue and orthologue SPOC, C-terminal IPR000504 RNA recognition motif domain IPR016194 Spen Paralogue and Orthologue SPOC, C-terminal-like IPR012677 Nucleotide-binding, alpha-beta plait IPR010912 Spen paralogue/orthologue C-terminal, metazoa |
Orthology group | ND |
Nucleotide sequence:
ATGGCTGCATCAGCAGCCGCTCCGGCCACGTGGCGTGGCACCAACGACAGCGCCACCGAG
TATTGTCGTCGCGGTAACACACCCGCGGCATATGGCAGGACGACGCCGCACCACCGCTGG
TGTTCGTCTGTGGGCGGCGGCGCGGCCGGCGGCGAGAGCACCCCGAGCACGCCCGGCGGC
GGCACCGAGCGTCGGCGGCGGCTCTCCGAGTCCGGCTCCTCGCGCTCGGAGACCAGCTCG
CCCGAGCCCAGCGACACCTCGCGCGCCTCCACGCCGCCCGCCGACCATCACGCCGCGCAC
CACACGCCCAGGAGGACGCCGCCCACGCATACGCATCAGTGGCCGTCAACAGCGAACGGC
AGACCACTGGCGATATGCGTCCGGAACCTCCCCACTCGGTCGACCGACAGCTCCTTGAAG
GACGGTCTCTATCACGAGTACAAGAAACACGGGAAAGTGGTGTGGGTGAAGGTGGTCGGT
CAGAATGCCGATCGGTATGCGGTCGTCCGTTTTAAGAAGCCGTCCGACGTAGAGAAGGCA
CTCGAAGTGTCTCAAGATAAGCTTTTTTTTGGTTGCAAGATCTCAGTGGCACCTCACCAA
AGTTGTGACGACGATGCGGAATCGGCGAAACCTTATGAGACTGATATTGATGAATACCAT
CCGAAGGCGACCAGAACGTTGTTTATCGGTAACCTGGAAAAGGACGTTACACAACAGCAA
CTTAGGGATAAATTTAAGCATTTCGGGAGAATAATCGAGATCGATATTAAGAAGGGTAGC
GGCGGGGGCGCCGGCTACGCTTTCTGTCAATACGCGTCCATATCGAGCGTCGTGGAGGCG
ATAAGAGCCATGGACGGAGAGTACGTTGGCAACTCGCGCGTCAAACTCGGCTTCGGGAAA
CCGGTCGCCACCACCTGCGTCTGGGTCGACGGTCTCACCGAACACACGGAAAAACAGGTA
CTGGGCGCCGTGTCTCGTTGCGGCGCGGCGACGTCGGTGTGCGTGGACCGCGCGGCGGGC
GCGGCCCTCGTCCACTTCGAGCAGGCGGCGGCGGCGGGGGGCGCCGTGCGGGAACTCCGA
CGCGTCGCCGCACAGCTCTCCGCCGCCGAGCCCGACCACCCGAGGCTCTGTGTCGACTAC
GCCTCCAGGGAGTGCCAGGAAGCGTTTTACGAGCAATTGGAAAAACATGGTGGTTCTGCT
GCCCTCGCAGGAGCTGAACGGGTTGCGGGAGAACTGCCCGGCCGGTACGTAGCGCCACGG
CACGAGACTCTTCGATACGAGGCCGCTGGGACGAATCGCTCGAGAGCGCCGAGTTTTAGT
CGTGCCTCGTCACGGACTCCGAGATATAATGCAAACGATCATTACGATCCTGCTGAATAC
GCGGCCGATCGTAGATACAGGGTATTTGATGATTTAGGATCTAGTCCACAAACAGATGAT
GCAAACTATGAAGAACGTTTGCAGTCAGTAGTCGTTTCTCCTCACCGTGGGCATAAACAT
CGGAGGGATTCCAGTCCCGAAGGAAGAAAGCATTCAAAGGAGCGGCACCGTAGCGCTGGT
GGCGGATCTCGTCGATCCCGATCGGGGTCCCGCGGTCACTCATCCCGGCGGAGGCATCGC
AGACGACGCGATGGATCTGAATCGAGAGGGTCTCGGGCGTCACGTGCTGGTACTCCCTTG
AGAGACGAGCTAGATGCGCCCCCCACGGAGCCGCGTCGCCCACCGCGGGAGCGACCACCG
CTGCCCATGAGTCTCCCTCTGCCGAAGTTTGCCGTACAGTTGTTACGAGCCGCACCACCG
CAGCCACGCCTTCTGCCACCGGCCCCCGCATCGCCGCCGCGCCCGCCCTCCGCTTCTTCC
TCTAGTGGAGGCTCGGCGCCTCACTCACCCTCGCTGGAGGAGCGTATACGGTCTCTTGAC
GAAAAGTATGAGAGGTGGACGGGATCGCGAGCCCACATCGACGCTCCTGATAGATCCCGA
CTTCGTCATCGTTTACTGGAATTGGATATCAATGAAGTGAAGCCCTCGGATGTAGTTCGT
TCTCTTTTGGCTAAGCGCTCTGTGTTCGATGAGGATTCTGAACGGTTGGAGGGTGCCACC
CGCGCACCGAGCCCGGGAGGTAGCCCGCGTTCTTTACCGGTTTCGACGTCTCGTGTCCTA
CGATATCCTTTTCCCGTTCACGGAATGCATACTTCAGCTGTCAATACGCCATTAACGAAT
CCAGCGGGTCTCACGCAACCACCGGAACCAGAAGAATATGATAGAAGTAGACTATTAAAT
ATGGATAAAATAGCCAAGCATGTAGACATCGAGAACGGTCAAGAGATAAGAATGCGATTG
CGCACTCCTTCGATTGATAGGCCAATGTTGGAAAATTTTGAAAAGAAGCATTCTCCGACA
AGTGAAAAATCCGAAATCCGTTCACTGCCAGAAACAGAAACACAAGAAAATAGAAGAAGA
AACTCTTTCATATCAAATGTTGAGGACAGTAAGACTGCAACAATAACGGACAAGGCGACC
TTGGAAAGAATAAGTCTAGAAAAATCCATTGAAAAAAGTATTTTGGAAATCCAAGAAAAA
TATAATATTAGAGACAACAAAATTATTTGTGAAGATAAAATTACAACCTCCGCGGTAACA
TCTATCAAGAAGGAGGAAATCATTGAAGAGAAAGTTAAAGCTAGTGATTTCTTTAATAAG
TTCGAAAGAAAATGTTTCAGGGAAGATACTAAATCACGATGTCTAGAAACAAAAAGTGAA
TTAAAACAAGAGTTGAGCATGGACATGAACAAAAGTGATAAATATCTGTTAACAAGCGAT
TATTCTTCTAAGACTGAAGTAATGCTTATGGATATGAAACAAAAGCTAGACCACGATAAA
AAAAGTGAATTCAATTGTCAACCCAAGCAAAATGAAAGATTTAAAGACAAGTATCATAAT
GCCACTAACAAGAATGCTACCAACTTTAGAGTGCAATCAGAAAGTCATTTTCATTCATTA
AATAGTATTGATTTACCTAAAAATAAAACTGAAAAAAGTAACAAGGAGATGCTTTTGAAA
AGTATCACTTGTGCAAAAGATAATGTTCTTTTGAAACAAATTGATGAGACACACCACGTA
AAATGTGATAAATCTAATAACGTGAATGTATCAACTACTTCTGTACCGCCCGAAGAAAAG
GCTGTTGGTAAACTTCAGGAGGACTGTGCTAAAATTGAAAAAGATAAATTAAGACATATT
CGTGACAAAATAGACAAGGAAATAGAAAGAGATAAATCAAAAATCAAGCAAACTAGTTCT
AAGTTTGATAAACACGACAAAGAACGTAAACTTAAAGATGATTCTCTTAGTACAAAAATA
CATACAGATAAATCTGATAGAACTAGAAACGAGAAAGAAGGAGACAAAATCTTAGATAAA
GATCGCGATTTTGAAAGATCTAAAGTAATTGAAGATAAAACATCTCGTCATGACAATCAC
AAAAGAGATAAACAAGAAAGGGACAGGAAAAAAGATATCCCAGATAATGAAAGTATGGCT
AAGGCTTCAAAAAAAGAAGAAAAGCATAAAAATGATCGACCGAAAAAGGATAGTGATAAT
AAGCGTGATAATAAAAGGGAAGGGGACTCATCGAAAAGTAGAAAAAGTTCCAGAGATGAA
TCTATTAGAGATATTTGTCGAAAAGATTCGACAGACTCTACAACATCAAGAGCGTCCCAC
GATTCTTCAAAATCTAGGGAATCTGAAAACATTGAGCTAAAGGAAGACACGAAAACAAGA
ATTAAGAGTCTCTCGGATGTAGTGAATAAAGTAGATACAACTGAAAAAGACCCAACAATC
AAACATCACGACCACCTTAAATTGGATGGTAAAACAAAAATTAAAAGTGAAATTAAAGAT
GAAAAAGATCCTTCTGAGCAACATTTTAGGAATAAAATAGAAAGCTATGTTGATAATAAG
AAAATTAAAACAGAGAATTCTGAAAAGTCGAGACATTATTCACTAGACTCACCTAGCGTT
GACATTAAAAGAAAAGAGCGTTTAAACTCGTGTTCAAGTTTGCCGTCTAATATAGGCCAT
AAGCGAAGAATGTCTTCACAAGACAGCATAGATTGTCTCAATGAGGAAATAAAGAAAGGT
AAAAGTGAACGTCGCGAATCTAAAGACTGTAAATCTGTTGAAAGACACAAAACCACAAAG
TTTAGTAAGGGACATTTCGCTAAAATAATTGAGAGTAAAACTAAAGACGACAAGAAAAAT
CAAGTCAAACCACCTGATACAACTTTTACTAATCCTAAATGCTTGGACACAAAAGAAATC
GGTAGCGATAAACTAAAGACATCCAGAAAAAGTCCGATCGAAAGTATTGAAAGCAAAACA
ATATCTGCTCCAACAAACATTTCATCCGAACCCCTGCATAATAATTTTGATTTTCTGGCT
ACCTTAGAACTTCGGTCTAGCGAAGAAGACGAGAAACAAAAAGCTTTAAGGAAAGAGATG
AAGGAGAAGAAACGAATACAACAATTGCAACAAATTCAAGAACTTCAGATGCAACAAGAT
GCCTTACAGCAAGCAGAGATAGGTAACAAAATAAAAGATGATAGAAGAATTAAAAGTGAT
GACAAGAAAAAAGATATTTCCAGAGAGAAAAGGATGTCTACTGAAAGAAAAAGCAAGGAT
GAGAAAACTGATAACATCAAGCGACGAAATAGAAAACAATTACATAGTACTGATACCTCG
GATTCAGATGAACCCAAAAAGCATTCTATATTTGATATAGTTGATGACGAATATAACATA
TCTATGTATGATAAAGTGAAAGCTAGATCATGTAAAAATATGCAAAAACAGGAAGAAGAA
AAGCGACAAGAAAAAATAAAAGCCAAGTTTAGTCAATTGAAACAAAGTAGGGCTAAACGC
GAGGAAAAGAAAAGATCAAGTTGGGATGAAGACAGTGATTCTGATAGTGAAAGACGTAAA
TCTCGTAAAATATCAATGGATAGTTCATCCGATGAAGATCACATTACAATACACAGTAAA
AAACGTGAGAAATCACAAGGGTCGAGTCTTGAATATGATCGTAATAGAACTAATGATTAT
TTCGACACAATAAGCAATGAAGAGGATTCCTCTAATAAATTATCGCGTAAAAATTCAAGA
ACACGGATTATGTCTGACTCATCTGATGATGAGAACAGTCGGAGGAAAGCCAGTAAAAGT
CCATGTTTTATTGATAGAATTAAGAAAGAAGTTTTTTCAGATTCTGAATCTTCTACGAAA
TGCAAAGACATTGATGCAGATATACAGTTAAAACAACCATGCGAAGATAAAGTTAAGAAA
ACATCGCTTTTAAATTTGTTCGGTAAAAGCGATAGCGAAGATAATAGACTTAAACCAACA
TTTGAAAATGATAATGATTATAGAATGTCATTCACAAAAACTTTCCCAAATGACTTGTCT
TCAGAAAATGAATCAATACCGAATAGAATATCTTCTGAATTACGCAGAAAACATAAGAAA
AAACAAAAGAGATATAAGTTTACATATTCGGACGAGGAAAACAAAAGTAATCATGATGGT
TCTACAGAAATTAGAAACAAACATAAGAATTCTGATAAAAGTCGCCGACACAGTAATAAA
AAAGAAAAAAGAAGAGATAAGATACGTGATAGTATTGATGCTGACGAACCAAAGGAAGAA
AAGGATAAAATAAAAATTGAGAAGTGTTCTCCCAACCAATCATTTGACACAGTGTTGGAG
GCAATGTCAAATAATAACAAAAAGGAAGGCAAAATGGAAGACATATTTGGTCCTTTGTCA
GATGAATCTGACAGAGAGGTCAAGACTATAAGCAATAGAGTTGATGTTTTGCCACCAGAA
TGTGAGACATCGTTATGTACTGCAGCTGATAAATTAAGTAGTGATGAAATGAAGATCAGA
GACAAGGAGGAAATTAAAAGGCGAAAAGAAAAGAGACGAAAAGAAAAGAGACATTTCGTT
AAAGAAGATGATAACAGCTTGGATGTAGATGCTGTAAGTAAAGCTATTGAAGCTAGATTA
TTCGAAGAGTCAAATGTTGACAACAACCATTCTAGATCAGAAAAAACAGAAGCTAAAGTC
AATTCTGAAAGTGATTTGAAATATATGGGTATCAATTATAAAGGAGACTTAAATAAATAT
AATGAGAAGTCCAAAAAAGAATCTCGAGAGAGAAAAAAGAAAAAGAAGAGAAATAGAGAA
GAAAGACAAAGCCGTAAAGAACATCATCATGAGAAGTTACATAAAATGGATTCATTTTTA
GATAATCATATGGAACAATTATCACCAAAAACATTACTCGATATACCATTACCTACCGAT
ATTCCTAAAGTTGAAATAATAGATAAGTCGGATGACAGTAAATCTTTATCTGAATCTCCG
AGTTTACCAAGAATAACAGATAGCCCTACCTTTGTTTTAAATGCAGAAGATACCAATAAC
ACTACAAATGAGTATGATACTCCTAAAAGTCTCGTCGTTGACGATGAAGTCGAGATTGAT
TATGAAATGACAAGAAATATTGAAGTGACAGACATACCCATGCCTCCCCCTATAGACAAT
ATTGTTCAAGACATCAGTGAAGTGCCCCTACCGAAAGATCCTCCAATTGATCAAGTTTTA
GAAATATCATTACCTGATAGTGAGGATTCTAATGTTAACACAGATGTAAAAGAAAGTGAA
GTGGCGGTTAAAAGCATATCTAATTTAGAACCAGAAATTGAAAAACCTGTTGTGCAAAGT
GAAAAAAATATAAGTGACTGTAAATTAGACAAAAAATTAGAAGAAAAACCTCGGGCTATA
ATTTCACAAGAAGAAACTGAAGATGCTGTAGCCGCTTTACTTGGTGAAAGTTTTGGAGGT
AAAGTAAATACTTTTGATTGTTATGAAGAAGTAGAATGCAACACGTCCAATGAAATGGAG
ATAGAAAATACTACGATCGAAAGTGAACTTATCCCTGAAGAAGATGCAGAAGAGATGAGA
CAGGCTGTGCAAAATTTAAATGCCAGTGAAATGGAAATGAAACCTGATACACCAGTGTCT
GACGATAATCTTTTGTTGATTGACACGGATACCGAGGAAACAGAGGACACTACTCAGGAT
ACTATAGACAGATTACCGGTTGACCTCATAGCAACAAACCAATCCCTTAATAATAACACC
AAAATAACTGTTGCTCAAACTACCGAAGTTAGTAGCATTGTTATCACGAAGCCGAAACCA
ACAATGGCAAATTCTGTACAGAAAGAAATTAACGAATCGATTAATAATGTAAAGGAGGAA
CAGGTTAAACCCCCTGCTAAAAAACAAGAAACTGTGGAACAAATTACATCAACAGCGACT
CCAGTCATTACATCTTGGACTTTGTCAAACAATAAGTTAATAGAATCTCAAGTAGCAAAT
GTTCCTGTTAGTTCAGCTAGCAGCCGAGAAATTTCTGAAAATACCCAAACTCATATAAAA
ACAAATATTGTACAGATTAAAACATCACATAGTCCCAAAGTTCAGATAAATAACACTTTG
AGGCCTATTATAAATTCAAACAGGGTTAGTACAGCTCCTTACCAAGTGATAAATCAAATG
ATTAGACCTCAAGTATCTAACATTCAACCTCCAACAATCAAAATTCCTGAACCCCATATT
ATTTATCAAAAGCCAACTAACATTGTTATATCACCAAGGATAAATAGTGAACCTTTAATT
TTGAGTCCGAAATCTAGTGTACAAACAGAAGGTATGACTTCACCACGTTTACCTAATATG
GCTTTATTGAGTACTTCACCTCAAAATGTTGTTGGATCACCGAGCACAGCGCAGCAACGC
GCTCAGGGTCAACAAGTAACAGTGGTTCGAATGCAACAGTCACCACTAAGTCCTATACAC
AGTATGCATATTCCGCATGCGAGAGCTATGGTGTCGCCAAACAGACCCAATTCTGTGTTA
GTACAAACTCAGGGAACTCCAATACACTTCAATCGATTACCTGTAACACCAGTTCTTGCT
CCCATTTCAAAGCAAATAAATGCTAACATCATTCAGTCAACCAAAGGAAGCGGATCTTAC
GATTCTGTGTTAATTCAACAGCAAAAAATATGTCAGACAATGCCAGGAGACAGGGGAAAA
ATGGAGAGTCCACCAGATGCTTCGAAAATTATTTTGTCACCTAACAGACTGCAAAGCTCG
GTCTCAACTGTAATGGCACAAAATCGTCTCATACCTGTGCAAAATTCAATTCATGTGAGT
AATATAAATTCCGGCTTGAATTTACCAAACAAAGTCTTAATAAATAATAAAAGCCCATTC
AGTGACAAAAGAGATACTCACATAAGCAAATCTGAAAATATCATCGCCGCTACTTCTTCG
CCGATTATACATTTATCGGGTCCTTCTTATTCTTCTTCTAATATAATTCAAGCCAATACA
AAATCGGTGACGTGTAATCTTCAAGAAGTAACTACTGTGAGCAGAGGACACGGATCAAAT
GTCATCCATTCCATAAATACACAAAGACTTGTATCTTCGAGTCCTATAACAAGTGTTATT
CAACTAGAACCTGCAAAAGGCCCTCCTTCTGTTTTATCAATGGCGGCTGTACGAACGTCG
TCGCCTGGAGTTAAACAAGAGTTGTCGACCACAGTTGTTGTTACAACATCCAGCCTCGCT
AATGTTGTGATGACTCCATTGCTTTTGAAAACTAGTCCTAATCCAAAACCTACTTTAAGG
CTGCATACTAGAAGTGAAAGTGAGCAGGTTGACGCCCATTGCAATAACGAAATCCAAGCC
AGCGAGGATAGAACGGAAACATCTCATAAAGAACTTGGACAACAAAATAATATACAACAA
CCTTCACCTGTAACCCATCTATCACCCATAGATCGTGTTGGTAAATGTGAAACTGATTTC
GAAGAGCTCTTAAAAGATAACATAAATGAAATTAAAATAACAAATGAACATCACAAAAAG
ATTGACATTAATGTTAATAAAAATGTTGCAATTATTGGTCAAGTACAAAATAGAAGAAAC
AGCATTGATAATGATGTAGCTCTTATGAAACATGAAGTAGAAAATTTTATAAAAAACGAA
GCAGACAACGAAAACACTATTAAAAAATCTGACTCTGTAGATATGTCTTCAGTAAAAAAA
TGCGAAAATGAACTATCAACAAAGATTAAACCTGATAACAACATGCTATGTATAACAGAG
AAAATTCCTAAAACTGATGAGGATGACTTATCGAAACCAATGGAAAGTAACGCAACTATC
GCAAAAGAAAACCATGCTCCCGCAATACCAACTAGTGTCGATATAAACGATTCTATAAGA
CCTGAATCCGGTGTTAACAAAAATGATTTCTATAACCCACTTTGTATTCAAACGCAAGCA
CCTACAAACGATATAAAAAGTCTAGATGAAAGTGAGTATTGGTCAGGCAAAGATATAATT
AATATAGAATCTGTTATTAAGAAAGTTGATTCGTTATGTAGTGACAATGTCGAAGAAAAG
AAAATAGAAGAAAATACAACCAAACAGCCTTTAGTAATTGAAATGAAAAAGTCCGTTGAA
AGAAATGTTCAAAATGAGATTTTTATTGAAGAAAGTGAGACCGCTAAAACTGAAAAACCA
AATGAGAATAAAGTTGTTGAAAACGCAATAGATCCCTTTCAAGATAATAGTTTGATGGAA
GATTATGTTACAGAAAAAATTGGTACTGGTAAAAGAGGTGGCCGTAGTGGTAGGGGTAAG
AAGTCGGATAGAAATCCGGATAAGGTACATACCCGTCAAATAGCAAAAACTCCTAGAGGT
ACTTCGAAACGTGGCAGAGGTCGGGGGAAGGTAGACAAAAAAATTAAAAACATGATAAAT
ACCAATGCTAATAATATGCCTGGTGATGTGTACGATTTTCATGAAGATTCTGGGGATGAA
ACAGTCTCGTCACCTAATAAAACAGAAGTAAGACCACGTTTAATATTGACTATAAAAAGT
CCTCTTTCTGGCAATTCCAATACCGTTGTTTCAACCTTAAGTATAGCACAGAAGGACCCT
GTAAAAGTCACTGAAAAACATAAGGAAGAAAAATCTGATGTCTTTGCATCGCCCTCAATA
AATACAAGAAAGTCTCGACGGCTCCAAGAAAAAGATATACAAAGGAGTTCCGTCGATGAC
ATCATTGATGACGTCGTAAAAGGCACGGCCAATCAATTTAAAACTAATAACAAAGATTCG
AATAAAAAGAAATGTACAAGACAAACTGTCAATAAAACGGGCGATAAAACGCTCTCAAGT
GACACACGGAAATCTCCCAGAGGAGTTAAAAGAACAAGGGATAGAAGTTTGTCAGATGCT
TCCACGGAAAGCTGTGAAGAAAATATTAAAACTGACGAAATAATCAAAGAGTCCAAGATA
CCGAAATTAGAACCCACGATCCCACCCACCAAATGTACTCCCGAACCTGTCGTTAAGCCT
CTTATAGTTACCAGTCAGAGTGCCACCACCTGTCCACCTAATAACAACCCGATACCTATA
ATGAAACCACCCAAGAAGATGATATCCGAAATTAGCGCAAAGCTCGCCAGCGCTTTCGAG
GCTGCTGCTAACGCGACCAACCGTATTCAAGAACGATCCATCTTACCGTCGGAGCCGGAA
CGTCTCCCGCCACCGGCTGAGGGTGCCATTGGTGATGGCGACGCGGGCTACGGGCGACGA
CTCGTGGACAATGTACCCTCTAACATGATGCCCGTGGGAGCCACGGAGGCGACGGATGCG
AGGGTACAGTCCCCCGCCCTGCCGCATAGACCACCTTCCTCACATCACCTGCCCGATAGA
GCTACGCCCATCCTTGTCAGGGGTGGCGATGGAGAAGGAGGGCCGATCGCCGCCATGGGT
CCCTTGGGCGCGATGGGCGCTATGGGTGGAGCTATGGGTGGTATGGACGCGGGGGCAGCG
CTGTACCGGGGCGCGTACCCGGGCCCGGCCAGCTTGCCGCGGGGTGCACACCATCCCCCA
CTCGCCAAACAGGTCGCCGTCATCGGACAAAATCATCATCCAGGATCACGCGTTACTATC
ACCAGTGTACCCGCTGGCAGTCCACAAGGACCTATGGCAATTGGCGAAGGTGTATACCCG
CATTTTTCACATCATCATTACCAAATGTATCAGCAACATTTCCGTGCGACCCAACAAGAA
AACAGGGACACGCCAACATCCTTTACTAGAGGCGCCTTGGACGCCGAGGGCTCGGAGGCT
CCGACGCCTCCTCTGGAACTGCGTCGGCCTCCATCGGCTCGTGTGCCCCGTCCGGCGCAT
TCGCCCTCGCCGCACGACCGGCATATCATGTACGCGGTGCGATGCGGTCGGTCTCCGCCT
CCGGCTCACGGCACGTCTCGGCCGTCGTCCGCGCTGCCCCCGCCCGCCGCGCCCGGCCCG
CCACACGCCTCGCAGGTCCCCAGGGAGGCGGACTCCCTACAGATGCTTTTGCGGCGCTAC
CCGGTCATGTGGCAAGGTCTGCTGGCTCTCAAGAACGACTCGGCGGCCGTGCAGATGCAT
TTCGTGGGTGGCAACCCTGGTGTGGCGGCTGACGCGCTGTCGCGACACTCTGACGGCACG
GCCGCCTCTTTATTGCGCATCGCTCAGCGCATGAGGCTAGAACCAGCTCAGCTGGACCAA
GTACATCGCAAGATGAAGCTCGAAAATGAACACTGCATGCTGCTAGCGCTACCCTGTGGC
CGCGACCACATGGACGTGCTTCAGCAGTCCACCAACCTGACGGCCGGATTCATCACCTAC
TTGCAGCGCAAGCAGGCTGCCGGCATCGTTAATGTAGCGCCCACCCCGGGTCACCATCAG
TCTATGTACACGGTGCACATATTCCCGTCGTGCGAGTTCGCCAACGAGAACCTGACCCGC
ATCGCCCCCGACCTGATGCACCGCGTGTCCAACATAGCTCACCTGCTGATCGTCATCGCC
ACGGCCATCGGATGA
Protein sequence:
MAASAAAPATWRGTNDSATEYCRRGNTPAAYGRTTPHHRWCSSVGGGAAGGESTPSTPGG
GTERRRRLSESGSSRSETSSPEPSDTSRASTPPADHHAAHHTPRRTPPTHTHQWPSTANG
RPLAICVRNLPTRSTDSSLKDGLYHEYKKHGKVVWVKVVGQNADRYAVVRFKKPSDVEKA
LEVSQDKLFFGCKISVAPHQSCDDDAESAKPYETDIDEYHPKATRTLFIGNLEKDVTQQQ
LRDKFKHFGRIIEIDIKKGSGGGAGYAFCQYASISSVVEAIRAMDGEYVGNSRVKLGFGK
PVATTCVWVDGLTEHTEKQVLGAVSRCGAATSVCVDRAAGAALVHFEQAAAAGGAVRELR
RVAAQLSAAEPDHPRLCVDYASRECQEAFYEQLEKHGGSAALAGAERVAGELPGRYVAPR
HETLRYEAAGTNRSRAPSFSRASSRTPRYNANDHYDPAEYAADRRYRVFDDLGSSPQTDD
ANYEERLQSVVVSPHRGHKHRRDSSPEGRKHSKERHRSAGGGSRRSRSGSRGHSSRRRHR
RRRDGSESRGSRASRAGTPLRDELDAPPTEPRRPPRERPPLPMSLPLPKFAVQLLRAAPP
QPRLLPPAPASPPRPPSASSSSGGSAPHSPSLEERIRSLDEKYERWTGSRAHIDAPDRSR
LRHRLLELDINEVKPSDVVRSLLAKRSVFDEDSERLEGATRAPSPGGSPRSLPVSTSRVL
RYPFPVHGMHTSAVNTPLTNPAGLTQPPEPEEYDRSRLLNMDKIAKHVDIENGQEIRMRL
RTPSIDRPMLENFEKKHSPTSEKSEIRSLPETETQENRRRNSFISNVEDSKTATITDKAT
LERISLEKSIEKSILEIQEKYNIRDNKIICEDKITTSAVTSIKKEEIIEEKVKASDFFNK
FERKCFREDTKSRCLETKSELKQELSMDMNKSDKYLLTSDYSSKTEVMLMDMKQKLDHDK
KSEFNCQPKQNERFKDKYHNATNKNATNFRVQSESHFHSLNSIDLPKNKTEKSNKEMLLK
SITCAKDNVLLKQIDETHHVKCDKSNNVNVSTTSVPPEEKAVGKLQEDCAKIEKDKLRHI
RDKIDKEIERDKSKIKQTSSKFDKHDKERKLKDDSLSTKIHTDKSDRTRNEKEGDKILDK
DRDFERSKVIEDKTSRHDNHKRDKQERDRKKDIPDNESMAKASKKEEKHKNDRPKKDSDN
KRDNKREGDSSKSRKSSRDESIRDICRKDSTDSTTSRASHDSSKSRESENIELKEDTKTR
IKSLSDVVNKVDTTEKDPTIKHHDHLKLDGKTKIKSEIKDEKDPSEQHFRNKIESYVDNK
KIKTENSEKSRHYSLDSPSVDIKRKERLNSCSSLPSNIGHKRRMSSQDSIDCLNEEIKKG
KSERRESKDCKSVERHKTTKFSKGHFAKIIESKTKDDKKNQVKPPDTTFTNPKCLDTKEI
GSDKLKTSRKSPIESIESKTISAPTNISSEPLHNNFDFLATLELRSSEEDEKQKALRKEM
KEKKRIQQLQQIQELQMQQDALQQAEIGNKIKDDRRIKSDDKKKDISREKRMSTERKSKD
EKTDNIKRRNRKQLHSTDTSDSDEPKKHSIFDIVDDEYNISMYDKVKARSCKNMQKQEEE
KRQEKIKAKFSQLKQSRAKREEKKRSSWDEDSDSDSERRKSRKISMDSSSDEDHITIHSK
KREKSQGSSLEYDRNRTNDYFDTISNEEDSSNKLSRKNSRTRIMSDSSDDENSRRKASKS
PCFIDRIKKEVFSDSESSTKCKDIDADIQLKQPCEDKVKKTSLLNLFGKSDSEDNRLKPT
FENDNDYRMSFTKTFPNDLSSENESIPNRISSELRRKHKKKQKRYKFTYSDEENKSNHDG
STEIRNKHKNSDKSRRHSNKKEKRRDKIRDSIDADEPKEEKDKIKIEKCSPNQSFDTVLE
AMSNNNKKEGKMEDIFGPLSDESDREVKTISNRVDVLPPECETSLCTAADKLSSDEMKIR
DKEEIKRRKEKRRKEKRHFVKEDDNSLDVDAVSKAIEARLFEESNVDNNHSRSEKTEAKV
NSESDLKYMGINYKGDLNKYNEKSKKESRERKKKKKRNREERQSRKEHHHEKLHKMDSFL
DNHMEQLSPKTLLDIPLPTDIPKVEIIDKSDDSKSLSESPSLPRITDSPTFVLNAEDTNN
TTNEYDTPKSLVVDDEVEIDYEMTRNIEVTDIPMPPPIDNIVQDISEVPLPKDPPIDQVL
EISLPDSEDSNVNTDVKESEVAVKSISNLEPEIEKPVVQSEKNISDCKLDKKLEEKPRAI
ISQEETEDAVAALLGESFGGKVNTFDCYEEVECNTSNEMEIENTTIESELIPEEDAEEMR
QAVQNLNASEMEMKPDTPVSDDNLLLIDTDTEETEDTTQDTIDRLPVDLIATNQSLNNNT
KITVAQTTEVSSIVITKPKPTMANSVQKEINESINNVKEEQVKPPAKKQETVEQITSTAT
PVITSWTLSNNKLIESQVANVPVSSASSREISENTQTHIKTNIVQIKTSHSPKVQINNTL
RPIINSNRVSTAPYQVINQMIRPQVSNIQPPTIKIPEPHIIYQKPTNIVISPRINSEPLI
LSPKSSVQTEGMTSPRLPNMALLSTSPQNVVGSPSTAQQRAQGQQVTVVRMQQSPLSPIH
SMHIPHARAMVSPNRPNSVLVQTQGTPIHFNRLPVTPVLAPISKQINANIIQSTKGSGSY
DSVLIQQQKICQTMPGDRGKMESPPDASKIILSPNRLQSSVSTVMAQNRLIPVQNSIHVS
NINSGLNLPNKVLINNKSPFSDKRDTHISKSENIIAATSSPIIHLSGPSYSSSNIIQANT
KSVTCNLQEVTTVSRGHGSNVIHSINTQRLVSSSPITSVIQLEPAKGPPSVLSMAAVRTS
SPGVKQELSTTVVVTTSSLANVVMTPLLLKTSPNPKPTLRLHTRSESEQVDAHCNNEIQA
SEDRTETSHKELGQQNNIQQPSPVTHLSPIDRVGKCETDFEELLKDNINEIKITNEHHKK
IDINVNKNVAIIGQVQNRRNSIDNDVALMKHEVENFIKNEADNENTIKKSDSVDMSSVKK
CENELSTKIKPDNNMLCITEKIPKTDEDDLSKPMESNATIAKENHAPAIPTSVDINDSIR
PESGVNKNDFYNPLCIQTQAPTNDIKSLDESEYWSGKDIINIESVIKKVDSLCSDNVEEK
KIEENTTKQPLVIEMKKSVERNVQNEIFIEESETAKTEKPNENKVVENAIDPFQDNSLME
DYVTEKIGTGKRGGRSGRGKKSDRNPDKVHTRQIAKTPRGTSKRGRGRGKVDKKIKNMIN
TNANNMPGDVYDFHEDSGDETVSSPNKTEVRPRLILTIKSPLSGNSNTVVSTLSIAQKDP
VKVTEKHKEEKSDVFASPSINTRKSRRLQEKDIQRSSVDDIIDDVVKGTANQFKTNNKDS
NKKKCTRQTVNKTGDKTLSSDTRKSPRGVKRTRDRSLSDASTESCEENIKTDEIIKESKI
PKLEPTIPPTKCTPEPVVKPLIVTSQSATTCPPNNNPIPIMKPPKKMISEISAKLASAFE
AAANATNRIQERSILPSEPERLPPPAEGAIGDGDAGYGRRLVDNVPSNMMPVGATEATDA
RVQSPALPHRPPSSHHLPDRATPILVRGGDGEGGPIAAMGPLGAMGAMGGAMGGMDAGAA
LYRGAYPGPASLPRGAHHPPLAKQVAVIGQNHHPGSRVTITSVPAGSPQGPMAIGEGVYP
HFSHHHYQMYQQHFRATQQENRDTPTSFTRGALDAEGSEAPTPPLELRRPPSARVPRPAH
SPSPHDRHIMYAVRCGRSPPPAHGTSRPSSALPPPAAPGPPHASQVPREADSLQMLLRRY
PVMWQGLLALKNDSAAVQMHFVGGNPGVAADALSRHSDGTAASLLRIAQRMRLEPAQLDQ
VHRKMKLENEHCMLLALPCGRDHMDVLQQSTNLTAGFITYLQRKQAAGIVNVAPTPGHHQ
SMYTVHIFPSCEFANENLTRIAPDLMHRVSNIAHLLIVIATAIG