DPGLEAN20839 in OGS1.0

New model in OGS2.0DPOGS207370 
Genomic Positionscaffold133:- 85377-101935
See gene structure
CDS Length12015
Paired RNAseq reads  7231
Single RNAseq reads  17512
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009003 (3e-15)
Best Drosophila hit  split ends, isoform C (1e-76)
Best Human hitmsx2-interacting protein (3e-59)
Best NR hit (blastp)  protein gar2, putative [Pediculus humanus corporis] (5e-134)
Best NR hit (blastx)  PREDICTED: similar to CG18497-PA, isoform A, partial [Apis mellifera] (3e-80)
GeneOntology terms














  
GO:0045449 regulation of transcription
GO:0007379 segment specification
GO:0005634 nucleus
GO:0030528 transcription regulator activity
GO:0007411 axon guidance
GO:0007400 neuroblast fate determination
GO:0008347 glial cell migration
GO:0007403 glial cell fate determination
GO:0007173 epidermal growth factor receptor signaling pathway
GO:0016055 Wnt receptor signaling pathway
GO:0000166 nucleotide binding
GO:0003676 nucleic acid binding
GO:0007474 imaginal disc-derived wing vein specification
GO:0008407 bristle morphogenesis
GO:0007422 peripheral nervous system development
GO:0007163 establishment or maintenance of cell polarity
InterPro families



  
IPR012921 Spen paralogue and orthologue SPOC, C-terminal
IPR000504 RNA recognition motif domain
IPR016194 Spen Paralogue and Orthologue SPOC, C-terminal-like
IPR012677 Nucleotide-binding, alpha-beta plait
IPR010912 Spen paralogue/orthologue C-terminal, metazoa
Orthology groupND

Nucleotide sequence:

ATGGCTGCATCAGCAGCCGCTCCGGCCACGTGGCGTGGCACCAACGACAGCGCCACCGAG
TATTGTCGTCGCGGTAACACACCCGCGGCATATGGCAGGACGACGCCGCACCACCGCTGG
TGTTCGTCTGTGGGCGGCGGCGCGGCCGGCGGCGAGAGCACCCCGAGCACGCCCGGCGGC
GGCACCGAGCGTCGGCGGCGGCTCTCCGAGTCCGGCTCCTCGCGCTCGGAGACCAGCTCG
CCCGAGCCCAGCGACACCTCGCGCGCCTCCACGCCGCCCGCCGACCATCACGCCGCGCAC
CACACGCCCAGGAGGACGCCGCCCACGCATACGCATCAGTGGCCGTCAACAGCGAACGGC
AGACCACTGGCGATATGCGTCCGGAACCTCCCCACTCGGTCGACCGACAGCTCCTTGAAG
GACGGTCTCTATCACGAGTACAAGAAACACGGGAAAGTGGTGTGGGTGAAGGTGGTCGGT
CAGAATGCCGATCGGTATGCGGTCGTCCGTTTTAAGAAGCCGTCCGACGTAGAGAAGGCA
CTCGAAGTGTCTCAAGATAAGCTTTTTTTTGGTTGCAAGATCTCAGTGGCACCTCACCAA
AGTTGTGACGACGATGCGGAATCGGCGAAACCTTATGAGACTGATATTGATGAATACCAT
CCGAAGGCGACCAGAACGTTGTTTATCGGTAACCTGGAAAAGGACGTTACACAACAGCAA
CTTAGGGATAAATTTAAGCATTTCGGGAGAATAATCGAGATCGATATTAAGAAGGGTAGC
GGCGGGGGCGCCGGCTACGCTTTCTGTCAATACGCGTCCATATCGAGCGTCGTGGAGGCG
ATAAGAGCCATGGACGGAGAGTACGTTGGCAACTCGCGCGTCAAACTCGGCTTCGGGAAA
CCGGTCGCCACCACCTGCGTCTGGGTCGACGGTCTCACCGAACACACGGAAAAACAGGTA
CTGGGCGCCGTGTCTCGTTGCGGCGCGGCGACGTCGGTGTGCGTGGACCGCGCGGCGGGC
GCGGCCCTCGTCCACTTCGAGCAGGCGGCGGCGGCGGGGGGCGCCGTGCGGGAACTCCGA
CGCGTCGCCGCACAGCTCTCCGCCGCCGAGCCCGACCACCCGAGGCTCTGTGTCGACTAC
GCCTCCAGGGAGTGCCAGGAAGCGTTTTACGAGCAATTGGAAAAACATGGTGGTTCTGCT
GCCCTCGCAGGAGCTGAACGGGTTGCGGGAGAACTGCCCGGCCGGTACGTAGCGCCACGG
CACGAGACTCTTCGATACGAGGCCGCTGGGACGAATCGCTCGAGAGCGCCGAGTTTTAGT
CGTGCCTCGTCACGGACTCCGAGATATAATGCAAACGATCATTACGATCCTGCTGAATAC
GCGGCCGATCGTAGATACAGGGTATTTGATGATTTAGGATCTAGTCCACAAACAGATGAT
GCAAACTATGAAGAACGTTTGCAGTCAGTAGTCGTTTCTCCTCACCGTGGGCATAAACAT
CGGAGGGATTCCAGTCCCGAAGGAAGAAAGCATTCAAAGGAGCGGCACCGTAGCGCTGGT
GGCGGATCTCGTCGATCCCGATCGGGGTCCCGCGGTCACTCATCCCGGCGGAGGCATCGC
AGACGACGCGATGGATCTGAATCGAGAGGGTCTCGGGCGTCACGTGCTGGTACTCCCTTG
AGAGACGAGCTAGATGCGCCCCCCACGGAGCCGCGTCGCCCACCGCGGGAGCGACCACCG
CTGCCCATGAGTCTCCCTCTGCCGAAGTTTGCCGTACAGTTGTTACGAGCCGCACCACCG
CAGCCACGCCTTCTGCCACCGGCCCCCGCATCGCCGCCGCGCCCGCCCTCCGCTTCTTCC
TCTAGTGGAGGCTCGGCGCCTCACTCACCCTCGCTGGAGGAGCGTATACGGTCTCTTGAC
GAAAAGTATGAGAGGTGGACGGGATCGCGAGCCCACATCGACGCTCCTGATAGATCCCGA
CTTCGTCATCGTTTACTGGAATTGGATATCAATGAAGTGAAGCCCTCGGATGTAGTTCGT
TCTCTTTTGGCTAAGCGCTCTGTGTTCGATGAGGATTCTGAACGGTTGGAGGGTGCCACC
CGCGCACCGAGCCCGGGAGGTAGCCCGCGTTCTTTACCGGTTTCGACGTCTCGTGTCCTA
CGATATCCTTTTCCCGTTCACGGAATGCATACTTCAGCTGTCAATACGCCATTAACGAAT
CCAGCGGGTCTCACGCAACCACCGGAACCAGAAGAATATGATAGAAGTAGACTATTAAAT
ATGGATAAAATAGCCAAGCATGTAGACATCGAGAACGGTCAAGAGATAAGAATGCGATTG
CGCACTCCTTCGATTGATAGGCCAATGTTGGAAAATTTTGAAAAGAAGCATTCTCCGACA
AGTGAAAAATCCGAAATCCGTTCACTGCCAGAAACAGAAACACAAGAAAATAGAAGAAGA
AACTCTTTCATATCAAATGTTGAGGACAGTAAGACTGCAACAATAACGGACAAGGCGACC
TTGGAAAGAATAAGTCTAGAAAAATCCATTGAAAAAAGTATTTTGGAAATCCAAGAAAAA
TATAATATTAGAGACAACAAAATTATTTGTGAAGATAAAATTACAACCTCCGCGGTAACA
TCTATCAAGAAGGAGGAAATCATTGAAGAGAAAGTTAAAGCTAGTGATTTCTTTAATAAG
TTCGAAAGAAAATGTTTCAGGGAAGATACTAAATCACGATGTCTAGAAACAAAAAGTGAA
TTAAAACAAGAGTTGAGCATGGACATGAACAAAAGTGATAAATATCTGTTAACAAGCGAT
TATTCTTCTAAGACTGAAGTAATGCTTATGGATATGAAACAAAAGCTAGACCACGATAAA
AAAAGTGAATTCAATTGTCAACCCAAGCAAAATGAAAGATTTAAAGACAAGTATCATAAT
GCCACTAACAAGAATGCTACCAACTTTAGAGTGCAATCAGAAAGTCATTTTCATTCATTA
AATAGTATTGATTTACCTAAAAATAAAACTGAAAAAAGTAACAAGGAGATGCTTTTGAAA
AGTATCACTTGTGCAAAAGATAATGTTCTTTTGAAACAAATTGATGAGACACACCACGTA
AAATGTGATAAATCTAATAACGTGAATGTATCAACTACTTCTGTACCGCCCGAAGAAAAG
GCTGTTGGTAAACTTCAGGAGGACTGTGCTAAAATTGAAAAAGATAAATTAAGACATATT
CGTGACAAAATAGACAAGGAAATAGAAAGAGATAAATCAAAAATCAAGCAAACTAGTTCT
AAGTTTGATAAACACGACAAAGAACGTAAACTTAAAGATGATTCTCTTAGTACAAAAATA
CATACAGATAAATCTGATAGAACTAGAAACGAGAAAGAAGGAGACAAAATCTTAGATAAA
GATCGCGATTTTGAAAGATCTAAAGTAATTGAAGATAAAACATCTCGTCATGACAATCAC
AAAAGAGATAAACAAGAAAGGGACAGGAAAAAAGATATCCCAGATAATGAAAGTATGGCT
AAGGCTTCAAAAAAAGAAGAAAAGCATAAAAATGATCGACCGAAAAAGGATAGTGATAAT
AAGCGTGATAATAAAAGGGAAGGGGACTCATCGAAAAGTAGAAAAAGTTCCAGAGATGAA
TCTATTAGAGATATTTGTCGAAAAGATTCGACAGACTCTACAACATCAAGAGCGTCCCAC
GATTCTTCAAAATCTAGGGAATCTGAAAACATTGAGCTAAAGGAAGACACGAAAACAAGA
ATTAAGAGTCTCTCGGATGTAGTGAATAAAGTAGATACAACTGAAAAAGACCCAACAATC
AAACATCACGACCACCTTAAATTGGATGGTAAAACAAAAATTAAAAGTGAAATTAAAGAT
GAAAAAGATCCTTCTGAGCAACATTTTAGGAATAAAATAGAAAGCTATGTTGATAATAAG
AAAATTAAAACAGAGAATTCTGAAAAGTCGAGACATTATTCACTAGACTCACCTAGCGTT
GACATTAAAAGAAAAGAGCGTTTAAACTCGTGTTCAAGTTTGCCGTCTAATATAGGCCAT
AAGCGAAGAATGTCTTCACAAGACAGCATAGATTGTCTCAATGAGGAAATAAAGAAAGGT
AAAAGTGAACGTCGCGAATCTAAAGACTGTAAATCTGTTGAAAGACACAAAACCACAAAG
TTTAGTAAGGGACATTTCGCTAAAATAATTGAGAGTAAAACTAAAGACGACAAGAAAAAT
CAAGTCAAACCACCTGATACAACTTTTACTAATCCTAAATGCTTGGACACAAAAGAAATC
GGTAGCGATAAACTAAAGACATCCAGAAAAAGTCCGATCGAAAGTATTGAAAGCAAAACA
ATATCTGCTCCAACAAACATTTCATCCGAACCCCTGCATAATAATTTTGATTTTCTGGCT
ACCTTAGAACTTCGGTCTAGCGAAGAAGACGAGAAACAAAAAGCTTTAAGGAAAGAGATG
AAGGAGAAGAAACGAATACAACAATTGCAACAAATTCAAGAACTTCAGATGCAACAAGAT
GCCTTACAGCAAGCAGAGATAGGTAACAAAATAAAAGATGATAGAAGAATTAAAAGTGAT
GACAAGAAAAAAGATATTTCCAGAGAGAAAAGGATGTCTACTGAAAGAAAAAGCAAGGAT
GAGAAAACTGATAACATCAAGCGACGAAATAGAAAACAATTACATAGTACTGATACCTCG
GATTCAGATGAACCCAAAAAGCATTCTATATTTGATATAGTTGATGACGAATATAACATA
TCTATGTATGATAAAGTGAAAGCTAGATCATGTAAAAATATGCAAAAACAGGAAGAAGAA
AAGCGACAAGAAAAAATAAAAGCCAAGTTTAGTCAATTGAAACAAAGTAGGGCTAAACGC
GAGGAAAAGAAAAGATCAAGTTGGGATGAAGACAGTGATTCTGATAGTGAAAGACGTAAA
TCTCGTAAAATATCAATGGATAGTTCATCCGATGAAGATCACATTACAATACACAGTAAA
AAACGTGAGAAATCACAAGGGTCGAGTCTTGAATATGATCGTAATAGAACTAATGATTAT
TTCGACACAATAAGCAATGAAGAGGATTCCTCTAATAAATTATCGCGTAAAAATTCAAGA
ACACGGATTATGTCTGACTCATCTGATGATGAGAACAGTCGGAGGAAAGCCAGTAAAAGT
CCATGTTTTATTGATAGAATTAAGAAAGAAGTTTTTTCAGATTCTGAATCTTCTACGAAA
TGCAAAGACATTGATGCAGATATACAGTTAAAACAACCATGCGAAGATAAAGTTAAGAAA
ACATCGCTTTTAAATTTGTTCGGTAAAAGCGATAGCGAAGATAATAGACTTAAACCAACA
TTTGAAAATGATAATGATTATAGAATGTCATTCACAAAAACTTTCCCAAATGACTTGTCT
TCAGAAAATGAATCAATACCGAATAGAATATCTTCTGAATTACGCAGAAAACATAAGAAA
AAACAAAAGAGATATAAGTTTACATATTCGGACGAGGAAAACAAAAGTAATCATGATGGT
TCTACAGAAATTAGAAACAAACATAAGAATTCTGATAAAAGTCGCCGACACAGTAATAAA
AAAGAAAAAAGAAGAGATAAGATACGTGATAGTATTGATGCTGACGAACCAAAGGAAGAA
AAGGATAAAATAAAAATTGAGAAGTGTTCTCCCAACCAATCATTTGACACAGTGTTGGAG
GCAATGTCAAATAATAACAAAAAGGAAGGCAAAATGGAAGACATATTTGGTCCTTTGTCA
GATGAATCTGACAGAGAGGTCAAGACTATAAGCAATAGAGTTGATGTTTTGCCACCAGAA
TGTGAGACATCGTTATGTACTGCAGCTGATAAATTAAGTAGTGATGAAATGAAGATCAGA
GACAAGGAGGAAATTAAAAGGCGAAAAGAAAAGAGACGAAAAGAAAAGAGACATTTCGTT
AAAGAAGATGATAACAGCTTGGATGTAGATGCTGTAAGTAAAGCTATTGAAGCTAGATTA
TTCGAAGAGTCAAATGTTGACAACAACCATTCTAGATCAGAAAAAACAGAAGCTAAAGTC
AATTCTGAAAGTGATTTGAAATATATGGGTATCAATTATAAAGGAGACTTAAATAAATAT
AATGAGAAGTCCAAAAAAGAATCTCGAGAGAGAAAAAAGAAAAAGAAGAGAAATAGAGAA
GAAAGACAAAGCCGTAAAGAACATCATCATGAGAAGTTACATAAAATGGATTCATTTTTA
GATAATCATATGGAACAATTATCACCAAAAACATTACTCGATATACCATTACCTACCGAT
ATTCCTAAAGTTGAAATAATAGATAAGTCGGATGACAGTAAATCTTTATCTGAATCTCCG
AGTTTACCAAGAATAACAGATAGCCCTACCTTTGTTTTAAATGCAGAAGATACCAATAAC
ACTACAAATGAGTATGATACTCCTAAAAGTCTCGTCGTTGACGATGAAGTCGAGATTGAT
TATGAAATGACAAGAAATATTGAAGTGACAGACATACCCATGCCTCCCCCTATAGACAAT
ATTGTTCAAGACATCAGTGAAGTGCCCCTACCGAAAGATCCTCCAATTGATCAAGTTTTA
GAAATATCATTACCTGATAGTGAGGATTCTAATGTTAACACAGATGTAAAAGAAAGTGAA
GTGGCGGTTAAAAGCATATCTAATTTAGAACCAGAAATTGAAAAACCTGTTGTGCAAAGT
GAAAAAAATATAAGTGACTGTAAATTAGACAAAAAATTAGAAGAAAAACCTCGGGCTATA
ATTTCACAAGAAGAAACTGAAGATGCTGTAGCCGCTTTACTTGGTGAAAGTTTTGGAGGT
AAAGTAAATACTTTTGATTGTTATGAAGAAGTAGAATGCAACACGTCCAATGAAATGGAG
ATAGAAAATACTACGATCGAAAGTGAACTTATCCCTGAAGAAGATGCAGAAGAGATGAGA
CAGGCTGTGCAAAATTTAAATGCCAGTGAAATGGAAATGAAACCTGATACACCAGTGTCT
GACGATAATCTTTTGTTGATTGACACGGATACCGAGGAAACAGAGGACACTACTCAGGAT
ACTATAGACAGATTACCGGTTGACCTCATAGCAACAAACCAATCCCTTAATAATAACACC
AAAATAACTGTTGCTCAAACTACCGAAGTTAGTAGCATTGTTATCACGAAGCCGAAACCA
ACAATGGCAAATTCTGTACAGAAAGAAATTAACGAATCGATTAATAATGTAAAGGAGGAA
CAGGTTAAACCCCCTGCTAAAAAACAAGAAACTGTGGAACAAATTACATCAACAGCGACT
CCAGTCATTACATCTTGGACTTTGTCAAACAATAAGTTAATAGAATCTCAAGTAGCAAAT
GTTCCTGTTAGTTCAGCTAGCAGCCGAGAAATTTCTGAAAATACCCAAACTCATATAAAA
ACAAATATTGTACAGATTAAAACATCACATAGTCCCAAAGTTCAGATAAATAACACTTTG
AGGCCTATTATAAATTCAAACAGGGTTAGTACAGCTCCTTACCAAGTGATAAATCAAATG
ATTAGACCTCAAGTATCTAACATTCAACCTCCAACAATCAAAATTCCTGAACCCCATATT
ATTTATCAAAAGCCAACTAACATTGTTATATCACCAAGGATAAATAGTGAACCTTTAATT
TTGAGTCCGAAATCTAGTGTACAAACAGAAGGTATGACTTCACCACGTTTACCTAATATG
GCTTTATTGAGTACTTCACCTCAAAATGTTGTTGGATCACCGAGCACAGCGCAGCAACGC
GCTCAGGGTCAACAAGTAACAGTGGTTCGAATGCAACAGTCACCACTAAGTCCTATACAC
AGTATGCATATTCCGCATGCGAGAGCTATGGTGTCGCCAAACAGACCCAATTCTGTGTTA
GTACAAACTCAGGGAACTCCAATACACTTCAATCGATTACCTGTAACACCAGTTCTTGCT
CCCATTTCAAAGCAAATAAATGCTAACATCATTCAGTCAACCAAAGGAAGCGGATCTTAC
GATTCTGTGTTAATTCAACAGCAAAAAATATGTCAGACAATGCCAGGAGACAGGGGAAAA
ATGGAGAGTCCACCAGATGCTTCGAAAATTATTTTGTCACCTAACAGACTGCAAAGCTCG
GTCTCAACTGTAATGGCACAAAATCGTCTCATACCTGTGCAAAATTCAATTCATGTGAGT
AATATAAATTCCGGCTTGAATTTACCAAACAAAGTCTTAATAAATAATAAAAGCCCATTC
AGTGACAAAAGAGATACTCACATAAGCAAATCTGAAAATATCATCGCCGCTACTTCTTCG
CCGATTATACATTTATCGGGTCCTTCTTATTCTTCTTCTAATATAATTCAAGCCAATACA
AAATCGGTGACGTGTAATCTTCAAGAAGTAACTACTGTGAGCAGAGGACACGGATCAAAT
GTCATCCATTCCATAAATACACAAAGACTTGTATCTTCGAGTCCTATAACAAGTGTTATT
CAACTAGAACCTGCAAAAGGCCCTCCTTCTGTTTTATCAATGGCGGCTGTACGAACGTCG
TCGCCTGGAGTTAAACAAGAGTTGTCGACCACAGTTGTTGTTACAACATCCAGCCTCGCT
AATGTTGTGATGACTCCATTGCTTTTGAAAACTAGTCCTAATCCAAAACCTACTTTAAGG
CTGCATACTAGAAGTGAAAGTGAGCAGGTTGACGCCCATTGCAATAACGAAATCCAAGCC
AGCGAGGATAGAACGGAAACATCTCATAAAGAACTTGGACAACAAAATAATATACAACAA
CCTTCACCTGTAACCCATCTATCACCCATAGATCGTGTTGGTAAATGTGAAACTGATTTC
GAAGAGCTCTTAAAAGATAACATAAATGAAATTAAAATAACAAATGAACATCACAAAAAG
ATTGACATTAATGTTAATAAAAATGTTGCAATTATTGGTCAAGTACAAAATAGAAGAAAC
AGCATTGATAATGATGTAGCTCTTATGAAACATGAAGTAGAAAATTTTATAAAAAACGAA
GCAGACAACGAAAACACTATTAAAAAATCTGACTCTGTAGATATGTCTTCAGTAAAAAAA
TGCGAAAATGAACTATCAACAAAGATTAAACCTGATAACAACATGCTATGTATAACAGAG
AAAATTCCTAAAACTGATGAGGATGACTTATCGAAACCAATGGAAAGTAACGCAACTATC
GCAAAAGAAAACCATGCTCCCGCAATACCAACTAGTGTCGATATAAACGATTCTATAAGA
CCTGAATCCGGTGTTAACAAAAATGATTTCTATAACCCACTTTGTATTCAAACGCAAGCA
CCTACAAACGATATAAAAAGTCTAGATGAAAGTGAGTATTGGTCAGGCAAAGATATAATT
AATATAGAATCTGTTATTAAGAAAGTTGATTCGTTATGTAGTGACAATGTCGAAGAAAAG
AAAATAGAAGAAAATACAACCAAACAGCCTTTAGTAATTGAAATGAAAAAGTCCGTTGAA
AGAAATGTTCAAAATGAGATTTTTATTGAAGAAAGTGAGACCGCTAAAACTGAAAAACCA
AATGAGAATAAAGTTGTTGAAAACGCAATAGATCCCTTTCAAGATAATAGTTTGATGGAA
GATTATGTTACAGAAAAAATTGGTACTGGTAAAAGAGGTGGCCGTAGTGGTAGGGGTAAG
AAGTCGGATAGAAATCCGGATAAGGTACATACCCGTCAAATAGCAAAAACTCCTAGAGGT
ACTTCGAAACGTGGCAGAGGTCGGGGGAAGGTAGACAAAAAAATTAAAAACATGATAAAT
ACCAATGCTAATAATATGCCTGGTGATGTGTACGATTTTCATGAAGATTCTGGGGATGAA
ACAGTCTCGTCACCTAATAAAACAGAAGTAAGACCACGTTTAATATTGACTATAAAAAGT
CCTCTTTCTGGCAATTCCAATACCGTTGTTTCAACCTTAAGTATAGCACAGAAGGACCCT
GTAAAAGTCACTGAAAAACATAAGGAAGAAAAATCTGATGTCTTTGCATCGCCCTCAATA
AATACAAGAAAGTCTCGACGGCTCCAAGAAAAAGATATACAAAGGAGTTCCGTCGATGAC
ATCATTGATGACGTCGTAAAAGGCACGGCCAATCAATTTAAAACTAATAACAAAGATTCG
AATAAAAAGAAATGTACAAGACAAACTGTCAATAAAACGGGCGATAAAACGCTCTCAAGT
GACACACGGAAATCTCCCAGAGGAGTTAAAAGAACAAGGGATAGAAGTTTGTCAGATGCT
TCCACGGAAAGCTGTGAAGAAAATATTAAAACTGACGAAATAATCAAAGAGTCCAAGATA
CCGAAATTAGAACCCACGATCCCACCCACCAAATGTACTCCCGAACCTGTCGTTAAGCCT
CTTATAGTTACCAGTCAGAGTGCCACCACCTGTCCACCTAATAACAACCCGATACCTATA
ATGAAACCACCCAAGAAGATGATATCCGAAATTAGCGCAAAGCTCGCCAGCGCTTTCGAG
GCTGCTGCTAACGCGACCAACCGTATTCAAGAACGATCCATCTTACCGTCGGAGCCGGAA
CGTCTCCCGCCACCGGCTGAGGGTGCCATTGGTGATGGCGACGCGGGCTACGGGCGACGA
CTCGTGGACAATGTACCCTCTAACATGATGCCCGTGGGAGCCACGGAGGCGACGGATGCG
AGGGTACAGTCCCCCGCCCTGCCGCATAGACCACCTTCCTCACATCACCTGCCCGATAGA
GCTACGCCCATCCTTGTCAGGGGTGGCGATGGAGAAGGAGGGCCGATCGCCGCCATGGGT
CCCTTGGGCGCGATGGGCGCTATGGGTGGAGCTATGGGTGGTATGGACGCGGGGGCAGCG
CTGTACCGGGGCGCGTACCCGGGCCCGGCCAGCTTGCCGCGGGGTGCACACCATCCCCCA
CTCGCCAAACAGGTCGCCGTCATCGGACAAAATCATCATCCAGGATCACGCGTTACTATC
ACCAGTGTACCCGCTGGCAGTCCACAAGGACCTATGGCAATTGGCGAAGGTGTATACCCG
CATTTTTCACATCATCATTACCAAATGTATCAGCAACATTTCCGTGCGACCCAACAAGAA
AACAGGGACACGCCAACATCCTTTACTAGAGGCGCCTTGGACGCCGAGGGCTCGGAGGCT
CCGACGCCTCCTCTGGAACTGCGTCGGCCTCCATCGGCTCGTGTGCCCCGTCCGGCGCAT
TCGCCCTCGCCGCACGACCGGCATATCATGTACGCGGTGCGATGCGGTCGGTCTCCGCCT
CCGGCTCACGGCACGTCTCGGCCGTCGTCCGCGCTGCCCCCGCCCGCCGCGCCCGGCCCG
CCACACGCCTCGCAGGTCCCCAGGGAGGCGGACTCCCTACAGATGCTTTTGCGGCGCTAC
CCGGTCATGTGGCAAGGTCTGCTGGCTCTCAAGAACGACTCGGCGGCCGTGCAGATGCAT
TTCGTGGGTGGCAACCCTGGTGTGGCGGCTGACGCGCTGTCGCGACACTCTGACGGCACG
GCCGCCTCTTTATTGCGCATCGCTCAGCGCATGAGGCTAGAACCAGCTCAGCTGGACCAA
GTACATCGCAAGATGAAGCTCGAAAATGAACACTGCATGCTGCTAGCGCTACCCTGTGGC
CGCGACCACATGGACGTGCTTCAGCAGTCCACCAACCTGACGGCCGGATTCATCACCTAC
TTGCAGCGCAAGCAGGCTGCCGGCATCGTTAATGTAGCGCCCACCCCGGGTCACCATCAG
TCTATGTACACGGTGCACATATTCCCGTCGTGCGAGTTCGCCAACGAGAACCTGACCCGC
ATCGCCCCCGACCTGATGCACCGCGTGTCCAACATAGCTCACCTGCTGATCGTCATCGCC
ACGGCCATCGGATGA

Protein sequence:

MAASAAAPATWRGTNDSATEYCRRGNTPAAYGRTTPHHRWCSSVGGGAAGGESTPSTPGG
GTERRRRLSESGSSRSETSSPEPSDTSRASTPPADHHAAHHTPRRTPPTHTHQWPSTANG
RPLAICVRNLPTRSTDSSLKDGLYHEYKKHGKVVWVKVVGQNADRYAVVRFKKPSDVEKA
LEVSQDKLFFGCKISVAPHQSCDDDAESAKPYETDIDEYHPKATRTLFIGNLEKDVTQQQ
LRDKFKHFGRIIEIDIKKGSGGGAGYAFCQYASISSVVEAIRAMDGEYVGNSRVKLGFGK
PVATTCVWVDGLTEHTEKQVLGAVSRCGAATSVCVDRAAGAALVHFEQAAAAGGAVRELR
RVAAQLSAAEPDHPRLCVDYASRECQEAFYEQLEKHGGSAALAGAERVAGELPGRYVAPR
HETLRYEAAGTNRSRAPSFSRASSRTPRYNANDHYDPAEYAADRRYRVFDDLGSSPQTDD
ANYEERLQSVVVSPHRGHKHRRDSSPEGRKHSKERHRSAGGGSRRSRSGSRGHSSRRRHR
RRRDGSESRGSRASRAGTPLRDELDAPPTEPRRPPRERPPLPMSLPLPKFAVQLLRAAPP
QPRLLPPAPASPPRPPSASSSSGGSAPHSPSLEERIRSLDEKYERWTGSRAHIDAPDRSR
LRHRLLELDINEVKPSDVVRSLLAKRSVFDEDSERLEGATRAPSPGGSPRSLPVSTSRVL
RYPFPVHGMHTSAVNTPLTNPAGLTQPPEPEEYDRSRLLNMDKIAKHVDIENGQEIRMRL
RTPSIDRPMLENFEKKHSPTSEKSEIRSLPETETQENRRRNSFISNVEDSKTATITDKAT
LERISLEKSIEKSILEIQEKYNIRDNKIICEDKITTSAVTSIKKEEIIEEKVKASDFFNK
FERKCFREDTKSRCLETKSELKQELSMDMNKSDKYLLTSDYSSKTEVMLMDMKQKLDHDK
KSEFNCQPKQNERFKDKYHNATNKNATNFRVQSESHFHSLNSIDLPKNKTEKSNKEMLLK
SITCAKDNVLLKQIDETHHVKCDKSNNVNVSTTSVPPEEKAVGKLQEDCAKIEKDKLRHI
RDKIDKEIERDKSKIKQTSSKFDKHDKERKLKDDSLSTKIHTDKSDRTRNEKEGDKILDK
DRDFERSKVIEDKTSRHDNHKRDKQERDRKKDIPDNESMAKASKKEEKHKNDRPKKDSDN
KRDNKREGDSSKSRKSSRDESIRDICRKDSTDSTTSRASHDSSKSRESENIELKEDTKTR
IKSLSDVVNKVDTTEKDPTIKHHDHLKLDGKTKIKSEIKDEKDPSEQHFRNKIESYVDNK
KIKTENSEKSRHYSLDSPSVDIKRKERLNSCSSLPSNIGHKRRMSSQDSIDCLNEEIKKG
KSERRESKDCKSVERHKTTKFSKGHFAKIIESKTKDDKKNQVKPPDTTFTNPKCLDTKEI
GSDKLKTSRKSPIESIESKTISAPTNISSEPLHNNFDFLATLELRSSEEDEKQKALRKEM
KEKKRIQQLQQIQELQMQQDALQQAEIGNKIKDDRRIKSDDKKKDISREKRMSTERKSKD
EKTDNIKRRNRKQLHSTDTSDSDEPKKHSIFDIVDDEYNISMYDKVKARSCKNMQKQEEE
KRQEKIKAKFSQLKQSRAKREEKKRSSWDEDSDSDSERRKSRKISMDSSSDEDHITIHSK
KREKSQGSSLEYDRNRTNDYFDTISNEEDSSNKLSRKNSRTRIMSDSSDDENSRRKASKS
PCFIDRIKKEVFSDSESSTKCKDIDADIQLKQPCEDKVKKTSLLNLFGKSDSEDNRLKPT
FENDNDYRMSFTKTFPNDLSSENESIPNRISSELRRKHKKKQKRYKFTYSDEENKSNHDG
STEIRNKHKNSDKSRRHSNKKEKRRDKIRDSIDADEPKEEKDKIKIEKCSPNQSFDTVLE
AMSNNNKKEGKMEDIFGPLSDESDREVKTISNRVDVLPPECETSLCTAADKLSSDEMKIR
DKEEIKRRKEKRRKEKRHFVKEDDNSLDVDAVSKAIEARLFEESNVDNNHSRSEKTEAKV
NSESDLKYMGINYKGDLNKYNEKSKKESRERKKKKKRNREERQSRKEHHHEKLHKMDSFL
DNHMEQLSPKTLLDIPLPTDIPKVEIIDKSDDSKSLSESPSLPRITDSPTFVLNAEDTNN
TTNEYDTPKSLVVDDEVEIDYEMTRNIEVTDIPMPPPIDNIVQDISEVPLPKDPPIDQVL
EISLPDSEDSNVNTDVKESEVAVKSISNLEPEIEKPVVQSEKNISDCKLDKKLEEKPRAI
ISQEETEDAVAALLGESFGGKVNTFDCYEEVECNTSNEMEIENTTIESELIPEEDAEEMR
QAVQNLNASEMEMKPDTPVSDDNLLLIDTDTEETEDTTQDTIDRLPVDLIATNQSLNNNT
KITVAQTTEVSSIVITKPKPTMANSVQKEINESINNVKEEQVKPPAKKQETVEQITSTAT
PVITSWTLSNNKLIESQVANVPVSSASSREISENTQTHIKTNIVQIKTSHSPKVQINNTL
RPIINSNRVSTAPYQVINQMIRPQVSNIQPPTIKIPEPHIIYQKPTNIVISPRINSEPLI
LSPKSSVQTEGMTSPRLPNMALLSTSPQNVVGSPSTAQQRAQGQQVTVVRMQQSPLSPIH
SMHIPHARAMVSPNRPNSVLVQTQGTPIHFNRLPVTPVLAPISKQINANIIQSTKGSGSY
DSVLIQQQKICQTMPGDRGKMESPPDASKIILSPNRLQSSVSTVMAQNRLIPVQNSIHVS
NINSGLNLPNKVLINNKSPFSDKRDTHISKSENIIAATSSPIIHLSGPSYSSSNIIQANT
KSVTCNLQEVTTVSRGHGSNVIHSINTQRLVSSSPITSVIQLEPAKGPPSVLSMAAVRTS
SPGVKQELSTTVVVTTSSLANVVMTPLLLKTSPNPKPTLRLHTRSESEQVDAHCNNEIQA
SEDRTETSHKELGQQNNIQQPSPVTHLSPIDRVGKCETDFEELLKDNINEIKITNEHHKK
IDINVNKNVAIIGQVQNRRNSIDNDVALMKHEVENFIKNEADNENTIKKSDSVDMSSVKK
CENELSTKIKPDNNMLCITEKIPKTDEDDLSKPMESNATIAKENHAPAIPTSVDINDSIR
PESGVNKNDFYNPLCIQTQAPTNDIKSLDESEYWSGKDIINIESVIKKVDSLCSDNVEEK
KIEENTTKQPLVIEMKKSVERNVQNEIFIEESETAKTEKPNENKVVENAIDPFQDNSLME
DYVTEKIGTGKRGGRSGRGKKSDRNPDKVHTRQIAKTPRGTSKRGRGRGKVDKKIKNMIN
TNANNMPGDVYDFHEDSGDETVSSPNKTEVRPRLILTIKSPLSGNSNTVVSTLSIAQKDP
VKVTEKHKEEKSDVFASPSINTRKSRRLQEKDIQRSSVDDIIDDVVKGTANQFKTNNKDS
NKKKCTRQTVNKTGDKTLSSDTRKSPRGVKRTRDRSLSDASTESCEENIKTDEIIKESKI
PKLEPTIPPTKCTPEPVVKPLIVTSQSATTCPPNNNPIPIMKPPKKMISEISAKLASAFE
AAANATNRIQERSILPSEPERLPPPAEGAIGDGDAGYGRRLVDNVPSNMMPVGATEATDA
RVQSPALPHRPPSSHHLPDRATPILVRGGDGEGGPIAAMGPLGAMGAMGGAMGGMDAGAA
LYRGAYPGPASLPRGAHHPPLAKQVAVIGQNHHPGSRVTITSVPAGSPQGPMAIGEGVYP
HFSHHHYQMYQQHFRATQQENRDTPTSFTRGALDAEGSEAPTPPLELRRPPSARVPRPAH
SPSPHDRHIMYAVRCGRSPPPAHGTSRPSSALPPPAAPGPPHASQVPREADSLQMLLRRY
PVMWQGLLALKNDSAAVQMHFVGGNPGVAADALSRHSDGTAASLLRIAQRMRLEPAQLDQ
VHRKMKLENEHCMLLALPCGRDHMDVLQQSTNLTAGFITYLQRKQAAGIVNVAPTPGHHQ
SMYTVHIFPSCEFANENLTRIAPDLMHRVSNIAHLLIVIATAIG