DPGLEAN17418 in OGS1.0

New model in OGS2.0DPOGS202304 
Genomic Positionscaffold664:+ 136423-148403
See gene structure
CDS Length5064
Paired RNAseq reads  2299
Single RNAseq reads  5352
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004976 (5e-32)
Best Drosophila hit  CG34422, isoform A (1e-122)
Best Human hitAT-rich interactive domain-containing protein 4B isoform 2 (3e-66)
Best NR hit (blastp)  AGAP007503-PA [Anopheles gambiae str. PEST] (2e-150)
Best NR hit (blastx)  AGAP007503-PB [Anopheles gambiae str. PEST] (1e-135)
GeneOntology terms




  
GO:0006333 chromatin assembly or disassembly
GO:0000785 chromatin
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0003682 chromatin binding
GO:0006911 phagocytosis, engulfment
InterPro families

  
IPR001606 ARID/BRIGHT DNA-binding domain
IPR016197 Chromo domain-like
IPR012603 RBB1NT
Orthology groupMCL11362

Nucleotide sequence:

ATGCAGGGTGATGATCCTCCTTTCCTACCAGTGGGTACGGATGTGAGCGCAAAGTACAAA
GGTGCATTCTGTGAGGCCAAAATTAAGAAAGTTGTGCGTAACATTAAATGCAAGGTGACA
TTAAAAGCTGGTGGTATTACCACTGTCAATGATGATGTCATCAAGGGCACTCTGAGGGTC
GGGAGCACTGTGGAAGTCAAACAGGACCCCAAGAAAGAAGCCATGGAAGCCGTTATTACG
AAAATACAGGACTGTAGCCAATACACCGTCGTATTTGATGATGGCGACATTACAACATTA
CGACGTTCAGCGCTGTGCCTTAAGAGCGGGAGACATTTTAATGAAAGTGAAACCTTGGAC
CAACTGCCTCTTACACATCCAGAACATTTCTCTACACCAGTTATAGCCGGGAGGAGAGGT
CGGAGAGGAAGAGCGCAGTCAGATGAAAGTGACGGTGAGGGTACGACCCCAGTGAAAGCG
GATAGTGCTGAACGTGAACCCCACGTGGGCCGTGTGGTGCTGGTGGAAGCGGCGAGCGGT
GCTGAGAGACGTCGACCTCACCAGCCGGCCTTCCCAGCACTTGTGGTGGCCCCGACGGCA
CAGATCAAAGTCAAAGAGGACTACCTCGTCAGATCCTTCAAGGATGGAAGATACTACACG
GTGCCGAAGAAGGAGGCTCGTGAATTCCGCAAGGGCGCAGCACCCCTGGAGTGGTGCGGA
GTGGAGGCGGCGCTGCAGTACTTGGCCCACGGCGACCTACCGCCTCACTGGGACCGGGAC
GCCCTCTTCAACGAGCCGAGGAACACCTCTGATGATAGCTCAGACGACGAACCGCGTGAA
GAGAAGGATCACTTTGTAGCGCAGCTGTACAAGTTCATGGACGACCGAGGCACACCACTC
AACAGGAACCCCACCATCGCCAACAGAGACATTGATCTATATAGGCTGTTCAGAGTGGTT
CAAAAACTAGGCGGCTACAACCGCGTCACGAACCAAAACCAATGGAAGACTATAGCAGAC
AAAATGGGTTTCCACCCGGTCACCACCAGCATCACTAACCTGTGCAAGCAGGCTTATAAA
AAGTTTCTCCATAGCTTCGAAGATTTCTACCGCAAGCTGGGTGTGACGCTAGTGGCTCAC
CCTCGCGGCGCGCGCACGCCTCCCGCCGGCCGATCCCTCATCAGGGACCGCGACAAACTA
CCGCCCTCCGCCGCCTCGCCCGCCTCGACTACTTCATCCACGCCCAGCACGCCCAGCCAG
CGCAAAGACAAAGACTCGGACAAAAGCGAGACGGAAAAGAGCGATAAGAGCGACAAGAGC
GACAAGAGTGAGAAGAGTGAGAGGGAAGATAAAGTAGAGAAACAGGAGAAGAAGGAGAAA
CCTCGCGCGAGCGACGAGGATGACAGCGCGGATAACCAGCCGCTGATAACCACTACACCC
AAAATCGAAAAGGACAAGGAGAAGGAGAAGGAAAAAGAGAAAGAGAAAGAAAAGGATCGA
GAAAAAGAAAAAGATATAGAAAAGGATAAAGAAAAAGAGAAAAGTGTACCAAGTGAAGAT
AAGAGCACAGTGAAGCCCAGATCACAATCCAAGACGCGGAGTCTGCCGCCGGTCAAGAGC
GAGTCCCACGAAAAGAGAACTACGAAGAGAAAAACTATATCCTCTAAATGTGAGAGCAGC
GGCAATACATTAAGGGCCTCTCGCAGACCTCACGTGTCCACCGACAGCGACAGTTCCGGG
CGAGCCTCAAGATGTGGCCCAACAAAGAAGATGCAAAGTCGTCGCAGTCAGAGCGCCAAT
TCAGCGAGCAGCGGCAACACTATCGCATCAAACAGCAGCAAGAGACCCCGGAAAAGGAAG
AACACTGAATCATCTAACAACGAACCAGCCAGATCGGTTGGTGCCAGTGTGAAAGCTCAA
GTCGGTGACAAACTTAAAGTATACTACGGTCCCACGCAATCTGAATCAAAGGTAACCTAT
GAAGCTAAGGTCATAGAAATATCATCTGAGGGCATGCTCCGCGTTCACTACACGGGCTGG
AACACCCGCTATGATGAATGGATCAAACCGCAGAGGATTGCTCTGAACGTCACACAACAT
GATCAGAGAAACAAGAAGGGAACTAATCTAAGCAGACGCTCTCGAAGCAAACGAACAGAG
GAATCATCCGCGCGCTCCGACAGCGACACGGATTCTGACAGCGACGAGAGCGTTAAGAGA
CCATCAAAGAAATCCGAAGACAAATCTATAACTAAAACACCTTCGAGAACCAAAGACACG
AAATCAAGCGACAGCAGCAGTTCCAGCAAACCGAGGAAGAGACCGATGAGGACTGTATCG
ACTCCAGTGATCACGTCTCCCGCTAAGAAACCAAGGATCGGCGTCTCCAGCCAGCACCAG
GGACGAGACTACGACCTGAACGAGATAAGATCAGAACTGAAAGGACTTCATTCAGTGAAA
CAAGAAGCAGATGACGCTGGTAAAGCTGATATAGCGCAGGACTCTATAATGAATCCGATA
ACTCAGCCGCCAGAGGTCCCCGAGAAGCAGGCGGAGGACGTCTACGAGTTCAAAGAACCG
GAACCCTTCGAGCTCGAGTTGCACGACGAGAAGAAGAAGCGAACTCATCGCATTTTCGAT
GACATCTCGCCCAGTAAATACACATCCACGCTGTCGAAGTCGCTGAGCGAGGAAATATCT
GAGGAGCCGTTGAGGGCCAGGCCGTCCTCGTTCAGATCACCGTCCTTATCGCCGTTTAGA
GATTTCGGGTCGAGTCGAGACGTTCCCAGCAGGCAGAGCCCGGAAGATGATTCCAATAAT
GCTCTATTCTCCCTCGACGATGATTCCTTCCCTGGGGAAGGCAGTTCGGGGCCGATCTTC
GAAGGTTTCACCCCGGCGAAGAACCAAGAGACGTATTCTAAGAAGAGCAAGGTGTCCAAA
CTTCGGCAATTGATTGACGACTCACCGGACAGCCCGGCCGACGACGAGCAGTCCTCAGAC
GATGAACCCGAACCGGTCGTCAAGGAAGAGAGGCAGCCCAGCCCTGTCCTGAAAGTAACC
GAAACAGTTAAACAGACGGAAGCGAACAAGGTGATTAAAGAGGAAATAAAAACTCCAGAA
ACAAAAAAAGAACAAGTGACAGTTGTGAAAGAAATCCCACTAGCACAGAGTATCCCAGAA
CCGCCAGGTACACCACCGCCGAAACCCAAACCAGAAGCTGCCAAACCAAAATTGGAACTT
CCAAGCCTGATCATAACGGCTGCCACATCAAAAGATAAAGAAGATAAGAAGATTGAGAAA
ATAATAAAGGAGGAAGTAAACGAAAAAATTGTGAAGGAAACGATCATGAATATACCGTTA
CCGGAACCCAAGGAGCTGCCGGAAATAAAAGTGGATCCTGAATTATCCTCAATCATGGAA
CCACCCTCGAGCCCTCTGATAGATACGGAGGAAGACAAGTCCGAACCAGACAGTCCGGCG
AGGATCGACGTTCTTCCTGAACCACCTCCGGGATTCCTGCTACAATCTGAAGGACCTAAA
ATAGCAGAGAAACTACTTAAAGCCATCAACAGCGCCAAAAGACTATCGATCTCGCCGCCT
CCTGTGGACGACAGACCCGACACGCCCAAGAAAGATGTTGTTATAGAAGATAAAATATCA
CCCATACTTGAGAAACGGCCTCCAAGCAAACCGGAGCTAATGAAACCCTTGAAGCTTGAT
CCCGTCAAACGATCGTCTCCGGCCGAAGCTACCGACTCTATATTCGGCGAGCCGTCCAAC
CTGACGGACTTGAAACGAGATCTATCAGACATCAAGAAGATCAAACCCAAGGAAAACACG
CCGCCTCGCTTACAGAGTCCACTTAATATATTGGAAAGGAGGAAAAGCGTCGCCGACCTG
CCGTTGAGCGCTCCCGGGAAGAATAAGGTTCTCAGCGACACTATACAGAAACTCTCGAGT
CAAATCAACCAGTCGGTGGCTGCGGCCAGCATACCGCTACCGCCGTTCCCACCCGAGGAT
AGAAGCGAGTCCAGCGACTCCGACGACTCCGACAGAAGGTTGATAATCGACAAGCTGTCG
GTGGAGGAGTGGGCTGGTTCTAGTGGCGGCGGGGGCGGCAGCGGCGTCACCACCAACGTA
CCACTGGCGAGGACCCAGACCGCCATGAGGGCGCTTCACGCGGGGAAGTCCCCGGGCGAG
TGGAGTGCCGGCGAGTCGCTGCTTATGCTCGAAGACGCTTGTAAAAACGAGCGGAAACAC
AGCGCAAGCGTGGTGGTGGCGGGTGGTACGCGGCCCTCGGGTTCCGCGTCCACCCCGGTG
GTGGGCCCGGAGGAAGACAGCTGTGCCTTACTACTCTGCGAGGAGACCATCCCCGGGTCA
CCCGCGCCGGACACCGAGCCTGCGCCCCCAACACGAGCCCTCCACCTACCCTTCGCCTGC
ACCCCTCAACACCACCCGCAGAATACACACTCCCATAAAGCGGAGGAGCGCCGAGGGTCG
GGCGGGTCAGGGTCTAGTGGTGCTGGTGTGTCAGGAGTGTCCGGTGTGTCAGGTGTGTCG
GGAGTGTCGGGAGTGTCAGGTCCCGCCGGGGACGAATGGTCCCGCCGACGAGCGCTCCTC
GACAACACGCCCCCCACCACGCCGGATAGCAGCCTCGACCTGTCGCCGAGGGAGCGACGC
ATTTCGGAGACGAGTCCGTCTGACAGAAAGGAGGACGACGAAGACGCCCCGGTACAAGAC
CCCTGCGCCGCGGACATCGACAAACCCCATAGCAGTGGTCGCTGTCGCAAGGCGTCGGAG
TCGTCGGGCCGCACGAGGACGAGGCGGAGACGCGACACAGACGACGCCCACGCGCCGCCG
GCACTCAAATACAACTTCTATGTGGACCTCGATCCGTCGTGGGATTGTCAGACCCGTATA
AACGTTCTGTCGACGCGGCTGTCCGACCTGCGCAAGGCTTACCACTCGGTGAAGGCGGAG
CTGGCGGCCATCGACAGGCGGAGGAAGAAACTACGGCGGAAGGAACGGGAAGCCATAAAA
GCAGCCAAAGCTGCATGTTCCTGA

Protein sequence:

MQGDDPPFLPVGTDVSAKYKGAFCEAKIKKVVRNIKCKVTLKAGGITTVNDDVIKGTLRV
GSTVEVKQDPKKEAMEAVITKIQDCSQYTVVFDDGDITTLRRSALCLKSGRHFNESETLD
QLPLTHPEHFSTPVIAGRRGRRGRAQSDESDGEGTTPVKADSAEREPHVGRVVLVEAASG
AERRRPHQPAFPALVVAPTAQIKVKEDYLVRSFKDGRYYTVPKKEAREFRKGAAPLEWCG
VEAALQYLAHGDLPPHWDRDALFNEPRNTSDDSSDDEPREEKDHFVAQLYKFMDDRGTPL
NRNPTIANRDIDLYRLFRVVQKLGGYNRVTNQNQWKTIADKMGFHPVTTSITNLCKQAYK
KFLHSFEDFYRKLGVTLVAHPRGARTPPAGRSLIRDRDKLPPSAASPASTTSSTPSTPSQ
RKDKDSDKSETEKSDKSDKSDKSEKSEREDKVEKQEKKEKPRASDEDDSADNQPLITTTP
KIEKDKEKEKEKEKEKEKDREKEKDIEKDKEKEKSVPSEDKSTVKPRSQSKTRSLPPVKS
ESHEKRTTKRKTISSKCESSGNTLRASRRPHVSTDSDSSGRASRCGPTKKMQSRRSQSAN
SASSGNTIASNSSKRPRKRKNTESSNNEPARSVGASVKAQVGDKLKVYYGPTQSESKVTY
EAKVIEISSEGMLRVHYTGWNTRYDEWIKPQRIALNVTQHDQRNKKGTNLSRRSRSKRTE
ESSARSDSDTDSDSDESVKRPSKKSEDKSITKTPSRTKDTKSSDSSSSSKPRKRPMRTVS
TPVITSPAKKPRIGVSSQHQGRDYDLNEIRSELKGLHSVKQEADDAGKADIAQDSIMNPI
TQPPEVPEKQAEDVYEFKEPEPFELELHDEKKKRTHRIFDDISPSKYTSTLSKSLSEEIS
EEPLRARPSSFRSPSLSPFRDFGSSRDVPSRQSPEDDSNNALFSLDDDSFPGEGSSGPIF
EGFTPAKNQETYSKKSKVSKLRQLIDDSPDSPADDEQSSDDEPEPVVKEERQPSPVLKVT
ETVKQTEANKVIKEEIKTPETKKEQVTVVKEIPLAQSIPEPPGTPPPKPKPEAAKPKLEL
PSLIITAATSKDKEDKKIEKIIKEEVNEKIVKETIMNIPLPEPKELPEIKVDPELSSIME
PPSSPLIDTEEDKSEPDSPARIDVLPEPPPGFLLQSEGPKIAEKLLKAINSAKRLSISPP
PVDDRPDTPKKDVVIEDKISPILEKRPPSKPELMKPLKLDPVKRSSPAEATDSIFGEPSN
LTDLKRDLSDIKKIKPKENTPPRLQSPLNILERRKSVADLPLSAPGKNKVLSDTIQKLSS
QINQSVAAASIPLPPFPPEDRSESSDSDDSDRRLIIDKLSVEEWAGSSGGGGGSGVTTNV
PLARTQTAMRALHAGKSPGEWSAGESLLMLEDACKNERKHSASVVVAGGTRPSGSASTPV
VGPEEDSCALLLCEETIPGSPAPDTEPAPPTRALHLPFACTPQHHPQNTHSHKAEERRGS
GGSGSSGAGVSGVSGVSGVSGVSGVSGPAGDEWSRRRALLDNTPPTTPDSSLDLSPRERR
ISETSPSDRKEDDEDAPVQDPCAADIDKPHSSGRCRKASESSGRTRTRRRRDTDDAHAPP
ALKYNFYVDLDPSWDCQTRINVLSTRLSDLRKAYHSVKAELAAIDRRRKKLRRKEREAIK
AAKAACS