New model in OGS2.0 | DPOGS202304 |
---|---|
Genomic Position | scaffold664:+ 136423-148403 |
See gene structure | |
CDS Length | 5064 |
Paired RNAseq reads | 2299 |
Single RNAseq reads | 5352 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004976 (5e-32) |
Best Drosophila hit | CG34422, isoform A (1e-122) |
Best Human hit | AT-rich interactive domain-containing protein 4B isoform 2 (3e-66) |
Best NR hit (blastp) | AGAP007503-PA [Anopheles gambiae str. PEST] (2e-150) |
Best NR hit (blastx) | AGAP007503-PB [Anopheles gambiae str. PEST] (1e-135) |
GeneOntology terms | GO:0006333 chromatin assembly or disassembly GO:0000785 chromatin GO:0005634 nucleus GO:0003677 DNA binding GO:0003682 chromatin binding GO:0006911 phagocytosis, engulfment |
InterPro families | IPR001606 ARID/BRIGHT DNA-binding domain IPR016197 Chromo domain-like IPR012603 RBB1NT |
Orthology group | MCL11362 |
Nucleotide sequence:
ATGCAGGGTGATGATCCTCCTTTCCTACCAGTGGGTACGGATGTGAGCGCAAAGTACAAA
GGTGCATTCTGTGAGGCCAAAATTAAGAAAGTTGTGCGTAACATTAAATGCAAGGTGACA
TTAAAAGCTGGTGGTATTACCACTGTCAATGATGATGTCATCAAGGGCACTCTGAGGGTC
GGGAGCACTGTGGAAGTCAAACAGGACCCCAAGAAAGAAGCCATGGAAGCCGTTATTACG
AAAATACAGGACTGTAGCCAATACACCGTCGTATTTGATGATGGCGACATTACAACATTA
CGACGTTCAGCGCTGTGCCTTAAGAGCGGGAGACATTTTAATGAAAGTGAAACCTTGGAC
CAACTGCCTCTTACACATCCAGAACATTTCTCTACACCAGTTATAGCCGGGAGGAGAGGT
CGGAGAGGAAGAGCGCAGTCAGATGAAAGTGACGGTGAGGGTACGACCCCAGTGAAAGCG
GATAGTGCTGAACGTGAACCCCACGTGGGCCGTGTGGTGCTGGTGGAAGCGGCGAGCGGT
GCTGAGAGACGTCGACCTCACCAGCCGGCCTTCCCAGCACTTGTGGTGGCCCCGACGGCA
CAGATCAAAGTCAAAGAGGACTACCTCGTCAGATCCTTCAAGGATGGAAGATACTACACG
GTGCCGAAGAAGGAGGCTCGTGAATTCCGCAAGGGCGCAGCACCCCTGGAGTGGTGCGGA
GTGGAGGCGGCGCTGCAGTACTTGGCCCACGGCGACCTACCGCCTCACTGGGACCGGGAC
GCCCTCTTCAACGAGCCGAGGAACACCTCTGATGATAGCTCAGACGACGAACCGCGTGAA
GAGAAGGATCACTTTGTAGCGCAGCTGTACAAGTTCATGGACGACCGAGGCACACCACTC
AACAGGAACCCCACCATCGCCAACAGAGACATTGATCTATATAGGCTGTTCAGAGTGGTT
CAAAAACTAGGCGGCTACAACCGCGTCACGAACCAAAACCAATGGAAGACTATAGCAGAC
AAAATGGGTTTCCACCCGGTCACCACCAGCATCACTAACCTGTGCAAGCAGGCTTATAAA
AAGTTTCTCCATAGCTTCGAAGATTTCTACCGCAAGCTGGGTGTGACGCTAGTGGCTCAC
CCTCGCGGCGCGCGCACGCCTCCCGCCGGCCGATCCCTCATCAGGGACCGCGACAAACTA
CCGCCCTCCGCCGCCTCGCCCGCCTCGACTACTTCATCCACGCCCAGCACGCCCAGCCAG
CGCAAAGACAAAGACTCGGACAAAAGCGAGACGGAAAAGAGCGATAAGAGCGACAAGAGC
GACAAGAGTGAGAAGAGTGAGAGGGAAGATAAAGTAGAGAAACAGGAGAAGAAGGAGAAA
CCTCGCGCGAGCGACGAGGATGACAGCGCGGATAACCAGCCGCTGATAACCACTACACCC
AAAATCGAAAAGGACAAGGAGAAGGAGAAGGAAAAAGAGAAAGAGAAAGAAAAGGATCGA
GAAAAAGAAAAAGATATAGAAAAGGATAAAGAAAAAGAGAAAAGTGTACCAAGTGAAGAT
AAGAGCACAGTGAAGCCCAGATCACAATCCAAGACGCGGAGTCTGCCGCCGGTCAAGAGC
GAGTCCCACGAAAAGAGAACTACGAAGAGAAAAACTATATCCTCTAAATGTGAGAGCAGC
GGCAATACATTAAGGGCCTCTCGCAGACCTCACGTGTCCACCGACAGCGACAGTTCCGGG
CGAGCCTCAAGATGTGGCCCAACAAAGAAGATGCAAAGTCGTCGCAGTCAGAGCGCCAAT
TCAGCGAGCAGCGGCAACACTATCGCATCAAACAGCAGCAAGAGACCCCGGAAAAGGAAG
AACACTGAATCATCTAACAACGAACCAGCCAGATCGGTTGGTGCCAGTGTGAAAGCTCAA
GTCGGTGACAAACTTAAAGTATACTACGGTCCCACGCAATCTGAATCAAAGGTAACCTAT
GAAGCTAAGGTCATAGAAATATCATCTGAGGGCATGCTCCGCGTTCACTACACGGGCTGG
AACACCCGCTATGATGAATGGATCAAACCGCAGAGGATTGCTCTGAACGTCACACAACAT
GATCAGAGAAACAAGAAGGGAACTAATCTAAGCAGACGCTCTCGAAGCAAACGAACAGAG
GAATCATCCGCGCGCTCCGACAGCGACACGGATTCTGACAGCGACGAGAGCGTTAAGAGA
CCATCAAAGAAATCCGAAGACAAATCTATAACTAAAACACCTTCGAGAACCAAAGACACG
AAATCAAGCGACAGCAGCAGTTCCAGCAAACCGAGGAAGAGACCGATGAGGACTGTATCG
ACTCCAGTGATCACGTCTCCCGCTAAGAAACCAAGGATCGGCGTCTCCAGCCAGCACCAG
GGACGAGACTACGACCTGAACGAGATAAGATCAGAACTGAAAGGACTTCATTCAGTGAAA
CAAGAAGCAGATGACGCTGGTAAAGCTGATATAGCGCAGGACTCTATAATGAATCCGATA
ACTCAGCCGCCAGAGGTCCCCGAGAAGCAGGCGGAGGACGTCTACGAGTTCAAAGAACCG
GAACCCTTCGAGCTCGAGTTGCACGACGAGAAGAAGAAGCGAACTCATCGCATTTTCGAT
GACATCTCGCCCAGTAAATACACATCCACGCTGTCGAAGTCGCTGAGCGAGGAAATATCT
GAGGAGCCGTTGAGGGCCAGGCCGTCCTCGTTCAGATCACCGTCCTTATCGCCGTTTAGA
GATTTCGGGTCGAGTCGAGACGTTCCCAGCAGGCAGAGCCCGGAAGATGATTCCAATAAT
GCTCTATTCTCCCTCGACGATGATTCCTTCCCTGGGGAAGGCAGTTCGGGGCCGATCTTC
GAAGGTTTCACCCCGGCGAAGAACCAAGAGACGTATTCTAAGAAGAGCAAGGTGTCCAAA
CTTCGGCAATTGATTGACGACTCACCGGACAGCCCGGCCGACGACGAGCAGTCCTCAGAC
GATGAACCCGAACCGGTCGTCAAGGAAGAGAGGCAGCCCAGCCCTGTCCTGAAAGTAACC
GAAACAGTTAAACAGACGGAAGCGAACAAGGTGATTAAAGAGGAAATAAAAACTCCAGAA
ACAAAAAAAGAACAAGTGACAGTTGTGAAAGAAATCCCACTAGCACAGAGTATCCCAGAA
CCGCCAGGTACACCACCGCCGAAACCCAAACCAGAAGCTGCCAAACCAAAATTGGAACTT
CCAAGCCTGATCATAACGGCTGCCACATCAAAAGATAAAGAAGATAAGAAGATTGAGAAA
ATAATAAAGGAGGAAGTAAACGAAAAAATTGTGAAGGAAACGATCATGAATATACCGTTA
CCGGAACCCAAGGAGCTGCCGGAAATAAAAGTGGATCCTGAATTATCCTCAATCATGGAA
CCACCCTCGAGCCCTCTGATAGATACGGAGGAAGACAAGTCCGAACCAGACAGTCCGGCG
AGGATCGACGTTCTTCCTGAACCACCTCCGGGATTCCTGCTACAATCTGAAGGACCTAAA
ATAGCAGAGAAACTACTTAAAGCCATCAACAGCGCCAAAAGACTATCGATCTCGCCGCCT
CCTGTGGACGACAGACCCGACACGCCCAAGAAAGATGTTGTTATAGAAGATAAAATATCA
CCCATACTTGAGAAACGGCCTCCAAGCAAACCGGAGCTAATGAAACCCTTGAAGCTTGAT
CCCGTCAAACGATCGTCTCCGGCCGAAGCTACCGACTCTATATTCGGCGAGCCGTCCAAC
CTGACGGACTTGAAACGAGATCTATCAGACATCAAGAAGATCAAACCCAAGGAAAACACG
CCGCCTCGCTTACAGAGTCCACTTAATATATTGGAAAGGAGGAAAAGCGTCGCCGACCTG
CCGTTGAGCGCTCCCGGGAAGAATAAGGTTCTCAGCGACACTATACAGAAACTCTCGAGT
CAAATCAACCAGTCGGTGGCTGCGGCCAGCATACCGCTACCGCCGTTCCCACCCGAGGAT
AGAAGCGAGTCCAGCGACTCCGACGACTCCGACAGAAGGTTGATAATCGACAAGCTGTCG
GTGGAGGAGTGGGCTGGTTCTAGTGGCGGCGGGGGCGGCAGCGGCGTCACCACCAACGTA
CCACTGGCGAGGACCCAGACCGCCATGAGGGCGCTTCACGCGGGGAAGTCCCCGGGCGAG
TGGAGTGCCGGCGAGTCGCTGCTTATGCTCGAAGACGCTTGTAAAAACGAGCGGAAACAC
AGCGCAAGCGTGGTGGTGGCGGGTGGTACGCGGCCCTCGGGTTCCGCGTCCACCCCGGTG
GTGGGCCCGGAGGAAGACAGCTGTGCCTTACTACTCTGCGAGGAGACCATCCCCGGGTCA
CCCGCGCCGGACACCGAGCCTGCGCCCCCAACACGAGCCCTCCACCTACCCTTCGCCTGC
ACCCCTCAACACCACCCGCAGAATACACACTCCCATAAAGCGGAGGAGCGCCGAGGGTCG
GGCGGGTCAGGGTCTAGTGGTGCTGGTGTGTCAGGAGTGTCCGGTGTGTCAGGTGTGTCG
GGAGTGTCGGGAGTGTCAGGTCCCGCCGGGGACGAATGGTCCCGCCGACGAGCGCTCCTC
GACAACACGCCCCCCACCACGCCGGATAGCAGCCTCGACCTGTCGCCGAGGGAGCGACGC
ATTTCGGAGACGAGTCCGTCTGACAGAAAGGAGGACGACGAAGACGCCCCGGTACAAGAC
CCCTGCGCCGCGGACATCGACAAACCCCATAGCAGTGGTCGCTGTCGCAAGGCGTCGGAG
TCGTCGGGCCGCACGAGGACGAGGCGGAGACGCGACACAGACGACGCCCACGCGCCGCCG
GCACTCAAATACAACTTCTATGTGGACCTCGATCCGTCGTGGGATTGTCAGACCCGTATA
AACGTTCTGTCGACGCGGCTGTCCGACCTGCGCAAGGCTTACCACTCGGTGAAGGCGGAG
CTGGCGGCCATCGACAGGCGGAGGAAGAAACTACGGCGGAAGGAACGGGAAGCCATAAAA
GCAGCCAAAGCTGCATGTTCCTGA
Protein sequence:
MQGDDPPFLPVGTDVSAKYKGAFCEAKIKKVVRNIKCKVTLKAGGITTVNDDVIKGTLRV
GSTVEVKQDPKKEAMEAVITKIQDCSQYTVVFDDGDITTLRRSALCLKSGRHFNESETLD
QLPLTHPEHFSTPVIAGRRGRRGRAQSDESDGEGTTPVKADSAEREPHVGRVVLVEAASG
AERRRPHQPAFPALVVAPTAQIKVKEDYLVRSFKDGRYYTVPKKEAREFRKGAAPLEWCG
VEAALQYLAHGDLPPHWDRDALFNEPRNTSDDSSDDEPREEKDHFVAQLYKFMDDRGTPL
NRNPTIANRDIDLYRLFRVVQKLGGYNRVTNQNQWKTIADKMGFHPVTTSITNLCKQAYK
KFLHSFEDFYRKLGVTLVAHPRGARTPPAGRSLIRDRDKLPPSAASPASTTSSTPSTPSQ
RKDKDSDKSETEKSDKSDKSDKSEKSEREDKVEKQEKKEKPRASDEDDSADNQPLITTTP
KIEKDKEKEKEKEKEKEKDREKEKDIEKDKEKEKSVPSEDKSTVKPRSQSKTRSLPPVKS
ESHEKRTTKRKTISSKCESSGNTLRASRRPHVSTDSDSSGRASRCGPTKKMQSRRSQSAN
SASSGNTIASNSSKRPRKRKNTESSNNEPARSVGASVKAQVGDKLKVYYGPTQSESKVTY
EAKVIEISSEGMLRVHYTGWNTRYDEWIKPQRIALNVTQHDQRNKKGTNLSRRSRSKRTE
ESSARSDSDTDSDSDESVKRPSKKSEDKSITKTPSRTKDTKSSDSSSSSKPRKRPMRTVS
TPVITSPAKKPRIGVSSQHQGRDYDLNEIRSELKGLHSVKQEADDAGKADIAQDSIMNPI
TQPPEVPEKQAEDVYEFKEPEPFELELHDEKKKRTHRIFDDISPSKYTSTLSKSLSEEIS
EEPLRARPSSFRSPSLSPFRDFGSSRDVPSRQSPEDDSNNALFSLDDDSFPGEGSSGPIF
EGFTPAKNQETYSKKSKVSKLRQLIDDSPDSPADDEQSSDDEPEPVVKEERQPSPVLKVT
ETVKQTEANKVIKEEIKTPETKKEQVTVVKEIPLAQSIPEPPGTPPPKPKPEAAKPKLEL
PSLIITAATSKDKEDKKIEKIIKEEVNEKIVKETIMNIPLPEPKELPEIKVDPELSSIME
PPSSPLIDTEEDKSEPDSPARIDVLPEPPPGFLLQSEGPKIAEKLLKAINSAKRLSISPP
PVDDRPDTPKKDVVIEDKISPILEKRPPSKPELMKPLKLDPVKRSSPAEATDSIFGEPSN
LTDLKRDLSDIKKIKPKENTPPRLQSPLNILERRKSVADLPLSAPGKNKVLSDTIQKLSS
QINQSVAAASIPLPPFPPEDRSESSDSDDSDRRLIIDKLSVEEWAGSSGGGGGSGVTTNV
PLARTQTAMRALHAGKSPGEWSAGESLLMLEDACKNERKHSASVVVAGGTRPSGSASTPV
VGPEEDSCALLLCEETIPGSPAPDTEPAPPTRALHLPFACTPQHHPQNTHSHKAEERRGS
GGSGSSGAGVSGVSGVSGVSGVSGVSGPAGDEWSRRRALLDNTPPTTPDSSLDLSPRERR
ISETSPSDRKEDDEDAPVQDPCAADIDKPHSSGRCRKASESSGRTRTRRRRDTDDAHAPP
ALKYNFYVDLDPSWDCQTRINVLSTRLSDLRKAYHSVKAELAAIDRRRKKLRRKEREAIK
AAKAACS