New model in OGS2.0 | DPOGS201152  |
---|---|
Genomic Position | scaffold575:- 25436-40435 |
See gene structure | |
CDS Length | 4743 |
Paired RNAseq reads   | 7309 |
Single RNAseq reads   | 18043 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003928 (0.0) |
Best Drosophila hit   | winged eye, isoform B (7e-57) |
Best Human hit | BAH and coiled-coil domain-containing protein 1 (1e-38) |
Best NR hit (blastp)   | phd finger transcription factor [Aedes aegypti] (3e-122) |
Best NR hit (blastx)   | PREDICTED: similar to phd finger transcription factor [Tribolium castaneum] (3e-70) |
GeneOntology terms   | GO:0003677 DNA binding |
InterPro families   | IPR001025 Bromo adjacent homology (BAH) domain |
Orthology group | MCL23954 |
Nucleotide sequence:
ATGTCTGTCCTTCCCTCAGGTGTAACTCGGTTCCCGTTGTACTCCCTGTTCCCGACACAA
GGCAACAGATCAACTCAGATACATAAAGCTGTAGTCGCCTCCGTCTATCACTCTCGTTAT
AAAGCCTGTCTTGCGAGGTGTGTACTCTCCTCTCACCACCTGTCCTTGTGTCCGTTAGTG
AGTGAGAGCGTGTTCTCGTCCTCCGTGTCGTCGCTGGCAGCTCCTGCAGCTCCTCCAGCT
CCTCCGCCGCCGACTCACTCCCCTTACCCTCCGCTACACCTGGAGCTGCTCAACTGCTCG
CACCGGCTGCTGCTCGCAGACAAGGAGACCAGTAAGATGGAAGACTCGAAGATGGGTAAA
TGTATCCCGGAGACTCTTTGCTGTCCGGAACAGAGACAGCTGCTGGACGGAGTGATGATG
GAGCCGCTCACCATCAAGATGGAGCAGGCTAAAAGGACCTGCAACACCTGTCTCACCACC
TCGCCCAAGAAGGTGATCCACACGGACGGCGGCTGCAGTCGGACCAACCCCGTCACCTGG
CTGGGTGTGGGTCATGTCGGGGGACACGTCGGGGGTCAGAGCGTGGTCCAGGTGAAACGC
GAGCCACAACACATGCACGTTTCCGAGCTGGTAGCCGTCAAGCTGGAGCAGGCCTCGCCC
GGACACAAGCCGGACCCGCCGCCCATACCGCCCTCACTCAATAACGGTTCTATCCCGGTC
GGTATAGCGGTGGCCAGGCAACGAGTGGGCGACGGAGGCCTGCTGGCCGCGCTGTCACAG
AAAGACAACCAGCGACTACATGACATCGCGCAAGTGTCGATGGAGAGTATGATGGTGGGC
GGCATGGGTGGCGTGGGCGTTGGGTCCGTGGGCAGTGGCATGTCGTCCGTGGGCGGTTTT
CAGCTGGTCCGAGAGCCGTCCTCGGGAGCTCTGCTGCTCCTACCCGCTCCCCCCGACCTC
CCTCACGCCGTGGTGTGGGGCGGAGTGCCTTATCCGTCGACACCCTTGCTGCTGCCGCCC
GCCCCGCATCCGTCTCACCACCTCCAGCTGCTCCCGGGCGATCTGCTGGCGTCTACTGCT
ACGCTACAACACACCCACACTCATTCCACGCGGCTGGTCACCCTCGCCCCCGCTCCCCCC
GCACAGCCTCACCCTCACCCGGTCGATAAGAGGAAACCCCTCATGCAGCCGATAATAACC
CCTCACACGCTCATCAAAATAGAGCCGGAGCCGCCGCAGGAGAAGCCCCAGGCCTTCGCT
CCCGAACCCGTCCAGCAGCCGATACTCACCACGCATCTCTACTACCAGCCGGATTTCCAA
GAGCAGGCGTGCAGGCCTCAGGGTCCCCCGGCGGCCCCTCAGACTCCGCCGCCCGAAGTC
CCCGCACACAAGGACGCCTCTAACCAGACAGATCACATTGACGAGGACGACGACTCTCCT
ATCAACGCGGACGAGGACGAGCGTGAGGTCGCGTGTGTCGGGGTCGCGCACTACCCGGAG
GGCTCGAACATTATACAGATACAACCATTGCATCAGTCCATAGACGGCGCGGAGGAAACC
ACTCTGGAGGGACTCGTGGCGGCCACCGTGGTGGGGGCGGTGGAGAAGGCGATCAATGCC
ATGGAGAGGGAAGAGGCCGCCGATAACTCCAGCAGCAGAGGCTCCCACGACCTGGTCATC
GACACAGCCGGCAGCATGACGCCCCTGCAGGCGCAGCAACGCCCGCCCGTCGATGTCAGC
GGCCTGGAACTCCTCTCGAACAGTATCGAGCAGTTTGAGCGGACGACTCCCAGCAACCAC
GCGGCCGGCTCCGACCAGGCGCCGCTTACGATCGACACGAGGCCGTCTACTAAAATAAAC
ATACTCATAAAAACCTCACCGAGGTCTCCGTCGCAAGACGATGATGTAGTAGAGACACAT
AAGATAAGGTTCCAGTTCCCTCTAGTGGAGACAGGTGACAGCGACAAGCCGTCCCTGGAT
GGGTTGGGGTTGCTGTGTGCTTTGGCGGAGCAGAGATTCATGGAGGAGGTTGAGGAGAGC
CCCTCCCCAAATATGCCCTCGACCTCGCGAACTTTTATTAAGACGGAGCTGCTGAGTCCC
ACAGAAAGGGAAAGGGAGAGGGAGATATCGTCCGAATTGAAAAAAGAAAGACACAGACAC
AGAGATGACGCTTCGGGCGAGAGAAGGAGAAAGAGAAGTACAGATAAAGAAGAGAGATTA
CGACACAAGTTGGAAAAGATGCGTCGACACAAAAGGGAGAGAAAGGACACCGACAAATCT
GAAGGTGGCGAGTTGGAGGCGAGTTTACGGCGAGTGACGGCATGTTCCTGTGGGATGGTG
AATTGTTCCCACACGTCGAGCGTCCCGTCCGCGCAGGCGCTGGTGAACGCCATGGAGAAA
GATATGAGGGAACGTCTACAAGACCTGCAGCGGCAGTGTGACGAGAAGCGCGCTCAGCTG
GACGCCCTTACTCCGCCACTGCCGGCGCTCGTCACGCCCGCACCGTGCCTACAGCTCAGT
GCGAGTCCAGCTCTCTCACCGGACTCCGATAGAGGCTCATCCAAGAAACGTAAAGTGGGA
AGACCGAGGAAAGTCTCCAGTCCCGACTCCACGGAAACCATCGTAGCCAAGAAACCGAAG
TCGAAAAACACTCTCGTCGGTTACTTACTGGCGAAAGGAAAACTTAAAGGGAACATATTG
TACTCAAAGGGCGAACCCTCGAGAGACGACGGGAGTAGAACTAGTAAGGTCCGGCCGAAA
CTAAAAGCGGAGCCTGTCGTGAAGATGTACTCCGAGGAAGACGAAAACGATTGGGGGCTC
AACAGATCTGCGAGCTCCTCCATGGAAAGTCTCAACGAGGTGAGGCAGAAGAACAGGGAG
AAGGTAGACCGGTTAGCAAGGAAGCATTCGAGAGACGACATGTCAGAAATAGAGCTAGCC
CTGCGGAGAGCAAGCGCCTCCAGTGATAGTGACAAGGAAAGAAGACGTGCAAGAAAGAAG
CGGAAGAGTACTAAGTCGAAAGAGAGAGCAGAGGCGAACGCGGAGCAAACACAGCACGAA
CCAGGGACTCAGACGAAGATATCGAAATGTACTTTGACAGAAGAAAAAATAGACACATCG
CCGAGAGTGCTCACGGCGAGAGGAGGACTCTTCTACGCGGGGAAGTTGAGCGCTGTACAG
GCCCCTGACGTGTACGCAATAACATTAGACGGAGAGAGAGGAAATAAACCGCACATACTG
TCGAGGGAGGAAACATTGAGAGATGCCATCCTGGAGGTGAGTCCGTCTAGTGTCCGGGAG
CTTCCATCCGGTACCCGTGTATGTGCGTACTGGTCCCAGCAGTACCGCTGTTTGTATCCG
GGAACAGTGGCCGTGTCTTCCCCTGACCCACACCATGACAAATTTGTCGCGGTGGAATTC
GATGACGGTGACTCCGGTAGGATAGCCATCGAAGATATCAGATTCTTGGAACCCAATTAT
CCTATCGTGGAATACGAAAACACATTGTTTACTCTGAGCAAACGGCGGCGTAATACGAGC
GTGACGGAAGATAAAAAACACTCGACGGCTTCCACGAGCAATGACGTCAAGAACGAAGCA
CAGAACGATGGACAGAAGGAGGAAGAAGATCGACACAGAGATAGGAAGAAGTTAAAGAAA
CATCGCAAAGAAAAGATGAGGCGACTGACGAGCGAGGACGGACCCGGCTCAGAATATATA
AAGAAGAAGAAAAAGAAACACAAGTGCTGCGAGGAACACGGGAAGCATCGCAGGCATCAC
AAGAAACATCACCGGAAACACAAGAAGAGACATCACTCAATATGCAAGGAACATTCCAGT
TCCTCGGGCGACGATCACAGACAGAAATCATCCTCGGACTACATGGACTCCAACAAGTCC
AACGAGGACTCGCTCGACTCCAACGACCGCCTCTCCACTCTCATAGCGGTCGAGAAGTCT
CCGGAAGAAAACATGAAAACTGTCATCAAGAAGGCCGTGCTGTCCAAGAAGAGTCTAGTG
AAGAATGCGGTGCTGGATTTCAGTAATTTGAATAAAATAACACTCAAGAAGGAAGATGCG
AAGGATAACCTGAAGGATAGCGGGATCGGCCTGGAGGAGCCGCTCCCAGAGACCGCGGCA
GCGTCCACGTCCACCGATACTTCTAAAAAGAAGGCGAAGAAGCGCACGGTGTCCTCTACA
TCCTCGGACGGCGGCGGCGGTGTCAGCAAGATGGCGGCCTTCCTCCCGGGGGGAGCGTTG
TGGAGGTGGCACGGGCCCGCCTACAGGAGGACCACCAGGCCTCGGCACAGGAAACTATTC
TACAAGGCCATACAACGCGGGGAGGAAATACTACATGTGGGTGAGGCGGCGGTGTTCCTG
TCCACCGGCCGCGCCGACCGCCCCTACATAGGACGGATCGCGGCCCTTTGGCAGGCCCGA
GGTGCCATGGCGGTCAGGGTACACTGGTTCTACCACCCTGAGGAGACGGCCGGCTGCCGA
GACTTGAAGTACCCGGGCGGGCTGTTCGAGTCCCCGCACACCGACGAGAACGACGTCCAG
ACGATATCCCACAAGTGTGAGGTCCTGCCCCTGGCACAGTACCAGGAGCGGCTGGGGGAC
GACCCGGCCCGGTACAGCACCGTGTACGACAACAACGACGTGTACTACCTCGCGGGTCAC
TACGACCCCACCCAGCAGGCCCTCACCATGGAGCCGCACATACCGCTGCAGGACAACTCC
TAG
Protein sequence:
MSVLPSGVTRFPLYSLFPTQGNRSTQIHKAVVASVYHSRYKACLARCVLSSHHLSLCPLV
SESVFSSSVSSLAAPAAPPAPPPPTHSPYPPLHLELLNCSHRLLLADKETSKMEDSKMGK
CIPETLCCPEQRQLLDGVMMEPLTIKMEQAKRTCNTCLTTSPKKVIHTDGGCSRTNPVTW
LGVGHVGGHVGGQSVVQVKREPQHMHVSELVAVKLEQASPGHKPDPPPIPPSLNNGSIPV
GIAVARQRVGDGGLLAALSQKDNQRLHDIAQVSMESMMVGGMGGVGVGSVGSGMSSVGGF
QLVREPSSGALLLLPAPPDLPHAVVWGGVPYPSTPLLLPPAPHPSHHLQLLPGDLLASTA
TLQHTHTHSTRLVTLAPAPPAQPHPHPVDKRKPLMQPIITPHTLIKIEPEPPQEKPQAFA
PEPVQQPILTTHLYYQPDFQEQACRPQGPPAAPQTPPPEVPAHKDASNQTDHIDEDDDSP
INADEDEREVACVGVAHYPEGSNIIQIQPLHQSIDGAEETTLEGLVAATVVGAVEKAINA
MEREEAADNSSSRGSHDLVIDTAGSMTPLQAQQRPPVDVSGLELLSNSIEQFERTTPSNH
AAGSDQAPLTIDTRPSTKINILIKTSPRSPSQDDDVVETHKIRFQFPLVETGDSDKPSLD
GLGLLCALAEQRFMEEVEESPSPNMPSTSRTFIKTELLSPTEREREREISSELKKERHRH
RDDASGERRRKRSTDKEERLRHKLEKMRRHKRERKDTDKSEGGELEASLRRVTACSCGMV
NCSHTSSVPSAQALVNAMEKDMRERLQDLQRQCDEKRAQLDALTPPLPALVTPAPCLQLS
ASPALSPDSDRGSSKKRKVGRPRKVSSPDSTETIVAKKPKSKNTLVGYLLAKGKLKGNIL
YSKGEPSRDDGSRTSKVRPKLKAEPVVKMYSEEDENDWGLNRSASSSMESLNEVRQKNRE
KVDRLARKHSRDDMSEIELALRRASASSDSDKERRRARKKRKSTKSKERAEANAEQTQHE
PGTQTKISKCTLTEEKIDTSPRVLTARGGLFYAGKLSAVQAPDVYAITLDGERGNKPHIL
SREETLRDAILEVSPSSVRELPSGTRVCAYWSQQYRCLYPGTVAVSSPDPHHDKFVAVEF
DDGDSGRIAIEDIRFLEPNYPIVEYENTLFTLSKRRRNTSVTEDKKHSTASTSNDVKNEA
QNDGQKEEEDRHRDRKKLKKHRKEKMRRLTSEDGPGSEYIKKKKKKHKCCEEHGKHRRHH
KKHHRKHKKRHHSICKEHSSSSGDDHRQKSSSDYMDSNKSNEDSLDSNDRLSTLIAVEKS
PEENMKTVIKKAVLSKKSLVKNAVLDFSNLNKITLKKEDAKDNLKDSGIGLEEPLPETAA
ASTSTDTSKKKAKKRTVSSTSSDGGGGVSKMAAFLPGGALWRWHGPAYRRTTRPRHRKLF
YKAIQRGEEILHVGEAAVFLSTGRADRPYIGRIAALWQARGAMAVRVHWFYHPEETAGCR
DLKYPGGLFESPHTDENDVQTISHKCEVLPLAQYQERLGDDPARYSTVYDNNDVYYLAGH
YDPTQQALTMEPHIPLQDNS