DPGLEAN17843 in OGS1.0

New model in OGS2.0DPOGS201152 
Genomic Positionscaffold575:- 25436-40435
See gene structure
CDS Length4743
Paired RNAseq reads  7309
Single RNAseq reads  18043
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003928 (0.0)
Best Drosophila hit  winged eye, isoform B (7e-57)
Best Human hitBAH and coiled-coil domain-containing protein 1 (1e-38)
Best NR hit (blastp)  phd finger transcription factor [Aedes aegypti] (3e-122)
Best NR hit (blastx)  PREDICTED: similar to phd finger transcription factor [Tribolium castaneum] (3e-70)
GeneOntology terms  GO:0003677 DNA binding
InterPro families  IPR001025 Bromo adjacent homology (BAH) domain
Orthology groupMCL23954

Nucleotide sequence:

ATGTCTGTCCTTCCCTCAGGTGTAACTCGGTTCCCGTTGTACTCCCTGTTCCCGACACAA
GGCAACAGATCAACTCAGATACATAAAGCTGTAGTCGCCTCCGTCTATCACTCTCGTTAT
AAAGCCTGTCTTGCGAGGTGTGTACTCTCCTCTCACCACCTGTCCTTGTGTCCGTTAGTG
AGTGAGAGCGTGTTCTCGTCCTCCGTGTCGTCGCTGGCAGCTCCTGCAGCTCCTCCAGCT
CCTCCGCCGCCGACTCACTCCCCTTACCCTCCGCTACACCTGGAGCTGCTCAACTGCTCG
CACCGGCTGCTGCTCGCAGACAAGGAGACCAGTAAGATGGAAGACTCGAAGATGGGTAAA
TGTATCCCGGAGACTCTTTGCTGTCCGGAACAGAGACAGCTGCTGGACGGAGTGATGATG
GAGCCGCTCACCATCAAGATGGAGCAGGCTAAAAGGACCTGCAACACCTGTCTCACCACC
TCGCCCAAGAAGGTGATCCACACGGACGGCGGCTGCAGTCGGACCAACCCCGTCACCTGG
CTGGGTGTGGGTCATGTCGGGGGACACGTCGGGGGTCAGAGCGTGGTCCAGGTGAAACGC
GAGCCACAACACATGCACGTTTCCGAGCTGGTAGCCGTCAAGCTGGAGCAGGCCTCGCCC
GGACACAAGCCGGACCCGCCGCCCATACCGCCCTCACTCAATAACGGTTCTATCCCGGTC
GGTATAGCGGTGGCCAGGCAACGAGTGGGCGACGGAGGCCTGCTGGCCGCGCTGTCACAG
AAAGACAACCAGCGACTACATGACATCGCGCAAGTGTCGATGGAGAGTATGATGGTGGGC
GGCATGGGTGGCGTGGGCGTTGGGTCCGTGGGCAGTGGCATGTCGTCCGTGGGCGGTTTT
CAGCTGGTCCGAGAGCCGTCCTCGGGAGCTCTGCTGCTCCTACCCGCTCCCCCCGACCTC
CCTCACGCCGTGGTGTGGGGCGGAGTGCCTTATCCGTCGACACCCTTGCTGCTGCCGCCC
GCCCCGCATCCGTCTCACCACCTCCAGCTGCTCCCGGGCGATCTGCTGGCGTCTACTGCT
ACGCTACAACACACCCACACTCATTCCACGCGGCTGGTCACCCTCGCCCCCGCTCCCCCC
GCACAGCCTCACCCTCACCCGGTCGATAAGAGGAAACCCCTCATGCAGCCGATAATAACC
CCTCACACGCTCATCAAAATAGAGCCGGAGCCGCCGCAGGAGAAGCCCCAGGCCTTCGCT
CCCGAACCCGTCCAGCAGCCGATACTCACCACGCATCTCTACTACCAGCCGGATTTCCAA
GAGCAGGCGTGCAGGCCTCAGGGTCCCCCGGCGGCCCCTCAGACTCCGCCGCCCGAAGTC
CCCGCACACAAGGACGCCTCTAACCAGACAGATCACATTGACGAGGACGACGACTCTCCT
ATCAACGCGGACGAGGACGAGCGTGAGGTCGCGTGTGTCGGGGTCGCGCACTACCCGGAG
GGCTCGAACATTATACAGATACAACCATTGCATCAGTCCATAGACGGCGCGGAGGAAACC
ACTCTGGAGGGACTCGTGGCGGCCACCGTGGTGGGGGCGGTGGAGAAGGCGATCAATGCC
ATGGAGAGGGAAGAGGCCGCCGATAACTCCAGCAGCAGAGGCTCCCACGACCTGGTCATC
GACACAGCCGGCAGCATGACGCCCCTGCAGGCGCAGCAACGCCCGCCCGTCGATGTCAGC
GGCCTGGAACTCCTCTCGAACAGTATCGAGCAGTTTGAGCGGACGACTCCCAGCAACCAC
GCGGCCGGCTCCGACCAGGCGCCGCTTACGATCGACACGAGGCCGTCTACTAAAATAAAC
ATACTCATAAAAACCTCACCGAGGTCTCCGTCGCAAGACGATGATGTAGTAGAGACACAT
AAGATAAGGTTCCAGTTCCCTCTAGTGGAGACAGGTGACAGCGACAAGCCGTCCCTGGAT
GGGTTGGGGTTGCTGTGTGCTTTGGCGGAGCAGAGATTCATGGAGGAGGTTGAGGAGAGC
CCCTCCCCAAATATGCCCTCGACCTCGCGAACTTTTATTAAGACGGAGCTGCTGAGTCCC
ACAGAAAGGGAAAGGGAGAGGGAGATATCGTCCGAATTGAAAAAAGAAAGACACAGACAC
AGAGATGACGCTTCGGGCGAGAGAAGGAGAAAGAGAAGTACAGATAAAGAAGAGAGATTA
CGACACAAGTTGGAAAAGATGCGTCGACACAAAAGGGAGAGAAAGGACACCGACAAATCT
GAAGGTGGCGAGTTGGAGGCGAGTTTACGGCGAGTGACGGCATGTTCCTGTGGGATGGTG
AATTGTTCCCACACGTCGAGCGTCCCGTCCGCGCAGGCGCTGGTGAACGCCATGGAGAAA
GATATGAGGGAACGTCTACAAGACCTGCAGCGGCAGTGTGACGAGAAGCGCGCTCAGCTG
GACGCCCTTACTCCGCCACTGCCGGCGCTCGTCACGCCCGCACCGTGCCTACAGCTCAGT
GCGAGTCCAGCTCTCTCACCGGACTCCGATAGAGGCTCATCCAAGAAACGTAAAGTGGGA
AGACCGAGGAAAGTCTCCAGTCCCGACTCCACGGAAACCATCGTAGCCAAGAAACCGAAG
TCGAAAAACACTCTCGTCGGTTACTTACTGGCGAAAGGAAAACTTAAAGGGAACATATTG
TACTCAAAGGGCGAACCCTCGAGAGACGACGGGAGTAGAACTAGTAAGGTCCGGCCGAAA
CTAAAAGCGGAGCCTGTCGTGAAGATGTACTCCGAGGAAGACGAAAACGATTGGGGGCTC
AACAGATCTGCGAGCTCCTCCATGGAAAGTCTCAACGAGGTGAGGCAGAAGAACAGGGAG
AAGGTAGACCGGTTAGCAAGGAAGCATTCGAGAGACGACATGTCAGAAATAGAGCTAGCC
CTGCGGAGAGCAAGCGCCTCCAGTGATAGTGACAAGGAAAGAAGACGTGCAAGAAAGAAG
CGGAAGAGTACTAAGTCGAAAGAGAGAGCAGAGGCGAACGCGGAGCAAACACAGCACGAA
CCAGGGACTCAGACGAAGATATCGAAATGTACTTTGACAGAAGAAAAAATAGACACATCG
CCGAGAGTGCTCACGGCGAGAGGAGGACTCTTCTACGCGGGGAAGTTGAGCGCTGTACAG
GCCCCTGACGTGTACGCAATAACATTAGACGGAGAGAGAGGAAATAAACCGCACATACTG
TCGAGGGAGGAAACATTGAGAGATGCCATCCTGGAGGTGAGTCCGTCTAGTGTCCGGGAG
CTTCCATCCGGTACCCGTGTATGTGCGTACTGGTCCCAGCAGTACCGCTGTTTGTATCCG
GGAACAGTGGCCGTGTCTTCCCCTGACCCACACCATGACAAATTTGTCGCGGTGGAATTC
GATGACGGTGACTCCGGTAGGATAGCCATCGAAGATATCAGATTCTTGGAACCCAATTAT
CCTATCGTGGAATACGAAAACACATTGTTTACTCTGAGCAAACGGCGGCGTAATACGAGC
GTGACGGAAGATAAAAAACACTCGACGGCTTCCACGAGCAATGACGTCAAGAACGAAGCA
CAGAACGATGGACAGAAGGAGGAAGAAGATCGACACAGAGATAGGAAGAAGTTAAAGAAA
CATCGCAAAGAAAAGATGAGGCGACTGACGAGCGAGGACGGACCCGGCTCAGAATATATA
AAGAAGAAGAAAAAGAAACACAAGTGCTGCGAGGAACACGGGAAGCATCGCAGGCATCAC
AAGAAACATCACCGGAAACACAAGAAGAGACATCACTCAATATGCAAGGAACATTCCAGT
TCCTCGGGCGACGATCACAGACAGAAATCATCCTCGGACTACATGGACTCCAACAAGTCC
AACGAGGACTCGCTCGACTCCAACGACCGCCTCTCCACTCTCATAGCGGTCGAGAAGTCT
CCGGAAGAAAACATGAAAACTGTCATCAAGAAGGCCGTGCTGTCCAAGAAGAGTCTAGTG
AAGAATGCGGTGCTGGATTTCAGTAATTTGAATAAAATAACACTCAAGAAGGAAGATGCG
AAGGATAACCTGAAGGATAGCGGGATCGGCCTGGAGGAGCCGCTCCCAGAGACCGCGGCA
GCGTCCACGTCCACCGATACTTCTAAAAAGAAGGCGAAGAAGCGCACGGTGTCCTCTACA
TCCTCGGACGGCGGCGGCGGTGTCAGCAAGATGGCGGCCTTCCTCCCGGGGGGAGCGTTG
TGGAGGTGGCACGGGCCCGCCTACAGGAGGACCACCAGGCCTCGGCACAGGAAACTATTC
TACAAGGCCATACAACGCGGGGAGGAAATACTACATGTGGGTGAGGCGGCGGTGTTCCTG
TCCACCGGCCGCGCCGACCGCCCCTACATAGGACGGATCGCGGCCCTTTGGCAGGCCCGA
GGTGCCATGGCGGTCAGGGTACACTGGTTCTACCACCCTGAGGAGACGGCCGGCTGCCGA
GACTTGAAGTACCCGGGCGGGCTGTTCGAGTCCCCGCACACCGACGAGAACGACGTCCAG
ACGATATCCCACAAGTGTGAGGTCCTGCCCCTGGCACAGTACCAGGAGCGGCTGGGGGAC
GACCCGGCCCGGTACAGCACCGTGTACGACAACAACGACGTGTACTACCTCGCGGGTCAC
TACGACCCCACCCAGCAGGCCCTCACCATGGAGCCGCACATACCGCTGCAGGACAACTCC
TAG

Protein sequence:

MSVLPSGVTRFPLYSLFPTQGNRSTQIHKAVVASVYHSRYKACLARCVLSSHHLSLCPLV
SESVFSSSVSSLAAPAAPPAPPPPTHSPYPPLHLELLNCSHRLLLADKETSKMEDSKMGK
CIPETLCCPEQRQLLDGVMMEPLTIKMEQAKRTCNTCLTTSPKKVIHTDGGCSRTNPVTW
LGVGHVGGHVGGQSVVQVKREPQHMHVSELVAVKLEQASPGHKPDPPPIPPSLNNGSIPV
GIAVARQRVGDGGLLAALSQKDNQRLHDIAQVSMESMMVGGMGGVGVGSVGSGMSSVGGF
QLVREPSSGALLLLPAPPDLPHAVVWGGVPYPSTPLLLPPAPHPSHHLQLLPGDLLASTA
TLQHTHTHSTRLVTLAPAPPAQPHPHPVDKRKPLMQPIITPHTLIKIEPEPPQEKPQAFA
PEPVQQPILTTHLYYQPDFQEQACRPQGPPAAPQTPPPEVPAHKDASNQTDHIDEDDDSP
INADEDEREVACVGVAHYPEGSNIIQIQPLHQSIDGAEETTLEGLVAATVVGAVEKAINA
MEREEAADNSSSRGSHDLVIDTAGSMTPLQAQQRPPVDVSGLELLSNSIEQFERTTPSNH
AAGSDQAPLTIDTRPSTKINILIKTSPRSPSQDDDVVETHKIRFQFPLVETGDSDKPSLD
GLGLLCALAEQRFMEEVEESPSPNMPSTSRTFIKTELLSPTEREREREISSELKKERHRH
RDDASGERRRKRSTDKEERLRHKLEKMRRHKRERKDTDKSEGGELEASLRRVTACSCGMV
NCSHTSSVPSAQALVNAMEKDMRERLQDLQRQCDEKRAQLDALTPPLPALVTPAPCLQLS
ASPALSPDSDRGSSKKRKVGRPRKVSSPDSTETIVAKKPKSKNTLVGYLLAKGKLKGNIL
YSKGEPSRDDGSRTSKVRPKLKAEPVVKMYSEEDENDWGLNRSASSSMESLNEVRQKNRE
KVDRLARKHSRDDMSEIELALRRASASSDSDKERRRARKKRKSTKSKERAEANAEQTQHE
PGTQTKISKCTLTEEKIDTSPRVLTARGGLFYAGKLSAVQAPDVYAITLDGERGNKPHIL
SREETLRDAILEVSPSSVRELPSGTRVCAYWSQQYRCLYPGTVAVSSPDPHHDKFVAVEF
DDGDSGRIAIEDIRFLEPNYPIVEYENTLFTLSKRRRNTSVTEDKKHSTASTSNDVKNEA
QNDGQKEEEDRHRDRKKLKKHRKEKMRRLTSEDGPGSEYIKKKKKKHKCCEEHGKHRRHH
KKHHRKHKKRHHSICKEHSSSSGDDHRQKSSSDYMDSNKSNEDSLDSNDRLSTLIAVEKS
PEENMKTVIKKAVLSKKSLVKNAVLDFSNLNKITLKKEDAKDNLKDSGIGLEEPLPETAA
ASTSTDTSKKKAKKRTVSSTSSDGGGGVSKMAAFLPGGALWRWHGPAYRRTTRPRHRKLF
YKAIQRGEEILHVGEAAVFLSTGRADRPYIGRIAALWQARGAMAVRVHWFYHPEETAGCR
DLKYPGGLFESPHTDENDVQTISHKCEVLPLAQYQERLGDDPARYSTVYDNNDVYYLAGH
YDPTQQALTMEPHIPLQDNS