DPGLEAN21255 in OGS1.0

New model in OGS2.0DPOGS205052 
Genomic Positionscaffold616:- 22955-27751
See gene structure
CDS Length3909
Paired RNAseq reads  411
Single RNAseq reads  1109
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006927 (0.0)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  ND
Best NR hit (blastx)  GK19503 [Drosophila willistoni] (3e-07)
GeneOntology terms  ND
InterPro families  IPR011598 Helix-loop-helix DNA-binding
Orthology groupMCL40114

Nucleotide sequence:

ATGGATCCACCTTCTAGTAATAAAAAACGTACTCCAAGCCGTTGTAGAGAATGGGAAAAG
CAAAGACGTATAAAATTTAATGATGCTATATCAAAACTTGGAGATATCGTCAAATCTATA
CACAAAGCAAACAGTCACATGAATGGTGAAGAGCTTGACAATGCCAACTATCCAAAAATT
GAAATCGTTCAAAAAGCAATTATATGCTTGACAAATATAATGCATGAAAAAACACAATTA
AAGGCTGAAATTCTAGCTTTAGAAGTAAAATTAGAAGCCATTGAGAAACAAAAGCAAAAT
AAAAAAGATGTATCACTTCAAGTCACAATTGGTTTAAACAAAAAAAGTCAAAATAACAAA
TATGTAAAATTATTAATGCTTCAAAAATCAAAAGAAAACAATGGAAAGGAAAAGGAAAAA
ACAATAACCCAAAAAACAAAACCTAAGATAGATACAAATACATTAAACAAAAATCCACCA
AAGTTACCAAAATTACTGCCACTAACAAATATGAAAAAAGAAAATACTATTGTTATGTTG
CCAGCTACACCATATATATTTCCCCAACGGCCAGTATTATTTCCTCAACCACCAACTATA
GTTTTGGTAGATACAAATCTTCAAACACTTAACAAAACAACAACAATACCAGTTATCAAT
AGAAATACTAATGACATCACTAGAACAACAATGGTCAATGTGTTACCAATATCAGCATAT
TCACGTCCATTGTCTGCCCTGAAATCAAAAAAGGGGAGTAAGGCGAAAAATAACTCGCCT
AAAAAAGGGGGTAAAAAAACAAAATCCAATGATACTCCTGCAGAAAATAATAAAAATGAA
GAAAAGCCTGATGATAAACTCAATCCAGGTAACAATGAAAAAGCCGTTGGTGACCTGAAA
GCCTCAGATACAGAAACTACTGCCGAAATAAATGAGCAATCAAATTCAAATAAAGAACTA
AGCCCGAAAGAAGCACCAAACTTAAACGATACCAGCATTGTTGCCTGTGAGACAAACAAT
AATCTGCCTTCTGAAGAAACTGTCGATTTAAATAAAATTACAGAAACGCCCAAAATAGTC
GAACCATTAATGGATAAGCCAAATAATATTAAAATAGCCCCAACAAATCTTGCATGTCAT
CCAGAAAATGTACCAAAAACAGTGTCAGTGGATAATGTCAAATGTGACAAAATTCCTGTT
GAGAAAACTTTAGGCGCTGAATACAAAAACAAAGATAATAAGTTACCACCTATAATTGAC
CCAACTATTTGTGAGAATGTCGTCGATGGTGGTAATGCAAGATTGGAATTAGCAGAAGAA
TTCTTAGCCGCTTCACCAACAGCTGCTTTTTTGATGTCATTCCCATTAGTGAGTGGCAAC
AGAGCGGACAGCCCTGCTGAAGAAGCTAATAATACAAATGCAAAGGACAATCGACGCGTT
GAAATAGCGCAGCCGGTGTCATACTTTGATAAATCTAACACATCCGATAGTAAAACGAAA
GCTTCCATCAAACAGAATGTCACCACAGTTCAAAATACCAATAAGGTAGCCGAACAACAA
AAATATGAAAGCAATCTAAAAAACACAGAGGTAAAAGTCACTGCGCCTATATCAAGTGTG
ACAACCGCTAATGATAATCCATTTCTTAACTTACCTCTGCCATCAATAATATCATCCAGT
TGCAATCTAGCCGATGCTACATTTGGTATTGATTTTGATTGCCACGTCACCAAAGCTGGT
ACCACAATAACAACATCACACAGTAATTCTAACAATTTTATGTACAAAAGTGAACCATTC
AATGCAGTTAAAAGCACTATTTATAGTACCAGCAGTATATCGTCTGGACATGAATTTAAT
AGCTTGGGATTGTATCCGTGCGCTATGGATAATTATTCAAACAAAAATAAGCCTGACCTG
ACCAACGTTGAGGACAATTTAATGAAGATAAATTCATCGAGGCTGACATATGACATTGAT
TTAGGATGGTCTCACAAAAGTTTCGATTTCGTCAATTGTACAACCAGCGCGAATACGTTT
CACAAAGATACTATATTAACTACTGTGTCCACGCCATTCTCTACAAACTATAATCCGTTC
AATCCAGACTTCCACGTCCCGTTAGTTCCTAATTCTAATAAGAAAAACCTTATCAGTAAG
ACAACTACATTCCCCGATCAGATCACAAACTTTTATTCCCAAGGTGGTAACTTATGGTCT
GACGAAGTATCATCTATATACACAAATAGTAATGTTTCGAAGAATTTTATATCGAAGCAA
CAAAATTATTTTCCCGTGGAACATTTACAACCGAATGTCCATACGAAAACGAGCACAGCC
AAACAGTTTGATACGAAACATATATCAGAAAGCGCTACTGAAACTAACTTAAAACCGGCG
ACTGCTGTGGGACAAGTGGTTGAAAAATATACCAAAAAGTCTCCAAGCAAAATGCATATT
AATTGGATGACGTCAGAGATAAGACCAATGCAAAATAATTGCAATCAAACAACAGCGGAA
ATGAAAGAAACCAAATTACCATATTCCCACGTGGAGCAACTTCCAAAAAAACAAATTCCA
CAAAATGAAAGCAATTATTTTCCTATCAATATGCACCATTTCCCAACTCAAGCTAATCAC
GAAGATGTTCAAGTATGGCCGACAGCTCGACCCGCTGGCACTACAGAAATAAGTATCGAT
CCGCCGCCGATAAATTTGCCAACTTTAATTGGAGATCTAGCACTTGGTCCACATGACAAG
AAGAAGGCTGATATTCTAAACAGATCCGTTCCTCATTCCGATTTACAAAACTGTGGAAAT
TTCCTATCTGTCACCCAGTTAATGAATCGCACCACAGAAAATATGCCACAGCGATCTAAT
GTGCTAATGGCTGATCAGAAATCCCTAGCGGCGAAACAAAATTTACCTCATATCGTTAAT
GATAATAGAAAAACGATGACTACGTCTCAGACGAATGTGGGTTACGGTTTTAACGACTCG
AAAGCACTTAACTCTTATGAAAATATAAATCAATTTCTACAAAATAAATCAAAAACTTCC
CTGAAACCCGAAAAAAATGCAAAAGCGCATAAAAATAATTATTCCGCCGAAGCACTGATA
CGTGGTGGAACAAATTACAATCAAAAACTACAAGACCATTCCAGTAACAAATTTATGATG
CCCGCTCAGAAATATAACGATTTTAACATCCAAGATTCGGGAGTCGCCCAAGTGTCTCAT
TTTCCATCTATCATTGACTATTCCGACAACAGCTACACTGGACAGCAATTCACAGGGACA
GCATTATACAATTCAACTACAAACACGATATCAAATTCTTTTTACTCCAATTTCATGCCG
GGAAGCAGTAATTTGATGTCGGGAAATTACACGGCAGCGCCTTTCAGCAGCGAGTTTGTT
GATTACAACCAGACGATGGAATGTAACTATACGAACCACAAATATAACGAGGTCAAAATG
AGAAACAACACAACCGCGTTCCAACAGGATAAAGAAACAACTAATTACAAGAGTTCAAGA
AGAGAATCTGCAGCCAAACATAAATTGGAATGTTCGAAGAAAGATTCCAATAAAAAATAT
CAAAGTAAAAGACCAAAATTAACTAACGAAGTCGAAGAATGGAACGATAGTTCCCATTTG
CTCTGGCAGAATAAAACGCCATCCAAACGGCATCAAAACTTAATGTCAGATGAAATTCCA
TTTCCGAACTATGTGGGAAATCAAATGCCAACTCAGTATCAGCCAGATTTCTTTAATAGC
CATATAATGCCATCCAACATGCAGGGCGTGGCTAATGCTGATCGTTCCTTGGCAAGCTTC
CCAGTAGCATCTCGAGCTAACTTTAACCTAAGCGCTTTATTTCCGGAGATAACTATGAAA
GTGCAATGA

Protein sequence:

MDPPSSNKKRTPSRCREWEKQRRIKFNDAISKLGDIVKSIHKANSHMNGEELDNANYPKI
EIVQKAIICLTNIMHEKTQLKAEILALEVKLEAIEKQKQNKKDVSLQVTIGLNKKSQNNK
YVKLLMLQKSKENNGKEKEKTITQKTKPKIDTNTLNKNPPKLPKLLPLTNMKKENTIVML
PATPYIFPQRPVLFPQPPTIVLVDTNLQTLNKTTTIPVINRNTNDITRTTMVNVLPISAY
SRPLSALKSKKGSKAKNNSPKKGGKKTKSNDTPAENNKNEEKPDDKLNPGNNEKAVGDLK
ASDTETTAEINEQSNSNKELSPKEAPNLNDTSIVACETNNNLPSEETVDLNKITETPKIV
EPLMDKPNNIKIAPTNLACHPENVPKTVSVDNVKCDKIPVEKTLGAEYKNKDNKLPPIID
PTICENVVDGGNARLELAEEFLAASPTAAFLMSFPLVSGNRADSPAEEANNTNAKDNRRV
EIAQPVSYFDKSNTSDSKTKASIKQNVTTVQNTNKVAEQQKYESNLKNTEVKVTAPISSV
TTANDNPFLNLPLPSIISSSCNLADATFGIDFDCHVTKAGTTITTSHSNSNNFMYKSEPF
NAVKSTIYSTSSISSGHEFNSLGLYPCAMDNYSNKNKPDLTNVEDNLMKINSSRLTYDID
LGWSHKSFDFVNCTTSANTFHKDTILTTVSTPFSTNYNPFNPDFHVPLVPNSNKKNLISK
TTTFPDQITNFYSQGGNLWSDEVSSIYTNSNVSKNFISKQQNYFPVEHLQPNVHTKTSTA
KQFDTKHISESATETNLKPATAVGQVVEKYTKKSPSKMHINWMTSEIRPMQNNCNQTTAE
MKETKLPYSHVEQLPKKQIPQNESNYFPINMHHFPTQANHEDVQVWPTARPAGTTEISID
PPPINLPTLIGDLALGPHDKKKADILNRSVPHSDLQNCGNFLSVTQLMNRTTENMPQRSN
VLMADQKSLAAKQNLPHIVNDNRKTMTTSQTNVGYGFNDSKALNSYENINQFLQNKSKTS
LKPEKNAKAHKNNYSAEALIRGGTNYNQKLQDHSSNKFMMPAQKYNDFNIQDSGVAQVSH
FPSIIDYSDNSYTGQQFTGTALYNSTTNTISNSFYSNFMPGSSNLMSGNYTAAPFSSEFV
DYNQTMECNYTNHKYNEVKMRNNTTAFQQDKETTNYKSSRRESAAKHKLECSKKDSNKKY
QSKRPKLTNEVEEWNDSSHLLWQNKTPSKRHQNLMSDEIPFPNYVGNQMPTQYQPDFFNS
HIMPSNMQGVANADRSLASFPVASRANFNLSALFPEITMKVQ