New model in OGS2.0 | DPOGS205052  |
---|---|
Genomic Position | scaffold616:- 22955-27751 |
See gene structure | |
CDS Length | 3909 |
Paired RNAseq reads   | 411 |
Single RNAseq reads   | 1109 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006927 (0.0) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | ND |
Best NR hit (blastx)   | GK19503 [Drosophila willistoni] (3e-07) |
GeneOntology terms   | ND |
InterPro families   | IPR011598 Helix-loop-helix DNA-binding |
Orthology group | MCL40114 |
Nucleotide sequence:
ATGGATCCACCTTCTAGTAATAAAAAACGTACTCCAAGCCGTTGTAGAGAATGGGAAAAG
CAAAGACGTATAAAATTTAATGATGCTATATCAAAACTTGGAGATATCGTCAAATCTATA
CACAAAGCAAACAGTCACATGAATGGTGAAGAGCTTGACAATGCCAACTATCCAAAAATT
GAAATCGTTCAAAAAGCAATTATATGCTTGACAAATATAATGCATGAAAAAACACAATTA
AAGGCTGAAATTCTAGCTTTAGAAGTAAAATTAGAAGCCATTGAGAAACAAAAGCAAAAT
AAAAAAGATGTATCACTTCAAGTCACAATTGGTTTAAACAAAAAAAGTCAAAATAACAAA
TATGTAAAATTATTAATGCTTCAAAAATCAAAAGAAAACAATGGAAAGGAAAAGGAAAAA
ACAATAACCCAAAAAACAAAACCTAAGATAGATACAAATACATTAAACAAAAATCCACCA
AAGTTACCAAAATTACTGCCACTAACAAATATGAAAAAAGAAAATACTATTGTTATGTTG
CCAGCTACACCATATATATTTCCCCAACGGCCAGTATTATTTCCTCAACCACCAACTATA
GTTTTGGTAGATACAAATCTTCAAACACTTAACAAAACAACAACAATACCAGTTATCAAT
AGAAATACTAATGACATCACTAGAACAACAATGGTCAATGTGTTACCAATATCAGCATAT
TCACGTCCATTGTCTGCCCTGAAATCAAAAAAGGGGAGTAAGGCGAAAAATAACTCGCCT
AAAAAAGGGGGTAAAAAAACAAAATCCAATGATACTCCTGCAGAAAATAATAAAAATGAA
GAAAAGCCTGATGATAAACTCAATCCAGGTAACAATGAAAAAGCCGTTGGTGACCTGAAA
GCCTCAGATACAGAAACTACTGCCGAAATAAATGAGCAATCAAATTCAAATAAAGAACTA
AGCCCGAAAGAAGCACCAAACTTAAACGATACCAGCATTGTTGCCTGTGAGACAAACAAT
AATCTGCCTTCTGAAGAAACTGTCGATTTAAATAAAATTACAGAAACGCCCAAAATAGTC
GAACCATTAATGGATAAGCCAAATAATATTAAAATAGCCCCAACAAATCTTGCATGTCAT
CCAGAAAATGTACCAAAAACAGTGTCAGTGGATAATGTCAAATGTGACAAAATTCCTGTT
GAGAAAACTTTAGGCGCTGAATACAAAAACAAAGATAATAAGTTACCACCTATAATTGAC
CCAACTATTTGTGAGAATGTCGTCGATGGTGGTAATGCAAGATTGGAATTAGCAGAAGAA
TTCTTAGCCGCTTCACCAACAGCTGCTTTTTTGATGTCATTCCCATTAGTGAGTGGCAAC
AGAGCGGACAGCCCTGCTGAAGAAGCTAATAATACAAATGCAAAGGACAATCGACGCGTT
GAAATAGCGCAGCCGGTGTCATACTTTGATAAATCTAACACATCCGATAGTAAAACGAAA
GCTTCCATCAAACAGAATGTCACCACAGTTCAAAATACCAATAAGGTAGCCGAACAACAA
AAATATGAAAGCAATCTAAAAAACACAGAGGTAAAAGTCACTGCGCCTATATCAAGTGTG
ACAACCGCTAATGATAATCCATTTCTTAACTTACCTCTGCCATCAATAATATCATCCAGT
TGCAATCTAGCCGATGCTACATTTGGTATTGATTTTGATTGCCACGTCACCAAAGCTGGT
ACCACAATAACAACATCACACAGTAATTCTAACAATTTTATGTACAAAAGTGAACCATTC
AATGCAGTTAAAAGCACTATTTATAGTACCAGCAGTATATCGTCTGGACATGAATTTAAT
AGCTTGGGATTGTATCCGTGCGCTATGGATAATTATTCAAACAAAAATAAGCCTGACCTG
ACCAACGTTGAGGACAATTTAATGAAGATAAATTCATCGAGGCTGACATATGACATTGAT
TTAGGATGGTCTCACAAAAGTTTCGATTTCGTCAATTGTACAACCAGCGCGAATACGTTT
CACAAAGATACTATATTAACTACTGTGTCCACGCCATTCTCTACAAACTATAATCCGTTC
AATCCAGACTTCCACGTCCCGTTAGTTCCTAATTCTAATAAGAAAAACCTTATCAGTAAG
ACAACTACATTCCCCGATCAGATCACAAACTTTTATTCCCAAGGTGGTAACTTATGGTCT
GACGAAGTATCATCTATATACACAAATAGTAATGTTTCGAAGAATTTTATATCGAAGCAA
CAAAATTATTTTCCCGTGGAACATTTACAACCGAATGTCCATACGAAAACGAGCACAGCC
AAACAGTTTGATACGAAACATATATCAGAAAGCGCTACTGAAACTAACTTAAAACCGGCG
ACTGCTGTGGGACAAGTGGTTGAAAAATATACCAAAAAGTCTCCAAGCAAAATGCATATT
AATTGGATGACGTCAGAGATAAGACCAATGCAAAATAATTGCAATCAAACAACAGCGGAA
ATGAAAGAAACCAAATTACCATATTCCCACGTGGAGCAACTTCCAAAAAAACAAATTCCA
CAAAATGAAAGCAATTATTTTCCTATCAATATGCACCATTTCCCAACTCAAGCTAATCAC
GAAGATGTTCAAGTATGGCCGACAGCTCGACCCGCTGGCACTACAGAAATAAGTATCGAT
CCGCCGCCGATAAATTTGCCAACTTTAATTGGAGATCTAGCACTTGGTCCACATGACAAG
AAGAAGGCTGATATTCTAAACAGATCCGTTCCTCATTCCGATTTACAAAACTGTGGAAAT
TTCCTATCTGTCACCCAGTTAATGAATCGCACCACAGAAAATATGCCACAGCGATCTAAT
GTGCTAATGGCTGATCAGAAATCCCTAGCGGCGAAACAAAATTTACCTCATATCGTTAAT
GATAATAGAAAAACGATGACTACGTCTCAGACGAATGTGGGTTACGGTTTTAACGACTCG
AAAGCACTTAACTCTTATGAAAATATAAATCAATTTCTACAAAATAAATCAAAAACTTCC
CTGAAACCCGAAAAAAATGCAAAAGCGCATAAAAATAATTATTCCGCCGAAGCACTGATA
CGTGGTGGAACAAATTACAATCAAAAACTACAAGACCATTCCAGTAACAAATTTATGATG
CCCGCTCAGAAATATAACGATTTTAACATCCAAGATTCGGGAGTCGCCCAAGTGTCTCAT
TTTCCATCTATCATTGACTATTCCGACAACAGCTACACTGGACAGCAATTCACAGGGACA
GCATTATACAATTCAACTACAAACACGATATCAAATTCTTTTTACTCCAATTTCATGCCG
GGAAGCAGTAATTTGATGTCGGGAAATTACACGGCAGCGCCTTTCAGCAGCGAGTTTGTT
GATTACAACCAGACGATGGAATGTAACTATACGAACCACAAATATAACGAGGTCAAAATG
AGAAACAACACAACCGCGTTCCAACAGGATAAAGAAACAACTAATTACAAGAGTTCAAGA
AGAGAATCTGCAGCCAAACATAAATTGGAATGTTCGAAGAAAGATTCCAATAAAAAATAT
CAAAGTAAAAGACCAAAATTAACTAACGAAGTCGAAGAATGGAACGATAGTTCCCATTTG
CTCTGGCAGAATAAAACGCCATCCAAACGGCATCAAAACTTAATGTCAGATGAAATTCCA
TTTCCGAACTATGTGGGAAATCAAATGCCAACTCAGTATCAGCCAGATTTCTTTAATAGC
CATATAATGCCATCCAACATGCAGGGCGTGGCTAATGCTGATCGTTCCTTGGCAAGCTTC
CCAGTAGCATCTCGAGCTAACTTTAACCTAAGCGCTTTATTTCCGGAGATAACTATGAAA
GTGCAATGA
Protein sequence:
MDPPSSNKKRTPSRCREWEKQRRIKFNDAISKLGDIVKSIHKANSHMNGEELDNANYPKI
EIVQKAIICLTNIMHEKTQLKAEILALEVKLEAIEKQKQNKKDVSLQVTIGLNKKSQNNK
YVKLLMLQKSKENNGKEKEKTITQKTKPKIDTNTLNKNPPKLPKLLPLTNMKKENTIVML
PATPYIFPQRPVLFPQPPTIVLVDTNLQTLNKTTTIPVINRNTNDITRTTMVNVLPISAY
SRPLSALKSKKGSKAKNNSPKKGGKKTKSNDTPAENNKNEEKPDDKLNPGNNEKAVGDLK
ASDTETTAEINEQSNSNKELSPKEAPNLNDTSIVACETNNNLPSEETVDLNKITETPKIV
EPLMDKPNNIKIAPTNLACHPENVPKTVSVDNVKCDKIPVEKTLGAEYKNKDNKLPPIID
PTICENVVDGGNARLELAEEFLAASPTAAFLMSFPLVSGNRADSPAEEANNTNAKDNRRV
EIAQPVSYFDKSNTSDSKTKASIKQNVTTVQNTNKVAEQQKYESNLKNTEVKVTAPISSV
TTANDNPFLNLPLPSIISSSCNLADATFGIDFDCHVTKAGTTITTSHSNSNNFMYKSEPF
NAVKSTIYSTSSISSGHEFNSLGLYPCAMDNYSNKNKPDLTNVEDNLMKINSSRLTYDID
LGWSHKSFDFVNCTTSANTFHKDTILTTVSTPFSTNYNPFNPDFHVPLVPNSNKKNLISK
TTTFPDQITNFYSQGGNLWSDEVSSIYTNSNVSKNFISKQQNYFPVEHLQPNVHTKTSTA
KQFDTKHISESATETNLKPATAVGQVVEKYTKKSPSKMHINWMTSEIRPMQNNCNQTTAE
MKETKLPYSHVEQLPKKQIPQNESNYFPINMHHFPTQANHEDVQVWPTARPAGTTEISID
PPPINLPTLIGDLALGPHDKKKADILNRSVPHSDLQNCGNFLSVTQLMNRTTENMPQRSN
VLMADQKSLAAKQNLPHIVNDNRKTMTTSQTNVGYGFNDSKALNSYENINQFLQNKSKTS
LKPEKNAKAHKNNYSAEALIRGGTNYNQKLQDHSSNKFMMPAQKYNDFNIQDSGVAQVSH
FPSIIDYSDNSYTGQQFTGTALYNSTTNTISNSFYSNFMPGSSNLMSGNYTAAPFSSEFV
DYNQTMECNYTNHKYNEVKMRNNTTAFQQDKETTNYKSSRRESAAKHKLECSKKDSNKKY
QSKRPKLTNEVEEWNDSSHLLWQNKTPSKRHQNLMSDEIPFPNYVGNQMPTQYQPDFFNS
HIMPSNMQGVANADRSLASFPVASRANFNLSALFPEITMKVQ