New model in OGS2.0 | DPOGS211010  |
---|---|
Genomic Position | scaffold711:+ 23380-33028 |
See gene structure | |
CDS Length | 1518 |
Paired RNAseq reads   | 772 |
Single RNAseq reads   | 2526 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006492 (7e-48) |
Best Drosophila hit   | fruitless, isoform A (6e-61) |
Best Human hit | kelch-like protein 20 (3e-10) |
Best NR hit (blastp)   | fruitless [Tribolium castaneum] (3e-70) |
Best NR hit (blastx)   | fru [Drosophila virilis] (5e-59) |
GeneOntology terms    | GO:0016545 male courtship behavior, veined wing vibration GO:0008049 male courtship behavior GO:0007618 mating GO:0003700 sequence-specific DNA binding transcription factor activity GO:0045433 male courtship behavior, veined wing generated song production GO:0007517 muscle organ development GO:0007530 sex determination GO:0007275 multicellular organismal development GO:0005634 nucleus GO:0003702 RNA polymerase II transcription factor activity GO:0007617 mating behavior GO:0016543 male courtship behavior, orientation prior to leg tapping and wing vibration GO:0007620 copulation GO:0007417 central nervous system development GO:0046661 male sex differentiation GO:0048047 mating behavior, sex discrimination GO:0005515 protein binding GO:0008270 zinc ion binding GO:0005622 intracellular GO:0002118 aggressive behavior |
InterPro families    | IPR000210 BTB/POZ-like IPR015880 Zinc finger, C2H2-like IPR011333 BTB/POZ fold IPR007087 Zinc finger, C2H2-type IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding IPR013069 BTB/POZ |
Orthology group | MCL19040 |
Nucleotide sequence:
ATGGACCAGCAATTTTGTTTGCGCTGGAACAATCATCCAACCAACCTGACAGATGTGCTT
GCAAGCCTATTACAGAGAGAGGCACTATGTGATGTTACACTAGCATGCGATGGGGAAACA
GTCAAGGCACACCAGACAATACTATCAGCGTGTTCCCCGTATTTTGAAAGTATATTCTTA
CAAAATTCACACCCGCATCCCATTATATTCCTTAAAGATGTGAGGTTCTCAGAGATGAAA
TCTCTGTTAGATTTTATGTATAAGGGAGAGGTGAATGTTGGCCAAAATATGCTACCAATG
TTCCTAAAGACTGCCGAAAGTTTACAAGTTAGAGGTTTGACAGAGAATAATACGTTGAAT
ACTAAGTCAGAGGAGCGGTCGACTCCCAGCGTGAGTGCTGAGAATTTATCCCGCGGTGAG
TTCGCCACACCGCCTGCTGCTCATGCGCTCGCAGCTCTAACACCGCTGCCGCAGTCACAG
TCACTGCCGCAGTCGCTGCCGCAATCGCTGCCGCAGCCGCTGCCGTTGCCGTTGCCGCCG
CACGCGCCGCTCGAGAAGCGACGCAGGAAGAACTCCACCGCGCCAAGGGACGATATCGAT
CTGTCCTACCGACATTATGAGGGGCACGTGAAGGCTAGCAAAGGTTCAACCGGCTCTGGT
TCCGAGCCGTCGACTCCTCCACCAGCTCACGGCCGCGCCGCTCGCTCCCCAGCATTGCTC
GTTAAACAAGAGCCAGACTACACGCAACACCACTCCTACGACCAGACTCACCTCACGGGA
ATGGGAGTGAATGATATGGCATCAATGATAACCCAGCACTCGATGAACAACGATTGCAAC
GAGAGCGAACCCGTAATGCCGCCTCACCCCGACCAGACGGACACCATTGACGGTGATAAA
TATGCGGACGAGAATGATATACCTCAAGAGCATTTCGGCCAAAATATAACAAATATTGAG
AATATAGTAAAATCTTTTAGGATAGCATTAAATCATAGATCACATAGCCCAATGACCTGT
CAGATATGTGGTAAAACTGTAAGCAATATCAAGAAGCACATGAAATCACACAATCCAGAA
CAACACAAATGCCCTCTCTGTCCGATTATACTAACAAGGGCTGACAATCTAAAACGCCAT
CTAAGAATGAAACATTGTTCAGTACTAGGCTCGAAGGGCTGGCACATGAGGCTGACGTTC
GAGCGTGTGGCGGGCGCCCTCAACCTGCACCGCTGCAAGCTGTGCGGGAAGGTGGTCACT
CACATCAGGAACCACTATCACGTGCACTTCCCTGGACGGTTCGAGTGCCCGCTATGCCGA
GCCACCTACACGCGCTCGGACAACCTGCGCACGCACTGCAAGTTCAAGCATCCGGCTTAC
AACCCCGACACGCGCAAGTTCGAGGGCGCGCCGGTGGCCGTGGGCGCGGGCGTGGGCGTG
GGTGGCGCTCACGGGCCTCACGCAGCTCATGCGCCGCCGCCGTTGTTCGCGAACCACCTG
GACGCGGGCTTCGACTGA
Protein sequence:
MDQQFCLRWNNHPTNLTDVLASLLQREALCDVTLACDGETVKAHQTILSACSPYFESIFL
QNSHPHPIIFLKDVRFSEMKSLLDFMYKGEVNVGQNMLPMFLKTAESLQVRGLTENNTLN
TKSEERSTPSVSAENLSRGEFATPPAAHALAALTPLPQSQSLPQSLPQSLPQPLPLPLPP
HAPLEKRRRKNSTAPRDDIDLSYRHYEGHVKASKGSTGSGSEPSTPPPAHGRAARSPALL
VKQEPDYTQHHSYDQTHLTGMGVNDMASMITQHSMNNDCNESEPVMPPHPDQTDTIDGDK
YADENDIPQEHFGQNITNIENIVKSFRIALNHRSHSPMTCQICGKTVSNIKKHMKSHNPE
QHKCPLCPIILTRADNLKRHLRMKHCSVLGSKGWHMRLTFERVAGALNLHRCKLCGKVVT
HIRNHYHVHFPGRFECPLCRATYTRSDNLRTHCKFKHPAYNPDTRKFEGAPVAVGAGVGV
GGAHGPHAAHAPPPLFANHLDAGFD