New model in OGS2.0 | DPOGS215007  |
---|---|
Genomic Position | scaffold361:+ 48506-69388 |
See gene structure | |
CDS Length | 2637 |
Paired RNAseq reads   | 1479 |
Single RNAseq reads   | 3808 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012158 (4e-70) |
Best Drosophila hit   | ND |
Best Human hit | AT-rich interactive domain-containing protein 5B (3e-51) |
Best NR hit (blastp)   | PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum] (6e-91) |
Best NR hit (blastx)   | PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum] (6e-78) |
GeneOntology terms    | GO:0001822 kidney development GO:0048705 skeletal system morphogenesis GO:0045449 regulation of transcription GO:0010761 fibroblast migration GO:0048644 muscle organ morphogenesis GO:0045892 negative regulation of transcription, DNA-dependent GO:0016564 transcription repressor activity GO:0005622 intracellular GO:0006807 nitrogen compound metabolic process GO:0009791 post-embryonic development GO:0035264 multicellular organism growth GO:0048008 platelet-derived growth factor receptor signaling pathway GO:0060021 palate development GO:0005634 nucleus GO:0003677 DNA binding GO:0060325 face morphogenesis |
InterPro families   | IPR001606 ARID/BRIGHT DNA-binding domain |
Orthology group | MCL20420 |
Nucleotide sequence:
ATGGAGAAGCCGTCTTACAAGCTGGTGGGTGCTCCGTGTGGACATCACGGACAGTACACC
TTCTACAAGGCGATCCGTCTGACGGGACCCCGGGACAGGATCGTCGCTATCGGGGACTTC
TTCTTCGTCAGGATCTGGCAGGACTCAGAACTGGTCTCCATAGGCGAGCTCCAGTTGTTG
TGGACGGACCGCGTGTCGGACCAGACCCTGGTGTCTCTGAGGCTGTACTTCCTCCCGGAG
AACACGCCCGACGGAAGAAACACACATGGGGAGGACGAGGTGCTCGCTATCAATGATAAG
GTGGTCTTGAGGGCGGAGGAGCTCCTCAGCTGGGTGTGCAGCGGCGCGGGCTGGCGGTGG
GGGCTGCGGGCCGTGTGGCGAGGGGCCTGCGCGCCGCCCGCGGAGCCCCGGCACTCCGCC
CCCCTGCACCACACCAAGCTGGACTTCAGCGACGTCGAGAGAGAGAAGAGCTCCATCACG
GTGGACGTGGACGAGCCTGGCGTGGTGGTGTTCTCGTACCCCCGGTACTGCCGGTACCGG
GCCCTGGTCGCTCGCCTGGAGGGCATCCAGGCCGCCTGGCTCAGAGACTCGCTGGTGGCT
GCGCTGGGCGGATACGCCGCGCCCACCAGAAACACTAGGATACTGTACTGTAAGGACACA
TTCGAGTACCCGGAACTGGAAGGTCATGAGTTCGTCTGTAACCACCTAGCTCCTAAACTG
AAAGGTCGTCCGCGGGGTCGGCGGAGGCGGGCGGCGCGGTCGCGCGACCGGTCGCCCGAC
CAGTCGCCCGACTCGCGCTCCAGTGACAGTGACGCTGTCGAGACCCGGACACCTCGGAGG
ATATCACTCCGGAACGGTCCAGAGAAGATCAGCGAAGACGAGGACTCGGAGAAGGACCAG
CAGTTCATGAAACAACTGAAGGAGTTCCTCAAGGAGAAGAATGAAACTGTGAAAGTCCCA
CACAGCTATAAAAATGTATCTCTCCGGTCGCTGTACTCCTGGGTGTGGTCATCCGGGGGG
TTTGCGGCGGCGTGTCGCGCGGGCGCGTGGCGGGAAAGATACCGGGAAAACGCGCCCGCA
CTACGACGGATATACGAGAGATATCTCCTCCAATACGAAAACCACGAGCGGTGGAACGGC
AGGAAGTATCCGAAGATGAACGGCATCATAGACGGAGCTAAGACCATAGACACCATCGAC
GTCACGGACTCCCCGGCGAGGGACACGCCCGGGCACACGGAGCTGCCCGCCAAGACGCTC
CGCACACCCTCGCCCAGACCAGAGAACCTGGTGTTGGACAACGAAACTGGAGAGATCACT
AAAGAACTGAACATTACGTCCAAACCCGCCGAGGAGCTGAACAGGGAGTTCCTGGACTCG
CTGCCCAAAGAAGAGAAACCGGCCAAGATCTTCGTCAAGCCCGTGGAGAAGCTGATAGAG
CCTGGACTGCAGAACAAGATGGGCCTGGACAATGACGGCGTGGGGTCCGCGTTCTTTAAC
GAGCTGGCGCAGAAGTTAAACTTGGGTAACTCCGACACCCGCTTCCTGCAACAGCTGTCG
GCGCCGGACAGCCTCACGAGCCTGTCCTCGCTAGGAGATAAATACACGAACGGGCACATC
AACAGCGACCAGAAACCGCGCAGTTCCCTCCGCGCTGTTCGCGTGAAGACCACCCGAGCA
CCCCCCACCGCCCCCACCACCCCCAACCCCGTGCCCGCGGAGAGCTCGTCGCCGCCCTCC
ATCACCAGCGTCGTCAACAACTTCGGGATCCACCATCCGCCCACGCCGACCGCCAACGAC
GACGACATCGTAGAGGTGCCCTACAAGCCCAAGAGTCCGGAGATCATAGACCTGGACGAG
TACCCGGAGAGTCCCCAGGCCATCAAGAACAAGAAGCTGGACATCCTCAAGGAGCGCGGC
CTGGAGGTGACGGCCGTGCCCCCGGGGCCCGCCTGGCCGCCCGCGCCGCTGCTCCTCAAC
CCCGTGCAGCAGATCATGGGCCAGGCGTCGCTGTTCCAGATGTACAACATCATCCCCAGC
TACCCCAACGGCGCGCCCGCGCCGAAGGTCATCCAGGCGTCCTCGGCGTTCGGCTCGTGC
GGGCCGGAGAAGACGGTGTACGGCAACCCCAAGGACCCCTTCATGCCGCCGCCGCACGTG
CTGCAGGGCACGCCCGTCAAGCCGCAGCGCAGCGTGGCGCCCGCGCCCGCCGCGCCGCTG
GACGTGCTGGACCTCACCTGCAAGACGCGGCCCGGACACAAGCCGGCCGTGGAGATCGTG
CGCCTGCCGCCCGCCCCCCGCCCGCAGAGCCTCGCCAGCAGCTACTCGCTGGTGGACGGC
AAGGCGGTCGTGGGCTCCAACCTGGAGATCACGCTCGTCAACAAGTCGCACACGCCGCCC
CGGAGGCCGCAGAAGAGGTCCTCCAACGGCAAGTTCGTGTCGTCGAAGACTCCGCCGCAG
GAGTCGCCGCCCAAGAAGCCGTCGCCGGCGCGGCCCGAGCCGCCGGTGGAGCCCTACAGC
CTGTTCCTGCGGGGCGCGCCGGGCCTGGACCCGCGCCAGCTGGCGCTGTACCGCGACCTC
GTGGCCGGCCAGCTGCGCTACCCGGGCCTGCTCAGCACGCCCACCACCAAGAACTAA
Protein sequence:
MEKPSYKLVGAPCGHHGQYTFYKAIRLTGPRDRIVAIGDFFFVRIWQDSELVSIGELQLL
WTDRVSDQTLVSLRLYFLPENTPDGRNTHGEDEVLAINDKVVLRAEELLSWVCSGAGWRW
GLRAVWRGACAPPAEPRHSAPLHHTKLDFSDVEREKSSITVDVDEPGVVVFSYPRYCRYR
ALVARLEGIQAAWLRDSLVAALGGYAAPTRNTRILYCKDTFEYPELEGHEFVCNHLAPKL
KGRPRGRRRRAARSRDRSPDQSPDSRSSDSDAVETRTPRRISLRNGPEKISEDEDSEKDQ
QFMKQLKEFLKEKNETVKVPHSYKNVSLRSLYSWVWSSGGFAAACRAGAWRERYRENAPA
LRRIYERYLLQYENHERWNGRKYPKMNGIIDGAKTIDTIDVTDSPARDTPGHTELPAKTL
RTPSPRPENLVLDNETGEITKELNITSKPAEELNREFLDSLPKEEKPAKIFVKPVEKLIE
PGLQNKMGLDNDGVGSAFFNELAQKLNLGNSDTRFLQQLSAPDSLTSLSSLGDKYTNGHI
NSDQKPRSSLRAVRVKTTRAPPTAPTTPNPVPAESSSPPSITSVVNNFGIHHPPTPTAND
DDIVEVPYKPKSPEIIDLDEYPESPQAIKNKKLDILKERGLEVTAVPPGPAWPPAPLLLN
PVQQIMGQASLFQMYNIIPSYPNGAPAPKVIQASSAFGSCGPEKTVYGNPKDPFMPPPHV
LQGTPVKPQRSVAPAPAAPLDVLDLTCKTRPGHKPAVEIVRLPPAPRPQSLASSYSLVDG
KAVVGSNLEITLVNKSHTPPRRPQKRSSNGKFVSSKTPPQESPPKKPSPARPEPPVEPYS
LFLRGAPGLDPRQLALYRDLVAGQLRYPGLLSTPTTKN