DPGLEAN17193 in OGS1.0

New model in OGS2.0DPOGS215007 
Genomic Positionscaffold361:+ 48506-69388
See gene structure
CDS Length2637
Paired RNAseq reads  1479
Single RNAseq reads  3808
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012158 (4e-70)
Best Drosophila hit  ND
Best Human hitAT-rich interactive domain-containing protein 5B (3e-51)
Best NR hit (blastp)  PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum] (6e-91)
Best NR hit (blastx)  PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum] (6e-78)
GeneOntology terms














  
GO:0001822 kidney development
GO:0048705 skeletal system morphogenesis
GO:0045449 regulation of transcription
GO:0010761 fibroblast migration
GO:0048644 muscle organ morphogenesis
GO:0045892 negative regulation of transcription, DNA-dependent
GO:0016564 transcription repressor activity
GO:0005622 intracellular
GO:0006807 nitrogen compound metabolic process
GO:0009791 post-embryonic development
GO:0035264 multicellular organism growth
GO:0048008 platelet-derived growth factor receptor signaling pathway
GO:0060021 palate development
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0060325 face morphogenesis
InterPro families  IPR001606 ARID/BRIGHT DNA-binding domain
Orthology groupMCL20420

Nucleotide sequence:

ATGGAGAAGCCGTCTTACAAGCTGGTGGGTGCTCCGTGTGGACATCACGGACAGTACACC
TTCTACAAGGCGATCCGTCTGACGGGACCCCGGGACAGGATCGTCGCTATCGGGGACTTC
TTCTTCGTCAGGATCTGGCAGGACTCAGAACTGGTCTCCATAGGCGAGCTCCAGTTGTTG
TGGACGGACCGCGTGTCGGACCAGACCCTGGTGTCTCTGAGGCTGTACTTCCTCCCGGAG
AACACGCCCGACGGAAGAAACACACATGGGGAGGACGAGGTGCTCGCTATCAATGATAAG
GTGGTCTTGAGGGCGGAGGAGCTCCTCAGCTGGGTGTGCAGCGGCGCGGGCTGGCGGTGG
GGGCTGCGGGCCGTGTGGCGAGGGGCCTGCGCGCCGCCCGCGGAGCCCCGGCACTCCGCC
CCCCTGCACCACACCAAGCTGGACTTCAGCGACGTCGAGAGAGAGAAGAGCTCCATCACG
GTGGACGTGGACGAGCCTGGCGTGGTGGTGTTCTCGTACCCCCGGTACTGCCGGTACCGG
GCCCTGGTCGCTCGCCTGGAGGGCATCCAGGCCGCCTGGCTCAGAGACTCGCTGGTGGCT
GCGCTGGGCGGATACGCCGCGCCCACCAGAAACACTAGGATACTGTACTGTAAGGACACA
TTCGAGTACCCGGAACTGGAAGGTCATGAGTTCGTCTGTAACCACCTAGCTCCTAAACTG
AAAGGTCGTCCGCGGGGTCGGCGGAGGCGGGCGGCGCGGTCGCGCGACCGGTCGCCCGAC
CAGTCGCCCGACTCGCGCTCCAGTGACAGTGACGCTGTCGAGACCCGGACACCTCGGAGG
ATATCACTCCGGAACGGTCCAGAGAAGATCAGCGAAGACGAGGACTCGGAGAAGGACCAG
CAGTTCATGAAACAACTGAAGGAGTTCCTCAAGGAGAAGAATGAAACTGTGAAAGTCCCA
CACAGCTATAAAAATGTATCTCTCCGGTCGCTGTACTCCTGGGTGTGGTCATCCGGGGGG
TTTGCGGCGGCGTGTCGCGCGGGCGCGTGGCGGGAAAGATACCGGGAAAACGCGCCCGCA
CTACGACGGATATACGAGAGATATCTCCTCCAATACGAAAACCACGAGCGGTGGAACGGC
AGGAAGTATCCGAAGATGAACGGCATCATAGACGGAGCTAAGACCATAGACACCATCGAC
GTCACGGACTCCCCGGCGAGGGACACGCCCGGGCACACGGAGCTGCCCGCCAAGACGCTC
CGCACACCCTCGCCCAGACCAGAGAACCTGGTGTTGGACAACGAAACTGGAGAGATCACT
AAAGAACTGAACATTACGTCCAAACCCGCCGAGGAGCTGAACAGGGAGTTCCTGGACTCG
CTGCCCAAAGAAGAGAAACCGGCCAAGATCTTCGTCAAGCCCGTGGAGAAGCTGATAGAG
CCTGGACTGCAGAACAAGATGGGCCTGGACAATGACGGCGTGGGGTCCGCGTTCTTTAAC
GAGCTGGCGCAGAAGTTAAACTTGGGTAACTCCGACACCCGCTTCCTGCAACAGCTGTCG
GCGCCGGACAGCCTCACGAGCCTGTCCTCGCTAGGAGATAAATACACGAACGGGCACATC
AACAGCGACCAGAAACCGCGCAGTTCCCTCCGCGCTGTTCGCGTGAAGACCACCCGAGCA
CCCCCCACCGCCCCCACCACCCCCAACCCCGTGCCCGCGGAGAGCTCGTCGCCGCCCTCC
ATCACCAGCGTCGTCAACAACTTCGGGATCCACCATCCGCCCACGCCGACCGCCAACGAC
GACGACATCGTAGAGGTGCCCTACAAGCCCAAGAGTCCGGAGATCATAGACCTGGACGAG
TACCCGGAGAGTCCCCAGGCCATCAAGAACAAGAAGCTGGACATCCTCAAGGAGCGCGGC
CTGGAGGTGACGGCCGTGCCCCCGGGGCCCGCCTGGCCGCCCGCGCCGCTGCTCCTCAAC
CCCGTGCAGCAGATCATGGGCCAGGCGTCGCTGTTCCAGATGTACAACATCATCCCCAGC
TACCCCAACGGCGCGCCCGCGCCGAAGGTCATCCAGGCGTCCTCGGCGTTCGGCTCGTGC
GGGCCGGAGAAGACGGTGTACGGCAACCCCAAGGACCCCTTCATGCCGCCGCCGCACGTG
CTGCAGGGCACGCCCGTCAAGCCGCAGCGCAGCGTGGCGCCCGCGCCCGCCGCGCCGCTG
GACGTGCTGGACCTCACCTGCAAGACGCGGCCCGGACACAAGCCGGCCGTGGAGATCGTG
CGCCTGCCGCCCGCCCCCCGCCCGCAGAGCCTCGCCAGCAGCTACTCGCTGGTGGACGGC
AAGGCGGTCGTGGGCTCCAACCTGGAGATCACGCTCGTCAACAAGTCGCACACGCCGCCC
CGGAGGCCGCAGAAGAGGTCCTCCAACGGCAAGTTCGTGTCGTCGAAGACTCCGCCGCAG
GAGTCGCCGCCCAAGAAGCCGTCGCCGGCGCGGCCCGAGCCGCCGGTGGAGCCCTACAGC
CTGTTCCTGCGGGGCGCGCCGGGCCTGGACCCGCGCCAGCTGGCGCTGTACCGCGACCTC
GTGGCCGGCCAGCTGCGCTACCCGGGCCTGCTCAGCACGCCCACCACCAAGAACTAA

Protein sequence:

MEKPSYKLVGAPCGHHGQYTFYKAIRLTGPRDRIVAIGDFFFVRIWQDSELVSIGELQLL
WTDRVSDQTLVSLRLYFLPENTPDGRNTHGEDEVLAINDKVVLRAEELLSWVCSGAGWRW
GLRAVWRGACAPPAEPRHSAPLHHTKLDFSDVEREKSSITVDVDEPGVVVFSYPRYCRYR
ALVARLEGIQAAWLRDSLVAALGGYAAPTRNTRILYCKDTFEYPELEGHEFVCNHLAPKL
KGRPRGRRRRAARSRDRSPDQSPDSRSSDSDAVETRTPRRISLRNGPEKISEDEDSEKDQ
QFMKQLKEFLKEKNETVKVPHSYKNVSLRSLYSWVWSSGGFAAACRAGAWRERYRENAPA
LRRIYERYLLQYENHERWNGRKYPKMNGIIDGAKTIDTIDVTDSPARDTPGHTELPAKTL
RTPSPRPENLVLDNETGEITKELNITSKPAEELNREFLDSLPKEEKPAKIFVKPVEKLIE
PGLQNKMGLDNDGVGSAFFNELAQKLNLGNSDTRFLQQLSAPDSLTSLSSLGDKYTNGHI
NSDQKPRSSLRAVRVKTTRAPPTAPTTPNPVPAESSSPPSITSVVNNFGIHHPPTPTAND
DDIVEVPYKPKSPEIIDLDEYPESPQAIKNKKLDILKERGLEVTAVPPGPAWPPAPLLLN
PVQQIMGQASLFQMYNIIPSYPNGAPAPKVIQASSAFGSCGPEKTVYGNPKDPFMPPPHV
LQGTPVKPQRSVAPAPAAPLDVLDLTCKTRPGHKPAVEIVRLPPAPRPQSLASSYSLVDG
KAVVGSNLEITLVNKSHTPPRRPQKRSSNGKFVSSKTPPQESPPKKPSPARPEPPVEPYS
LFLRGAPGLDPRQLALYRDLVAGQLRYPGLLSTPTTKN