DPGLEAN21443 in OGS1.0

New model in OGS2.0DPOGS213467 
Genomic Positionscaffold738:- 25141-57161
See gene structure
CDS Length1704
Paired RNAseq reads  134
Single RNAseq reads  332
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004491 (2e-43)
Best Drosophila hit  AP-2, isoform A (1e-97)
Best Human hittranscription factor AP-2-alpha isoform c (7e-84)
Best NR hit (blastp)  transcription factor ap-2 [Aedes aegypti] (8e-128)
Best NR hit (blastx)  AP-2 [Bombyx mori] (4e-116)
GeneOntology terms










  
GO:0007422 peripheral nervous system development
GO:0030902 hindbrain development
GO:0030182 neuron differentiation
GO:0001501 skeletal system development
GO:0030318 melanocyte differentiation
GO:0050935 iridophore differentiation
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006355 regulation of transcription, DNA-dependent
GO:0014036 neural crest cell fate specification
GO:0014032 neural crest cell development
GO:0051216 cartilage development
InterPro families
  
IPR013854 Transcription factor AP-2, C-terminal
IPR004979 Transcription factor AP-2
Orthology groupMCL11946

Nucleotide sequence:

ATGAACGTTTACATGCAGTGCCATCGTGAGTCTGTAAGTCTCCAAGCGCCCACTCCATGT
GATAGCCTCGAGCGACGCGCGCTCACGTGCTGCGCGGGAGATTTGAATTTGGAACACGAC
AAAGACATCAAACGTAATATAAATATAAGTTTAGAGAAAGACAGCAGCAAAACAATAAGA
TTGGAGCTACGTCACCCGGATCATGTTGTAATGGATTATAAGATTGACGGACAAATGGAC
GTCGCATTGGTTGGAATTAATGAAACTGAAGACAAACACGAGGACACTAACCAAAACGGA
AATGAGGAAAAAAAGAGACGAAAGAGGAAACCTCTCTTCATTCCATGGATTCACCATGCG
GGTGCCGGGTCCAGGTTGAAGGGAGAGAAGCTTCAACAGCCCGGGACTTGCGACCCAGGT
GTAGCTTTAAGGGGCCCCTATGTAATAGTAGGTGGCTTTCACATTTCACAAGTGTTGTAT
TACATTGAAGACATCGAAACAGAGCTGGCGTACGTTGATTTATCTAGCGTGATAGAGATC
CAGGAGCGTCTGGGCGGCGGCGGGCTGGGTTTGGGCGGCGGAGGCTTCCGCGGCGCTCAA
CCCTCGCTCGCCGACTTCCAGCCGCCATACTTCCCGCCGCCCTTCGCCCCTAGCGCACAT
CCTGCGAGTCCGCACCACCAACAACAGAGCCATGGCATGGAGTATTCTGGAGGTCCGGAG
TACGGGCAGCACTACGCGCCGCAGCAGCTCCTACCACGACACCACGGACACGAGCCTCCC
CATCTCAGACATCATAGAGACCATCACGACGTACACTCCCACCACCTACCTCACGGTGGA
TTCAGTTACGACAGGAGGACGGACTACGGAGCCCGGGAGCAACACGACCTCGCCCTTCAT
CACGCGTTACACACGGACGAGACACAGAATGCAGGCATGGACGATACGACGGGCTTCATG
ACCGACCTTCCTTTATTAAAAACAATGAAAGCCCGCGATGTAGGGACAGGTGCCTGCGCC
CCCAGCGACGTGTTCTGCTCTGTACCAGGGAGACTCTCTCTCCTGTCTTCGACCAGCAAG
TACAAGGTCACCGTCGCCGAGGTCCAGCGAAGGCTCTCACCACCAGAGTGCCTGAACGCG
TCACTACTCGGAGGTGTACTGAGAAGAGCAAAAAGCAAAAATGGCGGTAGGTTACTTAGG
GAAAAACTAGAGAAAATCGGTCTGAACCTTCCAGCGGGGCGACGGAAAGCGGCTAACGTG
ACGCTACTCACGTCATTAGTAGAAGCCGAGGCTGTTCATTTGGCGCGTGATTTTGGTTAC
GTCTGCGAGACTGAGTTCCCGGCCCGAGCGCTCGCGGAATACCTCGCGAGACAATACGCT
GAACACGACGCCAGACGACGCAGGGACCTGTTACACGCCACCAAACAGGTGGTGAAGGAG
GTGATGGACCTATTGAACCAGGACCGTTCTCCTCTGTGTAACACGCGACCCCCTCACCTC
TTGGAGCCGGCCATACAGCGGCACCTCACACACTTCTCTCTCATATCACACGGTTTCGGT
GGACCGGCCATCGTCGCCGCACTGACAGCCATACAGAATTTCCTAAACGAGTCGTTAAAG
CATTTAGACAAGTTATATCCACAGAGCGGGATGGTGTCGTCGACAATGGACAAGACAAAA
ATGGATCCCGACATCAAAAAGTAG

Protein sequence:

MNVYMQCHRESVSLQAPTPCDSLERRALTCCAGDLNLEHDKDIKRNINISLEKDSSKTIR
LELRHPDHVVMDYKIDGQMDVALVGINETEDKHEDTNQNGNEEKKRRKRKPLFIPWIHHA
GAGSRLKGEKLQQPGTCDPGVALRGPYVIVGGFHISQVLYYIEDIETELAYVDLSSVIEI
QERLGGGGLGLGGGGFRGAQPSLADFQPPYFPPPFAPSAHPASPHHQQQSHGMEYSGGPE
YGQHYAPQQLLPRHHGHEPPHLRHHRDHHDVHSHHLPHGGFSYDRRTDYGAREQHDLALH
HALHTDETQNAGMDDTTGFMTDLPLLKTMKARDVGTGACAPSDVFCSVPGRLSLLSSTSK
YKVTVAEVQRRLSPPECLNASLLGGVLRRAKSKNGGRLLREKLEKIGLNLPAGRRKAANV
TLLTSLVEAEAVHLARDFGYVCETEFPARALAEYLARQYAEHDARRRRDLLHATKQVVKE
VMDLLNQDRSPLCNTRPPHLLEPAIQRHLTHFSLISHGFGGPAIVAALTAIQNFLNESLK
HLDKLYPQSGMVSSTMDKTKMDPDIKK