DPGLEAN14979 in OGS1.0

Genomic Positionscaffold4772:+ 888-5977
See gene structure
CDS Length1812
Paired RNAseq reads  491
Single RNAseq reads  1206
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001917 (7e-153)
Best Drosophila hit  another B-box affiliate, isoform C (9e-30)
Best Human hittripartite motif-containing protein 71 (5e-22)
Best NR hit (blastp)  PREDICTED: similar to CG15105 CG15105-PA [Tribolium castaneum] (2e-97)
Best NR hit (blastx)  PREDICTED: similar to CG15105 CG15105-PA [Tribolium castaneum] (7e-95)
GeneOntology terms
  
GO:0005515 protein binding
GO:0008270 zinc ion binding
InterPro families





  
IPR001258 NHL repeat
IPR018957 Zinc finger, C3HC4 RING-type
IPR017907 Zinc finger, RING-type, conserved site
IPR001841 Zinc finger, RING-type
IPR013017 NHL repeat, subgroup
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR011042 Six-bladed beta-propeller, TolB-like
Orthology groupMCL18006

Nucleotide sequence:

ATGGCAGAAAAGGTTAAGAAGAAACCATCGTTTTTTAGTTTGAAACGTTTCAACAGTCAA
AAGAGTATAGATGAACCAGTAACACCAACTCAACCAAGTCCGCCAGCGTTAAAACCAGCG
CAGAGAAGAATGAGTAATACAGACACGAAACCGAGGAAAGAGTCTGCTGCTGGGATTAGA
GAACTAGTGCAATGTTCGTTGTGCTTGGAAATGTTGAGTAATCCCAAAATGTTGCCCTGT
CAACACACCTTCTGTTGTGCGTGTCTCAAAGTATATGTCGCAGATCAACCGGTGATAACT
TGTCCCATATGCTGCACGCGAATACAAGTACTTAGACAAAACTTCGTAGAGGAACTGCCG
TCTAATCTTTACATTGACACATTATTAACCATGTTTGGTGTCGAAAAGAAATCTGATACT
CATACTCCACCGGTGACTCCATATAATCTACAAAGTGTCGATCTGTTTGCTGCTGGAGTT
AGGTGCTCTCAATGTCGGACCATGTGTGATAATTCTGACGTTTCAGCTTGTCAACATTGT
AAATTGAACTTTTGCGGCGTTTGCTGGTCACAACACTTGAAAGACATGAGGTCACAGATT
AGTTCAATTTTGAAGGAATTAGACGCTGCCTCCAGTCGACTCGAACACAAAGTAGAACAT
TATAAGGATCGCTGTGAACGAATCGTTGAACAAATTAACATAGCCGCGGAGGAAAAAATT
AACACAATTTTGGAAAGCAAAGTAAATCTACTAGAAGAAGCCAACAGATTAACTAAATCC
GGAGATCTATCCGCATTAGCTCTAAAATCATCTCTTGAAGAGGCTAGAGATGCATCTATT
CAAACAATGACAAGCGAAGACGATAATGACGTCGAAATGATTAACAAATTTATAAACCTA
CATCAAAACACAATCCAATTGTTATCCGAAGTATCGAAATGGGATGCTGAAAAATTTGTA
TTTGACAAAGAAAACTTTAGCATAGAATCGGATTCGTCCGTGCCTTGTGATGCTGAATCA
GATGATCCTTTGCCCGAAACAGTTAAACAGAATAATCCTATGGAAAATGAGGCCAGCCTT
ATTTTGCACTATAGATTATCATGTGAAGGAATGCTGTGCCCGGTACACATCGCCTTCATG
AAACCTTTAGGAGAGATTTATGTTACAGATAAATGGAAACACTGCATCCACGTGTTCTCT
AAGGACGGTGATTACTTAAGGTCGTTGGGACAGAAAGGCAGTCGTGTGGGCATGTTGAGA
TCTCCTGAAGGCATTGCTACTGATAACATCAGTAATCAGATATATGTCGTGGATACTGGC
AATGATAGGGTCCAGGTATTAGATACGGAAGGTAAATTCATCGACCAATACGGTGTGGCC
ACGAGAGCGCAAACTAGTAACACGGCCAACGTTTGGACACAACAAGAAACTTTGTGTACG
GAATTCAACGCGCCTACAGCTGTAGCAGTGACCAAAGATCGAGTGATTGTATTGGACAGT
GGGAATCGAAGAGTGAAAATATATAACAAACAGGATAAGAATAAAATCACAGAGTTCGGG
TCTTTGGGACACAGGAAGGGACAATTCAGACAGCCAGAAGTATTGACGGTCGATCCCCTG
GGTTTTATACTAGTTGGTGATTCAGGCAACTGTCGAGTCCAAATCTTCAAACCGAACGGA
CAATTGGTCCGGGTTTTCGGAGGTCTAGGTGCTGATCCCGGCAAATTTGGATGGATATCA
GGAATATACGTCACCAAACAGCTAGACATTATAGTCAGCGATACCAAAAATCACAACGTC
AACTTCTTCTAA

Protein sequence:

MAEKVKKKPSFFSLKRFNSQKSIDEPVTPTQPSPPALKPAQRRMSNTDTKPRKESAAGIR
ELVQCSLCLEMLSNPKMLPCQHTFCCACLKVYVADQPVITCPICCTRIQVLRQNFVEELP
SNLYIDTLLTMFGVEKKSDTHTPPVTPYNLQSVDLFAAGVRCSQCRTMCDNSDVSACQHC
KLNFCGVCWSQHLKDMRSQISSILKELDAASSRLEHKVEHYKDRCERIVEQINIAAEEKI
NTILESKVNLLEEANRLTKSGDLSALALKSSLEEARDASIQTMTSEDDNDVEMINKFINL
HQNTIQLLSEVSKWDAEKFVFDKENFSIESDSSVPCDAESDDPLPETVKQNNPMENEASL
ILHYRLSCEGMLCPVHIAFMKPLGEIYVTDKWKHCIHVFSKDGDYLRSLGQKGSRVGMLR
SPEGIATDNISNQIYVVDTGNDRVQVLDTEGKFIDQYGVATRAQTSNTANVWTQQETLCT
EFNAPTAVAVTKDRVIVLDSGNRRVKIYNKQDKNKITEFGSLGHRKGQFRQPEVLTVDPL
GFILVGDSGNCRVQIFKPNGQLVRVFGGLGADPGKFGWISGIYVTKQLDIIVSDTKNHNV
NFF