DPGLEAN04330 in OGS1.0

New model in OGS2.0DPOGS201259 
Genomic Positionscaffold36:+ 136735-145405
See gene structure
CDS Length3057
Paired RNAseq reads  3895
Single RNAseq reads  9652
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012479 (0.0)
Best Drosophila hit  tonalli, isoform D (1e-109)
Best Human hitzinc finger MIZ domain-containing protein 1 (3e-132)
Best NR hit (blastp)  PREDICTED: similar to sumo ligase [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to sumo ligase [Tribolium castaneum] (0.0)
GeneOntology terms














  
GO:0007296 vitellogenesis
GO:0005737 cytoplasm
GO:0007569 cell aging
GO:0048146 positive regulation of fibroblast proliferation
GO:0016607 nuclear speck
GO:0048589 developmental growth
GO:0048844 artery morphogenesis
GO:0045449 regulation of transcription
GO:0046872 metal ion binding
GO:0003007 heart morphogenesis
GO:0005515 protein binding
GO:0005634 nucleus
GO:0001701 in utero embryonic development
GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0008270 zinc ion binding
GO:0001570 vasculogenesis
InterPro families
  
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR004181 Zinc finger, MIZ-type
Orthology groupMCL11617

Nucleotide sequence:

ATGTTGAGCGAGCGAGCGTGGCGGGAGAGGGGAAGGGGGTACGGAGCGAGCGGCAACGGG
ACGACAAGACCCGAGCACCGCCGAGGGCCGAATGCCGCGACGTGGTCGTACGGAGGGCGA
GCGAGGAGCGGCGGCGGTGGCGGGCGGGGTGGAGGAGGGGTGCGAGCGGGGGTGCGCCAC
TTTATCGCGGCGAGCTGGCAACGCCGCGCGGGGCGGTTGGTGGAAGGGAGGGGGGAGAGG
TCCGCCAGCCGCCAGCCGCCGGCCGCCAGCCGCCAGCCAGTGTTACCTTTGACGGTGGTT
GGAGCGGTCGCCGGACGGTGGACCGTAGTGATGTCGCCGTCGGCACTCGCGCTGCACGTC
TGTCATGTTGAAGTCGACTGTCGACTGTCGGGCTCGTTTTGTTACCATTTGGATTTTTTT
CACGGTGATTTAGAGTGTGGATTTTGCGACAAAATAAATGCCAGTGACTTATCGAGTTTA
CAGAGTGGATTTGTGTTAGTTACCGAGGGGGTGCGTTCAGACGACAGGACGGCGTACACG
GCTAACACGACTGGCTACCCGGGACAGGGCTACGGTTACTCTGACAGACATCACCAACAT
AACATAGGACACAGAGGAAACGCGCCAAGCGGCTATGGGTGCCCCGGCGGAGCGGGCGCG
ATGAGCGGCGGAGGCGAGAACGCGCAGTTCGGCGCCACCGCCGCGATGGTCGCCGCGGCC
ACCACGGCCGTGATGCAGGACTCGCAGCCCTTCTCACAGATGCAAAACAACATGACGATG
GGAAATCCCCAGTACAGTGCGATGAATGGCTACGGTCAACAACGCAGCCACAATCCCGCG
ATGACGGGAATGGGGATGGGTGGCAACGGCGGGATGAACGGAATGACCGGCATGGGACAG
ATGGGGAACGCCATGACCGGCATGAACCCCATGGCTCAGATGGCCAACATGGGGATGCAC
GCGAATATGATCTCCTCTCAGATGGGACCGGGGCAGATGGGGAACTCCGCGAAGATGGGC
CCGGGGTATCAGAGGCGGCACACGCCATACCCCTCGGGGACGATGATTATGGGTTCCCGA
AAGACCCAATACATGGGCGGCCAACCCGGCTTCGGCCCGGCTCCCGCCCAGTACCAGACG
GTCAGTACCCGAACGGGTTACGGCGGACGGCCGGGCTTCCAGAGTCAGTACCCGCCGCAG
CAGCCGCTCGGAGCCAGTGGGAACTTCGGATCGGGTATGAGAGGCACCATGAGACAATCG
ACTCCGCCGTACTCTAACCAGGGACAGTATTTTAACGGAGGAGTTCCTAATCAATTCCCA
CAGCATCAGGCCGGTAACGGCCAGTATGGAGCGCAGTACGGCGGACAGTTCGCGCAGGAA
GTGGCCATGAGATCTAACATGAGCTACCAGCACAGTCCGGTCCCCGGTAACCCGACGCCT
CCTCTGACGCCGGCCAGTAGTATGCCGCCCTACATCAGCCCCAACGCTGACGTCAAACCT
CACTTTAACGAGCTCAAACCACCGATGGGCATGCAGAATGACGAGCTCCGGTTAACATTC
CCCGTAAGAGATGGTATCATATTACCACCTTTTAGATTAGAACATAATTTAGCAGTTAGC
AATCACGTATTCCAATTAAAGCCTACGGTACATTCAACATTAATATGGAGATCTGATCTG
GAGCTGCAGCTCAAGTGCTTTCATCACGAAGACAGACAGATGAACACCAACTGGCCGGCG
AGCGTGCAGGTGTCCGTGAACGCCACGCCGCTCGTCATCGACCGAGGAGAGAACAAGACG
TCACACAAACCGCTGTACCTGAAGGAGGTCTGCCAGCCCGGGCGGAACACCATACAGATC
ACCGTCTCCGCCTGCTGCTGTTCGCACCTCTTCGTATTACAATTAGTCCACCGGCCGAGT
GTGAGGAGCGTGCTGCAAGGATTGCTGAGGAAGCGGCTGCTGACGGCCGACCACTGTATC
GCTAAGATCAAGATGAACTTCAACCAGTCGCCGGCTCAGAACAACAGCTCGAGCGCGCCC
AGCGACAGGGACGGCGTGGAACAGACGGCGCTTAAAGTGTCGTTGAAATGTCCGATAACC
TTCAAGAAGATCACCCTCCCGGCCCGGGGCCACGAGTGTAAACACATACAGTGCTTCGAC
TTAGAGTCGTATCTGCAACTAAACTGCGAGAGGGGCTCGTGGCGATGTCCGGTCTGCAAT
AAGCCAGCTCAGCTGGAAGGGTTAGAAGTGGATCAGTACATGTGGGGCATCCTGAACACC
TTGAACACGTCTGACGTCGACGAAGTGACCATCGACAGCGGGGCCAACTGGAAAGCTACC
AAGATATCCGCCAACCCCGGCATCAAGCAAGAAGACGACAGCAACGACAACAGTGGGAAG
AGAAGCAAAGCGGTGTCCCCGGGCTCCATGAACATGCCCACCATGAACAACTGGGACATG
ACCCAGGCTCTGTCGCCCTACCTGCCGCCCGACATGAACACCATCGCCAGCGGGTCCATG
ATCTCATACAACCAGGGAGGACAGAACAGGAACTCGGGCTCCAGCAACAACAACTACGAC
TTCGGCATCAACAGCGGACCCAACAGCAACGAGTTCGCCGGCAACGGACCGCTCGCACAC
CTCAACGACAGCGTTAACTCGCTCGACCCCTTGAACGCCATGGAGAAGACCCTCAACGAA
CAGATGCCCCACACACCCCACACCCCCCACACGCCGGGGTCCGCTCACACCCCGGGCGGA
GGAGCGACCCCGGGCTCCAGTCACACGGGGCCCCCCTCGGTGGACCGGCACCCCCTCACG
GACGTCGACATCCCGGCCGACCTGAACTTCGACCCCGCAGCGGTCATCGACGGAGAGGGC
ACCGACAACTTGAATCTGTTGCCGGAGACCAGCGTGGACCCCATGGAGCTGCTGTCGTAC
CTGGAGGCGCCGGCGCTGGGCGAGCTGCTGGCCACTCCGCCGTCGTCGTCGTCGTCCGCG
GGCAGCCTGGCGCCGCGCGCGCCGTCCTCCGACGACCTCCTCGCTCTGTTCGAGTGA

Protein sequence:

MLSERAWRERGRGYGASGNGTTRPEHRRGPNAATWSYGGRARSGGGGGRGGGGVRAGVRH
FIAASWQRRAGRLVEGRGERSASRQPPAASRQPVLPLTVVGAVAGRWTVVMSPSALALHV
CHVEVDCRLSGSFCYHLDFFHGDLECGFCDKINASDLSSLQSGFVLVTEGVRSDDRTAYT
ANTTGYPGQGYGYSDRHHQHNIGHRGNAPSGYGCPGGAGAMSGGGENAQFGATAAMVAAA
TTAVMQDSQPFSQMQNNMTMGNPQYSAMNGYGQQRSHNPAMTGMGMGGNGGMNGMTGMGQ
MGNAMTGMNPMAQMANMGMHANMISSQMGPGQMGNSAKMGPGYQRRHTPYPSGTMIMGSR
KTQYMGGQPGFGPAPAQYQTVSTRTGYGGRPGFQSQYPPQQPLGASGNFGSGMRGTMRQS
TPPYSNQGQYFNGGVPNQFPQHQAGNGQYGAQYGGQFAQEVAMRSNMSYQHSPVPGNPTP
PLTPASSMPPYISPNADVKPHFNELKPPMGMQNDELRLTFPVRDGIILPPFRLEHNLAVS
NHVFQLKPTVHSTLIWRSDLELQLKCFHHEDRQMNTNWPASVQVSVNATPLVIDRGENKT
SHKPLYLKEVCQPGRNTIQITVSACCCSHLFVLQLVHRPSVRSVLQGLLRKRLLTADHCI
AKIKMNFNQSPAQNNSSSAPSDRDGVEQTALKVSLKCPITFKKITLPARGHECKHIQCFD
LESYLQLNCERGSWRCPVCNKPAQLEGLEVDQYMWGILNTLNTSDVDEVTIDSGANWKAT
KISANPGIKQEDDSNDNSGKRSKAVSPGSMNMPTMNNWDMTQALSPYLPPDMNTIASGSM
ISYNQGGQNRNSGSSNNNYDFGINSGPNSNEFAGNGPLAHLNDSVNSLDPLNAMEKTLNE
QMPHTPHTPHTPGSAHTPGGGATPGSSHTGPPSVDRHPLTDVDIPADLNFDPAAVIDGEG
TDNLNLLPETSVDPMELLSYLEAPALGELLATPPSSSSSAGSLAPRAPSSDDLLALFE