New model in OGS2.0 | DPOGS201259  |
---|---|
Genomic Position | scaffold36:+ 136735-145405 |
See gene structure | |
CDS Length | 3057 |
Paired RNAseq reads   | 3895 |
Single RNAseq reads   | 9652 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012479 (0.0) |
Best Drosophila hit   | tonalli, isoform D (1e-109) |
Best Human hit | zinc finger MIZ domain-containing protein 1 (3e-132) |
Best NR hit (blastp)   | PREDICTED: similar to sumo ligase [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to sumo ligase [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0007296 vitellogenesis GO:0005737 cytoplasm GO:0007569 cell aging GO:0048146 positive regulation of fibroblast proliferation GO:0016607 nuclear speck GO:0048589 developmental growth GO:0048844 artery morphogenesis GO:0045449 regulation of transcription GO:0046872 metal ion binding GO:0003007 heart morphogenesis GO:0005515 protein binding GO:0005634 nucleus GO:0001701 in utero embryonic development GO:0045944 positive regulation of transcription from RNA polymerase II promoter GO:0008270 zinc ion binding GO:0001570 vasculogenesis |
InterPro families    | IPR013083 Zinc finger, RING/FYVE/PHD-type IPR004181 Zinc finger, MIZ-type |
Orthology group | MCL11617 |
Nucleotide sequence:
ATGTTGAGCGAGCGAGCGTGGCGGGAGAGGGGAAGGGGGTACGGAGCGAGCGGCAACGGG
ACGACAAGACCCGAGCACCGCCGAGGGCCGAATGCCGCGACGTGGTCGTACGGAGGGCGA
GCGAGGAGCGGCGGCGGTGGCGGGCGGGGTGGAGGAGGGGTGCGAGCGGGGGTGCGCCAC
TTTATCGCGGCGAGCTGGCAACGCCGCGCGGGGCGGTTGGTGGAAGGGAGGGGGGAGAGG
TCCGCCAGCCGCCAGCCGCCGGCCGCCAGCCGCCAGCCAGTGTTACCTTTGACGGTGGTT
GGAGCGGTCGCCGGACGGTGGACCGTAGTGATGTCGCCGTCGGCACTCGCGCTGCACGTC
TGTCATGTTGAAGTCGACTGTCGACTGTCGGGCTCGTTTTGTTACCATTTGGATTTTTTT
CACGGTGATTTAGAGTGTGGATTTTGCGACAAAATAAATGCCAGTGACTTATCGAGTTTA
CAGAGTGGATTTGTGTTAGTTACCGAGGGGGTGCGTTCAGACGACAGGACGGCGTACACG
GCTAACACGACTGGCTACCCGGGACAGGGCTACGGTTACTCTGACAGACATCACCAACAT
AACATAGGACACAGAGGAAACGCGCCAAGCGGCTATGGGTGCCCCGGCGGAGCGGGCGCG
ATGAGCGGCGGAGGCGAGAACGCGCAGTTCGGCGCCACCGCCGCGATGGTCGCCGCGGCC
ACCACGGCCGTGATGCAGGACTCGCAGCCCTTCTCACAGATGCAAAACAACATGACGATG
GGAAATCCCCAGTACAGTGCGATGAATGGCTACGGTCAACAACGCAGCCACAATCCCGCG
ATGACGGGAATGGGGATGGGTGGCAACGGCGGGATGAACGGAATGACCGGCATGGGACAG
ATGGGGAACGCCATGACCGGCATGAACCCCATGGCTCAGATGGCCAACATGGGGATGCAC
GCGAATATGATCTCCTCTCAGATGGGACCGGGGCAGATGGGGAACTCCGCGAAGATGGGC
CCGGGGTATCAGAGGCGGCACACGCCATACCCCTCGGGGACGATGATTATGGGTTCCCGA
AAGACCCAATACATGGGCGGCCAACCCGGCTTCGGCCCGGCTCCCGCCCAGTACCAGACG
GTCAGTACCCGAACGGGTTACGGCGGACGGCCGGGCTTCCAGAGTCAGTACCCGCCGCAG
CAGCCGCTCGGAGCCAGTGGGAACTTCGGATCGGGTATGAGAGGCACCATGAGACAATCG
ACTCCGCCGTACTCTAACCAGGGACAGTATTTTAACGGAGGAGTTCCTAATCAATTCCCA
CAGCATCAGGCCGGTAACGGCCAGTATGGAGCGCAGTACGGCGGACAGTTCGCGCAGGAA
GTGGCCATGAGATCTAACATGAGCTACCAGCACAGTCCGGTCCCCGGTAACCCGACGCCT
CCTCTGACGCCGGCCAGTAGTATGCCGCCCTACATCAGCCCCAACGCTGACGTCAAACCT
CACTTTAACGAGCTCAAACCACCGATGGGCATGCAGAATGACGAGCTCCGGTTAACATTC
CCCGTAAGAGATGGTATCATATTACCACCTTTTAGATTAGAACATAATTTAGCAGTTAGC
AATCACGTATTCCAATTAAAGCCTACGGTACATTCAACATTAATATGGAGATCTGATCTG
GAGCTGCAGCTCAAGTGCTTTCATCACGAAGACAGACAGATGAACACCAACTGGCCGGCG
AGCGTGCAGGTGTCCGTGAACGCCACGCCGCTCGTCATCGACCGAGGAGAGAACAAGACG
TCACACAAACCGCTGTACCTGAAGGAGGTCTGCCAGCCCGGGCGGAACACCATACAGATC
ACCGTCTCCGCCTGCTGCTGTTCGCACCTCTTCGTATTACAATTAGTCCACCGGCCGAGT
GTGAGGAGCGTGCTGCAAGGATTGCTGAGGAAGCGGCTGCTGACGGCCGACCACTGTATC
GCTAAGATCAAGATGAACTTCAACCAGTCGCCGGCTCAGAACAACAGCTCGAGCGCGCCC
AGCGACAGGGACGGCGTGGAACAGACGGCGCTTAAAGTGTCGTTGAAATGTCCGATAACC
TTCAAGAAGATCACCCTCCCGGCCCGGGGCCACGAGTGTAAACACATACAGTGCTTCGAC
TTAGAGTCGTATCTGCAACTAAACTGCGAGAGGGGCTCGTGGCGATGTCCGGTCTGCAAT
AAGCCAGCTCAGCTGGAAGGGTTAGAAGTGGATCAGTACATGTGGGGCATCCTGAACACC
TTGAACACGTCTGACGTCGACGAAGTGACCATCGACAGCGGGGCCAACTGGAAAGCTACC
AAGATATCCGCCAACCCCGGCATCAAGCAAGAAGACGACAGCAACGACAACAGTGGGAAG
AGAAGCAAAGCGGTGTCCCCGGGCTCCATGAACATGCCCACCATGAACAACTGGGACATG
ACCCAGGCTCTGTCGCCCTACCTGCCGCCCGACATGAACACCATCGCCAGCGGGTCCATG
ATCTCATACAACCAGGGAGGACAGAACAGGAACTCGGGCTCCAGCAACAACAACTACGAC
TTCGGCATCAACAGCGGACCCAACAGCAACGAGTTCGCCGGCAACGGACCGCTCGCACAC
CTCAACGACAGCGTTAACTCGCTCGACCCCTTGAACGCCATGGAGAAGACCCTCAACGAA
CAGATGCCCCACACACCCCACACCCCCCACACGCCGGGGTCCGCTCACACCCCGGGCGGA
GGAGCGACCCCGGGCTCCAGTCACACGGGGCCCCCCTCGGTGGACCGGCACCCCCTCACG
GACGTCGACATCCCGGCCGACCTGAACTTCGACCCCGCAGCGGTCATCGACGGAGAGGGC
ACCGACAACTTGAATCTGTTGCCGGAGACCAGCGTGGACCCCATGGAGCTGCTGTCGTAC
CTGGAGGCGCCGGCGCTGGGCGAGCTGCTGGCCACTCCGCCGTCGTCGTCGTCGTCCGCG
GGCAGCCTGGCGCCGCGCGCGCCGTCCTCCGACGACCTCCTCGCTCTGTTCGAGTGA
Protein sequence:
MLSERAWRERGRGYGASGNGTTRPEHRRGPNAATWSYGGRARSGGGGGRGGGGVRAGVRH
FIAASWQRRAGRLVEGRGERSASRQPPAASRQPVLPLTVVGAVAGRWTVVMSPSALALHV
CHVEVDCRLSGSFCYHLDFFHGDLECGFCDKINASDLSSLQSGFVLVTEGVRSDDRTAYT
ANTTGYPGQGYGYSDRHHQHNIGHRGNAPSGYGCPGGAGAMSGGGENAQFGATAAMVAAA
TTAVMQDSQPFSQMQNNMTMGNPQYSAMNGYGQQRSHNPAMTGMGMGGNGGMNGMTGMGQ
MGNAMTGMNPMAQMANMGMHANMISSQMGPGQMGNSAKMGPGYQRRHTPYPSGTMIMGSR
KTQYMGGQPGFGPAPAQYQTVSTRTGYGGRPGFQSQYPPQQPLGASGNFGSGMRGTMRQS
TPPYSNQGQYFNGGVPNQFPQHQAGNGQYGAQYGGQFAQEVAMRSNMSYQHSPVPGNPTP
PLTPASSMPPYISPNADVKPHFNELKPPMGMQNDELRLTFPVRDGIILPPFRLEHNLAVS
NHVFQLKPTVHSTLIWRSDLELQLKCFHHEDRQMNTNWPASVQVSVNATPLVIDRGENKT
SHKPLYLKEVCQPGRNTIQITVSACCCSHLFVLQLVHRPSVRSVLQGLLRKRLLTADHCI
AKIKMNFNQSPAQNNSSSAPSDRDGVEQTALKVSLKCPITFKKITLPARGHECKHIQCFD
LESYLQLNCERGSWRCPVCNKPAQLEGLEVDQYMWGILNTLNTSDVDEVTIDSGANWKAT
KISANPGIKQEDDSNDNSGKRSKAVSPGSMNMPTMNNWDMTQALSPYLPPDMNTIASGSM
ISYNQGGQNRNSGSSNNNYDFGINSGPNSNEFAGNGPLAHLNDSVNSLDPLNAMEKTLNE
QMPHTPHTPHTPGSAHTPGGGATPGSSHTGPPSVDRHPLTDVDIPADLNFDPAAVIDGEG
TDNLNLLPETSVDPMELLSYLEAPALGELLATPPSSSSSAGSLAPRAPSSDDLLALFE