New model in OGS2.0 | DPOGS201164  |
---|---|
Genomic Position | scaffold73:+ 170131-174879 |
See gene structure | |
CDS Length | 1827 |
Paired RNAseq reads   | 1545 |
Single RNAseq reads   | 3654 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003961 (0.0) |
Best Drosophila hit   | CG4673, isoform D (3e-149) |
Best Human hit | nuclear protein localization protein 4 homolog (2e-122) |
Best NR hit (blastp)   | PREDICTED: similar to nuclear protein localization [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to nuclear protein localization [Tribolium castaneum] (5e-174) |
GeneOntology terms    | GO:0017056 structural constituent of nuclear pore GO:0005643 nuclear pore GO:0008270 zinc ion binding |
InterPro families    | IPR007717 NPL4 IPR007716 NPL4, zinc-binding putative |
Orthology group | MCL13432 |
Nucleotide sequence:
ATGTCAGGAACAAAAAAAATGACGCTACGGGTCCAGTCGTCGGAGGGCACAGCCCGCGTG
GAGATGCTGGACACCGAGGTCACATCGCGCCTCTACGAGCGAGTCCACGACACTCTCAAC
TTGAACTCATTCGGCTTCGCTTTACATAAAGACCGCGCGCGTAAACAAGAAATTTCGTCT
AGTAAATCTCGTCAACTCCGAGAGTACGGTCTGCAACATGGAGACATGCTCTATTTGAGT
CCTGTCAATGGAACAGTTCTCTTTGACCAGCCTTCTACTAGTTCTGAGCCACTCAACAAA
CCTTTGACAGAGCTATCAACAGAGGCAGGTCCTTCCACAGTGATTCCCTCAAATGCTGTC
AGTAAGGGACCAATAGAACATGAGGTAGATTTACAACTGTACCGTCTCTCGGGCAGCATT
CACCGACAGAGAGATGAAAAATTATGTCGTCACAATTCCAAAGGATGTTGTGTGCACTGT
TCGGCACTGGAGCCCTGGGATGAGGGCTATCTTAAAGAACACAACATCAAACATATGTCA
TTCCACGCCTACCTTCGCAAGATGACATCAGGGAAGTTCATTACACTGGATGAACTGTCA
TGTAAAATAAAGCCAGGCTGCAAGGAACACCCTCCCTGGCCCCGCGGCATCTGCTCGTCG
TGTCAGCCGGGCGCTGTGACGCTCACGAGGCAGCCCTACCGCCATGTGGACAACGTGCTA
CTGGAGCACGCCGCGCCCGTTGAGCGCTTCCTTTCCTACTGGCGCGCCACGGGTCACCAG
CGCGTGGGCTTCCTGTACGGCCGCTACGAGCTCCACCCCGACGTGCCGCTGGGTATTCGC
GCCCGCGTGGCCGCCGTTTACGAGCCGCCTCAGGAGTGCAGCCGGGACGCCGTCCGCCTG
GCGTCGGACGACCACGCCGCGCTCCTCGACCGCCTCGCCGCCCGTCTCGGCCTCGAGCGT
GTCGGCTGGATCTTCACCGACCTGCTACCGTTGGATCTAGTCAGCGGCACGGTGCAGTGT
CTGAGGGGTGTGGACACGCACTTCCTCTCCGCTCAGGAATGTATCACGGCAGGACATTTC
CAGAACGAGCATCCGAACGCGTGTAGGCACGCGTCCTCCGGCTACTTTGGCTCTAAATTC
GTGACGGTGTGCGTGACAGGCGACGCCGACAACCACATCCACTTGGAGGGCTATCAGGTG
TCGGGTCAGTGCGCGGCGCTGGTGAGGGACGGCATCCTACTGCCCACCAGGGACGCTCCC
GAACTCGGATACATTCGGGATTGCTCGCCCGAACAGTACGTGCCTGACGTTTACTATAAG
GAAAAGGATGCGTACGGCAACGAAGTAGGCGTGTCGGCGAAGAGGCTACCGGTGGCTTAT
TTGCTGGTGGACGTGCCGGTGGGCGTGGCGCCCGCAGCAGGCGAGCCCACCTTCGACCCC
CGGGCGTCGTTTCCTCCCGCGCACCGGCCCCTGCAGCAGCACGTGCAGTCCCTGAGCGGC
CTCCACGCGCACGTGGAGCGCGCCGAGTCGTTCCTGGCAGCGGCCTCCGACTTGCACGTG
CTGCTGTTCCTGGCTACCAACGACGCCGCGCCGCTGAGCCTGGAGCAGCTGGCGCCGCTG
CTGGACGCCGTCCGCCGCCGCGACGCGTCCGCGGCCGAGGCGTGGCGCGCGTCGCCCGCG
GCCGCCGCGCTGCTGGCCCCCCGTCTCTTTCTGTGTCAGGTGACTCGTGTCAGTACAGCT
CTTTCTCTCGTGTTTCAGGAACGCCATGTAACGAGCCCGCCGTCGGCCGCCGGTCCTTCC
TCGCCGACGGAATATAGAAATTACTAG
Protein sequence:
MSGTKKMTLRVQSSEGTARVEMLDTEVTSRLYERVHDTLNLNSFGFALHKDRARKQEISS
SKSRQLREYGLQHGDMLYLSPVNGTVLFDQPSTSSEPLNKPLTELSTEAGPSTVIPSNAV
SKGPIEHEVDLQLYRLSGSIHRQRDEKLCRHNSKGCCVHCSALEPWDEGYLKEHNIKHMS
FHAYLRKMTSGKFITLDELSCKIKPGCKEHPPWPRGICSSCQPGAVTLTRQPYRHVDNVL
LEHAAPVERFLSYWRATGHQRVGFLYGRYELHPDVPLGIRARVAAVYEPPQECSRDAVRL
ASDDHAALLDRLAARLGLERVGWIFTDLLPLDLVSGTVQCLRGVDTHFLSAQECITAGHF
QNEHPNACRHASSGYFGSKFVTVCVTGDADNHIHLEGYQVSGQCAALVRDGILLPTRDAP
ELGYIRDCSPEQYVPDVYYKEKDAYGNEVGVSAKRLPVAYLLVDVPVGVAPAAGEPTFDP
RASFPPAHRPLQQHVQSLSGLHAHVERAESFLAAASDLHVLLFLATNDAAPLSLEQLAPL
LDAVRRRDASAAEAWRASPAAAALLAPRLFLCQVTRVSTALSLVFQERHVTSPPSAAGPS
SPTEYRNY