DPGLEAN04992 in OGS1.0

New model in OGS2.0DPOGS201164 
Genomic Positionscaffold73:+ 170131-174879
See gene structure
CDS Length1827
Paired RNAseq reads  1545
Single RNAseq reads  3654
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003961 (0.0)
Best Drosophila hit  CG4673, isoform D (3e-149)
Best Human hitnuclear protein localization protein 4 homolog (2e-122)
Best NR hit (blastp)  PREDICTED: similar to nuclear protein localization [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to nuclear protein localization [Tribolium castaneum] (5e-174)
GeneOntology terms

  
GO:0017056 structural constituent of nuclear pore
GO:0005643 nuclear pore
GO:0008270 zinc ion binding
InterPro families
  
IPR007717 NPL4
IPR007716 NPL4, zinc-binding putative
Orthology groupMCL13432

Nucleotide sequence:

ATGTCAGGAACAAAAAAAATGACGCTACGGGTCCAGTCGTCGGAGGGCACAGCCCGCGTG
GAGATGCTGGACACCGAGGTCACATCGCGCCTCTACGAGCGAGTCCACGACACTCTCAAC
TTGAACTCATTCGGCTTCGCTTTACATAAAGACCGCGCGCGTAAACAAGAAATTTCGTCT
AGTAAATCTCGTCAACTCCGAGAGTACGGTCTGCAACATGGAGACATGCTCTATTTGAGT
CCTGTCAATGGAACAGTTCTCTTTGACCAGCCTTCTACTAGTTCTGAGCCACTCAACAAA
CCTTTGACAGAGCTATCAACAGAGGCAGGTCCTTCCACAGTGATTCCCTCAAATGCTGTC
AGTAAGGGACCAATAGAACATGAGGTAGATTTACAACTGTACCGTCTCTCGGGCAGCATT
CACCGACAGAGAGATGAAAAATTATGTCGTCACAATTCCAAAGGATGTTGTGTGCACTGT
TCGGCACTGGAGCCCTGGGATGAGGGCTATCTTAAAGAACACAACATCAAACATATGTCA
TTCCACGCCTACCTTCGCAAGATGACATCAGGGAAGTTCATTACACTGGATGAACTGTCA
TGTAAAATAAAGCCAGGCTGCAAGGAACACCCTCCCTGGCCCCGCGGCATCTGCTCGTCG
TGTCAGCCGGGCGCTGTGACGCTCACGAGGCAGCCCTACCGCCATGTGGACAACGTGCTA
CTGGAGCACGCCGCGCCCGTTGAGCGCTTCCTTTCCTACTGGCGCGCCACGGGTCACCAG
CGCGTGGGCTTCCTGTACGGCCGCTACGAGCTCCACCCCGACGTGCCGCTGGGTATTCGC
GCCCGCGTGGCCGCCGTTTACGAGCCGCCTCAGGAGTGCAGCCGGGACGCCGTCCGCCTG
GCGTCGGACGACCACGCCGCGCTCCTCGACCGCCTCGCCGCCCGTCTCGGCCTCGAGCGT
GTCGGCTGGATCTTCACCGACCTGCTACCGTTGGATCTAGTCAGCGGCACGGTGCAGTGT
CTGAGGGGTGTGGACACGCACTTCCTCTCCGCTCAGGAATGTATCACGGCAGGACATTTC
CAGAACGAGCATCCGAACGCGTGTAGGCACGCGTCCTCCGGCTACTTTGGCTCTAAATTC
GTGACGGTGTGCGTGACAGGCGACGCCGACAACCACATCCACTTGGAGGGCTATCAGGTG
TCGGGTCAGTGCGCGGCGCTGGTGAGGGACGGCATCCTACTGCCCACCAGGGACGCTCCC
GAACTCGGATACATTCGGGATTGCTCGCCCGAACAGTACGTGCCTGACGTTTACTATAAG
GAAAAGGATGCGTACGGCAACGAAGTAGGCGTGTCGGCGAAGAGGCTACCGGTGGCTTAT
TTGCTGGTGGACGTGCCGGTGGGCGTGGCGCCCGCAGCAGGCGAGCCCACCTTCGACCCC
CGGGCGTCGTTTCCTCCCGCGCACCGGCCCCTGCAGCAGCACGTGCAGTCCCTGAGCGGC
CTCCACGCGCACGTGGAGCGCGCCGAGTCGTTCCTGGCAGCGGCCTCCGACTTGCACGTG
CTGCTGTTCCTGGCTACCAACGACGCCGCGCCGCTGAGCCTGGAGCAGCTGGCGCCGCTG
CTGGACGCCGTCCGCCGCCGCGACGCGTCCGCGGCCGAGGCGTGGCGCGCGTCGCCCGCG
GCCGCCGCGCTGCTGGCCCCCCGTCTCTTTCTGTGTCAGGTGACTCGTGTCAGTACAGCT
CTTTCTCTCGTGTTTCAGGAACGCCATGTAACGAGCCCGCCGTCGGCCGCCGGTCCTTCC
TCGCCGACGGAATATAGAAATTACTAG

Protein sequence:

MSGTKKMTLRVQSSEGTARVEMLDTEVTSRLYERVHDTLNLNSFGFALHKDRARKQEISS
SKSRQLREYGLQHGDMLYLSPVNGTVLFDQPSTSSEPLNKPLTELSTEAGPSTVIPSNAV
SKGPIEHEVDLQLYRLSGSIHRQRDEKLCRHNSKGCCVHCSALEPWDEGYLKEHNIKHMS
FHAYLRKMTSGKFITLDELSCKIKPGCKEHPPWPRGICSSCQPGAVTLTRQPYRHVDNVL
LEHAAPVERFLSYWRATGHQRVGFLYGRYELHPDVPLGIRARVAAVYEPPQECSRDAVRL
ASDDHAALLDRLAARLGLERVGWIFTDLLPLDLVSGTVQCLRGVDTHFLSAQECITAGHF
QNEHPNACRHASSGYFGSKFVTVCVTGDADNHIHLEGYQVSGQCAALVRDGILLPTRDAP
ELGYIRDCSPEQYVPDVYYKEKDAYGNEVGVSAKRLPVAYLLVDVPVGVAPAAGEPTFDP
RASFPPAHRPLQQHVQSLSGLHAHVERAESFLAAASDLHVLLFLATNDAAPLSLEQLAPL
LDAVRRRDASAAEAWRASPAAAALLAPRLFLCQVTRVSTALSLVFQERHVTSPPSAAGPS
SPTEYRNY