DPGLEAN11922 in OGS1.0

New model in OGS2.0DPOGS215991 
Genomic Positionscaffold3158:- 7585-15704
See gene structure
CDS Length3855
Paired RNAseq reads  12784
Single RNAseq reads  36026
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001182 (0.0)
Best Drosophila hit  CG8086, isoform E (4e-101)
Best Human hitouter dense fiber protein 3 (2e-10)
Best NR hit (blastp)  GJ20592 [Drosophila virilis] (3e-117)
Best NR hit (blastx)  GH10184 [Drosophila grimshawi] (6e-137)
GeneOntology terms  GO:0005643 nuclear pore
InterPro families  IPR010736 Protein of unknown function DUF1309
Orthology groupMCL16516

Nucleotide sequence:

ATGTATTTATTTGAAATTTGGCGCATTGTTTATAGAGCTGACATCACCGGAGGTAAACCT
CCGCCGGAATCTAGAAAGACTAGAGCACCAGCGTTCACGTTTGGACAGAAATTGGAAGCT
GCGGGTAAGGATAAGAGTGGCCCCGGTCCCGCTTCTTACAACACGGAAGGCATGACAGCT
AAAGGACGGGCGGGGGGCCCGGCGGCGTCTTTGCATGGTCGATGGCCACCACCTCGAGTA
ACGCCTACACCGGCTCCTTGCGACTACGAGCCCAGCAAGGCTGCCCGAGCTGTTCTTGAT
CATGCTCCAGCATTCTCGATAGGTCTTCGCGTTTCTCCCCCACAGGCTGGAAATAAAACA
CCAGCGCCTAACGTTTATTCTATGCCACCTGTTCTTGGAGAAGCGAGAGAAGGCAGTAAG
CGAGCGGCACCGGCATTCAGTATCACGGGTCGTGGCAAAACGATCGAGTCGAAGACACCG
ATGCCTGGTCCTGGCACTTACACGACGGATAAAGCGGCATCTGTTATTACTAAACGTCCT
CCAGCCTACACGATGGCACCTAGACGGGAGCTGAAGCCCCCAACCGCAGCTGTTCCTGGT
CCAGGAGTCTATTGCCCGGAAAAAGTTAAGTCTCATCGTACACAATCATCAAAGCTCACA
GTAAAGTCATCTAGCAAAATTCAAAAAATAGAGCAGATACCAGCGCCTAATGCTTATAAT
CCCGAAAAAGCAGATAGGATTTTAAGAGAAAAATCTCCAGCGTTTTCTTTTAGAACTAAA
TCAGAAATAATAAAAATTCAGGACGCTCCTGCACCGAATGTATATTCTCCGGAAAAATCT
TTACATGCTTTGAAAAACGGTCCTAAATATACTCTTTCTGGAAAAGGAACTGCGGAGAAA
CATGATGTTACCCCTGCGCCAAATTCTTATAACCCCCAAAAAGCTGATAAATTGTTACGT
GAAAGTTCACCAGCTTATACACTGAGATCAAAGGAAATACTTGAAAAAATTGATGACACA
CCAGCTCCTAATGTATATGCTCCAGAAAAATCTTTGCACATGTTAAATGGTGGCCCAAAA
TTTACAATTCTGCCTGCTCCTAACAGTTACAATCCCGAAAAAGCTGATAAGATTTTACAT
GAAAGCACTCCAGCATATTCTTTTAGGGTAAAAGACCATCCAAATGTAAGGAGTGAATCA
CCAGCTCCGAATGTTTATTCGCCTGAAAAGTCTATGTATTCATTAGACAGTGCTCCAAAA
TTTTCAATAAGTGGAAAAGGTTACTCTGAAAAAATCGCGGATACTCCAAGCCCAAATGCC
TATAATCCTAATAAGGCGGATAAATTATTACACGAATCCTCACCTGCTTACACGTTTCGA
GCAAAGGATAAAATATTAAAAACTGATAATTTTCCTGCACCTAATGTATATTCGCCAGAA
AAGTCAATACATTCATTAGATAGTACACCAAAATTTACCATGGCAGGCAGAGGTTCTTCT
CCGAAAATTGAAGATGTGCCGGCTCCTAATGCATACTGCCCTGATAAAGCTGACAAACTT
CTTCACGACTCTTCTCCTGCTTATACGTTAAGACCTAAAATTTTGGAGGGGAAACTCAGT
GACACTCCAGCACCTAATGCATATGAACCTCGTCTTAAAGATGATGCTCCAAAATATAGC
TTGTATGGAAAAGGACATGATATTAAGCCATCTGATACTCCTGGACCTAACGTCTATGAG
CCGCGCTTACTTGATAATACTCCTAAATACTCTTTAACAGGGAAAGGTCACGATGCCAAA
ATATTCAATACACCTGGACCTAATTGTTACGATCCTCATTTACCTTCAAACTCGCCAAGA
TTCACTATGTCAGGGAAAGGTCCAGATGAAAAATTTCCTGATGTTCCTGCTCCAAACTCC
TATAATGCTTCTTTACCTAATAATGCACCTAAATTCACGATAAGTGGAAAAGGTTATGAT
CCCAAAATGTTTATTACTCCTGGACCTGATTGTTATGATCCACATTTACCTCAAAATAGT
CCAAGATATACAATGGGTGGTAAAAGTAATGACCCAAAATGTTTTGAAGTCCCAGCACCT
AATGCATACGATCCACATATTATAAATGAATCTCCTAAATATACAATGTGTGGAAAGGGA
CACCCCGATAAAATAATTGACACACCTGCTCCTAATGCCTATGATCCAGATAAATATCCA
CGCAGTGGCGAGCCAAAGTACAGTTTTGGTATCAAAAGACCACCACTAAAAACTGAAAAT
TATCCGGCTCCTAATGCTTATTATGCTGATCGGGCTGATAAAGTTTTACATGAAACTTCA
CCGGCATATACATTCCGACCTAAAATTGAAGACAACAAAAAACCAGATACACCGGGTCCC
AATGCATACAATATAGAGAAGGCTGATAAAGTCATTTTAGAACATACCCCTTCATATAGC
TTATCACCGAAAGGAAAGGATGCCAAAATAAATGATACCCCGGCACCAAATGTTTATAAC
CCAGAAAAAGCTGACAAGCTCTTATTAGATAACGCACCACGATACTCGTTCAGAATGAAG
ACAAATCCACATAAATCAGATAATAATCCAGCCCCCAACAATTACAACCCTGATAAGGCC
GATAAACTTTTACACAGTGCTCCACAGTATACATTTAGAATCAAACCTGATGACATAAAA
GCTATAGATACTCCTGCACCTAACTCTTACACCATCCCAAATCTTCAAAAAACTCCACTA
TACACGATTTCTGGAAGACATAAAGAGCCGATAGATGAACGTCTTAAAGTTCCCGCTCCC
GGGGCTTATAACCCAGAAAAAGGCTATAAATTTGTTTTGACGTACTCACCGCAATACACT
TTTGGCGTTAAAATTCACACTGACAAATATGCTGATACGCCAGCTCCCAATAGTTATCGT
ATTCCGTCTGTACTGGAGAGTCCCGTCTACACTATGGTAGGTCGTCCGAAAGAGCCTAAG
GATGATCGTTGTAGAATACCCGCACCAGGAACATATTCTCCGGAGAAAGTACAGATAAAT
AAAACCCCGCAAATCACGTTTGGAATAAAACATTCTCCTCTTCTGGGTCAACTTAAGCCA
ATTGAACCTCCTCGTCATGGTATGCAAACAATGAAAAAACCTGTTGAGAAAGAAGTGCAC
GACGATAATTACAGAAACTTGTCCCAAACTTGGGAAAAAGAAAGTATAGTGATCAAAACA
AATGGCGATGTCAACCAACCCAGAACACCTGAAACAAATTCACGACAATCTATGTATGAG
TCTATGGATTCCAATAATGATACTCGCAATATGCACACACATGTGACACAGGTTAGAAAT
GAAATAAGAAGTTCTACAGCTACACCGGAGCCTGTTCAAGAAAGGCTCACCCAAGAAATA
GTTTGGGTCCCTGAAACCCAGCCTCGACGAGGTTCTTATACAATAGAAAAATCTGATGGC
AATGGATTTATTGAACGTTATGAGAATAGTGAAGTCATTCCGGTTGAAAATGGAGCTGTT
CATATATCTGGTAGCGGAGTAAGAGGGGCGTCGTGTACTGAGGAGCATAGTAGCGAAGTG
GTTAAAAAGGATGGCTTCCTGCAAAATGTTAATAAAAGAGTAAACAATTCCAGCGCTCAT
GAGCAGAGTCAGAAATCTAGCGAGGAAGTTCGTACTGGAAGTGATATTCAGCACTTACCA
GACGGTGGTATTGCGCAGACCACTACAACAACAACCATAAAAAAAATTGGAAAATCAGCC
AAAACAGCGAATGCTACGACCACAGTCACTCGAACCAATACTGTTGTAACTGCACGCGAT
GTCGGCGCTAAATGA

Protein sequence:

MYLFEIWRIVYRADITGGKPPPESRKTRAPAFTFGQKLEAAGKDKSGPGPASYNTEGMTA
KGRAGGPAASLHGRWPPPRVTPTPAPCDYEPSKAARAVLDHAPAFSIGLRVSPPQAGNKT
PAPNVYSMPPVLGEAREGSKRAAPAFSITGRGKTIESKTPMPGPGTYTTDKAASVITKRP
PAYTMAPRRELKPPTAAVPGPGVYCPEKVKSHRTQSSKLTVKSSSKIQKIEQIPAPNAYN
PEKADRILREKSPAFSFRTKSEIIKIQDAPAPNVYSPEKSLHALKNGPKYTLSGKGTAEK
HDVTPAPNSYNPQKADKLLRESSPAYTLRSKEILEKIDDTPAPNVYAPEKSLHMLNGGPK
FTILPAPNSYNPEKADKILHESTPAYSFRVKDHPNVRSESPAPNVYSPEKSMYSLDSAPK
FSISGKGYSEKIADTPSPNAYNPNKADKLLHESSPAYTFRAKDKILKTDNFPAPNVYSPE
KSIHSLDSTPKFTMAGRGSSPKIEDVPAPNAYCPDKADKLLHDSSPAYTLRPKILEGKLS
DTPAPNAYEPRLKDDAPKYSLYGKGHDIKPSDTPGPNVYEPRLLDNTPKYSLTGKGHDAK
IFNTPGPNCYDPHLPSNSPRFTMSGKGPDEKFPDVPAPNSYNASLPNNAPKFTISGKGYD
PKMFITPGPDCYDPHLPQNSPRYTMGGKSNDPKCFEVPAPNAYDPHIINESPKYTMCGKG
HPDKIIDTPAPNAYDPDKYPRSGEPKYSFGIKRPPLKTENYPAPNAYYADRADKVLHETS
PAYTFRPKIEDNKKPDTPGPNAYNIEKADKVILEHTPSYSLSPKGKDAKINDTPAPNVYN
PEKADKLLLDNAPRYSFRMKTNPHKSDNNPAPNNYNPDKADKLLHSAPQYTFRIKPDDIK
AIDTPAPNSYTIPNLQKTPLYTISGRHKEPIDERLKVPAPGAYNPEKGYKFVLTYSPQYT
FGVKIHTDKYADTPAPNSYRIPSVLESPVYTMVGRPKEPKDDRCRIPAPGTYSPEKVQIN
KTPQITFGIKHSPLLGQLKPIEPPRHGMQTMKKPVEKEVHDDNYRNLSQTWEKESIVIKT
NGDVNQPRTPETNSRQSMYESMDSNNDTRNMHTHVTQVRNEIRSSTATPEPVQERLTQEI
VWVPETQPRRGSYTIEKSDGNGFIERYENSEVIPVENGAVHISGSGVRGASCTEEHSSEV
VKKDGFLQNVNKRVNNSSAHEQSQKSSEEVRTGSDIQHLPDGGIAQTTTTTTIKKIGKSA
KTANATTTVTRTNTVVTARDVGAK