New model in OGS2.0 | DPOGS215991  |
---|---|
Genomic Position | scaffold3158:- 7585-15704 |
See gene structure | |
CDS Length | 3855 |
Paired RNAseq reads   | 12784 |
Single RNAseq reads   | 36026 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001182 (0.0) |
Best Drosophila hit   | CG8086, isoform E (4e-101) |
Best Human hit | outer dense fiber protein 3 (2e-10) |
Best NR hit (blastp)   | GJ20592 [Drosophila virilis] (3e-117) |
Best NR hit (blastx)   | GH10184 [Drosophila grimshawi] (6e-137) |
GeneOntology terms   | GO:0005643 nuclear pore |
InterPro families   | IPR010736 Protein of unknown function DUF1309 |
Orthology group | MCL16516 |
Nucleotide sequence:
ATGTATTTATTTGAAATTTGGCGCATTGTTTATAGAGCTGACATCACCGGAGGTAAACCT
CCGCCGGAATCTAGAAAGACTAGAGCACCAGCGTTCACGTTTGGACAGAAATTGGAAGCT
GCGGGTAAGGATAAGAGTGGCCCCGGTCCCGCTTCTTACAACACGGAAGGCATGACAGCT
AAAGGACGGGCGGGGGGCCCGGCGGCGTCTTTGCATGGTCGATGGCCACCACCTCGAGTA
ACGCCTACACCGGCTCCTTGCGACTACGAGCCCAGCAAGGCTGCCCGAGCTGTTCTTGAT
CATGCTCCAGCATTCTCGATAGGTCTTCGCGTTTCTCCCCCACAGGCTGGAAATAAAACA
CCAGCGCCTAACGTTTATTCTATGCCACCTGTTCTTGGAGAAGCGAGAGAAGGCAGTAAG
CGAGCGGCACCGGCATTCAGTATCACGGGTCGTGGCAAAACGATCGAGTCGAAGACACCG
ATGCCTGGTCCTGGCACTTACACGACGGATAAAGCGGCATCTGTTATTACTAAACGTCCT
CCAGCCTACACGATGGCACCTAGACGGGAGCTGAAGCCCCCAACCGCAGCTGTTCCTGGT
CCAGGAGTCTATTGCCCGGAAAAAGTTAAGTCTCATCGTACACAATCATCAAAGCTCACA
GTAAAGTCATCTAGCAAAATTCAAAAAATAGAGCAGATACCAGCGCCTAATGCTTATAAT
CCCGAAAAAGCAGATAGGATTTTAAGAGAAAAATCTCCAGCGTTTTCTTTTAGAACTAAA
TCAGAAATAATAAAAATTCAGGACGCTCCTGCACCGAATGTATATTCTCCGGAAAAATCT
TTACATGCTTTGAAAAACGGTCCTAAATATACTCTTTCTGGAAAAGGAACTGCGGAGAAA
CATGATGTTACCCCTGCGCCAAATTCTTATAACCCCCAAAAAGCTGATAAATTGTTACGT
GAAAGTTCACCAGCTTATACACTGAGATCAAAGGAAATACTTGAAAAAATTGATGACACA
CCAGCTCCTAATGTATATGCTCCAGAAAAATCTTTGCACATGTTAAATGGTGGCCCAAAA
TTTACAATTCTGCCTGCTCCTAACAGTTACAATCCCGAAAAAGCTGATAAGATTTTACAT
GAAAGCACTCCAGCATATTCTTTTAGGGTAAAAGACCATCCAAATGTAAGGAGTGAATCA
CCAGCTCCGAATGTTTATTCGCCTGAAAAGTCTATGTATTCATTAGACAGTGCTCCAAAA
TTTTCAATAAGTGGAAAAGGTTACTCTGAAAAAATCGCGGATACTCCAAGCCCAAATGCC
TATAATCCTAATAAGGCGGATAAATTATTACACGAATCCTCACCTGCTTACACGTTTCGA
GCAAAGGATAAAATATTAAAAACTGATAATTTTCCTGCACCTAATGTATATTCGCCAGAA
AAGTCAATACATTCATTAGATAGTACACCAAAATTTACCATGGCAGGCAGAGGTTCTTCT
CCGAAAATTGAAGATGTGCCGGCTCCTAATGCATACTGCCCTGATAAAGCTGACAAACTT
CTTCACGACTCTTCTCCTGCTTATACGTTAAGACCTAAAATTTTGGAGGGGAAACTCAGT
GACACTCCAGCACCTAATGCATATGAACCTCGTCTTAAAGATGATGCTCCAAAATATAGC
TTGTATGGAAAAGGACATGATATTAAGCCATCTGATACTCCTGGACCTAACGTCTATGAG
CCGCGCTTACTTGATAATACTCCTAAATACTCTTTAACAGGGAAAGGTCACGATGCCAAA
ATATTCAATACACCTGGACCTAATTGTTACGATCCTCATTTACCTTCAAACTCGCCAAGA
TTCACTATGTCAGGGAAAGGTCCAGATGAAAAATTTCCTGATGTTCCTGCTCCAAACTCC
TATAATGCTTCTTTACCTAATAATGCACCTAAATTCACGATAAGTGGAAAAGGTTATGAT
CCCAAAATGTTTATTACTCCTGGACCTGATTGTTATGATCCACATTTACCTCAAAATAGT
CCAAGATATACAATGGGTGGTAAAAGTAATGACCCAAAATGTTTTGAAGTCCCAGCACCT
AATGCATACGATCCACATATTATAAATGAATCTCCTAAATATACAATGTGTGGAAAGGGA
CACCCCGATAAAATAATTGACACACCTGCTCCTAATGCCTATGATCCAGATAAATATCCA
CGCAGTGGCGAGCCAAAGTACAGTTTTGGTATCAAAAGACCACCACTAAAAACTGAAAAT
TATCCGGCTCCTAATGCTTATTATGCTGATCGGGCTGATAAAGTTTTACATGAAACTTCA
CCGGCATATACATTCCGACCTAAAATTGAAGACAACAAAAAACCAGATACACCGGGTCCC
AATGCATACAATATAGAGAAGGCTGATAAAGTCATTTTAGAACATACCCCTTCATATAGC
TTATCACCGAAAGGAAAGGATGCCAAAATAAATGATACCCCGGCACCAAATGTTTATAAC
CCAGAAAAAGCTGACAAGCTCTTATTAGATAACGCACCACGATACTCGTTCAGAATGAAG
ACAAATCCACATAAATCAGATAATAATCCAGCCCCCAACAATTACAACCCTGATAAGGCC
GATAAACTTTTACACAGTGCTCCACAGTATACATTTAGAATCAAACCTGATGACATAAAA
GCTATAGATACTCCTGCACCTAACTCTTACACCATCCCAAATCTTCAAAAAACTCCACTA
TACACGATTTCTGGAAGACATAAAGAGCCGATAGATGAACGTCTTAAAGTTCCCGCTCCC
GGGGCTTATAACCCAGAAAAAGGCTATAAATTTGTTTTGACGTACTCACCGCAATACACT
TTTGGCGTTAAAATTCACACTGACAAATATGCTGATACGCCAGCTCCCAATAGTTATCGT
ATTCCGTCTGTACTGGAGAGTCCCGTCTACACTATGGTAGGTCGTCCGAAAGAGCCTAAG
GATGATCGTTGTAGAATACCCGCACCAGGAACATATTCTCCGGAGAAAGTACAGATAAAT
AAAACCCCGCAAATCACGTTTGGAATAAAACATTCTCCTCTTCTGGGTCAACTTAAGCCA
ATTGAACCTCCTCGTCATGGTATGCAAACAATGAAAAAACCTGTTGAGAAAGAAGTGCAC
GACGATAATTACAGAAACTTGTCCCAAACTTGGGAAAAAGAAAGTATAGTGATCAAAACA
AATGGCGATGTCAACCAACCCAGAACACCTGAAACAAATTCACGACAATCTATGTATGAG
TCTATGGATTCCAATAATGATACTCGCAATATGCACACACATGTGACACAGGTTAGAAAT
GAAATAAGAAGTTCTACAGCTACACCGGAGCCTGTTCAAGAAAGGCTCACCCAAGAAATA
GTTTGGGTCCCTGAAACCCAGCCTCGACGAGGTTCTTATACAATAGAAAAATCTGATGGC
AATGGATTTATTGAACGTTATGAGAATAGTGAAGTCATTCCGGTTGAAAATGGAGCTGTT
CATATATCTGGTAGCGGAGTAAGAGGGGCGTCGTGTACTGAGGAGCATAGTAGCGAAGTG
GTTAAAAAGGATGGCTTCCTGCAAAATGTTAATAAAAGAGTAAACAATTCCAGCGCTCAT
GAGCAGAGTCAGAAATCTAGCGAGGAAGTTCGTACTGGAAGTGATATTCAGCACTTACCA
GACGGTGGTATTGCGCAGACCACTACAACAACAACCATAAAAAAAATTGGAAAATCAGCC
AAAACAGCGAATGCTACGACCACAGTCACTCGAACCAATACTGTTGTAACTGCACGCGAT
GTCGGCGCTAAATGA
Protein sequence:
MYLFEIWRIVYRADITGGKPPPESRKTRAPAFTFGQKLEAAGKDKSGPGPASYNTEGMTA
KGRAGGPAASLHGRWPPPRVTPTPAPCDYEPSKAARAVLDHAPAFSIGLRVSPPQAGNKT
PAPNVYSMPPVLGEAREGSKRAAPAFSITGRGKTIESKTPMPGPGTYTTDKAASVITKRP
PAYTMAPRRELKPPTAAVPGPGVYCPEKVKSHRTQSSKLTVKSSSKIQKIEQIPAPNAYN
PEKADRILREKSPAFSFRTKSEIIKIQDAPAPNVYSPEKSLHALKNGPKYTLSGKGTAEK
HDVTPAPNSYNPQKADKLLRESSPAYTLRSKEILEKIDDTPAPNVYAPEKSLHMLNGGPK
FTILPAPNSYNPEKADKILHESTPAYSFRVKDHPNVRSESPAPNVYSPEKSMYSLDSAPK
FSISGKGYSEKIADTPSPNAYNPNKADKLLHESSPAYTFRAKDKILKTDNFPAPNVYSPE
KSIHSLDSTPKFTMAGRGSSPKIEDVPAPNAYCPDKADKLLHDSSPAYTLRPKILEGKLS
DTPAPNAYEPRLKDDAPKYSLYGKGHDIKPSDTPGPNVYEPRLLDNTPKYSLTGKGHDAK
IFNTPGPNCYDPHLPSNSPRFTMSGKGPDEKFPDVPAPNSYNASLPNNAPKFTISGKGYD
PKMFITPGPDCYDPHLPQNSPRYTMGGKSNDPKCFEVPAPNAYDPHIINESPKYTMCGKG
HPDKIIDTPAPNAYDPDKYPRSGEPKYSFGIKRPPLKTENYPAPNAYYADRADKVLHETS
PAYTFRPKIEDNKKPDTPGPNAYNIEKADKVILEHTPSYSLSPKGKDAKINDTPAPNVYN
PEKADKLLLDNAPRYSFRMKTNPHKSDNNPAPNNYNPDKADKLLHSAPQYTFRIKPDDIK
AIDTPAPNSYTIPNLQKTPLYTISGRHKEPIDERLKVPAPGAYNPEKGYKFVLTYSPQYT
FGVKIHTDKYADTPAPNSYRIPSVLESPVYTMVGRPKEPKDDRCRIPAPGTYSPEKVQIN
KTPQITFGIKHSPLLGQLKPIEPPRHGMQTMKKPVEKEVHDDNYRNLSQTWEKESIVIKT
NGDVNQPRTPETNSRQSMYESMDSNNDTRNMHTHVTQVRNEIRSSTATPEPVQERLTQEI
VWVPETQPRRGSYTIEKSDGNGFIERYENSEVIPVENGAVHISGSGVRGASCTEEHSSEV
VKKDGFLQNVNKRVNNSSAHEQSQKSSEEVRTGSDIQHLPDGGIAQTTTTTTIKKIGKSA
KTANATTTVTRTNTVVTARDVGAK