DPGLEAN00885 in OGS1.0

New model in OGS2.0DPOGS201341 
Genomic Positionscaffold1500:+ 32998-37801
See gene structure
CDS Length3348
Paired RNAseq reads  2356
Single RNAseq reads  5355
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003125 (2e-122)
Best Drosophila hit  Upf2 (0.0)
Best Human hitregulator of nonsense transcripts 2 (0.0)
Best NR hit (blastp)  PREDICTED: similar to UPF2 regulator of nonsense transcripts homolog [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to UPF2 regulator of nonsense transcripts homolog [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay
GO:0005515 protein binding
GO:0005634 nucleus
GO:0005737 cytoplasm
GO:0016070 RNA metabolic process
GO:0042802 identical protein binding
InterPro families


  
IPR016024 Armadillo-type fold
IPR016021 MIF4-like, type 1/2/3
IPR003890 MIF4G-like, type 3
IPR007193 Up-frameshift suppressor 2
Orthology groupMCL14646

Nucleotide sequence:

ATGACTTCCAATGAAAAGGAAGCTGATAAAAGTGAGGACGCGTCTGGTATTGATGATGTT
TCTACAGAAAATACAGATGAAAATGACAAACTTGCTCTTACTGAATATATAGCTAACCTG
GAAGCAAGAATCAAACACAAAACAGATCTACGAAATCAAAACCTTAATTGTGTCAGACCG
CCAGACAATCATTTTTCAAAACTTGATTCAGGTTTAAAGAAAAATACGACCTTCGTTAAA
AAGCTTAAATCTTTTAGTGCCACACAACTAGATGCTCTTTCTAAAGATTTAAGTACTCTC
AATCTTACGAAGTATATTTCTGAAGTCGCTTCCGCTATCGCAGAAGCAAAATTAAAGATG
TCAGATATATCAGCTGCAATCACTCTGTGTTCTACATTGCATCAAACATATTCGGAATTT
AGTTCGTTCTTTTTTGAAAACTGGCAAAAGATACTATCCTTCAAAGCAACAGATAAGATA
ACCAATTCTTCAAAATTAAGAGTGGACTTAAGATTCTATGCAGAACTTATTGCTGTTGGT
ATATTTACCAACAAGACTGGTTTACCACTGTTGGGAATTGTGCTCACAGTTTTAATCAAT
ATGGATAAAGAAGAGCATAACAATATTCCAATTTTGTTGTCATTTTGTAAACATTGTGGT
GAAGACTATGCAGGATTAGTTCCAAAAAAAATTAAGGATACTGCTAAGAAATTTAATGTG
ACAGTACCAAGAAATACTTTTATACCAGCGGAGAAGCAAACAGCTGTGAGAAGTTTATTG
AAAGACTATTTCTTATCATTGACAAAGCATTTACTGGCTGAACGAGCTCAGCTTCAGGCT
CTGCATGCAGCCAATCAGAGAACCCTTCATACAAGAGGAGAGTTATCACAAGAGAGGAAA
GACCAATTGGAACAACATCAGGCCACATATGATAAACTACTTGTAGGAGCTCAGAACTTT
GCTGAAGTGTTAGGAGAAGACCTAGGTGAGGCTGGAGAACCTTTGACTTTGTCCATGACA
GTTATTGAGACTCAGGGTACAGTAACCATTGGTGGGAATGATGAGATCATTATGCAAGCT
GGTACTGACCCTTGGCAGGACGAAGACACGAGAACTTTCTATACCAGTCTTCCCGATTTG
AAAGTTTTCATGCCTAATTACCAATTGAAAGAGGCTGTGAAAAATAAAACGGAAACTGTT
ACCGAAGAGATGTTAGACGAGGATCTCAAAGAGGATGAGCTCAGTGACAATGAGGAACCG
GCACCTGTTGTTGCTGATGTGGAGCAAGAAGAGGCACAACCAGCTAATGTGTCTAATAAA
TACGCCCTTGATGCTTTCTTAAATGAATTGCCAAATTGTATCAATAGAGAGTTAATTGAC
AATGCGGCTGTAGACTTTGTTTTGAATTTGAACACTAAAAATAATAGAAAAAAATTAACA
CGGGTCCTATTTAGTGTTGCCAGAACAAGATTAGATCTATTACCATTTTACTCGAGATTC
GCGTCCATACTGTATCCAGTTTTACCCGACGTGTGTGTTGATTTGTGTCAAATGTTAAAA
CAAGATTTTAAGTATCATGTCAGGAAGAAGGATCAAATTAACATTGAATCAAAGATAAAA
GTGGTGAGGTTTATTGGAGAACTTGTTAAATTTGGTCTCTACTCCAAAATGGAAGCTTTG
TACTGTCTGAAAGTTCTGTTACACGATTTTAAACATCATCACATTGAAATGGCTTGTAAC
TTATTGGAGACTTGCGGAAGGTATTTGTACTGCAATCCTGATACACACCAAAGAACGATG
ATATATTTACAGCAGATGATGAGAAAAAAGACTGTTTCTGCTCTCGATTCACGTTACGTG
ACCCAAATCGAAAATGCATTTTATTACGTATGTCCACCCGAAGCACCGGCACAACCGAAA
GAGGAAGAGCCTCCCATGCACCAGTTCATTAGAAAAATTCTTCACGAAGATCTACAAAAA
AGTAACGAAGAAAAAATTTTGAGGCTTATGAGGAAACTTAATTGGGATGACCCTGAAGTA
GCGGCAGTGGCAATCCAACATCTGGCTGGCGGGTGGAGAGTCAGGGCGAGTGCGAGAAGG
GCATTGGCTCGCTTAACAGCTGAACTGGCTGCCTGGCAAGAAAACGTTGCCCCCGCTGTT
GTTGACACCATACTGGAGGAAATTAGAGTTACTATGGAAGACCCTCATCCAAAGTACAAT
CAGAGGAGAATAGCTAGTGTCCGATATCTTGGAGAACTCTATAATTACAAGCTCCTGGAT
TCCCGAGACGTTTTCACGGTTCTCTACTCTTTTATTACATTCGGTGTATCGAACGACCAT
TCTAACGTATCTCCACTAGATCCGTCCGACAATGTCTTCAGGATAAGATTAGTTTGTGCT
CTACTAGAGACTTGTGGCGCATATTTTAATAGTGGATCTAGTAAGAAACGACTGGATTAC
TTTTTGGTTTTCTTCCAAAATTACTATTGGTTTAAATACAGTGATCCTTACTGGACCGAG
GAGAATAAATTTCCGATATACGTCAAATACATATATCAGGAATGTTTGAGCAGTTTGCGG
CCCAAACTGACATTGTTTACTAGCTGGCAACAGTGTAAGGACGCTATAGAGGAGATAAGA
CAGACATTATACCCGGATTTGGGGGAAGACGAACACTTTGACAATGATGACCAGGGCGAG
GATAGTGTTGCTGATGGTTTAGACACCATCATAGAGACGGATGATGAAACAGATAATCCA
CACATGCCAGAAGAAAGCTCTGACGAAGACCCCATCACGGAAAGTGCTGGAAATGACGAG
AACGACGTGCAGACAGAAGACCTTCCCATCGAGCCGAGGCGTCCAGCTGTAAAACCTGTG
GAAGATGTGGAGTTTGAATCGGCATTCGAGAAGATGGTTATGGAGAACATTGCGGAAAGA
CAACGTGAGAATAGACCACAGCAAAGAGATATAGCTGTGCCAATGACATGTAGACAAACT
ACTAAAAAAACTTATGAACAGTTACTGCAAGGTAAGGAAGGAGTAGAATTTGTGTTGATG
GTGAGAAAAGGTATGAAACCACAGTACAAGTCGTTCAACGCGCCACCGGAGCTCGCGAGC
AATTTACAACAACAAGCCCTAGCGGATAAACAGGAAATGGAAAGAGTTAAACGTTTAACA
TTAAACATTTCTGAACGCCAAGAAGAGGAAGAATATAGCGCGGAGAGTGGGGGAGGTTCT
GGAGGAGGTGGCAACCCCAATAGAGGGCAGCACGTTCGACAAAAGTATCAACACCCTAAA
GGGGCACCGGATGCAGATCTTATATTTGGACCTAAGAAATTCAAATAA

Protein sequence:

MTSNEKEADKSEDASGIDDVSTENTDENDKLALTEYIANLEARIKHKTDLRNQNLNCVRP
PDNHFSKLDSGLKKNTTFVKKLKSFSATQLDALSKDLSTLNLTKYISEVASAIAEAKLKM
SDISAAITLCSTLHQTYSEFSSFFFENWQKILSFKATDKITNSSKLRVDLRFYAELIAVG
IFTNKTGLPLLGIVLTVLINMDKEEHNNIPILLSFCKHCGEDYAGLVPKKIKDTAKKFNV
TVPRNTFIPAEKQTAVRSLLKDYFLSLTKHLLAERAQLQALHAANQRTLHTRGELSQERK
DQLEQHQATYDKLLVGAQNFAEVLGEDLGEAGEPLTLSMTVIETQGTVTIGGNDEIIMQA
GTDPWQDEDTRTFYTSLPDLKVFMPNYQLKEAVKNKTETVTEEMLDEDLKEDELSDNEEP
APVVADVEQEEAQPANVSNKYALDAFLNELPNCINRELIDNAAVDFVLNLNTKNNRKKLT
RVLFSVARTRLDLLPFYSRFASILYPVLPDVCVDLCQMLKQDFKYHVRKKDQINIESKIK
VVRFIGELVKFGLYSKMEALYCLKVLLHDFKHHHIEMACNLLETCGRYLYCNPDTHQRTM
IYLQQMMRKKTVSALDSRYVTQIENAFYYVCPPEAPAQPKEEEPPMHQFIRKILHEDLQK
SNEEKILRLMRKLNWDDPEVAAVAIQHLAGGWRVRASARRALARLTAELAAWQENVAPAV
VDTILEEIRVTMEDPHPKYNQRRIASVRYLGELYNYKLLDSRDVFTVLYSFITFGVSNDH
SNVSPLDPSDNVFRIRLVCALLETCGAYFNSGSSKKRLDYFLVFFQNYYWFKYSDPYWTE
ENKFPIYVKYIYQECLSSLRPKLTLFTSWQQCKDAIEEIRQTLYPDLGEDEHFDNDDQGE
DSVADGLDTIIETDDETDNPHMPEESSDEDPITESAGNDENDVQTEDLPIEPRRPAVKPV
EDVEFESAFEKMVMENIAERQRENRPQQRDIAVPMTCRQTTKKTYEQLLQGKEGVEFVLM
VRKGMKPQYKSFNAPPELASNLQQQALADKQEMERVKRLTLNISERQEEEEYSAESGGGS
GGGGNPNRGQHVRQKYQHPKGAPDADLIFGPKKFK