New model in OGS2.0 | DPOGS201341  |
---|---|
Genomic Position | scaffold1500:+ 32998-37801 |
See gene structure | |
CDS Length | 3348 |
Paired RNAseq reads   | 2356 |
Single RNAseq reads   | 5355 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003125 (2e-122) |
Best Drosophila hit   | Upf2 (0.0) |
Best Human hit | regulator of nonsense transcripts 2 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to UPF2 regulator of nonsense transcripts homolog [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to UPF2 regulator of nonsense transcripts homolog [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay GO:0005515 protein binding GO:0005634 nucleus GO:0005737 cytoplasm GO:0016070 RNA metabolic process GO:0042802 identical protein binding |
InterPro families    | IPR016024 Armadillo-type fold IPR016021 MIF4-like, type 1/2/3 IPR003890 MIF4G-like, type 3 IPR007193 Up-frameshift suppressor 2 |
Orthology group | MCL14646 |
Nucleotide sequence:
ATGACTTCCAATGAAAAGGAAGCTGATAAAAGTGAGGACGCGTCTGGTATTGATGATGTT
TCTACAGAAAATACAGATGAAAATGACAAACTTGCTCTTACTGAATATATAGCTAACCTG
GAAGCAAGAATCAAACACAAAACAGATCTACGAAATCAAAACCTTAATTGTGTCAGACCG
CCAGACAATCATTTTTCAAAACTTGATTCAGGTTTAAAGAAAAATACGACCTTCGTTAAA
AAGCTTAAATCTTTTAGTGCCACACAACTAGATGCTCTTTCTAAAGATTTAAGTACTCTC
AATCTTACGAAGTATATTTCTGAAGTCGCTTCCGCTATCGCAGAAGCAAAATTAAAGATG
TCAGATATATCAGCTGCAATCACTCTGTGTTCTACATTGCATCAAACATATTCGGAATTT
AGTTCGTTCTTTTTTGAAAACTGGCAAAAGATACTATCCTTCAAAGCAACAGATAAGATA
ACCAATTCTTCAAAATTAAGAGTGGACTTAAGATTCTATGCAGAACTTATTGCTGTTGGT
ATATTTACCAACAAGACTGGTTTACCACTGTTGGGAATTGTGCTCACAGTTTTAATCAAT
ATGGATAAAGAAGAGCATAACAATATTCCAATTTTGTTGTCATTTTGTAAACATTGTGGT
GAAGACTATGCAGGATTAGTTCCAAAAAAAATTAAGGATACTGCTAAGAAATTTAATGTG
ACAGTACCAAGAAATACTTTTATACCAGCGGAGAAGCAAACAGCTGTGAGAAGTTTATTG
AAAGACTATTTCTTATCATTGACAAAGCATTTACTGGCTGAACGAGCTCAGCTTCAGGCT
CTGCATGCAGCCAATCAGAGAACCCTTCATACAAGAGGAGAGTTATCACAAGAGAGGAAA
GACCAATTGGAACAACATCAGGCCACATATGATAAACTACTTGTAGGAGCTCAGAACTTT
GCTGAAGTGTTAGGAGAAGACCTAGGTGAGGCTGGAGAACCTTTGACTTTGTCCATGACA
GTTATTGAGACTCAGGGTACAGTAACCATTGGTGGGAATGATGAGATCATTATGCAAGCT
GGTACTGACCCTTGGCAGGACGAAGACACGAGAACTTTCTATACCAGTCTTCCCGATTTG
AAAGTTTTCATGCCTAATTACCAATTGAAAGAGGCTGTGAAAAATAAAACGGAAACTGTT
ACCGAAGAGATGTTAGACGAGGATCTCAAAGAGGATGAGCTCAGTGACAATGAGGAACCG
GCACCTGTTGTTGCTGATGTGGAGCAAGAAGAGGCACAACCAGCTAATGTGTCTAATAAA
TACGCCCTTGATGCTTTCTTAAATGAATTGCCAAATTGTATCAATAGAGAGTTAATTGAC
AATGCGGCTGTAGACTTTGTTTTGAATTTGAACACTAAAAATAATAGAAAAAAATTAACA
CGGGTCCTATTTAGTGTTGCCAGAACAAGATTAGATCTATTACCATTTTACTCGAGATTC
GCGTCCATACTGTATCCAGTTTTACCCGACGTGTGTGTTGATTTGTGTCAAATGTTAAAA
CAAGATTTTAAGTATCATGTCAGGAAGAAGGATCAAATTAACATTGAATCAAAGATAAAA
GTGGTGAGGTTTATTGGAGAACTTGTTAAATTTGGTCTCTACTCCAAAATGGAAGCTTTG
TACTGTCTGAAAGTTCTGTTACACGATTTTAAACATCATCACATTGAAATGGCTTGTAAC
TTATTGGAGACTTGCGGAAGGTATTTGTACTGCAATCCTGATACACACCAAAGAACGATG
ATATATTTACAGCAGATGATGAGAAAAAAGACTGTTTCTGCTCTCGATTCACGTTACGTG
ACCCAAATCGAAAATGCATTTTATTACGTATGTCCACCCGAAGCACCGGCACAACCGAAA
GAGGAAGAGCCTCCCATGCACCAGTTCATTAGAAAAATTCTTCACGAAGATCTACAAAAA
AGTAACGAAGAAAAAATTTTGAGGCTTATGAGGAAACTTAATTGGGATGACCCTGAAGTA
GCGGCAGTGGCAATCCAACATCTGGCTGGCGGGTGGAGAGTCAGGGCGAGTGCGAGAAGG
GCATTGGCTCGCTTAACAGCTGAACTGGCTGCCTGGCAAGAAAACGTTGCCCCCGCTGTT
GTTGACACCATACTGGAGGAAATTAGAGTTACTATGGAAGACCCTCATCCAAAGTACAAT
CAGAGGAGAATAGCTAGTGTCCGATATCTTGGAGAACTCTATAATTACAAGCTCCTGGAT
TCCCGAGACGTTTTCACGGTTCTCTACTCTTTTATTACATTCGGTGTATCGAACGACCAT
TCTAACGTATCTCCACTAGATCCGTCCGACAATGTCTTCAGGATAAGATTAGTTTGTGCT
CTACTAGAGACTTGTGGCGCATATTTTAATAGTGGATCTAGTAAGAAACGACTGGATTAC
TTTTTGGTTTTCTTCCAAAATTACTATTGGTTTAAATACAGTGATCCTTACTGGACCGAG
GAGAATAAATTTCCGATATACGTCAAATACATATATCAGGAATGTTTGAGCAGTTTGCGG
CCCAAACTGACATTGTTTACTAGCTGGCAACAGTGTAAGGACGCTATAGAGGAGATAAGA
CAGACATTATACCCGGATTTGGGGGAAGACGAACACTTTGACAATGATGACCAGGGCGAG
GATAGTGTTGCTGATGGTTTAGACACCATCATAGAGACGGATGATGAAACAGATAATCCA
CACATGCCAGAAGAAAGCTCTGACGAAGACCCCATCACGGAAAGTGCTGGAAATGACGAG
AACGACGTGCAGACAGAAGACCTTCCCATCGAGCCGAGGCGTCCAGCTGTAAAACCTGTG
GAAGATGTGGAGTTTGAATCGGCATTCGAGAAGATGGTTATGGAGAACATTGCGGAAAGA
CAACGTGAGAATAGACCACAGCAAAGAGATATAGCTGTGCCAATGACATGTAGACAAACT
ACTAAAAAAACTTATGAACAGTTACTGCAAGGTAAGGAAGGAGTAGAATTTGTGTTGATG
GTGAGAAAAGGTATGAAACCACAGTACAAGTCGTTCAACGCGCCACCGGAGCTCGCGAGC
AATTTACAACAACAAGCCCTAGCGGATAAACAGGAAATGGAAAGAGTTAAACGTTTAACA
TTAAACATTTCTGAACGCCAAGAAGAGGAAGAATATAGCGCGGAGAGTGGGGGAGGTTCT
GGAGGAGGTGGCAACCCCAATAGAGGGCAGCACGTTCGACAAAAGTATCAACACCCTAAA
GGGGCACCGGATGCAGATCTTATATTTGGACCTAAGAAATTCAAATAA
Protein sequence:
MTSNEKEADKSEDASGIDDVSTENTDENDKLALTEYIANLEARIKHKTDLRNQNLNCVRP
PDNHFSKLDSGLKKNTTFVKKLKSFSATQLDALSKDLSTLNLTKYISEVASAIAEAKLKM
SDISAAITLCSTLHQTYSEFSSFFFENWQKILSFKATDKITNSSKLRVDLRFYAELIAVG
IFTNKTGLPLLGIVLTVLINMDKEEHNNIPILLSFCKHCGEDYAGLVPKKIKDTAKKFNV
TVPRNTFIPAEKQTAVRSLLKDYFLSLTKHLLAERAQLQALHAANQRTLHTRGELSQERK
DQLEQHQATYDKLLVGAQNFAEVLGEDLGEAGEPLTLSMTVIETQGTVTIGGNDEIIMQA
GTDPWQDEDTRTFYTSLPDLKVFMPNYQLKEAVKNKTETVTEEMLDEDLKEDELSDNEEP
APVVADVEQEEAQPANVSNKYALDAFLNELPNCINRELIDNAAVDFVLNLNTKNNRKKLT
RVLFSVARTRLDLLPFYSRFASILYPVLPDVCVDLCQMLKQDFKYHVRKKDQINIESKIK
VVRFIGELVKFGLYSKMEALYCLKVLLHDFKHHHIEMACNLLETCGRYLYCNPDTHQRTM
IYLQQMMRKKTVSALDSRYVTQIENAFYYVCPPEAPAQPKEEEPPMHQFIRKILHEDLQK
SNEEKILRLMRKLNWDDPEVAAVAIQHLAGGWRVRASARRALARLTAELAAWQENVAPAV
VDTILEEIRVTMEDPHPKYNQRRIASVRYLGELYNYKLLDSRDVFTVLYSFITFGVSNDH
SNVSPLDPSDNVFRIRLVCALLETCGAYFNSGSSKKRLDYFLVFFQNYYWFKYSDPYWTE
ENKFPIYVKYIYQECLSSLRPKLTLFTSWQQCKDAIEEIRQTLYPDLGEDEHFDNDDQGE
DSVADGLDTIIETDDETDNPHMPEESSDEDPITESAGNDENDVQTEDLPIEPRRPAVKPV
EDVEFESAFEKMVMENIAERQRENRPQQRDIAVPMTCRQTTKKTYEQLLQGKEGVEFVLM
VRKGMKPQYKSFNAPPELASNLQQQALADKQEMERVKRLTLNISERQEEEEYSAESGGGS
GGGGNPNRGQHVRQKYQHPKGAPDADLIFGPKKFK