DPGLEAN14892 in OGS1.0

New model in OGS2.0DPOGS216071 
Genomic Positionscaffold2314:+ 7093-10597
See gene structure
CDS Length1533
Paired RNAseq reads  642
Single RNAseq reads  1535
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008870 (9e-137)
Best Drosophila hit  Ate1, isoform C (1e-86)
Best Human hitarginyl-tRNA--protein transferase 1 isoform 1 (2e-112)
Best NR hit (blastp)  PREDICTED: arginyltransferase 1 isoform 1 [Oryctolagus cuniculus] (2e-119)
Best NR hit (blastx)  PREDICTED: arginyltransferase 1 isoform 1 [Oryctolagus cuniculus] (5e-121)
GeneOntology terms


  
GO:0004057 arginyltransferase activity
GO:0008415 acyltransferase activity
GO:0016598 protein arginylation
GO:0016740 transferase activity
InterPro families


  
IPR016181 Acyl-CoA N-acyltransferase
IPR007472 Arginine-tRNA-protein transferase, C-terminal
IPR007471 Arginine-tRNA-protein transferase, N-terminal
IPR017137 Arginine-tRNA-protein transferase 1, eukaryotic
Orthology groupMCL13370

Nucleotide sequence:

ATGAATCATAGTTTCATTAAATATTATTCTGAGCATGAGGGCTACAAATGTGGGTACTGT
AAACGTCCAGATACAAATTACAGTCACGTTATGTGGGCGCATGCAATGACGGTTACTGAC
TACCAGGATTTAATAGATAGAGGTTGGAGGAGATCTGGAAAACAATGCTACAAGCCAACA
TTAGAAGTCATCTGCTGTCCCATGTATACCATACGATGTAGGGCATTAGAATTTAAGGCT
AGTAAATCTCAAAAGAAAGTTTTGAAGAGTTTCAATAAGTTTTTAATCGGAGAAGAAATA
AGTGATATAAGTGCACAAGAAAGTCGAGAAGATGTTGCAATGGAACAAGTTGAAGGGCAG
GAGCAGTTTCTAGAATCTAAAAGGCCACATGAAGATGTAAACATTGCCGGAATGGATATT
CCATTTATAGAAGAAGCAGATGACTCAAGAAAACTTGAACTATCTGAAATAGAGACAAAA
GATGACAGTCATATGCAGAAATTTGATTCTCATCAAGATTTACAGCAAGCGAGTTCATCA
ACAGCCAGTAGTTCATGCCTGTTGGGAAACACATCAAAAGAAAAATCTAACAAAGTGACC
GGTGCAGATCCCACAAAAGCTCCATGCAAAAAAGCGAAACAGGCAAGGCGAGAGAGAATG
TTAGAAAAGCTCCAAAGGAAGGGTATCAATGTTACCACTTTAGATAATACCGGCAAAAAT
ACACCAAAAACCATTGAAGACATTATTAATGAACTACCAGATAATGTCAAGAGTAAACTT
GAGATAAAATTGGTGAGAACAGAACCACCGAGTCCAGAGTGGCTGGCTACGAAATCAGAA
AGTCATGAAGTTTATGTGAAATACCAAACTATTGTTCATGGAGATAAACCTGAGAAATGT
ACTGAACCCAAGTTCCATGATTTTTTGGTCCACAGTCCATTACTGGAAGAATATTCCGAA
GTGGGTCCCCCATGTGGATATGGTTCATTCCACCAACAGTATTGGCTGGACGGAAAGATT
ATAGCCGTTGGCGTTATAGACATACTGCCAAAATGTATATCGTCCGTATACTTTTTTTAT
GATCCCCAATATTTATGCCTGAGTTTAGGAACTTATGGAGCTTTAAGAGAAATAGCATTC
ACAAGACAGTTACAAAAGATTTGTCCTAATCTGAAATATTACAACATGGGATTCTACATA
CATACTTGTACTAAGATGAGATACAAGGGAAAGTTCCACCCATCGGACCTATTGTGCCCT
GAGACTTTCAAGTGGTTTCCCATCAAGGAATGTATAGCAAAGTTGGAAATATCAAAGTAT
TCAAGATTTGATCCTGATCTAGATGGTGTGGATGAAAATTATCCCACAGATAATGACGTG
AACAATATAAAAGTTTTATCAAACGGGCAAGTGCAAATTTACAAAGTTTTCAAACGGAAA
GCAGGAAGGAAGTACAATGAAGAATTTGAAGTTCTTGAATATGCGAGGCTGGTTGGAGGC
AAGACCGCTAGAAGTATTATAATGGTCATGTAG

Protein sequence:

MNHSFIKYYSEHEGYKCGYCKRPDTNYSHVMWAHAMTVTDYQDLIDRGWRRSGKQCYKPT
LEVICCPMYTIRCRALEFKASKSQKKVLKSFNKFLIGEEISDISAQESREDVAMEQVEGQ
EQFLESKRPHEDVNIAGMDIPFIEEADDSRKLELSEIETKDDSHMQKFDSHQDLQQASSS
TASSSCLLGNTSKEKSNKVTGADPTKAPCKKAKQARRERMLEKLQRKGINVTTLDNTGKN
TPKTIEDIINELPDNVKSKLEIKLVRTEPPSPEWLATKSESHEVYVKYQTIVHGDKPEKC
TEPKFHDFLVHSPLLEEYSEVGPPCGYGSFHQQYWLDGKIIAVGVIDILPKCISSVYFFY
DPQYLCLSLGTYGALREIAFTRQLQKICPNLKYYNMGFYIHTCTKMRYKGKFHPSDLLCP
ETFKWFPIKECIAKLEISKYSRFDPDLDGVDENYPTDNDVNNIKVLSNGQVQIYKVFKRK
AGRKYNEEFEVLEYARLVGGKTARSIIMVM