DPGLEAN13537 in OGS1.0

New model in OGS2.0DPOGS205954 
Genomic Positionscaffold20:+ 267440-271770
See gene structure
CDS Length1956
Paired RNAseq reads  3440
Single RNAseq reads  8109
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002838 (4e-64)
Best Drosophila hit  Protein-L-isoaspartate (D-aspartate) O-methyltransferase (4e-07)
Best Human hitprotein-L-isoaspartate O-methyltransferase domain-containing protein 2 isoform 1 (1e-74)
Best NR hit (blastp)  PREDICTED: protein-L-isoaspartate O-methyltransferase domain-containing protein 1-like [Saccoglossus kowalevskii] (6e-86)
Best NR hit (blastx)  PREDICTED: protein-L-isoaspartate O-methyltransferase domain-containing protein 1-like [Saccoglossus kowalevskii] (2e-77)
GeneOntology terms

  
GO:0006464 protein modification process
GO:0004719 protein-L-isoaspartate (D-aspartate) O-methyltransferase activity
GO:0005737 cytoplasm
InterPro families  IPR000682 Protein-L-isoaspartate(D-aspartate) O-methyltransferase
Orthology groupMCL24200

Nucleotide sequence:

ATGGGTGGTGCAGTTAGCTCCGGTCGAGATAACAATGAACTTATAGATAATTTGATGAGC
GGTAACTATATCCGCACCAGGCAAATAGAGATGGTGTTTCGAGCACTGGACAGGGCCGAC
TACATGACACCAGAGGCTCGGGATCAAGCCTACAAGGATTTGGCTTGGAAAAATGGGCCC
TTGCATCTATCTTCGCCTTGTATTTATAGTGAGGTGATGGAGGGGTTAGAGCTGAGACCG
GGCTTGTCCTTCTTGAACATTGGTTCGGGGACAGGCTACCTCAGTAGCTTGGTGGGCTTG
ATCCTCGGCACTTCGGGAATCAGTCACGGTGTGGAGGTCCATCCCGCTGTTGTGGAGTAC
GCCACCAAGAAGATTGGACAGTTCATTGAAAATTCACCAACCTTGGATGAGTTTGATTTC
TGTGAGCCCAAGTACTACCATGGGAACGGTCTATGTTTGAGCCCACCCGCTGTTGGATAT
GATCGTGTGTACTGCGGTGCCGCCTGTCCTGCTCAGTACGAGATGTACTTTAAACAACTA
ATAAAGGTGGGGGGGCTGCTGGTGATGCCGCTCAACGACACCCTAGTCCAGGTGCGACGA
CTGGGTGAGAACGAGTGGGTGTCGCGATGTTTGCTCAACGTGTCCTTCGCCACACTGAGG
GTACCCACCGCCGAGGAAGCCACCCAGCTCGTCAAACTAGACGAGTTGCGGCCCGTGCGC
CTCCAGCTACTGTCCCGCGCGGTGATCCGGTCCGCGATGCGTGGCGGAGTGCTGCGGCGA
CACCCCGAGCTCCGTCTGTCACCGCGCCCTCCTCCGCCCTCCGCCTGTCCACGACGCATC
TGCATTCCAATCGAGCCCGGCGGCTCCGTCGAGGGCCTGAACGTGCTCCACGACCTGGAC
AACGAGAGCGGAGCTAACGAAATGAACGCGCTGCTGAGTCTCGTGATCAGTATGGGACAG
AACAGGGTGGCCGGGGCGCTGCGGTTCGACCGCGTCGACTCCGGGACCGATGACGACGAA
CACGACGAGGACGATCAGGATGGAGACGAGCCCGAGACGAACCATGACGAGAACTCGGAA
GAGTCTGCAGAGGGGGCTCCAGGGGAGAACGCGGCCGACGAGTCGAAGAACGTCGACTTA
CCCACCTCGGACTCCGACGACAAACCAAATATGAACGGAGACACGAGCGACCTCGACGAC
TCCCCGCCGAGACCTCATCGCAGGCCCACCAAGAGATCGGATAGTAGGCAGAGAGAGAGC
CACAGTCACAGTCTGCGTCGCCGGCCGAGGCAGGGCTCGGACTCCAAGGACGATTCCCCG
CCCGACGACGCCATGCGAGATTTTTTGCGAAAGAGACGAGACGCTAAACCCTCGACCTCC
ACTTCACAGGGGTTAGGCTCCATGGCCGACGTGCTGAAGAATACCAAGGAAATGACCAAG
AAGGAGACCGTCCTCGACGACCACCAGAAAACGGACACCGTGACCGGACCCAGGCGCGAC
CTGCCCGGCGTGCCCTCGGTGGTGGAGATCGACCTGGGGCAGACGTCGGACATGGAGTGG
GACTCGGAGAACGACGAGCTGGACGCCGCCGGGGACCGGCTGGGGAGGCCCGAGCACAGC
GACGACGACACCCTCAAGAACAAGCCCAAGAGACAGAAGCTGGACAGCGGCATCGGAGAC
ACGCCCAGCAGCTCCTCGCCCGACAAGACCAAGAGCGACGACTCGGAGCGCTCCGACAGC
CCCGTCGCCGGAGGGAGCAGTGCCGTGGACAGCCGCAGCTGGTCTCCGGGTCCCGAGGAG
CGCGCCCACACGGAGCGCCGCAGACAGCGCGAGGCTCGCGGAGGAGACGCGCGCCGAGTG
CGCCTCTCGCTGCTCATGAAGCAGGCCGTGCGCGAGCTGCCGCTGCCGCACGCGCTCAAG
AAGTACGTGAACCTGGGACGGTGCTTCGAATTCTAA

Protein sequence:

MGGAVSSGRDNNELIDNLMSGNYIRTRQIEMVFRALDRADYMTPEARDQAYKDLAWKNGP
LHLSSPCIYSEVMEGLELRPGLSFLNIGSGTGYLSSLVGLILGTSGISHGVEVHPAVVEY
ATKKIGQFIENSPTLDEFDFCEPKYYHGNGLCLSPPAVGYDRVYCGAACPAQYEMYFKQL
IKVGGLLVMPLNDTLVQVRRLGENEWVSRCLLNVSFATLRVPTAEEATQLVKLDELRPVR
LQLLSRAVIRSAMRGGVLRRHPELRLSPRPPPPSACPRRICIPIEPGGSVEGLNVLHDLD
NESGANEMNALLSLVISMGQNRVAGALRFDRVDSGTDDDEHDEDDQDGDEPETNHDENSE
ESAEGAPGENAADESKNVDLPTSDSDDKPNMNGDTSDLDDSPPRPHRRPTKRSDSRQRES
HSHSLRRRPRQGSDSKDDSPPDDAMRDFLRKRRDAKPSTSTSQGLGSMADVLKNTKEMTK
KETVLDDHQKTDTVTGPRRDLPGVPSVVEIDLGQTSDMEWDSENDELDAAGDRLGRPEHS
DDDTLKNKPKRQKLDSGIGDTPSSSSPDKTKSDDSERSDSPVAGGSSAVDSRSWSPGPEE
RAHTERRRQREARGGDARRVRLSLLMKQAVRELPLPHALKKYVNLGRCFEF