DPGLEAN08022 in OGS1.0

New model in OGS2.0DPOGS212027 
Genomic Positionscaffold221:- 143395-154672
See gene structure
CDS Length1686
Paired RNAseq reads  813
Single RNAseq reads  2796
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010182 (5e-96)
Best Drosophila hit  CG16833, isoform H (2e-40)
Best Human hittubulin polyglutamylase TTLL4 (3e-36)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL001898 [Aedes aegypti] (2e-52)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL001898 [Aedes aegypti] (1e-50)
GeneOntology terms
  
GO:0004835 tubulin-tyrosine ligase activity
GO:0006464 protein modification process
InterPro families  IPR004344 Tubulin-tyrosine ligase
Orthology groupND

Nucleotide sequence:

ATGGCACACGAAAGTAACTTACGCCATGGTTACCAGCCAGGAAGAACGAAATATCAAAGA
ACTAGTCCCATTCGTCCATTGTTAGTCACCACATTATCAAACAATAAGCTAGATGACAGC
CGTCTACCTAAAGTTGAGGAATTGCTTGAGAAAGTGGACCAGGAAATGATAGAGGTTCCT
ATGACCAAACAGCCTTTAAAGTCAATACTCAATAATCCTTTGCCTAAGACACAGCCGCAA
AGTATATTAAAGAAACCTAAAAATAGATCATATCAGGAATCCAAATCTGCATTAGAAGGA
CGCAGGAGACAGAGAGCATCCAGTGAATCTGATAATTTTTATTGTAGTTTTCATCTCGGT
TCATCCTTACCGGGATATGATGGTCACCTTTCATATAGATCTAGTAACATGAAGCGGGCT
AAAGATAATTCTGGTACTGGATATTCATATGGAACTGCTATAGCCGCTCTCGGTAATACC
CCTAGGGCAAAATCTCCAACCAACTATCCTAAGAGCAGATGCAATTGTCAAAACCAAGTC
ATAGTTACACCTAAAGATGAAGCATACAGAGTACATTTGTTAAGGACGTCCGCAATAAAC
AGTGAAACTGTCAATGATGTCCCTGTTGTTGATATGCCTCGTCAAGAGACGCATACTCCT
ACGAGACCGGCTCACAGCCCCGCGCTGCCAGTGAGTTCTGTGACACCAACCACCGTCACC
AAGAAGAAGAAAAAAAAGAAAACTAAGACGAAGACCACGATCAGCACTCAAGAGGAAGCT
GATGATTCTCACAGCCGCTCGCCATCCCCGGATCCGACAGTGGACACCTGGAAAATGGAA
ACCGAAACGGAAGTAGACAAGACGGACAAACAGCTGCATAATGACGCAGACAAAAACCCC
GTGGTGAAGCTGGCGACTAAACTCAACGGCACCCTCGTGTCTCCCAAGAAGGCTGCGTCT
AAAGAGAAAGTTATTAACAAGGATATGTTCTTGACCACTTTACATACGACAAATCCCTTC
AAAAATATTCCCACTCGGAAAGAATCACCCCTCCACCTCCCCGAGTCGATGTCTAACTGT
CTCCGGCCGTCTCTGTTCCCGCGAGTGCCTCCTTACCTGAAGTTTATAAGTCACGATGAC
ACCGCACCACTCAAAATGCCGATGGCCATACAGAAACACCTCAAGTGGAAGTTGACCACA
ATCACACCGATAGTTGTCAAGAAGACGTTAACCAACTCGGGGTTCAGATTGGTCAAGAGC
GAGTGCGACACATCCGAGTGCCCCCAAGAAGAAACCGTGGATTGGATCGGTATATGGGGA
AAACACATGAAATCTATCATGTTCCGCGCCATCAAAGACGGGCAGAAGATGAACCACTTC
CCAGGAACCTTCCAGATAGGACGCAAGGACCGGTTGTGGAGGAACCTACAGAAGTTGGCG
TCTAGACACGGGGTCAGCGAGTTCGGCATCATGCCCAAAACATACGTCCTGCCTCACGAT
CTGAAAATACTGAAACACGACTGGGAGAAGTACGCCGCCAACAACGAGAAGTGGATCATA
AAGCCGGTAACAGATTTGTTTATGCTTAGTGTTGACTCGTTTCCTCATCAGACTCACGGC
GTCCTGTACGTCACTATATCGGAAATCATATTCATCACAAAATTCAGTCACAACTGTCGT
AACTAA

Protein sequence:

MAHESNLRHGYQPGRTKYQRTSPIRPLLVTTLSNNKLDDSRLPKVEELLEKVDQEMIEVP
MTKQPLKSILNNPLPKTQPQSILKKPKNRSYQESKSALEGRRRQRASSESDNFYCSFHLG
SSLPGYDGHLSYRSSNMKRAKDNSGTGYSYGTAIAALGNTPRAKSPTNYPKSRCNCQNQV
IVTPKDEAYRVHLLRTSAINSETVNDVPVVDMPRQETHTPTRPAHSPALPVSSVTPTTVT
KKKKKKKTKTKTTISTQEEADDSHSRSPSPDPTVDTWKMETETEVDKTDKQLHNDADKNP
VVKLATKLNGTLVSPKKAASKEKVINKDMFLTTLHTTNPFKNIPTRKESPLHLPESMSNC
LRPSLFPRVPPYLKFISHDDTAPLKMPMAIQKHLKWKLTTITPIVVKKTLTNSGFRLVKS
ECDTSECPQEETVDWIGIWGKHMKSIMFRAIKDGQKMNHFPGTFQIGRKDRLWRNLQKLA
SRHGVSEFGIMPKTYVLPHDLKILKHDWEKYAANNEKWIIKPVTDLFMLSVDSFPHQTHG
VLYVTISEIIFITKFSHNCRN