DPGLEAN16350 in OGS1.0

New model in OGS2.0DPOGS214351 
Genomic Positionscaffold29:+ 209793-214853
See gene structure
CDS Length1869
Paired RNAseq reads  1582
Single RNAseq reads  3686
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003971 (0.0)
Best Drosophila hit  hillarin, isoform A (0.0)
Best Human hitkyphoscoliosis peptidase (2e-32)
Best NR hit (blastp)  PREDICTED: similar to AGAP005020-PA isoform 2 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC009044 [Tribolium castaneum] (0.0)
GeneOntology terms  GO:0008270 zinc ion binding
InterPro families
  
IPR013998 Nebulin
IPR002931 Transglutaminase-like
Orthology groupMCL13879

Nucleotide sequence:

ATGAAACACAGGCAGGAAGAAGATGATTTGTATAGAAAGTTTTCTAAACACAGAGAGGAA
GAAAATCGTAGAATACGAGAAGAAATACAGGACGAGTGGGAGAGGGAGTTAGAAAGATTA
ACAAACCGATTCCAACAAGAGATGCAAGTGAAGAAACGAAGACCAGAATCTGAGATAGGA
GCCCTGACACTTCGACATCAACAGGAGAGAGCTGATCTGGAGAAGAATATGACTCTCCGG
AGAGACAAGAAGAAGGAGAGCTTGACTAGAAAGATGTTAGAACACGAGAGGGCTGCTACT
GCAGCACTGGTTGAAAAGCAAAGTCACGAGATGATGGAACTGATCCAGGAGCGTAGATCT
GAATACATGGCAGCATCCTCCATATTCCTGGACGGAGAAGAAGCACCCCCTTATCCTTCT
CGTGCTCCGCCCCCTTTGCCACCGCTTGTATCCAAATTCCACATATACACAGATCCTGCG
GAATTCGCGGATGTTGATAAGATTGCTATTTCCGTAGCGCAAGAGGATCAAAAAACTTTT
ACCGATTTGGTCCGACAACTCGTGGGTAGATGTGCGAGTGATGTCGAGAAAGCAAGAACC
ATTTTCCGCTGGATAACTGTGAAGAACCTCAACAACATACAGTTTGACGAGAACCTCCGA
GGGGATTCCCCCCTGGGATTACTTAGAGGCATCAAGCACGGCACCGAGAGTTATCACGTC
CTGTTTAAGAGACTGTGCAGTTATGCTGGTCTCCACTGCGTGGTAATCAAGGGGTACAGT
AAATCAGCTGGCTACCAGCCTGGAGTACGTTTCGAAGACAATCGCTTCCGCAACTCTTGG
AACGCGGTGTACGTGGCCGGGGCCTGGCGCTTTGTGCAATGCAACTGGGGGGCGAGACAC
CTTGTTAACGCTAAAGATGCTCCCAAGCCAGGAAACAGAGGAAAGAGCGACAGCTTGAGA
TATGAATACGACGATCACTATTTCCTGACGGATCCTCGCGAGTTCATCTACGAGTTCTAC
CCGCTTCAGCCTGACTGGCAGCTGTTGAAGACGCCCATCACTCTACACGATTTCGAGGAA
CTTCCCTTCGTGAGGTCGCTGTTCTTTAGATACGGACTCTACTTCAGCGATCCCAACACC
AAAGCTGTTATGTACACCGACTCTACTGGTGCGGCGACTATGCGTATAGCCATGCCGGCA
CACATGCAGAGCTCGTTGATCTTCCACTATAACCTTAAGTTCTACGACACGGAGGGCGAC
GGTTTTGACGGGGTCAGCCTTAAGCGGTTCGTCATGCAGTCTGTGGTTGGTAATGTTGTT
TCGTTCCGTGTACACGCGCCCTGTTCCGGGGCCTTTCTCCTGGACATTTTCGCGAACGCC
GTCACACCCAGGGAATACCTCACCGGCGAGCCCATGAAATTCAAAAGCGTTTGCAAATTT
AAGATTTGCTGCGCCGAACTACAAACAGTAATGGTGCCGCTACCAGATTGTGCTAGCGGT
GAGTGGGGGCCGACTAAAGCGACCAGACTCTTCGGCCTCGTCCCCATCACGCACCAGGAA
GCACTTGTATTCGCCGGCAGAGAACTAGAGATTCAGTTCCGAATGTCGCGCCCTCTAGCG
GACTTTATGGCGACTTTACACAAAAATGGCATCGATGAGAAACGGCTGTCCAAATACGTG
CAACAAAACGTCTCGGACGATATCGTCAGCTTTTACATAACATTCCCAGAGGAAGGTCAA
TACGGTTTGGACATATACACTCGCGAGCGCGGGGGACCCACGGCCATACACAACGGCTCC
AGCGAGAAGGAGAAACACCTACTTACACACTGCTGCAAATATCTCATCAACAGCAGTAAA
CGGAACTAA

Protein sequence:

MKHRQEEDDLYRKFSKHREEENRRIREEIQDEWERELERLTNRFQQEMQVKKRRPESEIG
ALTLRHQQERADLEKNMTLRRDKKKESLTRKMLEHERAATAALVEKQSHEMMELIQERRS
EYMAASSIFLDGEEAPPYPSRAPPPLPPLVSKFHIYTDPAEFADVDKIAISVAQEDQKTF
TDLVRQLVGRCASDVEKARTIFRWITVKNLNNIQFDENLRGDSPLGLLRGIKHGTESYHV
LFKRLCSYAGLHCVVIKGYSKSAGYQPGVRFEDNRFRNSWNAVYVAGAWRFVQCNWGARH
LVNAKDAPKPGNRGKSDSLRYEYDDHYFLTDPREFIYEFYPLQPDWQLLKTPITLHDFEE
LPFVRSLFFRYGLYFSDPNTKAVMYTDSTGAATMRIAMPAHMQSSLIFHYNLKFYDTEGD
GFDGVSLKRFVMQSVVGNVVSFRVHAPCSGAFLLDIFANAVTPREYLTGEPMKFKSVCKF
KICCAELQTVMVPLPDCASGEWGPTKATRLFGLVPITHQEALVFAGRELEIQFRMSRPLA
DFMATLHKNGIDEKRLSKYVQQNVSDDIVSFYITFPEEGQYGLDIYTRERGGPTAIHNGS
SEKEKHLLTHCCKYLINSSKRN