DPGLEAN17708 in OGS1.0

New model in OGS2.0DPOGS208836 
Genomic Positionscaffold209:+ 94243-102022
See gene structure
CDS Length3435
Paired RNAseq reads  4681
Single RNAseq reads  11130
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007943 (0.0)
Best Drosophila hit  ND
Best Human hitprotein SMG7 isoform 1 (1e-46)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Nasonia vitripennis] (3e-105)
Best NR hit (blastx)  PREDICTED: hypothetical protein [Nasonia vitripennis] (3e-80)
GeneOntology terms
  
GO:0016740 transferase activity
GO:0009790 embryo development
InterPro families  IPR019458 Telomerase activating protein Est1
Orthology groupMCL17645

Nucleotide sequence:

ATGGTTTTAAATGCCGCTGTGCAGTTATTAAGAGAGGCGGAGGAATTGAAACAGAAGATT
TTAAAATTCAATAGTTGCATTTCTATGCTTCAGGATAGAAGTTTATGGGTAACACAGCAA
CAGTTACAGAAGGTGTACCAGAAAGTTTTGGTACTGGATCTTGATTATGCTCTAGAGAAG
AAAGTGGAACAAGATCTATGGAATGTTGGGTTCAAGCAGCAGATTGAGGCTTTGCAGGCC
ATTTCCAAAGATAGAAAGAGTGTCCTCAGAAGTGAAGCTCAAGGTATGCTGTCATGGGTG
CTGCAGGCTGCCGCTGGGTTCTACCTATGCCTCTTGCATCAAATCTGCACAACATTTAAA
CTAGATCTACCATTTAGACGTAGGGCGTCCCTTCTTGGCTCGGTTGAAGGGTGGGAGGCC
GGTGGGTGTCCGGAACCCGTTCGAGCCGGTGCTGGAGCTGCCCGGTATGCGTGCCAGCAC
TGTCTCGTACACCTAGGGGACCTCGCCCGTTACAGACACCAGCTGAAAGTCGCACACACC
TTTTACAGGCATGCCCTAGCGGTGTCTGTGCATTCAGGGCAGCCATACAATCAGTTGGCG
CTGGTCGCTTGGCGTCGTGGCCGCCGTCTGGCCGCCCTCTACTGGCACGTCCGGTCGCTG
CTGGTCCGAGCGCCCTTCCCTCCCGCCCCCGCGAACCTCACCCGGACCCTGGCGGCCGCG
GGAGACACTGTGCAAAAATGTTTCAGTCGTGACGTCAAGGAGACGCCGCTGCCCGTGCTG
CCGGGGCTGTCCGGGGTGGAGGGGCACGCCACCAGCGCACCTTCAAAGGCAACTGTAACT
GAAAAACTCGACTCGCACTCCTATGTAAATGAACTAGTACGAGCACTCCACTACCTGCAC
AGCTTGGAACATCTCGACACGGCCGAAGAGCTGGTTGGGAAGCTGAACTCGTCCCTGACA
CACCTCGTGGCCACTGACAGCTTCGATTCCATGACTCTGGTTAAGATGGCGTGCGTCACA
ATCTGGCTGGTTCACTCCAGTACGGAGGACCTTTCGGTGGAGCCGTCGTCCATGAGCGAG
TCGGAAGGTCGGGCCGCAGTGCTGGCGTGCTCGCTGGCCGCTCACAGCGTGCTGGCACTG
CTGCTCGCAGCACACACTGGGGACACGCCCAACAAGGGCTTGCCGGCATTGCGTGTGTGG
CTTCAGTGGTCGTGGTGTCGGCCCGCGGCGCTCCGGTCGCATGCTTGGGGCTCCAGACCT
CACATGTGGGCAGCGCTCGCACACGCACTCAACAACATGGGAGACGCCCTCGAAGACCCC
GCCTATGAGACCCTCCCTCTGCCGGAGGATGAAGAGTTACACGGCTTCTTACCGCTGGAG
GAGGCTTTGAAGGGACTCAAGTTTCCAAACCACTGCGGCTGGGACTCCAACAAACTGCCT
CAAGAGGAACCCGAAGAAGACACGGCGTCCAGTGTATCAGCGTCGTGGGGGTCGTCGTAC
CTGGCGCTGGTCAGCGACACCGAGCTGCAGGCCCGCGTGAGGACGGCGCGGCTAAGACGA
CTCGGGGAGAAACTGGCGGAGCAGCACCCGGGACTACTCACCTGCGATACTGACGAAGAC
GGGGTGATGACATTTTCCACGAGCGAGTCTAGTAAGGAGCAGCTGTCCTTGGTGTTGGCG
ACCCTGACCCCGCCCTCCGCCCCGCCCACAGAGCCTAAGACCCCACCTCCGGCACCCCCA
CCACCACCACTCATCATATCGGAGGCTGACTTTCGAGAGAAAGTACGAGAAAAACGCGCT
GGCATTCTCAAGCCGCAGGGGTCGCTGGAGCGCGCGAGGGAGGAGAGACGAGCCGCGCCC
GCCGCTGAGGATCAGGACGGCGAGGAGTGTGAGGAGGGAAGTAAGAACGAGGACAAGAAG
GAGGCTCGCAAACCACGAGTCAACATCGCCATGGCGGCCATCATGAGGAAACAGGAGGAG
AGCAACAAACAGGTTAAATTTGTAACTCCACCCCCCACGCCGGAGACCACAGATGAAGCG
AGCGAGAGTTCGTCGAAGGATGAGAAAACCAAAGTCATTCAACCGAAGGCCATTAAATCA
TTAGCAAATTTACCGGTGGGAAGAAAAACGGGGGGAATTCTCTCGTTGAAAGATAAGTCG
GCCGGATATCCGCACCTCCAGAATACGGAAACGGAAACGAAGAAACCGGAACAGGAGGAG
ATGAACGAAGAGAAGTCTGCCCAAAACAGTTCCGTCTCGCAGAGCTACCATCAACGCGAT
CAAGGTACCAACTGGCCGACGATGCCGGCGCCCTACGGTGACAATAATAAAATGAACTTC
CAAAAGAATTACGGAATACAAAACAGCGGCATAAGTTACAACCCCAACTACCAACCTCCC
CCCAACACTCAGGGGATACGACTACCTGTTGTCAACCCCAAGGAGATCGACGTCAGGACG
GCGGCGCTTCAGAAACAGAACTCTCGCCAGGAAATATTCCAGGAGGCCAACAAATTCAAT
CACGGATACCAAATATCGGGGGACAAAAAGAATTTCCTCAACGACCTGCCGCCGAGATTC
GCGAATCAGTACCGCTACTGGCAGAGTCCGCAGGAAAACCAGTTCAACGACAACAAGTTC
AGGGACGACAGCAACAAACTCACCGCGCCCTTCACGGCACAACCTCCGAGACAAAATTGG
CCGAACCAGAGTGAAAACTTCCAACAGGGGATTCCCTGGTGGAAACCTGATACCCGCACC
AATTTCAATCAGCCTAACTTCTCCACCACACCCATGAATGTACCAAACTTCTATTCTCAA
ATGTCCGGGAACGTGCCTAATATTTATCCAAATTTACAATACAGCCAGATGCAAGGACAG
AATATTGGCCAAAACATGCCAACCGTGGGCCAGAATTTGGCAAATATAGGACGAAATAAA
CAGGATAGTTTGGCGCCGTCGTTCGGTCAGTCGGCCGTCGGTCAGCCGCAGCTCCAGACC
CTGGCGAGCATGGTGTCGTCTCCCGGCTACGGCTCGGCCTTGAACAGCTTCACCCCCTAC
CCGGCGGCCGTCAGCTACGACTCTTCCTTGTATCCTCAGTTCAACAAACTCGGCTACCAG
CCCCTGCAGCTGAACAAACAGAACTTCCAAGGGAAGGAGTCGGAGCCCGGAGTCAGCTTC
GGCAGCAACGTGCTGGACGTACAGCATATGAATTATAACGAACCGTTCGTCGCCGACGGA
GCCAACGACGCCTCGGAAGACGCGGCGGGCGCTCAGTCGGAGGCCGGCGTCTCCAACACA
TACTCGCTGTTCCGACAAGACGCGCACGCCTGGCCGCCCTCCACACATCAGTCGCTGTGG
TCGGGGCCGGGCGGGTCTCCGCTGGAGCGTCTCCTCGAGCAACAGAAGCAGATGAAGCCG
CCGTCGACGCACTGA

Protein sequence:

MVLNAAVQLLREAEELKQKILKFNSCISMLQDRSLWVTQQQLQKVYQKVLVLDLDYALEK
KVEQDLWNVGFKQQIEALQAISKDRKSVLRSEAQGMLSWVLQAAAGFYLCLLHQICTTFK
LDLPFRRRASLLGSVEGWEAGGCPEPVRAGAGAARYACQHCLVHLGDLARYRHQLKVAHT
FYRHALAVSVHSGQPYNQLALVAWRRGRRLAALYWHVRSLLVRAPFPPAPANLTRTLAAA
GDTVQKCFSRDVKETPLPVLPGLSGVEGHATSAPSKATVTEKLDSHSYVNELVRALHYLH
SLEHLDTAEELVGKLNSSLTHLVATDSFDSMTLVKMACVTIWLVHSSTEDLSVEPSSMSE
SEGRAAVLACSLAAHSVLALLLAAHTGDTPNKGLPALRVWLQWSWCRPAALRSHAWGSRP
HMWAALAHALNNMGDALEDPAYETLPLPEDEELHGFLPLEEALKGLKFPNHCGWDSNKLP
QEEPEEDTASSVSASWGSSYLALVSDTELQARVRTARLRRLGEKLAEQHPGLLTCDTDED
GVMTFSTSESSKEQLSLVLATLTPPSAPPTEPKTPPPAPPPPPLIISEADFREKVREKRA
GILKPQGSLERAREERRAAPAAEDQDGEECEEGSKNEDKKEARKPRVNIAMAAIMRKQEE
SNKQVKFVTPPPTPETTDEASESSSKDEKTKVIQPKAIKSLANLPVGRKTGGILSLKDKS
AGYPHLQNTETETKKPEQEEMNEEKSAQNSSVSQSYHQRDQGTNWPTMPAPYGDNNKMNF
QKNYGIQNSGISYNPNYQPPPNTQGIRLPVVNPKEIDVRTAALQKQNSRQEIFQEANKFN
HGYQISGDKKNFLNDLPPRFANQYRYWQSPQENQFNDNKFRDDSNKLTAPFTAQPPRQNW
PNQSENFQQGIPWWKPDTRTNFNQPNFSTTPMNVPNFYSQMSGNVPNIYPNLQYSQMQGQ
NIGQNMPTVGQNLANIGRNKQDSLAPSFGQSAVGQPQLQTLASMVSSPGYGSALNSFTPY
PAAVSYDSSLYPQFNKLGYQPLQLNKQNFQGKESEPGVSFGSNVLDVQHMNYNEPFVADG
ANDASEDAAGAQSEAGVSNTYSLFRQDAHAWPPSTHQSLWSGPGGSPLERLLEQQKQMKP
PSTH