New model in OGS2.0 | DPOGS208836  |
---|---|
Genomic Position | scaffold209:+ 94243-102022 |
See gene structure | |
CDS Length | 3435 |
Paired RNAseq reads   | 4681 |
Single RNAseq reads   | 11130 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007943 (0.0) |
Best Drosophila hit   | ND |
Best Human hit | protein SMG7 isoform 1 (1e-46) |
Best NR hit (blastp)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (3e-105) |
Best NR hit (blastx)   | PREDICTED: hypothetical protein [Nasonia vitripennis] (3e-80) |
GeneOntology terms    | GO:0016740 transferase activity GO:0009790 embryo development |
InterPro families   | IPR019458 Telomerase activating protein Est1 |
Orthology group | MCL17645 |
Nucleotide sequence:
ATGGTTTTAAATGCCGCTGTGCAGTTATTAAGAGAGGCGGAGGAATTGAAACAGAAGATT
TTAAAATTCAATAGTTGCATTTCTATGCTTCAGGATAGAAGTTTATGGGTAACACAGCAA
CAGTTACAGAAGGTGTACCAGAAAGTTTTGGTACTGGATCTTGATTATGCTCTAGAGAAG
AAAGTGGAACAAGATCTATGGAATGTTGGGTTCAAGCAGCAGATTGAGGCTTTGCAGGCC
ATTTCCAAAGATAGAAAGAGTGTCCTCAGAAGTGAAGCTCAAGGTATGCTGTCATGGGTG
CTGCAGGCTGCCGCTGGGTTCTACCTATGCCTCTTGCATCAAATCTGCACAACATTTAAA
CTAGATCTACCATTTAGACGTAGGGCGTCCCTTCTTGGCTCGGTTGAAGGGTGGGAGGCC
GGTGGGTGTCCGGAACCCGTTCGAGCCGGTGCTGGAGCTGCCCGGTATGCGTGCCAGCAC
TGTCTCGTACACCTAGGGGACCTCGCCCGTTACAGACACCAGCTGAAAGTCGCACACACC
TTTTACAGGCATGCCCTAGCGGTGTCTGTGCATTCAGGGCAGCCATACAATCAGTTGGCG
CTGGTCGCTTGGCGTCGTGGCCGCCGTCTGGCCGCCCTCTACTGGCACGTCCGGTCGCTG
CTGGTCCGAGCGCCCTTCCCTCCCGCCCCCGCGAACCTCACCCGGACCCTGGCGGCCGCG
GGAGACACTGTGCAAAAATGTTTCAGTCGTGACGTCAAGGAGACGCCGCTGCCCGTGCTG
CCGGGGCTGTCCGGGGTGGAGGGGCACGCCACCAGCGCACCTTCAAAGGCAACTGTAACT
GAAAAACTCGACTCGCACTCCTATGTAAATGAACTAGTACGAGCACTCCACTACCTGCAC
AGCTTGGAACATCTCGACACGGCCGAAGAGCTGGTTGGGAAGCTGAACTCGTCCCTGACA
CACCTCGTGGCCACTGACAGCTTCGATTCCATGACTCTGGTTAAGATGGCGTGCGTCACA
ATCTGGCTGGTTCACTCCAGTACGGAGGACCTTTCGGTGGAGCCGTCGTCCATGAGCGAG
TCGGAAGGTCGGGCCGCAGTGCTGGCGTGCTCGCTGGCCGCTCACAGCGTGCTGGCACTG
CTGCTCGCAGCACACACTGGGGACACGCCCAACAAGGGCTTGCCGGCATTGCGTGTGTGG
CTTCAGTGGTCGTGGTGTCGGCCCGCGGCGCTCCGGTCGCATGCTTGGGGCTCCAGACCT
CACATGTGGGCAGCGCTCGCACACGCACTCAACAACATGGGAGACGCCCTCGAAGACCCC
GCCTATGAGACCCTCCCTCTGCCGGAGGATGAAGAGTTACACGGCTTCTTACCGCTGGAG
GAGGCTTTGAAGGGACTCAAGTTTCCAAACCACTGCGGCTGGGACTCCAACAAACTGCCT
CAAGAGGAACCCGAAGAAGACACGGCGTCCAGTGTATCAGCGTCGTGGGGGTCGTCGTAC
CTGGCGCTGGTCAGCGACACCGAGCTGCAGGCCCGCGTGAGGACGGCGCGGCTAAGACGA
CTCGGGGAGAAACTGGCGGAGCAGCACCCGGGACTACTCACCTGCGATACTGACGAAGAC
GGGGTGATGACATTTTCCACGAGCGAGTCTAGTAAGGAGCAGCTGTCCTTGGTGTTGGCG
ACCCTGACCCCGCCCTCCGCCCCGCCCACAGAGCCTAAGACCCCACCTCCGGCACCCCCA
CCACCACCACTCATCATATCGGAGGCTGACTTTCGAGAGAAAGTACGAGAAAAACGCGCT
GGCATTCTCAAGCCGCAGGGGTCGCTGGAGCGCGCGAGGGAGGAGAGACGAGCCGCGCCC
GCCGCTGAGGATCAGGACGGCGAGGAGTGTGAGGAGGGAAGTAAGAACGAGGACAAGAAG
GAGGCTCGCAAACCACGAGTCAACATCGCCATGGCGGCCATCATGAGGAAACAGGAGGAG
AGCAACAAACAGGTTAAATTTGTAACTCCACCCCCCACGCCGGAGACCACAGATGAAGCG
AGCGAGAGTTCGTCGAAGGATGAGAAAACCAAAGTCATTCAACCGAAGGCCATTAAATCA
TTAGCAAATTTACCGGTGGGAAGAAAAACGGGGGGAATTCTCTCGTTGAAAGATAAGTCG
GCCGGATATCCGCACCTCCAGAATACGGAAACGGAAACGAAGAAACCGGAACAGGAGGAG
ATGAACGAAGAGAAGTCTGCCCAAAACAGTTCCGTCTCGCAGAGCTACCATCAACGCGAT
CAAGGTACCAACTGGCCGACGATGCCGGCGCCCTACGGTGACAATAATAAAATGAACTTC
CAAAAGAATTACGGAATACAAAACAGCGGCATAAGTTACAACCCCAACTACCAACCTCCC
CCCAACACTCAGGGGATACGACTACCTGTTGTCAACCCCAAGGAGATCGACGTCAGGACG
GCGGCGCTTCAGAAACAGAACTCTCGCCAGGAAATATTCCAGGAGGCCAACAAATTCAAT
CACGGATACCAAATATCGGGGGACAAAAAGAATTTCCTCAACGACCTGCCGCCGAGATTC
GCGAATCAGTACCGCTACTGGCAGAGTCCGCAGGAAAACCAGTTCAACGACAACAAGTTC
AGGGACGACAGCAACAAACTCACCGCGCCCTTCACGGCACAACCTCCGAGACAAAATTGG
CCGAACCAGAGTGAAAACTTCCAACAGGGGATTCCCTGGTGGAAACCTGATACCCGCACC
AATTTCAATCAGCCTAACTTCTCCACCACACCCATGAATGTACCAAACTTCTATTCTCAA
ATGTCCGGGAACGTGCCTAATATTTATCCAAATTTACAATACAGCCAGATGCAAGGACAG
AATATTGGCCAAAACATGCCAACCGTGGGCCAGAATTTGGCAAATATAGGACGAAATAAA
CAGGATAGTTTGGCGCCGTCGTTCGGTCAGTCGGCCGTCGGTCAGCCGCAGCTCCAGACC
CTGGCGAGCATGGTGTCGTCTCCCGGCTACGGCTCGGCCTTGAACAGCTTCACCCCCTAC
CCGGCGGCCGTCAGCTACGACTCTTCCTTGTATCCTCAGTTCAACAAACTCGGCTACCAG
CCCCTGCAGCTGAACAAACAGAACTTCCAAGGGAAGGAGTCGGAGCCCGGAGTCAGCTTC
GGCAGCAACGTGCTGGACGTACAGCATATGAATTATAACGAACCGTTCGTCGCCGACGGA
GCCAACGACGCCTCGGAAGACGCGGCGGGCGCTCAGTCGGAGGCCGGCGTCTCCAACACA
TACTCGCTGTTCCGACAAGACGCGCACGCCTGGCCGCCCTCCACACATCAGTCGCTGTGG
TCGGGGCCGGGCGGGTCTCCGCTGGAGCGTCTCCTCGAGCAACAGAAGCAGATGAAGCCG
CCGTCGACGCACTGA
Protein sequence:
MVLNAAVQLLREAEELKQKILKFNSCISMLQDRSLWVTQQQLQKVYQKVLVLDLDYALEK
KVEQDLWNVGFKQQIEALQAISKDRKSVLRSEAQGMLSWVLQAAAGFYLCLLHQICTTFK
LDLPFRRRASLLGSVEGWEAGGCPEPVRAGAGAARYACQHCLVHLGDLARYRHQLKVAHT
FYRHALAVSVHSGQPYNQLALVAWRRGRRLAALYWHVRSLLVRAPFPPAPANLTRTLAAA
GDTVQKCFSRDVKETPLPVLPGLSGVEGHATSAPSKATVTEKLDSHSYVNELVRALHYLH
SLEHLDTAEELVGKLNSSLTHLVATDSFDSMTLVKMACVTIWLVHSSTEDLSVEPSSMSE
SEGRAAVLACSLAAHSVLALLLAAHTGDTPNKGLPALRVWLQWSWCRPAALRSHAWGSRP
HMWAALAHALNNMGDALEDPAYETLPLPEDEELHGFLPLEEALKGLKFPNHCGWDSNKLP
QEEPEEDTASSVSASWGSSYLALVSDTELQARVRTARLRRLGEKLAEQHPGLLTCDTDED
GVMTFSTSESSKEQLSLVLATLTPPSAPPTEPKTPPPAPPPPPLIISEADFREKVREKRA
GILKPQGSLERAREERRAAPAAEDQDGEECEEGSKNEDKKEARKPRVNIAMAAIMRKQEE
SNKQVKFVTPPPTPETTDEASESSSKDEKTKVIQPKAIKSLANLPVGRKTGGILSLKDKS
AGYPHLQNTETETKKPEQEEMNEEKSAQNSSVSQSYHQRDQGTNWPTMPAPYGDNNKMNF
QKNYGIQNSGISYNPNYQPPPNTQGIRLPVVNPKEIDVRTAALQKQNSRQEIFQEANKFN
HGYQISGDKKNFLNDLPPRFANQYRYWQSPQENQFNDNKFRDDSNKLTAPFTAQPPRQNW
PNQSENFQQGIPWWKPDTRTNFNQPNFSTTPMNVPNFYSQMSGNVPNIYPNLQYSQMQGQ
NIGQNMPTVGQNLANIGRNKQDSLAPSFGQSAVGQPQLQTLASMVSSPGYGSALNSFTPY
PAAVSYDSSLYPQFNKLGYQPLQLNKQNFQGKESEPGVSFGSNVLDVQHMNYNEPFVADG
ANDASEDAAGAQSEAGVSNTYSLFRQDAHAWPPSTHQSLWSGPGGSPLERLLEQQKQMKP
PSTH