New model in OGS2.0 | DPOGS209718  |
---|---|
Genomic Position | scaffold2326:+ 36257-43792 |
See gene structure | |
CDS Length | 3084 |
Paired RNAseq reads   | 2181 |
Single RNAseq reads   | 5331 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008929 (7e-54) |
Best Drosophila hit   | CG4644 (9e-112) |
Best Human hit | DNA-directed RNA polymerase, mitochondrial precursor (5e-58) |
Best NR hit (blastp)   | PREDICTED: similar to CG4644 CG4644-PA [Acyrthosiphon pisum] (4e-150) |
Best NR hit (blastx)   | PREDICTED: similar to CG4644 CG4644-PA [Acyrthosiphon pisum] (8e-129) |
GeneOntology terms    | GO:0003899 DNA-directed RNA polymerase activity GO:0003677 DNA binding GO:0006350 transcription GO:0016740 transferase activity GO:0016779 nucleotidyltransferase activity |
InterPro families    | IPR002885 Pentatricopeptide repeat IPR002092 DNA-directed RNA polymerase, bacteriophage type |
Orthology group | MCL12219 |
Nucleotide sequence:
ATGCATCGACTACTATCGGCAAAAAGTTTATGCCAAAATAATTTACACAGTGTGACAGTG
TCTGCATCAAATTCTTTAAAAAATATATCTTTACCCAAAATAAAATGTTCTTTTTGTCAG
AAAATTCTTCTTACAACACCAACTGCAGAAAACCTGTTGTCGGCAAGGCATCAGTCCACG
AGAACGGTTAATGCTCTGAAATCCTTGAAAAAGAAAAGCAAACATAAAAGCTACAAAAAG
TACGGTGAACTCCTGCAAGTAAGTGAGACTAGCATGACGGAAATGCAGGTTTCAATAAAT
AAGTTAAATGCAGCACATCTATCAAAGCTTGCATCAAGTCCTGTGTCACTTGGACAACTT
CATCAATTAACAACAAATCCTTCGAAAAAGTTGAAGGACATAGAGGTTGATAAAGAGCTT
CTTCGAACAGTCAGAAATAAAATTTTGAAGAAGACAAGTCCCGATATTAAAATAGAAAAT
GATATGTGTGCTCTTATATTTACAAATCACAAAATATCAACAACTGAAGACAGGTTGAAA
ACAAAAAAGATCATGGATATGGTAAAAGAGAGTTACTATAATTTTAGAAAGGCAACAATT
TATGATCAAAAACTACAAGACCTCAGGTATGCCATCAATGAAGGTTTGACACAGGAATTT
AATTTTGAAACTGACTCTGAACTGAAAAACTTAGATAACTCTCTGCCAAAAAGTCCACAT
GCTGTTTTTAAAGAATTGTTTACCGACAAGCAAATAGACCGCTATGACCAAGAACTTATT
AGTCATATAACACAGTTTCAAATGCAAGGTTTAGCACATCCAGCCATAGATGATATAGAT
GATCCATCTTTTCACACTGACCTCGGTGAGCTAAGAGATGAATCTATTTTCGATGGTGAC
ACTGAACAAGTTAAGAGCCTAAAGATTAAGAAAGCTCAGCAAGCAATAAAACAAAAGAAG
AAGAAGATGCGTGAGAAACGTCGGCAGGCTCAAGAAGCGAGTATGAAGCAGGACATGAAA
GAATTGGAGCTCCAGGTTAAAGAAGATGCTTTGCAGAGATTATTAACATCACACTTGCAA
CTACTGTGTTCACTGGATATGATTTCAGAGGGCCGTCAAGTCTTACAGTATTATAGGAAA
AGGAATGCTAAATCACCGGAGCTACCAAAACTCAGAAGTGTTAAAATTTACAACACATTG
CTTAATGGATATGCTTCTGTGGGTAACATAGAAAATACAAAAGAACTATTATCATTTATG
TCAGAAGACCAGATAGAACCAAACGCGCAGACCTACGCAGCTGTGTTTGAATGTGTGGAG
AGAAGTCATCTGGCAGACAAGTCAGCAATATTAAATAATTTTCACAAAGAAATAAAAGAC
AAGGGCATGACGTTAAACGATTTGTTGGATCAAAGTCAATTCTTGTATGACCAGCGTGAG
GTGGTTCTCAGAGCGGTGAGGAGGCTGCAGCCGGGTTTTGAACCACACTACACACCTCCG
ATACTGGACCACGAGTGTCCGCTACTTGAAGATTTGAAACTAGATAAAGTTAACAATAGA
TCGGGGCTGTTCACCTCGCCGGCTAAAGGATTGATGACCTTGGAACAGCTGAGAGAGAAG
GGCAGGGAACAATTAGACATGGAAATTAATGGAGAGGTCGAAGTACACAACATTTCTTTA
AAAGACGAAGCTTCCAAGGAGGTTCTGTTATATCGCGAGAAGCTATCTTCGTCGGAGGCC
GAGTGGCGTAGCTCCCTCCGTGAGGCTCTAATCCGCCATCTTGCCACTCTGCGGGCTCGC
AGCGGCGCCTCCCACGCCCCCGTTACCCTCTATCCGTATCTGAAGGTTTTAGAAGTTGAC
GAGTTTGTAGAATTAATGATGAATGAAATCATCAAACTAGTCGACGGAAGTGAATCTTAC
AGTCCCACCTTGAAGTTACTGCAGAGAGATCTGGGGACACAGGTCTATCAAAAATACCAG
ATAGAACAGTATCGCCGGAACGGTGTTCTGAAGAAGATCGAGCAAGTGTATGACAAATAT
TGCAAGTGGTACCTGGAGAGGCATTCCTTGGACGGCACAGACACTCCATACAACAGTAGA
CAGGCTTGGCAGTTGTTGGTACATCAGAACAGAGATGGCGCTAGTTTGGATGTCGAGGCA
TCCCCGTGGTCGATGGAAATGAGACAAAGTATAGGAAAGTTTCTATATAACATTATCATT
AATGATGTCAAGGTTGATGTGAACATGTTCAAGCCTAACGCCCAAGTTAAGAAGTTGCCA
GCAGTGTACAAGGTCCACCGTCCGTGGGGTCGCTTGGTCCGTCTCGAGCTGAAGCCTCAC
CCGACCCTATCCCGCCTCTGGTCCGCGGCGGCCCGGCCCCGCCTCCGCCTCCGCTCGTCG
CTCGTGCCCGCCCGCTCGCCGCCCGCTCCTTGGCACAGCGCCACCGCCTCAGGAGCTTGT
CTCCTCACTACTACATCACTTATACGGATGCCGTTCTATGTGATGGGTCTAACGAAAAGA
TTGGAAGAGGCTCCGCCGGCGACCATGTACCCAGTACTTGACGGATTGAACCAGCTGGGA
GACGTGCCCTGGGTTATTAACCAGAGAATACTTGATTTACAACTCAAAGTCTTCAGATCG
GGCGGCGACAAAAAGCTGGATATACCGCCTCCTGCGTCCTCGTTGGACGCATCTCAGTGG
AAAATGGAAGGAAAGACTGGCGGGGAAGCGTTAAGGAGACGAGTGGTCATCAACAGGGCT
AAGGCAGACATGCACTCCCTGTGGTGCGACGCGCTTTACAAACTGTCACTGGCCAATCAC
TACAGGAACGTAACATTCTGGTTGCCTCACAACATGGACTTCCGCGGTCGTGTGTACGCG
GTGGGTCCGCACGTGTCGGCGCTGGGTCCGGACGCAGCGCGCGCCCTCCTGCGGCTGGCA
GGCGTCCGGCCGCTCGGAGCGCGCGGACTCGACTGGCTCAAGATACACGCTGTCAACCTC
ACCGGCACCAAGAAGAGGAGCACCGTGGAAGAAAGGTACAGATACTTATTTAATATGGCA
GTCATACTAACAACTTTGTTTTAG
Protein sequence:
MHRLLSAKSLCQNNLHSVTVSASNSLKNISLPKIKCSFCQKILLTTPTAENLLSARHQST
RTVNALKSLKKKSKHKSYKKYGELLQVSETSMTEMQVSINKLNAAHLSKLASSPVSLGQL
HQLTTNPSKKLKDIEVDKELLRTVRNKILKKTSPDIKIENDMCALIFTNHKISTTEDRLK
TKKIMDMVKESYYNFRKATIYDQKLQDLRYAINEGLTQEFNFETDSELKNLDNSLPKSPH
AVFKELFTDKQIDRYDQELISHITQFQMQGLAHPAIDDIDDPSFHTDLGELRDESIFDGD
TEQVKSLKIKKAQQAIKQKKKKMREKRRQAQEASMKQDMKELELQVKEDALQRLLTSHLQ
LLCSLDMISEGRQVLQYYRKRNAKSPELPKLRSVKIYNTLLNGYASVGNIENTKELLSFM
SEDQIEPNAQTYAAVFECVERSHLADKSAILNNFHKEIKDKGMTLNDLLDQSQFLYDQRE
VVLRAVRRLQPGFEPHYTPPILDHECPLLEDLKLDKVNNRSGLFTSPAKGLMTLEQLREK
GREQLDMEINGEVEVHNISLKDEASKEVLLYREKLSSSEAEWRSSLREALIRHLATLRAR
SGASHAPVTLYPYLKVLEVDEFVELMMNEIIKLVDGSESYSPTLKLLQRDLGTQVYQKYQ
IEQYRRNGVLKKIEQVYDKYCKWYLERHSLDGTDTPYNSRQAWQLLVHQNRDGASLDVEA
SPWSMEMRQSIGKFLYNIIINDVKVDVNMFKPNAQVKKLPAVYKVHRPWGRLVRLELKPH
PTLSRLWSAAARPRLRLRSSLVPARSPPAPWHSATASGACLLTTTSLIRMPFYVMGLTKR
LEEAPPATMYPVLDGLNQLGDVPWVINQRILDLQLKVFRSGGDKKLDIPPPASSLDASQW
KMEGKTGGEALRRRVVINRAKADMHSLWCDALYKLSLANHYRNVTFWLPHNMDFRGRVYA
VGPHVSALGPDAARALLRLAGVRPLGARGLDWLKIHAVNLTGTKKRSTVEERYRYLFNMA
VILTTLF