Genomic Position | scaffold1061:+ 37997-43278 |
---|---|
See gene structure | |
CDS Length | 5190 |
Paired RNAseq reads   | 51 |
Single RNAseq reads   | 147 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013961 (2e-17) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | reverse transcriptase [Papilio xuthus] (0.0) |
Best NR hit (blastx)   | reverse transcriptase [Papilio xuthus] (0.0) |
GeneOntology terms    | GO:0006278 RNA-dependent DNA replication GO:0003964 RNA-directed DNA polymerase activity GO:0003723 RNA binding |
InterPro families    | IPR013084 Zinc finger, CCHC retroviral-type IPR001878 Zinc finger, CCHC-type IPR000477 Reverse transcriptase IPR005135 Endonuclease/exonuclease/phosphatase |
Orthology group | MCL10007 |
Nucleotide sequence:
ATGAAAGCGAGTTGTAAAAACGATTTACCCCAGGAGGGTACCAGCCGCGCCGCGGCTGGA
GAATCCCTCACCGGTTCGCCGGTCTGCCCCTCGTATTCTGGGGGGGGCAACAACCGCTTC
GGGAGTCCTTCTTCTAAGGACTCTCTGGGACGAGGTGCAGTGGGCACGGGACCTGGCTTG
GGAAAAGGACACCGATCAGCAAAAGATTTGGACAAGGTGATGGTGGTAAGCTCTGACGAG
GAGCCTGTAGACGCGTCGGCGGTGCGACCAGTGGCGCGCCAGCCACTAGCGTCCAACGGT
GAGAAGGGAAGAGCAACCAGAAGGAGCCCGCGTACAACGAGCGGAAGTGAGATGGAGACG
GAGGAGACGCGCTCCGCCTTCTTCGCCGGTACGCCGACGAGCCTGGCGCCTCTCCGGAAG
CGGCCAGCGACAAGAAGACAACCAGGCGGGAGTTCGTCTGGCGGAAGTGACAAGGCTTCC
TTCGCTACTGCGGTGAAGAGAGGACGGGCAGTTGAGGAAGGCGAATCGAACTCGGAGGAG
GAAAACGTGGCGAGGTCGACGCGTCGGGTCGAGGTGGCCCTTTCCTCGGTTAAGACGCTG
CCAGCCTCGTGCCTCGCGAAAGAGATGGAGAGGGCCCTGAGCGTCATAGTCGACGTGGCC
CTCAAATCCAAGAATCTGAAGGGCGGATGCGTCAGGGCATTGAAGACGTCGGCGGCACTC
CTGGGGGAGGCAAAAGAGATTCTCCTGCAGCGGACCAGCGGCGAGGAGAATGAGATTCTC
CGAGCCCGGCTAGAGGAAGAAAGGAAGAAGAGCTCGCTGCTGGAGAAGGAGCTGGGGCTC
CTGAGAGAGGGGCAGGCCCGCTTGCGGGCAGACATGGACCTGCTCGCCACTGCCCCGAAA
CCAGCACGAGACGAGAAGAGCGAGGAGGAGCTTCGCGGGTCCCTCATGAGGGACCTAGGT
GCCATGATGGACGCGAAGCTCCAGGGGATCGCAGACCGGCTGCTCCCCGAAAAGCGCCTG
AGGCCGCCCCTAGCGGCGGACAAGAGGCCACCCCCAGCGCCTGCGTCGGCTGCTGTGGCT
GAGCCGGCAGGTAGAGTGGCGAGCAGGAAAAAGAACGGTGCCACAAGAGAACAGGAGAAG
ACGGCGAGACCTCTACCCCCGCCGCCTCCATCCATGGACAAGACATGGACGGAGGTGGTA
AAGAAGAAGGGCAAGAAGAAGACCGGCCCGCCAGTAGCCCCACCCCAAGAGGGGCCGAAG
GCGAAGAAGCCGGCCAAGGAGAAGAAGAAGGCAAAGAAGAAGAAGAAGAGGAGCCTCAAG
AGCGTGAAGAATCCTGCGGTGGTAATTACGCTGCCGCGGGATTCAGAAAGGACGTACGAG
GACGTCCTCAAAACGGCGCGGGCGGGACTCAACCTGGCAGAGCTAGGACTGCCGATGGGT
CTCGTCTGCAAGAAGACGGTGACGGGAGCCCGCATGTTCGAGCTGCCGGAGAGCGCGGAG
GAGGCGAAAGCCGACCTCCTGGCGCAAAAACTCCGGGAGCTCGTCCCTGAGGTGAAAGTC
GCCAGACCGGTGAAAAGCGGGGAACTGCGAATCACCGGTTTGGACGACTCCGTCACTAAG
GAGGACGTCCTCGCTGCGGTCGCTCGGCAGGGCAATTGCTCTGTCGAGCAAATACGGGTA
GGGCCGATCCGCTCCCGCGGGTCCTTCAGCACCGGGGCCGCTTGGGTGAGGTGCCCAGTG
GAGGTTGCCAAGTGCCTGGCCGAAGCCGGCCGCCTTTTGGTCGGCTGGAGCTCGGCACGG
GTGACACCGCTGGAGCCACGCCCTATGCGGTGCTACCGGTGCCTGGAGGTGGGACATGCG
GGCCTCAGATGTCCGTCCAAAACCGACCGCAGCCGTCTGTGCTTCCGTTGCGGCGGTGAA
GGACACATCGCTGAGGCCTGCACGTCGGACGCGAAGTGTCCGATCTGTGCGGAGGCGGGA
GTCGCCTCCAACCACATGGTGGGGGGCAAACAGTGCCACCCACCGAAGCGTAGGAAGGGT
AAGGGACCCGGAAAATCCTCGGTCCCCCCGCCCTCTAAATCTGCCGCGCTGGAGTCCATG
GTGGAGTGGTCCATCGATGTGGCAGTTCTCTGCGAGCCGTATTACGTGCCCGATCGCTCG
AATTGGGTATGTGATGCGGAGCAGTCTGTGGCGGTAGTCATCTCACCGGGTGCGCGCTCC
TCTTCTCCTCTCTCTGTCATCAGAAGGGGAGAAGGGTATGTCGTGGCTCAGTGGGGCGAC
TTCATACTTGTGGGCGTCTACCTCGCTCCGAGGAAGCCCGTCGTGGAGATCGAAAGACTC
CTCGACGAGGTCGGGGCTGAGGTCAGACGTGCGTCGCCGAGACAAGCAATTGTCCTCGGT
GACTTTAATGCCCACAGTACGGTGTGGCAATCGCCCGCTACGGACCCTCGCGGGGAGGTG
GTGGAGGAGTGGGCTGCAGCGGCTGGCCTATCCCTCCTCAACCGGGGCAGTGCAGTCACG
TGCGTACGACCGCAGGGCGAATCCATAGTGGACCTTTCGTTCGCTGCTCCTGCGGTTGCA
ACGCGTGTACGTGACTGGCGAGTCCTGGATGAGGTTGAGACGCTGTCGGACCACTTCTTC
ATCAGGTTCGAGTTGGCCCCACAGGATGGCTCTTCGTTCCGCCGCCGACCCGTCGGGAAC
GCTTCGGCGTTCCCACGGTGGTCCCTGACCGAGCTCGACAGGGACATGGCGGTGGAGGCG
TCGATCGTCCAAGCTTGGACGATGCGAGTGGAGCAGCCCGTTTCGGTGGATGGGGAGGCG
CGCAAATTTCGCCACAGCCTGTGGCGGATTTGCGATGCTGCGATGCCTCGCGTCGCGCAA
CGCTCCCCCAGACGCCAGGTGTACTGGTGGACGTCAGAGATCGCGCAGCTGCGTGCGGCT
TGCGCGGTGGCGAGACGCCAGTATATCCGACAACGGAGGCGGCACCCTCGTAACGAAGCC
GTCGAATGTCGGTTTCGTGACGCCTTCAACGGAGCGAAGAGCGGGCTTCAGCTCGCAATC
TGCAGGTCGAAGGAGTCGGCTCGCGAGGAGTTGCTTGCGCGTCTGGACAATGACCCGTGG
GGGCGTCCGTATCTCGGTGCCCGGAATAAGATCCGGGCCCAGATGGCCCCGGTCACGGAG
AGTCTGGAGCCGGAGCTTCTGCGAAGCGTCGTGAGTGCCTTATTCCCTGCGGAGACGGCG
CACTCGATGCTAGCGGAGGCTGGCACTTCACGCGGGCCGGACCGGGTGGAATTTATTCCG
CCCGTCTCTCTGGAGGAACTCGAGGAGTCTCTGCGGCCTCTCAAAGCCAAGAAAACCGCC
CCTGGGCCGGACGGCGTCCCCGGACGTGTCCTGGCCCTGGCTCTGGGCGAGTTGGCCGAG
TGGTTCGTGGAGATCCTCAACGAGTGTTTGAGATCGGGTCGCTTTCCATCGTGTTGGAAA
GAAGGGCGACTCGTTCTCCTCCAGAAAGAAGGTCGACCTGCAGACTCGCCATCTGCATAC
CGTCCCATAGTGCTGCTAGACGACGCGGGGAAGCTCTTCGAGCGTATCCTAGCTACCCGT
GTCGTCGCACACTTAAGCAGCAACGGACCCGACCTGGCCGAGTGCCAGTATGGTTTTCGG
GGTGGCCGGTCTACGATCGATGCCATCTCGAAGCTGAGGGAACTCGCCGATGATGCCGTT
TCGGGGGGTGGGGTGATGTTGGCGGTGTCCCTGGACATCTCCAACGCATTCAACACCCTT
CCGTTCGGTGTCATTGAAGAGGCCCTCAGATACCATGGTCTGCCACTTTACATCCGGCAG
ACCATCGGGTCCTATCTCCGCGAACGAGAGATCTCGTTCGTGGGGAGAGACGGTAGGGTC
CATCGCCACGAGGTGCGCTGCGGGGTTCCGCAGGGGTCAGTCCTCGGGCCGCTCCTGTGG
AACTTGGGGTACGACTTCGTACTTCGCGGCGCCCTCCAGACCGGGCTGAGCGTCGTCTGC
TGCGGACGACACGCTCTCGTCGGGGCCCAAGGCGTGGACCTGGCAGAGGCGACGTTGCGA
GCTGAAGCGGGAGCCGCCTCGATAGTACGGCGCATCGAGATGCTCGGGCTGAGGGTGGGT
CTCGACAAAACCGAGGCCCTCCTGTTCCACGGCCCTCGAGCACGACCACCGCCGGGCGCC
AGCATCAACATCTGCGGCGTCCGCGTCGAGCTCAGTCCCCGGATGAAATATCTGGGGCTG
ACTCTGGACGGTAGGTGGAACTTCCGGGAGCACTTTCGCGGTCTCGTCCCGAAATTACTC
GGGACGGCGAACGCTCTCGGAAGGCTTCTGCCTAATCTTGGTGGTCCAAGCGCGACATGC
CGGCGCTTGTACACCGGCGTGTTGCGTTCGATGGCGCTGTACGGAGCACCAGTGTGGGCC
GGTGCCCTCTCGAGGCCGAACGTGGCGTCGCTGCACAGAGCGCAGCGCGTCATGGCTTTG
AGAGTGGTGCGTGGATACCGCACGGTCTCCCACGAGGCGGCTTGCGTGCTCGCTGGGACG
CCTCCCTGGGACTTGGTCGCCCAAGTTCTGGCGGAGGTGCACCAGTGGCGCGCACGAGCT
CGATCCCAGGGTGTGAATCCGCCCTGGGATCCGGTGGATGGTTGGCGACGCTCCGCACAC
GAGGAGCTACTCCGTCGATGGAGGCGGCGGCTCTCCGAGCCAGTGGCCGGGTTGCGTACC
GTGGAGGCGATTCGACCGCTCCTTAAGGAGTGGGTGGATCGCCGACACGGTTCCTTGACC
TTCCGGCTGGTGCAGATCCTCTCGGGGCACGGTAGCTTCGGGCGGTATCTGTGCTACATA
GTCGGAAGAGAGCCGACGGCGGCGTGTCACCATTGCAGTTGCGTGGATGATACGCCCGAC
CATACATTAGCGGAGTGCCCTTCGTGGGAAGCGGAGCGTCGCCAACTAATCACCGAAGTT
GGCGCGGACCTTTCGCTGCCGGCCGTAGTCAAGGCTATGGTAGGTAGTGAGAGGGCCTGG
GCGGCGGTGGTCTCTTTCTGTGAGGTTGTCATCTCGCGGAAGGAGACTGCCGAAAGGGGG
AGGGAAGATGATCCCTCCTCGGCGCCGATGCGCCGACGTAGGCTGGGTCGTAGGCAGCGG
GCTTATGCCCGCCGAATGCCTCCCCAGTGA
Protein sequence:
MKASCKNDLPQEGTSRAAAGESLTGSPVCPSYSGGGNNRFGSPSSKDSLGRGAVGTGPGL
GKGHRSAKDLDKVMVVSSDEEPVDASAVRPVARQPLASNGEKGRATRRSPRTTSGSEMET
EETRSAFFAGTPTSLAPLRKRPATRRQPGGSSSGGSDKASFATAVKRGRAVEEGESNSEE
ENVARSTRRVEVALSSVKTLPASCLAKEMERALSVIVDVALKSKNLKGGCVRALKTSAAL
LGEAKEILLQRTSGEENEILRARLEEERKKSSLLEKELGLLREGQARLRADMDLLATAPK
PARDEKSEEELRGSLMRDLGAMMDAKLQGIADRLLPEKRLRPPLAADKRPPPAPASAAVA
EPAGRVASRKKNGATREQEKTARPLPPPPPSMDKTWTEVVKKKGKKKTGPPVAPPQEGPK
AKKPAKEKKKAKKKKKRSLKSVKNPAVVITLPRDSERTYEDVLKTARAGLNLAELGLPMG
LVCKKTVTGARMFELPESAEEAKADLLAQKLRELVPEVKVARPVKSGELRITGLDDSVTK
EDVLAAVARQGNCSVEQIRVGPIRSRGSFSTGAAWVRCPVEVAKCLAEAGRLLVGWSSAR
VTPLEPRPMRCYRCLEVGHAGLRCPSKTDRSRLCFRCGGEGHIAEACTSDAKCPICAEAG
VASNHMVGGKQCHPPKRRKGKGPGKSSVPPPSKSAALESMVEWSIDVAVLCEPYYVPDRS
NWVCDAEQSVAVVISPGARSSSPLSVIRRGEGYVVAQWGDFILVGVYLAPRKPVVEIERL
LDEVGAEVRRASPRQAIVLGDFNAHSTVWQSPATDPRGEVVEEWAAAAGLSLLNRGSAVT
CVRPQGESIVDLSFAAPAVATRVRDWRVLDEVETLSDHFFIRFELAPQDGSSFRRRPVGN
ASAFPRWSLTELDRDMAVEASIVQAWTMRVEQPVSVDGEARKFRHSLWRICDAAMPRVAQ
RSPRRQVYWWTSEIAQLRAACAVARRQYIRQRRRHPRNEAVECRFRDAFNGAKSGLQLAI
CRSKESAREELLARLDNDPWGRPYLGARNKIRAQMAPVTESLEPELLRSVVSALFPAETA
HSMLAEAGTSRGPDRVEFIPPVSLEELEESLRPLKAKKTAPGPDGVPGRVLALALGELAE
WFVEILNECLRSGRFPSCWKEGRLVLLQKEGRPADSPSAYRPIVLLDDAGKLFERILATR
VVAHLSSNGPDLAECQYGFRGGRSTIDAISKLRELADDAVSGGGVMLAVSLDISNAFNTL
PFGVIEEALRYHGLPLYIRQTIGSYLREREISFVGRDGRVHRHEVRCGVPQGSVLGPLLW
NLGYDFVLRGALQTGLSVVCCGRHALVGAQGVDLAEATLRAEAGAASIVRRIEMLGLRVG
LDKTEALLFHGPRARPPPGASINICGVRVELSPRMKYLGLTLDGRWNFREHFRGLVPKLL
GTANALGRLLPNLGGPSATCRRLYTGVLRSMALYGAPVWAGALSRPNVASLHRAQRVMAL
RVVRGYRTVSHEAACVLAGTPPWDLVAQVLAEVHQWRARARSQGVNPPWDPVDGWRRSAH
EELLRRWRRRLSEPVAGLRTVEAIRPLLKEWVDRRHGSLTFRLVQILSGHGSFGRYLCYI
VGREPTAACHHCSCVDDTPDHTLAECPSWEAERRQLITEVGADLSLPAVVKAMVGSERAW
AAVVSFCEVVISRKETAERGREDDPSSAPMRRRRLGRRQRAYARRMPPQ