New model in OGS2.0 | DPOGS210626  |
---|---|
Genomic Position | scaffold5756:+ 1521-5547 |
See gene structure | |
CDS Length | 3777 |
Paired RNAseq reads   | 16 |
Single RNAseq reads   | 43 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013961 (2e-25) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | reverse transcriptase [Papilio xuthus] (2e-112) |
Best NR hit (blastx)   | reverse transcrpitase [Papilio xuthus] (8e-105) |
GeneOntology terms   | ND |
InterPro families    | IPR001878 Zinc finger, CCHC-type IPR005135 Endonuclease/exonuclease/phosphatase IPR013084 Zinc finger, CCHC retroviral-type |
Orthology group | MCL11163 |
Nucleotide sequence:
ATGGTGGGGCACGCCCCAAGTGATATTCAGTACGTCCCCGTCGTGTGCCCTGGACCCCTT
CCTGACCACCGTAACGGAGTGGCAGGTTGGGTGACCGAATCTGAATATCATTACTTGTGG
CTTTTCGGAGTGTTTTCGTTGCGTTTTGTACTCTGCTATTTGATCCTCTCCACTTGGTGC
GCGGTAACGCAGTGGCCAGTGGAAGAATCAGTCTTGGGCAGGGCCCTTGACTGGACTATG
TCCGAGAACTCAGAGTCGTCGGACAGGAGAGGGGAAGCTCCTCGGAAAGTCTCCCCAATC
CCCCGGCAGACTAGAAGCATGACGGCCGCTGGTTCGAGCGGCTTGGATTCCGAGGCTAAC
CCGAGGAAGCGGAAGGACTCGGACTCTCAGGGGGATTCTGCGTTGAAGTTCCCAATCGTT
GTCCTTCCGAAAGTCCCCCGAAGTTCGACTGGTAGAGGGACCTACCACAATGCTGGTTTG
TCCCAAGCCAGACGTGAGAGGAGGGAAGCGATGATGGTGGATACCGACACTGACTCCACC
GTGACCGAAGTGGAGAAGGAACAAACGCGTTCCCCCAGAAAGGTGCAGGGTAGGAGGTCG
CCAGACATCTCGATGGAGGACCTAGTGAATAAGGCCAAGGTGAGTTCCGAAAGGATTTGG
ATGCTGGTGTCCAAGTCCAAAAATCTGAAAGGAACCACCAAGAAAGAAATAAAGGATACC
TCCATGGCCATAAACCAGCTGGTCGAATCAATCGCTGAAAGGTCCACGAGCGAGGAGCTC
AAGAGGGTGCAGGCGGACAACAAGCGCCTAAGAGGCCAGCTGTCCGTACTCCAGGATGAA
GTTGCTGCGCTCAGGCGAGCATTTGCTGAGCAGGGCAACCGAAGGAAGAGCCCTTTGGCC
GAGGTGGAAGAAACCGTAGAGGAGATGCAGGCAAGCAAGCCTAGTGAAGGCGTCTCTAGG
ACGGAGCTGACCAACATCATGGAGCGCTTCCAGCGAGAGTTGACTGAGCAATTCGGGCGC
ATTCTGGGTGCCCGTTTGGAGGGGCTCGAGGTGGATGGCCGACTCCTTCCAGCCGTCTCG
CACCGTCCAGCACTAACATCTGACACCCGGAAAAGGAGCAACGAGCCAGTACCCACCACT
CTTGTCCCACCTGCGAACATGCCAAAGGAGGCAAGAGCTAAAAAGGGGGCCAAAGCAAAG
GCCGTGGCAACGATCTCAGGTTGGGCTGCAGAGCAGACGGCGGGTCCCTCACACCAGGAG
CCACCAATGGTTTTGGAAGGCTGGACAACCGTGGTGAAACAGGCAAAGCCCAAGAAGCCC
GCGCCAGCTCCCGCTGCCCCCAAGGCTCAAAAGAAGAAGCCATCACTCCCGAAGCAGCCT
AAAACCCTAGCGGTGGTCGTGACCCTCAAGCCAGAGGCCGTGGCTGAAGGCATCACATAC
AAGGATGCCATCACCAAGGCCAAACAATCCGTCAGCTTGGAAGAGCTGGGAATTGGCTCG
ACCAAATTTCGGACGGGGATAACGGGGGCTCGAATTATTGAGCTCCCTAAGGAGGTCTCT
GCCGCCCAGGCTGACAACCTTGCGTCCAGGATTGGCCAAGCTCTAGGGGATGCCGCCAAG
GTGACACGCCCGAGGAAAATGGCCAATGTCCGAATTTCCGGCCTGGACGACTCAGTAACC
CCGGAGGAAATTCGTCTGGCGCTGGCAGAAAAAACTGGAGTCTCCCCGGAAGACTTCAAA
GTCGGGCTTATTACCCACGGGTACACTGGAGTAGGCTCTACCATAACTGCCTGCCCCATT
GAAACTGTTGCCAAGCTGGCGGAAGTGGGTCGGCTTTGTGTGGGCTGGAGCGCTGCCTCC
ATCAGGGTTCTGGAGCAGCGCCCAATGCGGTGTCACCGGTGCTACGGCATCGGACACCCG
CAGCAACTCTGCCCCTCTAACAAGGACCGCAGTGGATTATGTTTCCGTTGCGGAGAAGAG
GGGCACATATCTAAAGATTGCACAGCCCCACTTTGCTGCGCGGTGTGCAAGGACCGGGGC
TTGGCATCGGGTCACCGAATGGGGGGAGCCCACTGTAACCCCCCTCCGGTCAAAGGCCGC
TCCCTACAATGGGGATCAGCCCTCCGCTCGACTTCCGTGTCGGCCGCCCCTGGGAACTTA
AACCACGCGGTCGCAGCACAGGACCTCTTGTGCCAGACTGTGGCCGAGTGGAACATAAAT
GTAGCTATCGTTGCGGAGCCATACTCTATTCCCCGAACCCATAAATGGGCCGGGTCCGTG
GATGGTTCCGCGGCTATTTTCTTTCCCGGCGTGGCCTGCACTCACTCCGTTGTGGAGAGA
GGAGTGGGCTTTGTGGCAGCTCGATGGGGAGAAGTAGTGGTGGTCTCTACATACTTCTCC
CCAAACCGCAGCCGGGCCGACTTTGAGTCGTTCCTGGCTACGGTTGAAGGAGTCATCCTT
CGGGTGGCCCCCAGTCCGGTGCTGGTGGCTGGGGACCTCAATGCGTGGTCTCGCGCTTGG
GGCTCTACCAGACCTAACGCCCGCGGTCGTGTCCTGGAGTCCTGGGTTCTGTCATTGGGA
CTCCAGATTCTCAATAGAGGCAACACTCCAACCTGCGTCCGGTGGCAAGGCACATCCATA
GTGGACGTGACCTTTGCCACCCCATCACTTGCGGCTCGCATCAGCGACTGGCGGGTGATG
GAGGAAGTGGTGACCCTATCGGACCATCGGTACGTTCGATATGATATCTCCCCAGCATTC
CCCGGGACCCCTATCCAGCTGGGTGGCAGACCACCTTTCCTAAGGTGGTCACTCGTTCGC
CTCCAGCCCGATGTGGCTGAGGAGGCAGCGATGGTGAGAGCATGGGCCGCAGTGCCCGAC
ACCATGGCTGGGGATGCCGACTGTATGGCGGACCTTTTCGCGGACGACATTAAGGTTGTC
TGCGATGCCGCTATGCCGAGGACGCAGGCCTGCCCCCGAAACAGAGGGCAGGTATACTGG
TGGACGCAGGAACTGTCCAGCCTACGTACCGCCAGTATGGGGGCTAGGCGCGCCTACCAG
CGTTACCGTAGGCGCGCCCGAGGAACGCTCGGTGTAGAAGAAAGTCTATACCGGGCCTAC
CAGGATGCCAACAAGGCATTGCGGACGGCCATTCGCAAGGCCAAAGAGGATGCCTGGGAC
CAGTTCCTGGGCATACTCAATAACGACCCCTGGGGTAGGCCCTACAGGACGATTAGGGGG
AAATTCTCTACTCCAGCTTCTCCTACCTCCTGTATGGAGCCTGGGTTGCTGCGGAGGGTA
CTTGGGACGTTGTTCCCTGATCCTGGACCGTTCGCACCTCCGCGCATGACTACTGCAGAT
CTCGCTCAAGGGGAGCGGGTCGACGGCCCTCCCGTGTCGGATGCTGAATTCAGCACGATC
CGTTTGAGGCTCCGGTGCAAACGCAAGGCGCCGGGGCCGGATGGGGCCCCCTCCAAGGTG
TTGGCTATCGCCTTAGGGCCCCTGGAGGACCGGTACCGCGCAGTGCTCAACACCTGCATT
GCGGCGGCCCACTTCCTCAGGCGATGGAGAGTACGGCGGCTCTGTCTACTCCGTAAGGAG
AACCGTCCGGCGGATGCCCCAGAGGGCTACCGGCCAGTGGTGTTACTGGATGAGGCGGGG
AAGACTTTCGAGAAGATCCTCGCCTCCCGCATCATTCAGCATCTAGAAGGCAGTGGGCCA
GATCTGGCGGAATGCCAGTACGGTTTCCACGATCCACGTCCACGATCGACGCGGTGA
Protein sequence:
MVGHAPSDIQYVPVVCPGPLPDHRNGVAGWVTESEYHYLWLFGVFSLRFVLCYLILSTWC
AVTQWPVEESVLGRALDWTMSENSESSDRRGEAPRKVSPIPRQTRSMTAAGSSGLDSEAN
PRKRKDSDSQGDSALKFPIVVLPKVPRSSTGRGTYHNAGLSQARRERREAMMVDTDTDST
VTEVEKEQTRSPRKVQGRRSPDISMEDLVNKAKVSSERIWMLVSKSKNLKGTTKKEIKDT
SMAINQLVESIAERSTSEELKRVQADNKRLRGQLSVLQDEVAALRRAFAEQGNRRKSPLA
EVEETVEEMQASKPSEGVSRTELTNIMERFQRELTEQFGRILGARLEGLEVDGRLLPAVS
HRPALTSDTRKRSNEPVPTTLVPPANMPKEARAKKGAKAKAVATISGWAAEQTAGPSHQE
PPMVLEGWTTVVKQAKPKKPAPAPAAPKAQKKKPSLPKQPKTLAVVVTLKPEAVAEGITY
KDAITKAKQSVSLEELGIGSTKFRTGITGARIIELPKEVSAAQADNLASRIGQALGDAAK
VTRPRKMANVRISGLDDSVTPEEIRLALAEKTGVSPEDFKVGLITHGYTGVGSTITACPI
ETVAKLAEVGRLCVGWSAASIRVLEQRPMRCHRCYGIGHPQQLCPSNKDRSGLCFRCGEE
GHISKDCTAPLCCAVCKDRGLASGHRMGGAHCNPPPVKGRSLQWGSALRSTSVSAAPGNL
NHAVAAQDLLCQTVAEWNINVAIVAEPYSIPRTHKWAGSVDGSAAIFFPGVACTHSVVER
GVGFVAARWGEVVVVSTYFSPNRSRADFESFLATVEGVILRVAPSPVLVAGDLNAWSRAW
GSTRPNARGRVLESWVLSLGLQILNRGNTPTCVRWQGTSIVDVTFATPSLAARISDWRVM
EEVVTLSDHRYVRYDISPAFPGTPIQLGGRPPFLRWSLVRLQPDVAEEAAMVRAWAAVPD
TMAGDADCMADLFADDIKVVCDAAMPRTQACPRNRGQVYWWTQELSSLRTASMGARRAYQ
RYRRRARGTLGVEESLYRAYQDANKALRTAIRKAKEDAWDQFLGILNNDPWGRPYRTIRG
KFSTPASPTSCMEPGLLRRVLGTLFPDPGPFAPPRMTTADLAQGERVDGPPVSDAEFSTI
RLRLRCKRKAPGPDGAPSKVLAIALGPLEDRYRAVLNTCIAAAHFLRRWRVRRLCLLRKE
NRPADAPEGYRPVVLLDEAGKTFEKILASRIIQHLEGSGPDLAECQYGFHDPRPRSTR