DPGLEAN19492 in OGS1.0

New model in OGS2.0DPOGS210626 
Genomic Positionscaffold5756:+ 1521-5547
See gene structure
CDS Length3777
Paired RNAseq reads  16
Single RNAseq reads  43
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013961 (2e-25)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  reverse transcriptase [Papilio xuthus] (2e-112)
Best NR hit (blastx)  reverse transcrpitase [Papilio xuthus] (8e-105)
GeneOntology terms  ND
InterPro families

  
IPR001878 Zinc finger, CCHC-type
IPR005135 Endonuclease/exonuclease/phosphatase
IPR013084 Zinc finger, CCHC retroviral-type
Orthology groupMCL11163

Nucleotide sequence:

ATGGTGGGGCACGCCCCAAGTGATATTCAGTACGTCCCCGTCGTGTGCCCTGGACCCCTT
CCTGACCACCGTAACGGAGTGGCAGGTTGGGTGACCGAATCTGAATATCATTACTTGTGG
CTTTTCGGAGTGTTTTCGTTGCGTTTTGTACTCTGCTATTTGATCCTCTCCACTTGGTGC
GCGGTAACGCAGTGGCCAGTGGAAGAATCAGTCTTGGGCAGGGCCCTTGACTGGACTATG
TCCGAGAACTCAGAGTCGTCGGACAGGAGAGGGGAAGCTCCTCGGAAAGTCTCCCCAATC
CCCCGGCAGACTAGAAGCATGACGGCCGCTGGTTCGAGCGGCTTGGATTCCGAGGCTAAC
CCGAGGAAGCGGAAGGACTCGGACTCTCAGGGGGATTCTGCGTTGAAGTTCCCAATCGTT
GTCCTTCCGAAAGTCCCCCGAAGTTCGACTGGTAGAGGGACCTACCACAATGCTGGTTTG
TCCCAAGCCAGACGTGAGAGGAGGGAAGCGATGATGGTGGATACCGACACTGACTCCACC
GTGACCGAAGTGGAGAAGGAACAAACGCGTTCCCCCAGAAAGGTGCAGGGTAGGAGGTCG
CCAGACATCTCGATGGAGGACCTAGTGAATAAGGCCAAGGTGAGTTCCGAAAGGATTTGG
ATGCTGGTGTCCAAGTCCAAAAATCTGAAAGGAACCACCAAGAAAGAAATAAAGGATACC
TCCATGGCCATAAACCAGCTGGTCGAATCAATCGCTGAAAGGTCCACGAGCGAGGAGCTC
AAGAGGGTGCAGGCGGACAACAAGCGCCTAAGAGGCCAGCTGTCCGTACTCCAGGATGAA
GTTGCTGCGCTCAGGCGAGCATTTGCTGAGCAGGGCAACCGAAGGAAGAGCCCTTTGGCC
GAGGTGGAAGAAACCGTAGAGGAGATGCAGGCAAGCAAGCCTAGTGAAGGCGTCTCTAGG
ACGGAGCTGACCAACATCATGGAGCGCTTCCAGCGAGAGTTGACTGAGCAATTCGGGCGC
ATTCTGGGTGCCCGTTTGGAGGGGCTCGAGGTGGATGGCCGACTCCTTCCAGCCGTCTCG
CACCGTCCAGCACTAACATCTGACACCCGGAAAAGGAGCAACGAGCCAGTACCCACCACT
CTTGTCCCACCTGCGAACATGCCAAAGGAGGCAAGAGCTAAAAAGGGGGCCAAAGCAAAG
GCCGTGGCAACGATCTCAGGTTGGGCTGCAGAGCAGACGGCGGGTCCCTCACACCAGGAG
CCACCAATGGTTTTGGAAGGCTGGACAACCGTGGTGAAACAGGCAAAGCCCAAGAAGCCC
GCGCCAGCTCCCGCTGCCCCCAAGGCTCAAAAGAAGAAGCCATCACTCCCGAAGCAGCCT
AAAACCCTAGCGGTGGTCGTGACCCTCAAGCCAGAGGCCGTGGCTGAAGGCATCACATAC
AAGGATGCCATCACCAAGGCCAAACAATCCGTCAGCTTGGAAGAGCTGGGAATTGGCTCG
ACCAAATTTCGGACGGGGATAACGGGGGCTCGAATTATTGAGCTCCCTAAGGAGGTCTCT
GCCGCCCAGGCTGACAACCTTGCGTCCAGGATTGGCCAAGCTCTAGGGGATGCCGCCAAG
GTGACACGCCCGAGGAAAATGGCCAATGTCCGAATTTCCGGCCTGGACGACTCAGTAACC
CCGGAGGAAATTCGTCTGGCGCTGGCAGAAAAAACTGGAGTCTCCCCGGAAGACTTCAAA
GTCGGGCTTATTACCCACGGGTACACTGGAGTAGGCTCTACCATAACTGCCTGCCCCATT
GAAACTGTTGCCAAGCTGGCGGAAGTGGGTCGGCTTTGTGTGGGCTGGAGCGCTGCCTCC
ATCAGGGTTCTGGAGCAGCGCCCAATGCGGTGTCACCGGTGCTACGGCATCGGACACCCG
CAGCAACTCTGCCCCTCTAACAAGGACCGCAGTGGATTATGTTTCCGTTGCGGAGAAGAG
GGGCACATATCTAAAGATTGCACAGCCCCACTTTGCTGCGCGGTGTGCAAGGACCGGGGC
TTGGCATCGGGTCACCGAATGGGGGGAGCCCACTGTAACCCCCCTCCGGTCAAAGGCCGC
TCCCTACAATGGGGATCAGCCCTCCGCTCGACTTCCGTGTCGGCCGCCCCTGGGAACTTA
AACCACGCGGTCGCAGCACAGGACCTCTTGTGCCAGACTGTGGCCGAGTGGAACATAAAT
GTAGCTATCGTTGCGGAGCCATACTCTATTCCCCGAACCCATAAATGGGCCGGGTCCGTG
GATGGTTCCGCGGCTATTTTCTTTCCCGGCGTGGCCTGCACTCACTCCGTTGTGGAGAGA
GGAGTGGGCTTTGTGGCAGCTCGATGGGGAGAAGTAGTGGTGGTCTCTACATACTTCTCC
CCAAACCGCAGCCGGGCCGACTTTGAGTCGTTCCTGGCTACGGTTGAAGGAGTCATCCTT
CGGGTGGCCCCCAGTCCGGTGCTGGTGGCTGGGGACCTCAATGCGTGGTCTCGCGCTTGG
GGCTCTACCAGACCTAACGCCCGCGGTCGTGTCCTGGAGTCCTGGGTTCTGTCATTGGGA
CTCCAGATTCTCAATAGAGGCAACACTCCAACCTGCGTCCGGTGGCAAGGCACATCCATA
GTGGACGTGACCTTTGCCACCCCATCACTTGCGGCTCGCATCAGCGACTGGCGGGTGATG
GAGGAAGTGGTGACCCTATCGGACCATCGGTACGTTCGATATGATATCTCCCCAGCATTC
CCCGGGACCCCTATCCAGCTGGGTGGCAGACCACCTTTCCTAAGGTGGTCACTCGTTCGC
CTCCAGCCCGATGTGGCTGAGGAGGCAGCGATGGTGAGAGCATGGGCCGCAGTGCCCGAC
ACCATGGCTGGGGATGCCGACTGTATGGCGGACCTTTTCGCGGACGACATTAAGGTTGTC
TGCGATGCCGCTATGCCGAGGACGCAGGCCTGCCCCCGAAACAGAGGGCAGGTATACTGG
TGGACGCAGGAACTGTCCAGCCTACGTACCGCCAGTATGGGGGCTAGGCGCGCCTACCAG
CGTTACCGTAGGCGCGCCCGAGGAACGCTCGGTGTAGAAGAAAGTCTATACCGGGCCTAC
CAGGATGCCAACAAGGCATTGCGGACGGCCATTCGCAAGGCCAAAGAGGATGCCTGGGAC
CAGTTCCTGGGCATACTCAATAACGACCCCTGGGGTAGGCCCTACAGGACGATTAGGGGG
AAATTCTCTACTCCAGCTTCTCCTACCTCCTGTATGGAGCCTGGGTTGCTGCGGAGGGTA
CTTGGGACGTTGTTCCCTGATCCTGGACCGTTCGCACCTCCGCGCATGACTACTGCAGAT
CTCGCTCAAGGGGAGCGGGTCGACGGCCCTCCCGTGTCGGATGCTGAATTCAGCACGATC
CGTTTGAGGCTCCGGTGCAAACGCAAGGCGCCGGGGCCGGATGGGGCCCCCTCCAAGGTG
TTGGCTATCGCCTTAGGGCCCCTGGAGGACCGGTACCGCGCAGTGCTCAACACCTGCATT
GCGGCGGCCCACTTCCTCAGGCGATGGAGAGTACGGCGGCTCTGTCTACTCCGTAAGGAG
AACCGTCCGGCGGATGCCCCAGAGGGCTACCGGCCAGTGGTGTTACTGGATGAGGCGGGG
AAGACTTTCGAGAAGATCCTCGCCTCCCGCATCATTCAGCATCTAGAAGGCAGTGGGCCA
GATCTGGCGGAATGCCAGTACGGTTTCCACGATCCACGTCCACGATCGACGCGGTGA

Protein sequence:

MVGHAPSDIQYVPVVCPGPLPDHRNGVAGWVTESEYHYLWLFGVFSLRFVLCYLILSTWC
AVTQWPVEESVLGRALDWTMSENSESSDRRGEAPRKVSPIPRQTRSMTAAGSSGLDSEAN
PRKRKDSDSQGDSALKFPIVVLPKVPRSSTGRGTYHNAGLSQARRERREAMMVDTDTDST
VTEVEKEQTRSPRKVQGRRSPDISMEDLVNKAKVSSERIWMLVSKSKNLKGTTKKEIKDT
SMAINQLVESIAERSTSEELKRVQADNKRLRGQLSVLQDEVAALRRAFAEQGNRRKSPLA
EVEETVEEMQASKPSEGVSRTELTNIMERFQRELTEQFGRILGARLEGLEVDGRLLPAVS
HRPALTSDTRKRSNEPVPTTLVPPANMPKEARAKKGAKAKAVATISGWAAEQTAGPSHQE
PPMVLEGWTTVVKQAKPKKPAPAPAAPKAQKKKPSLPKQPKTLAVVVTLKPEAVAEGITY
KDAITKAKQSVSLEELGIGSTKFRTGITGARIIELPKEVSAAQADNLASRIGQALGDAAK
VTRPRKMANVRISGLDDSVTPEEIRLALAEKTGVSPEDFKVGLITHGYTGVGSTITACPI
ETVAKLAEVGRLCVGWSAASIRVLEQRPMRCHRCYGIGHPQQLCPSNKDRSGLCFRCGEE
GHISKDCTAPLCCAVCKDRGLASGHRMGGAHCNPPPVKGRSLQWGSALRSTSVSAAPGNL
NHAVAAQDLLCQTVAEWNINVAIVAEPYSIPRTHKWAGSVDGSAAIFFPGVACTHSVVER
GVGFVAARWGEVVVVSTYFSPNRSRADFESFLATVEGVILRVAPSPVLVAGDLNAWSRAW
GSTRPNARGRVLESWVLSLGLQILNRGNTPTCVRWQGTSIVDVTFATPSLAARISDWRVM
EEVVTLSDHRYVRYDISPAFPGTPIQLGGRPPFLRWSLVRLQPDVAEEAAMVRAWAAVPD
TMAGDADCMADLFADDIKVVCDAAMPRTQACPRNRGQVYWWTQELSSLRTASMGARRAYQ
RYRRRARGTLGVEESLYRAYQDANKALRTAIRKAKEDAWDQFLGILNNDPWGRPYRTIRG
KFSTPASPTSCMEPGLLRRVLGTLFPDPGPFAPPRMTTADLAQGERVDGPPVSDAEFSTI
RLRLRCKRKAPGPDGAPSKVLAIALGPLEDRYRAVLNTCIAAAHFLRRWRVRRLCLLRKE
NRPADAPEGYRPVVLLDEAGKTFEKILASRIIQHLEGSGPDLAECQYGFHDPRPRSTR