DPGLEAN21149 in OGS1.0

Genomic Positionscaffold11391:- 636-2633
See gene structure
CDS Length1998
Paired RNAseq reads  49
Single RNAseq reads  124
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  Gag protein [Bombyx mori] (7e-67)
Best NR hit (blastx)  Gag protein [Bombyx mori] (2e-44)
GeneOntology terms  ND
InterPro families
  
IPR001878 Zinc finger, CCHC-type
IPR013084 Zinc finger, CCHC retroviral-type
Orthology groupMCL10285

Nucleotide sequence:

ATGATGAGTGAGAGAGAGTATGATGGTGGGGGGGGTTCCCGGGGGGAAATACCGGGTAAG
GAGATGAATGAGAGTGTGTGTGCGGATGACTCAATGTCATCGCGCCCGGAATCGGTGTTA
GGAGCCAAACGGCTGTTCTCGGACGTATCTGGTTCCGACACCGAGACCGGGGGTGGGATG
TTTCGGGGTCCGGTAGTGTCGGGAAGGCGGGGCCGCGGCCGTACGCTGGCGAGAGCCCGC
AAATTTTTAAGGGAAAGGAAGGAGGAGGAGTCGGAGGCTGCCTTCGACTCCTGCCTCGAA
AAGAGGCTCCGAAAGGAGAGTGACGGGGGTGGGAAGGCGAAGGGGAAGGGGAGGGAGGTC
CCTTTGGACCTGCAAGCAATGGGGGCGGAGGCGATAAAGGTCGAGGCCGAGAAAAGCCTC
GACCTTATCAAAGAGTTGGTGGGAAAGAGTAGAAACCTCAAAGGGGGCTACTCGTCCCGA
ATTATGAAGGCCTCTGCCTTCCTGAGGGAGTCCCTGGATGCGCTCGTATCGCGCACCGAG
GCTGAAGAGACCCACCGCCTCAGAGCCGACATTGGTCGGCTCCAGAGGGAGAACGCGGGT
CTCAAAGAAGAGGTACGCGCTCACCGTCTCCAATACGAGGAGATGATGAGGGAGAGGGCG
GCGGCGGTTAAGGCAGGTGGGCCGGTCGGCCAGGACCAGCTTCAGGCGCTTGAAGCAAAC
ATCGCCAGGTTGGTGGGCAACCTGGTGGATGGACGGCTGACCGCGTTGGAGTCCCGGCTT
AGGCGGGAAGAGGTCGCCCGCCCTCCCCTGGCGGGAGATAACTCGGCCGTCGCCGTGGCT
GCCAGAGCGGCGATCCGCCAGGCGGCCATACAAAGAGGGGCCAAATCCTCAGCCCCTTTA
CCGGCGTCACCGGCACCGGTAATATTAGTGTCAGAGGATGATTTCCCTTCCCTCCCTGCG
CCCTCCAAGGGGAAGGCCAAGAAATCAAAGGGGGGGAAGGAGGTTGCCTGGACCGCCTTC
GGTCCATCCACCTCCGGTGGAAGGACTGAGGCGATAACTGCCGGGGCCGGGGCCGTTGCG
GCGGGGTGGACGGAGGTGGTCCGCCGCAAGGCCCTCAAAAAGAAGGAAGTGGTGCCGGTA
CCCAAAACACCGGCGCCACAACCAAAAAAGAAGGCGGGCCCTAAGGAGGGCCCCAAGAAA
GTGGCGCTCCCACGCTCACAGGCCGTAATGTTGAAGTTACGGCCTGAAGCAGCGGCTAAG
GGGGCGACCTACTTGTCGGTCCTCTTAAGGGCCGAGAGGGAGGTAAATACGAAGGAGCTG
GGTATCGGGCCCCTGAAAATCCGTTCATCGGCAACAGGGGCCCGCATCATCGAGGTGCCC
GGCTCAGCCAGCGCGGACAAAGCCGACGCTTTGGCCGCTAAGCTGAAGTCTGTGCTGGCG
GAGGAGGCGGAGGTGTCGAGGCCCGTGAAATTCACGGACGTAAGAGTAACGGGCCTCAAC
GACGCGACGACCGCGGACCGGCTGATAGCCGCGGTCGCGCAGGAGGGGGGCTGCACCGAG
GCCCAGGTCAGGGTTCGTAGCGTGCGACCTGGGCCTCGCGGCACAGGCTCCGCCCTGGTG
GAGGTGCCGGCTGCGGCGGCTAAAAAGCTGCTGCAGCTGGGCAGCCTGTCAGTCGGGTGG
AGCCAGGTGCGGCTCTCGCACATGGAGGCACGACCCAAGCACTGCTTCAAGTGCTTGGGA
ACGGGGCACGTTGCGGCCGCGTGCCCCAGTCCAAAAGACCGCAGCGGTTTGTGCTACCGC
TGCGGCAAGGAGGGCCACAAGTCCGCCCAGTGCTCCGCGTCACCGCGCTGCGCTGTGTGC
GCTGACGCGAGCAAGCCGGCGGACCATGTGATGGGGGGTCATTCGTGCCGCCCCCCATCC
ACCCGTGGGAAACTCCCGGTCCAGCCACAAAGGGCGGGGGTTTCGAGGCGGGCGTCGGGA
GCTGAGATGTCCGAATAA

Protein sequence:

MMSEREYDGGGGSRGEIPGKEMNESVCADDSMSSRPESVLGAKRLFSDVSGSDTETGGGM
FRGPVVSGRRGRGRTLARARKFLRERKEEESEAAFDSCLEKRLRKESDGGGKAKGKGREV
PLDLQAMGAEAIKVEAEKSLDLIKELVGKSRNLKGGYSSRIMKASAFLRESLDALVSRTE
AEETHRLRADIGRLQRENAGLKEEVRAHRLQYEEMMRERAAAVKAGGPVGQDQLQALEAN
IARLVGNLVDGRLTALESRLRREEVARPPLAGDNSAVAVAARAAIRQAAIQRGAKSSAPL
PASPAPVILVSEDDFPSLPAPSKGKAKKSKGGKEVAWTAFGPSTSGGRTEAITAGAGAVA
AGWTEVVRRKALKKKEVVPVPKTPAPQPKKKAGPKEGPKKVALPRSQAVMLKLRPEAAAK
GATYLSVLLRAEREVNTKELGIGPLKIRSSATGARIIEVPGSASADKADALAAKLKSVLA
EEAEVSRPVKFTDVRVTGLNDATTADRLIAAVAQEGGCTEAQVRVRSVRPGPRGTGSALV
EVPAAAAKKLLQLGSLSVGWSQVRLSHMEARPKHCFKCLGTGHVAAACPSPKDRSGLCYR
CGKEGHKSAQCSASPRCAVCADASKPADHVMGGHSCRPPSTRGKLPVQPQRAGVSRRASG
AEMSE