DPGLEAN18199 in OGS1.0

Genomic Positionscaffold2379:- 16122-19774
See gene structure
CDS Length2358
Paired RNAseq reads  641
Single RNAseq reads  2247
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009441 (2e-22)
Best Drosophila hit  CG14120 (2e-16)
Best Human hitcontactin-5 isoform short (4e-06)
Best NR hit (blastp)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (6e-157)
Best NR hit (blastx)  endonuclease-reverse transcriptase HmRTE-e01 [Heliconius melpomene] (1e-144)
GeneOntology terms

  
GO:0046872 metal ion binding
GO:0003676 nucleic acid binding
GO:0016787 hydrolase activity
InterPro families





  
IPR007110 Immunoglobulin-like
IPR000477 Reverse transcriptase
IPR003598 Immunoglobulin subtype 2
IPR001604 DNA/RNA non-specific endonuclease
IPR013098 Immunoglobulin I-set
IPR013783 Immunoglobulin-like fold
IPR020821 Extracellular Endonuclease, subunit A
Orthology groupMCL10012

Nucleotide sequence:

ATGTACCGCGATTCCGTTTCAATGGTTAGGACTGCTGTTGGCGATACAAAACCCTTTCCG
ATCTCAGTAGGGGTTCACCAAGGCTCGGCTCTTAGCCCCTTCTTGTTCAATGTAGTGCTG
GACACTGTCTCGGCTAACATCCAGGACCAGCCTCCATGGCTGATGATGTATGCCGATGAC
ATAGCGCTCATTGATGAGAGCAGGTTGACGCTAGAGCGAAGAGTGAACCTCTGGAAGGGT
ACGCTTGAGAACGGTGGTCTTAAACTAAATGTGACGAAGACCGAGTACATGGCTTGCGGA
AGCCCGGACTCTTGCACTATCCATATAGGTCCTGAACCAGCCGTTAAGTCGGAAAAGTTC
AGGTACCTTGGATCTATTCTGCATGAGTCCGGAGGCATCGATCACGATGTCCAAGCCCGG
ATCAGCGCTGCTTGGGCGAAATGGCGTGAGGTCACAGGTGTGGTCTGCGACCGCAGAATA
CCGCCCAAGCTCAAGGGACTAATATACAAGAGCATAATCCGACCGGTTCTCTTATATGGA
AGCGAATGTTGGCCAACACTGTCCAGGCACACTCAGGAGCTTCACGTCACGGAGATGAAG
ATGCTGAGGTGGATGTGTGGCGTAACGCAGGCTGACCGTATACGTAACACATTTATCCGA
GGTAGTCTTGGAGTCCGTGACGTAGCGGATAAGCTTCAAGAGAGTCGCCTGAGATGGTAT
GGCCACGTTGCACGCCGGCCTGAGAACTACGTCGGAAAAATTTGCCTTGATATGACGGTC
CCTGGAGCAAAACCCCCAGGACGCCCAAGAAAGCGATGGCTGGACACCGTGAAGCAGGAT
ATGAGAGCCAATGGACTTACCACCGCGGATGCTAAAGACCGTGCAAATTGCCGTGTTAGT
ACGTTTTTATCAGCTCACACTGCACATAAAAAGGTGACTGTAGGTTATAAGCCAAGATTT
CTCAGTGACGAAGAAACTGTTATAGAATATTCGGAAGGCGATTTCTCTTATATGGACTGC
AATGCCGATGGCTATCCAAAGCCAAGCACGCAATGGATACGTAATGGTGATCCTGTACCT
ATAAATGGGTCGTATCTAATTATAGAAATGAAACTTGAAGATATCGGATACTACCAATGT
ACCGTAAGCAATGATCTTGGTTCAATTAGACGTACTTTTAAAATTAATTCAGGAGAATGC
CTGCTGCGTACTAAGCATGATTTTAATGATCAGCAGCCTTTACTTTTGACTCTATCCAGA
GACTGGCCAGAATTTAGAACATCAAATGAATATGTCCATATACCAATTTATAAATATTTT
CTTCTATCATGCCCCGGCAGTTCTGTAATTTACAATGGAGAAACATTCGGTCAAAACGTC
AAAACGAAGTGTTCAGAACTAAAAGACAAAATTGAAATAAAAAACAGAATCATTGATTAT
GATAAATTAAAATGTACCAAAAAAATAAAACCTCTAACAAAGCGAACAGGAATAAGCTGT
TTTGAACAACACGGAAACAACACAGAATTATTGCAAATTGGTTTTTTTTCACGAAGTAAG
TTTTTGAAGGTATACGACGTTTGTTTAGATCACGAACAGAAGATACCTCTCTTTGCAAAA
CAAACCTCAAATAAAGGTATCGCCCTGAATGCACCCCCCGGAGATTACACATTTGTTGAA
AGTAAATATTTGCCCTTTCATTTTGGGGACATGTATGACTGTGATTCTCAGTTGAGATTT
ATTTCATCGTCGATCGGAAAATCAATAAAACCAGTTAAAGATGTTGAATGCTGTTTTACA
AAAAGACAATTGATCAATCCTCGAGATGTTTTGCCGGGATTATCACAAGTGGCTGTATAT
AGCTATTTAAATGTTATACCTCATTGGAGTACCTGTGGAACTAAAAACTGGGATGAACTT
GAACTAAGAGTACGATATCTGGGAAAATATTCATCTAATGAGCTGACCATTTTCACTGGA
GCATCAGATCCGATGATGTTGCCAGGACAGACAGAAGATGCTTATGTGTCCTTAAGAGAC
AGATTAAACAGACGTCAACCAGTGCCCATGTATTTATGGAAGATAATTCAAAACCCGGCA
GATAATTCTTCCTTAGCTGTCATCCAACTAAATATTCCTAATGTTACGTCAGCGGAGGCC
TATTCTTATATGCCATGTAACGATATATGTCCCGAAGTCGAGTGGTTGCGTAATAACGAT
TGGCAGGATGTGAATAAGGGATTCACATTCTGTTGCAGTATTAGTGATTTTAATTCACGT
TTCGGCAAGCTTTTTGACGGATGTGAAAAAGTATTCAAGACTTTACCACCTTTATTACCT
GATTTTTCTCTTATCTAA

Protein sequence:

MYRDSVSMVRTAVGDTKPFPISVGVHQGSALSPFLFNVVLDTVSANIQDQPPWLMMYADD
IALIDESRLTLERRVNLWKGTLENGGLKLNVTKTEYMACGSPDSCTIHIGPEPAVKSEKF
RYLGSILHESGGIDHDVQARISAAWAKWREVTGVVCDRRIPPKLKGLIYKSIIRPVLLYG
SECWPTLSRHTQELHVTEMKMLRWMCGVTQADRIRNTFIRGSLGVRDVADKLQESRLRWY
GHVARRPENYVGKICLDMTVPGAKPPGRPRKRWLDTVKQDMRANGLTTADAKDRANCRVS
TFLSAHTAHKKVTVGYKPRFLSDEETVIEYSEGDFSYMDCNADGYPKPSTQWIRNGDPVP
INGSYLIIEMKLEDIGYYQCTVSNDLGSIRRTFKINSGECLLRTKHDFNDQQPLLLTLSR
DWPEFRTSNEYVHIPIYKYFLLSCPGSSVIYNGETFGQNVKTKCSELKDKIEIKNRIIDY
DKLKCTKKIKPLTKRTGISCFEQHGNNTELLQIGFFSRSKFLKVYDVCLDHEQKIPLFAK
QTSNKGIALNAPPGDYTFVESKYLPFHFGDMYDCDSQLRFISSSIGKSIKPVKDVECCFT
KRQLINPRDVLPGLSQVAVYSYLNVIPHWSTCGTKNWDELELRVRYLGKYSSNELTIFTG
ASDPMMLPGQTEDAYVSLRDRLNRRQPVPMYLWKIIQNPADNSSLAVIQLNIPNVTSAEA
YSYMPCNDICPEVEWLRNNDWQDVNKGFTFCCSISDFNSRFGKLFDGCEKVFKTLPPLLP
DFSLI