DPGLEAN03954 in OGS1.0

New model in OGS2.0DPOGS201605 
Genomic Positionscaffold1498:+ 19448-59638
See gene structure
CDS Length3183
Paired RNAseq reads  2421
Single RNAseq reads  6920
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012168 (7e-29)
Best Drosophila hit  murashka, isoform C (5e-43)
Best Human hitRING finger protein 38 isoform 1 (4e-29)
Best NR hit (blastp)  eIF2B-beta protein [Bombyx mori] (5e-56)
Best NR hit (blastx)  ring finger protein [Culex quinquefasciatus] (1e-52)
GeneOntology terms



  
GO:0005575 cellular_component
GO:0008355 olfactory learning
GO:0007611 learning or memory
GO:0008270 zinc ion binding
GO:0005515 protein binding
InterPro families


  
IPR000649 Initiation factor 2B-related
IPR018957 Zinc finger, C3HC4 RING-type
IPR013083 Zinc finger, RING/FYVE/PHD-type
IPR001841 Zinc finger, RING-type
Orthology groupMCL29417

Nucleotide sequence:

ATGGGCCCACCTGAAGGTATGAAAGAATTGGACGAAAAAAGTATGGAAAGTTTAGTGAAT
TTCGTAGCAGATGTACGGACAGGCAAGGTCCAAGGTTCAGAGAACATAGCGCTGGCGACT
GTCAATCTATTGGAGCAGATAATAACCGACCTGGAAACGTCTACGGCCATGTCTTTAATA
AATGCTGTCCGTGTCGCTGGACGGCAGCTGGCTAGGGCGCTGCCATCAGAACACTTCGCT
GCTAACATGGTGAGGCGAGTGCTGCGTGCTGTTAGGGACGAGCAGCGTGCGCAACATAAC
CAGAGTGTGGAAGGTGCTGGCGAGTCTCTAGCTAGGCTGGTGCTGGCGGCGCCCCTCAGA
CGTGGGACGTTGCCCAGTGGGAGAGACTTGAGGGAACCGCTGAGAGATCATATAGCTGAG
CTGCGAGCCGAGCTGGATTCCAGCACTTCATCGATCTGCTCCCAGGCCAAGGAACACCTG
CACGCTGAAGACCTAGTACTGTCGTATGGAGGTGGCGCTTTACTGGAAAAGTTCCTCAAG
AGCAGCACAAGGAAATACAAACTCCTACTAGCAGCTGGTCCAGATGTTACAGAGTGTCAC
TCGATGGCTGTCCGTCTGTCACACAGCGGTGTCTCCGTAACAGTGACCAGCGCGGCCGCT
GTGGGCGCCCTGATGTCCAGGGTCAACAAGCTGCGAGCCGAGCTGGATTCCAGCACTTCA
TCGATCTGCTCCCAGGCCAAGGAACACCTGCACGCTGAAGACCTGGTACTGTCGTATGGA
GGTGGCGCTTTACTGGAAAAGTTCCTCAAGAGCAGCACGAGGAAATACAAACTCCTACTA
GCAGCTGGTCCAGATGTTACAGAGTGTCACTCGATGGCTGTCCGTCTATCACACAGCGGC
GTCTCCGTAACAGTGACCAGCGCGGCCGCTGTGGGCGCCCTGATGTCCAGGGTCAACAAG
GTGGTGTTGGAGGCCGTGGGAGCGCTGTCCGGGGGCTCCGCCCTGGCCGCGGCGGGGACC
CTCGCCCTCACCACCGCCGCCGCCCACCGAGCTGTGCCGGTGGTGGTGCTGTGTCCGCTC
CACGCTCTGTGTGCTGTCCACGCCTGCGAGAGACGTCTGCTAGCATCAGATTCTCCACCG
GCTGAAGCTATACCATATCAGAACTTGGAATCTTCTGTGTCACGAGTGGTGTCTCCCAAG
TATGACGTCCTGCCTCCGGACCACATTTCACTGTTCATAACAAATCTCGGAGGCAGCTCA
CCTTCGTACATCTACCGACTTCTCTCTGAGATATACGATCCGAGTGATAAAGAACAGCGG
GCGTCCGAGCCCGCTTCGGCGCCGATCGATACGTTGTGGTATGGCGCAGGAGATATGAGC
CAGGGGGGCTCGAGACGGCCCTATTCGCGCGGGGGCCCGCGCTGGCACCCGCGCTACAAG
GACTACTGGGACTACGAGCAGAACTACAGCGATGGCAGCAGTGCAGGTAGCCCCAAGGAG
GCGTGTGTGGCTCAGTCCCCGCCACCGGTGCCAGCCAAGAGACACCCTCACCATCACCAC
GAACAGCGCCGTCTCAGTGTCGGCGGCGGAGCGGGCGGCTCGCGTCGCGCCGAGCGTCGC
GCCGCCGGCCCTTCAAACCGCCGAACAGAACTGCGAGTGAGCCCGGGGGCGGGGGCCGGG
ACCTCCGCTGAGTCTCCACACCTACATCACACTGCACACCATCATCACATGGACACCAAT
CCTCTTGAGGGTTCAGCGCCCCTGGAAGCCGTTGTCGGCGCCAGCGCTGGATCTTCAGAC
GAACTGTCCAGCAAACCTGGTGACAGTCCCACAAGGAAAAGACGTCGCATATCAAGGCAT
CTGTCGACAGGTGAGAGTAACGTGACGGTAGCTGCGCCGCCGGTCGAGAGGCGGACACCG
CGCCACCACTACCAGCCACCTCGCCGCGTGCGTTACGTCAGCGGGAACGGTGCGGGGGTG
TGGCATGAGCGGCTGTTGGACCAGCACGCTGCTCACCCGTCACACCCTCACCCCGCTCAC
CCCGCGCACCCCGCCCATCCTGCTCACCCCGCTCACCCTGCGCACCTATCACACGCGCAC
CACACGCCGCTGCTGTTGGACATCAACCAGATGTCGCTGCGCGGCGCTCGCCTCAGTGCG
CTGGGTGGTGGCGTGGGTTGGGGGCACCACGCCCCCCACCACGCTCCACATCACGCCCCA
CATCAACACCAACACCATCGAGAGCATCCAACTAGGACCCAGATGGGCGGCGTGTACGCG
GGACTGCAGTACCACGGGAGCTTCGCGCCGGCGCCGCCCCCACGAGGACCCTTCGCCTCC
CCCCCTCATCCACACACGCACTACATCACCAACTCACAGCGTTCGGAAGGAGGTCGTCTG
GAAATGTTGGGTAGCGGTGAAGGTGGACTGTCTCCGCTGCAGCCGACAGCTGATCTCCAC
CACGCGCCCCTGTTACTCGCCACCGAGGCGCGGGGTGCTCCGTTGGAACTGATGGCACCT
CATCACGCGAGACACGCGCTATCACATCACCACCATCGCCGTAGTGGTGGCGGCGTGGGC
GTCGGCGGTGGCGTGGGCGTGGGCGTCGGTGTCGGTGTGGGCGTGGTGGGCGCCGGTTCG
CGAGCAGCCCGGCCGTACGTCCGTGCAGCTCCGCGCTGGGCCGCCCATCCTCACACCGTG
CATCACATACACCAGGGTGGTGGAGGTCTGGTCCAAGCGACGCTCCCCGCTCAGCTGCAC
GTAGCGTCGCCACTGTCACTGCCACCGCCGCCCCCCACTTACCAGGTGTTCCTAAATCTG
TTGGCAATGTTCCCGCTGTCACCGTACGCGGAGGCCCGGGACGAGGCCGGGGACTCCCCG
GAGACGGAGAACTACGAGGCGCTGCTGTCGCTGGCGGAGCGCCTGGGAGAGGCGAAGCCC
CGGGGGCTCGCCAGACACGAGATAGACCTACTGCCTTCATACAAATACTCTGAACAAACT
CACCAGGGTGAGCAGACCTCGTGCGTGGTGTGCATGTGTGAGTTCGAGGCCCGGCAGACG
CTCCGCGTGCTGCCTTGCGCCCATGAGTTCCACGCTAAGTGTGTTGATAAGTGGCTTCGG
TCAAATCGAACGTGTCCCATATGCCGCGGGAACGCGTCGGAGTACTTCACCAACTCGGAG
TGA

Protein sequence:

MGPPEGMKELDEKSMESLVNFVADVRTGKVQGSENIALATVNLLEQIITDLETSTAMSLI
NAVRVAGRQLARALPSEHFAANMVRRVLRAVRDEQRAQHNQSVEGAGESLARLVLAAPLR
RGTLPSGRDLREPLRDHIAELRAELDSSTSSICSQAKEHLHAEDLVLSYGGGALLEKFLK
SSTRKYKLLLAAGPDVTECHSMAVRLSHSGVSVTVTSAAAVGALMSRVNKLRAELDSSTS
SICSQAKEHLHAEDLVLSYGGGALLEKFLKSSTRKYKLLLAAGPDVTECHSMAVRLSHSG
VSVTVTSAAAVGALMSRVNKVVLEAVGALSGGSALAAAGTLALTTAAAHRAVPVVVLCPL
HALCAVHACERRLLASDSPPAEAIPYQNLESSVSRVVSPKYDVLPPDHISLFITNLGGSS
PSYIYRLLSEIYDPSDKEQRASEPASAPIDTLWYGAGDMSQGGSRRPYSRGGPRWHPRYK
DYWDYEQNYSDGSSAGSPKEACVAQSPPPVPAKRHPHHHHEQRRLSVGGGAGGSRRAERR
AAGPSNRRTELRVSPGAGAGTSAESPHLHHTAHHHHMDTNPLEGSAPLEAVVGASAGSSD
ELSSKPGDSPTRKRRRISRHLSTGESNVTVAAPPVERRTPRHHYQPPRRVRYVSGNGAGV
WHERLLDQHAAHPSHPHPAHPAHPAHPAHPAHPAHLSHAHHTPLLLDINQMSLRGARLSA
LGGGVGWGHHAPHHAPHHAPHQHQHHREHPTRTQMGGVYAGLQYHGSFAPAPPPRGPFAS
PPHPHTHYITNSQRSEGGRLEMLGSGEGGLSPLQPTADLHHAPLLLATEARGAPLELMAP
HHARHALSHHHHRRSGGGVGVGGGVGVGVGVGVGVVGAGSRAARPYVRAAPRWAAHPHTV
HHIHQGGGGLVQATLPAQLHVASPLSLPPPPPTYQVFLNLLAMFPLSPYAEARDEAGDSP
ETENYEALLSLAERLGEAKPRGLARHEIDLLPSYKYSEQTHQGEQTSCVVCMCEFEARQT
LRVLPCAHEFHAKCVDKWLRSNRTCPICRGNASEYFTNSE