DPGLEAN21235 in OGS1.0

Genomic Positionscaffold492:+ 45964-54963
See gene structure
CDS Length2112
Paired RNAseq reads  4781
Single RNAseq reads  12746
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001097 (4e-101)
Best Drosophila hit  calnexin 99A, isoform C (3e-73)
Best Human hitcalmegin precursor (6e-95)
Best NR hit (blastp)  GF22893 [Drosophila ananassae] (1e-154)
Best NR hit (blastx)  calnexin 99A, isoform C [Drosophila melanogaster] (1e-143)
GeneOntology terms





  
GO:0005509 calcium ion binding
GO:0005635 nuclear envelope
GO:0005783 endoplasmic reticulum
GO:0006457 protein folding
GO:0006461 protein complex assembly
GO:0007339 binding of sperm to zona pellucida
GO:0051082 unfolded protein binding
InterPro families



  
IPR001580 Calreticulin/calnexin
IPR008985 Concanavalin A-like lectin/glucanase
IPR009033 Calreticulin/calnexin, P
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR018124 Calreticulin/calnexin, conserved site
Orthology groupMCL10820

Nucleotide sequence:

ATGTTAAAAGAATACGAATACTGTCAAGTAGTGTTAAATCAGATAAAGACTTTTATTCCA
ATTGGATGTTTAATTAGTGACCGGCGCATTGTAACTAAAAATCGAAAGTGCATGATGGCA
CCTGGTATTATGCGGGTCTTTTTATTAAGCTTCTTGGTAGTCTCTGGCTCGCTGCAAGTT
ACGGCCGATGTCGACGATGCCGAAGATGGAGTAACTGTTGAGACAGAAGAGGAAATCTAC
CAAAGTCCTAAGGCCGATCCCAAGAAGGTGTATCTGGCGGAGAACTTTGATGATGTGGCA
TTGTTCAAGAAGAAGTGGATTAAGTCTGAAGCAAAGAAACAGGGTGTGGACGAAGATATC
GCCAAATATGATGGGAAATGGGAGATACAAATACCAACAAGAAAAATATTCAATAGCGAC
TCAGGGTTGGTGCTGACTACAGAGGCTAAGCATGCAGCTATATCAACACTGCTCGACCGG
CCGTTCGAGTTCAAAGACAAACCACTCATTGTACAATACGAAGTGACTATGCAGGAGGGT
CAAAATTGTGGTGGTGCTTACCTAAAACTTCTATCACGCGGTGTGAACACGAAAGCAGAC
CTCAAACAGTTCCACGACCAGACTGCGTACACCATCATGTTTGGGCCCGACAAATGTGGC
AACGACAACAAACTGCACTTCATCTTCAGACACAAAAACCCCAAGAATGGGACCATCGAA
GAAAAACACTGCAAGAAACCAACCCAACGTCTTGAAGACATCTACAAAGACAAGGAGCCT
CACCTGTACACTCTGATAGTGCGGCCAGACAACACATTCTCAGTCCTCGTCGACAACAAG
GAGTTCAACGCCGGTTCGTTGCTAGAAGACTTCACCCCACCCGTCAACCCTCCGGAGGAG
GTGGACGATCCCAACGACGAGAAGCCAGAGGACTGGGACGAGAGGGAGAAGGTTTTGTGG
CAAGTTCTTGTACGTCCTGCTGAGGGACCTGGTCGCAATGGTGATGAGCTGATCGTAGTC
GTTTTAAGGAACTTCCGCGGCTCAGAGGCAGCTGATAGATCTATAAAATATATAGATGTT
AGTTCTGAACGGACCGCTGTAAATTTATTTGCGAATATTATCAATGTTCTGGAGGAGATC
GTGGATCCCTCAGCGAGTAAGCCAGATGACTGGGATGAGAGTGAGCCGGCACAGATCATA
GACTTCAACGCTGTCAAACCAGACGGCTGGTTGGAAGACGAGCCTGACATGATACCAGAC
CCGGAGGCCAAGAAACCTGCGGATTGGGACGAGGAGATGGACGGGGAGTGGGAGGCGCCT
CTCGTGGATAACCCTCGCTGTGCCTCCGCACCCGGCTGTGGAACCTGGGCGCCGCCCACC
ATTCCCAACCCTAAATACAAGGGTATCTGGCGGGCACCTCTCATCCCCAACCCCAACTAC
AAGGGCAAGTGGAGTCCAAGGCGGATCCCCAACCCGGACTACTTCAACGATGAGCATCCC
TTCAGGATGACGCCCATTCACGCTGTTGGATTTGAACTGTGGTCGATGTCGCCCATGCTC
TTGTTCGACAACCTGATCATCACGGACGATCCGGCGGTGGCGGAGGCCTGGGCCGCTCAG
GGCTTCGCTCTCAAGAAACAGAGGATATCCAGTGACTCGAAAACGTGGTGGGGCAGACTG
CTGAGAGCCGTGAAGTACCGGCCGGGCGCGGTGTCGCTGTACGTGGTGTACTGCGCCGTA
CCTATCGTTATATACGTCGCCTACCTTATAAGGAGATCCTATGAGGAGTCCGTGGTGGAG
CTCGTCCTGCGCTCGGTGGGTGACAGACCCTGGCTGTGGGGAGCCGCGCTTCTGGTTTCC
TTCGCTGTGTTGGCCTTCGTCGCATACATGTGTTGTGGACCTCGAGTGGATCCGGAAGCG
GATGTCAAGAAGACGGACGCGGTTGTAGAGGATGATCCTCATCAAGAAGAAGTTGAAGAA
ACCAGTGAGAAGACGAGCAAAGCTGATCTGGAAGGCCCCGAGCCTGAGGCTGACACCAGT
GATACCACACCCTTAGTGGACTCGGAAGCAGCCGGCGACGGACAGAGGAAGAGGAAACCA
CGCAAGGAGTGA

Protein sequence:

MLKEYEYCQVVLNQIKTFIPIGCLISDRRIVTKNRKCMMAPGIMRVFLLSFLVVSGSLQV
TADVDDAEDGVTVETEEEIYQSPKADPKKVYLAENFDDVALFKKKWIKSEAKKQGVDEDI
AKYDGKWEIQIPTRKIFNSDSGLVLTTEAKHAAISTLLDRPFEFKDKPLIVQYEVTMQEG
QNCGGAYLKLLSRGVNTKADLKQFHDQTAYTIMFGPDKCGNDNKLHFIFRHKNPKNGTIE
EKHCKKPTQRLEDIYKDKEPHLYTLIVRPDNTFSVLVDNKEFNAGSLLEDFTPPVNPPEE
VDDPNDEKPEDWDEREKVLWQVLVRPAEGPGRNGDELIVVVLRNFRGSEAADRSIKYIDV
SSERTAVNLFANIINVLEEIVDPSASKPDDWDESEPAQIIDFNAVKPDGWLEDEPDMIPD
PEAKKPADWDEEMDGEWEAPLVDNPRCASAPGCGTWAPPTIPNPKYKGIWRAPLIPNPNY
KGKWSPRRIPNPDYFNDEHPFRMTPIHAVGFELWSMSPMLLFDNLIITDDPAVAEAWAAQ
GFALKKQRISSDSKTWWGRLLRAVKYRPGAVSLYVVYCAVPIVIYVAYLIRRSYEESVVE
LVLRSVGDRPWLWGAALLVSFAVLAFVAYMCCGPRVDPEADVKKTDAVVEDDPHQEEVEE
TSEKTSKADLEGPEPEADTSDTTPLVDSEAAGDGQRKRKPRKE