Genomic Position | scaffold492:+ 45964-54963 |
---|---|
See gene structure | |
CDS Length | 2112 |
Paired RNAseq reads   | 4781 |
Single RNAseq reads   | 12746 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001097 (4e-101) |
Best Drosophila hit   | calnexin 99A, isoform C (3e-73) |
Best Human hit | calmegin precursor (6e-95) |
Best NR hit (blastp)   | GF22893 [Drosophila ananassae] (1e-154) |
Best NR hit (blastx)   | calnexin 99A, isoform C [Drosophila melanogaster] (1e-143) |
GeneOntology terms    | GO:0005509 calcium ion binding GO:0005635 nuclear envelope GO:0005783 endoplasmic reticulum GO:0006457 protein folding GO:0006461 protein complex assembly GO:0007339 binding of sperm to zona pellucida GO:0051082 unfolded protein binding |
InterPro families    | IPR001580 Calreticulin/calnexin IPR008985 Concanavalin A-like lectin/glucanase IPR009033 Calreticulin/calnexin, P IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR018124 Calreticulin/calnexin, conserved site |
Orthology group | MCL10820 |
Nucleotide sequence:
ATGTTAAAAGAATACGAATACTGTCAAGTAGTGTTAAATCAGATAAAGACTTTTATTCCA
ATTGGATGTTTAATTAGTGACCGGCGCATTGTAACTAAAAATCGAAAGTGCATGATGGCA
CCTGGTATTATGCGGGTCTTTTTATTAAGCTTCTTGGTAGTCTCTGGCTCGCTGCAAGTT
ACGGCCGATGTCGACGATGCCGAAGATGGAGTAACTGTTGAGACAGAAGAGGAAATCTAC
CAAAGTCCTAAGGCCGATCCCAAGAAGGTGTATCTGGCGGAGAACTTTGATGATGTGGCA
TTGTTCAAGAAGAAGTGGATTAAGTCTGAAGCAAAGAAACAGGGTGTGGACGAAGATATC
GCCAAATATGATGGGAAATGGGAGATACAAATACCAACAAGAAAAATATTCAATAGCGAC
TCAGGGTTGGTGCTGACTACAGAGGCTAAGCATGCAGCTATATCAACACTGCTCGACCGG
CCGTTCGAGTTCAAAGACAAACCACTCATTGTACAATACGAAGTGACTATGCAGGAGGGT
CAAAATTGTGGTGGTGCTTACCTAAAACTTCTATCACGCGGTGTGAACACGAAAGCAGAC
CTCAAACAGTTCCACGACCAGACTGCGTACACCATCATGTTTGGGCCCGACAAATGTGGC
AACGACAACAAACTGCACTTCATCTTCAGACACAAAAACCCCAAGAATGGGACCATCGAA
GAAAAACACTGCAAGAAACCAACCCAACGTCTTGAAGACATCTACAAAGACAAGGAGCCT
CACCTGTACACTCTGATAGTGCGGCCAGACAACACATTCTCAGTCCTCGTCGACAACAAG
GAGTTCAACGCCGGTTCGTTGCTAGAAGACTTCACCCCACCCGTCAACCCTCCGGAGGAG
GTGGACGATCCCAACGACGAGAAGCCAGAGGACTGGGACGAGAGGGAGAAGGTTTTGTGG
CAAGTTCTTGTACGTCCTGCTGAGGGACCTGGTCGCAATGGTGATGAGCTGATCGTAGTC
GTTTTAAGGAACTTCCGCGGCTCAGAGGCAGCTGATAGATCTATAAAATATATAGATGTT
AGTTCTGAACGGACCGCTGTAAATTTATTTGCGAATATTATCAATGTTCTGGAGGAGATC
GTGGATCCCTCAGCGAGTAAGCCAGATGACTGGGATGAGAGTGAGCCGGCACAGATCATA
GACTTCAACGCTGTCAAACCAGACGGCTGGTTGGAAGACGAGCCTGACATGATACCAGAC
CCGGAGGCCAAGAAACCTGCGGATTGGGACGAGGAGATGGACGGGGAGTGGGAGGCGCCT
CTCGTGGATAACCCTCGCTGTGCCTCCGCACCCGGCTGTGGAACCTGGGCGCCGCCCACC
ATTCCCAACCCTAAATACAAGGGTATCTGGCGGGCACCTCTCATCCCCAACCCCAACTAC
AAGGGCAAGTGGAGTCCAAGGCGGATCCCCAACCCGGACTACTTCAACGATGAGCATCCC
TTCAGGATGACGCCCATTCACGCTGTTGGATTTGAACTGTGGTCGATGTCGCCCATGCTC
TTGTTCGACAACCTGATCATCACGGACGATCCGGCGGTGGCGGAGGCCTGGGCCGCTCAG
GGCTTCGCTCTCAAGAAACAGAGGATATCCAGTGACTCGAAAACGTGGTGGGGCAGACTG
CTGAGAGCCGTGAAGTACCGGCCGGGCGCGGTGTCGCTGTACGTGGTGTACTGCGCCGTA
CCTATCGTTATATACGTCGCCTACCTTATAAGGAGATCCTATGAGGAGTCCGTGGTGGAG
CTCGTCCTGCGCTCGGTGGGTGACAGACCCTGGCTGTGGGGAGCCGCGCTTCTGGTTTCC
TTCGCTGTGTTGGCCTTCGTCGCATACATGTGTTGTGGACCTCGAGTGGATCCGGAAGCG
GATGTCAAGAAGACGGACGCGGTTGTAGAGGATGATCCTCATCAAGAAGAAGTTGAAGAA
ACCAGTGAGAAGACGAGCAAAGCTGATCTGGAAGGCCCCGAGCCTGAGGCTGACACCAGT
GATACCACACCCTTAGTGGACTCGGAAGCAGCCGGCGACGGACAGAGGAAGAGGAAACCA
CGCAAGGAGTGA
Protein sequence:
MLKEYEYCQVVLNQIKTFIPIGCLISDRRIVTKNRKCMMAPGIMRVFLLSFLVVSGSLQV
TADVDDAEDGVTVETEEEIYQSPKADPKKVYLAENFDDVALFKKKWIKSEAKKQGVDEDI
AKYDGKWEIQIPTRKIFNSDSGLVLTTEAKHAAISTLLDRPFEFKDKPLIVQYEVTMQEG
QNCGGAYLKLLSRGVNTKADLKQFHDQTAYTIMFGPDKCGNDNKLHFIFRHKNPKNGTIE
EKHCKKPTQRLEDIYKDKEPHLYTLIVRPDNTFSVLVDNKEFNAGSLLEDFTPPVNPPEE
VDDPNDEKPEDWDEREKVLWQVLVRPAEGPGRNGDELIVVVLRNFRGSEAADRSIKYIDV
SSERTAVNLFANIINVLEEIVDPSASKPDDWDESEPAQIIDFNAVKPDGWLEDEPDMIPD
PEAKKPADWDEEMDGEWEAPLVDNPRCASAPGCGTWAPPTIPNPKYKGIWRAPLIPNPNY
KGKWSPRRIPNPDYFNDEHPFRMTPIHAVGFELWSMSPMLLFDNLIITDDPAVAEAWAAQ
GFALKKQRISSDSKTWWGRLLRAVKYRPGAVSLYVVYCAVPIVIYVAYLIRRSYEESVVE
LVLRSVGDRPWLWGAALLVSFAVLAFVAYMCCGPRVDPEADVKKTDAVVEDDPHQEEVEE
TSEKTSKADLEGPEPEADTSDTTPLVDSEAAGDGQRKRKPRKE