DPGLEAN20514 in OGS1.0

New model in OGS2.0DPOGS201735 
Genomic Positionscaffold1591:+ 18543-26477
See gene structure
CDS Length1515
Paired RNAseq reads  3616
Single RNAseq reads  10561
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001353 (3e-65)
Best Drosophila hit  ergic53, isoform A (5e-145)
Best Human hitprotein ERGIC-53 precursor (2e-89)
Best NR hit (blastp)  PREDICTED: similar to AGAP005404-PA [Tribolium castaneum] (4e-173)
Best NR hit (blastx)  PREDICTED: similar to AGAP005404-PA [Tribolium castaneum] (4e-154)
GeneOntology terms
  
GO:0005537 mannose binding
GO:0016020 membrane
InterPro families

  
IPR005052 Legume-like lectin
IPR008985 Concanavalin A-like lectin/glucanase
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
Orthology groupMCL12818

Nucleotide sequence:

ATGTCGTCGTATAGTTTAAATTTATTATTGTTGACGGTGTTTGGTATATTTGTGACTTCG
AACACCCAGACGATACATAAACGATTTGAATACAAATATTCGTTCAAACCGCCTTATTTA
GCACAAAAAGATGGTTCTGTGCCGTTTTGGGAGTACGGAGGCAATGCGATCGCGTCTGGT
GAGAGCGTCCGGTTGGCGCCGTCTCTGAGGAGTCAGAAGGGCGCTATATGGTCCAAGCAT
CCGATCAACTTCGACTGGTGGGAAGTGGACATCATGTTCAAGGTCACCGGCAGGGGAAGG
ATAGGAGCCGACGGGCTGGCCTTCTGGTACGTGACGAAGCGCGGGGAGTACACCGGCGAG
GTGTTCGGCTCCTCGGACCGCTGGAACGGTCTGGGGATCATCTTCGACTCCTTCGATAAC
GACAACAAACACAACAACCCTTACATCATGGCGGTCCTGAACGACGGCACTAAGAGCTTC
GACCACAAGAGCGATGGTTCCAGCCAGCTGTTATCCGGCTGCTTGCGGGACTTCAGGAAC
AAGCCCTTCCCGACCCGCGCCCGGATAGAATACTACGCGAACACACTCACAGTCTACTTC
CATAACGGTCTAACCAACAACGAGGCGGACTATGACTTGTGTTTCCGAGCTGAGAATGTC
CAGTTGCCTCGTGGAGGGTTCCTGGGCGTCTCCGCAGCCACCGGCGGGCTGGCGGACGAC
CATGACGTCATACACCTGCTGACGTCATCGCTGCACTCCACACAGCAGGGAGGACAGCAA
ATAAACAGTGCAGAGCAAGCCAAGCTGTCCCAGGAGTACCAGGAGTACCAGAAGAAGCTG
GAGCAGCAGAAAGAGGATTACAGGAAGGAACATCCCGATGAGGTCCGAGACAAGGACGGT
GAGTTTGATGACTGGTTTGAGTCTGATGGACAACGGGAGCTCAGACAGATCTTCGCCAGC
GTTGGGCATGTGCAGGACGGGGTCAGGGAACTCAGCAAGAAGATGGACGAGGTTATTGGC
AAACAGACAAACTCGCTGTCCATGTTGTCAGCCGTGTACAGTCAGACACAGACGATGCAG
GTCCAGCAGCCTGGACAAGCTGGCCAACCACCTGTACAGCAGATGCCGATGCTGCCCATC
ACCAGACACGACTGGGACCAGCTGATGGCCAACAACCAGCTCGTCATCAACACCATCGCC
GAGCTCAAGGGTTTCATAATAGACGTGTCGCGTAAGACGGACAGCGTGGTGGGCGGCGTG
GCGGGCGGCGCGGCGGGCGGCGCTCTAAACCAACAGGTGGTGAACGAACTGAGGGAGGGG
ATCAATCATGTTAAAAACAATGTGGCGGGAGTCGCACAGAGGCTGTCGTCCGCTCCGCCG
CAGCCCGCGTGTCCGTCTGTGTCGTGTGTGTCCACCACCATGCTGCTGACGGTAGTGGCG
TCGCAGCTGGCCGTCATGTTCCTGTATTCGTTGTACAAAGAAAGGAAAGAGGCGCAGGCT
AAGAAATTCTTCTGA

Protein sequence:

MSSYSLNLLLLTVFGIFVTSNTQTIHKRFEYKYSFKPPYLAQKDGSVPFWEYGGNAIASG
ESVRLAPSLRSQKGAIWSKHPINFDWWEVDIMFKVTGRGRIGADGLAFWYVTKRGEYTGE
VFGSSDRWNGLGIIFDSFDNDNKHNNPYIMAVLNDGTKSFDHKSDGSSQLLSGCLRDFRN
KPFPTRARIEYYANTLTVYFHNGLTNNEADYDLCFRAENVQLPRGGFLGVSAATGGLADD
HDVIHLLTSSLHSTQQGGQQINSAEQAKLSQEYQEYQKKLEQQKEDYRKEHPDEVRDKDG
EFDDWFESDGQRELRQIFASVGHVQDGVRELSKKMDEVIGKQTNSLSMLSAVYSQTQTMQ
VQQPGQAGQPPVQQMPMLPITRHDWDQLMANNQLVINTIAELKGFIIDVSRKTDSVVGGV
AGGAAGGALNQQVVNELREGINHVKNNVAGVAQRLSSAPPQPACPSVSCVSTTMLLTVVA
SQLAVMFLYSLYKERKEAQAKKFF