DPGLEAN13097 in OGS1.0

New model in OGS2.0DPOGS201438 
Genomic Positionscaffold544:+ 24083-36573
See gene structure
CDS Length4092
Paired RNAseq reads  8318
Single RNAseq reads  28073
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002596 (2e-24)
Best Drosophila hit  glycogenin, isoform B (2e-93)
Best Human hitglycogenin-1 isoform 2 (9e-81)
Best NR hit (blastp)  PREDICTED: similar to Glycogenin CG9480-PB, isoform B [Apis mellifera] (2e-110)
Best NR hit (blastx)  AGAP007724-PC [Anopheles gambiae str. PEST] (3e-104)
GeneOntology terms
  
GO:0016757 transferase activity, transferring glycosyl groups
GO:0005515 protein binding
InterPro families  IPR002495 Glycosyl transferase, family 8
Orthology groupMCL15009

Nucleotide sequence:

ATGTCAAACCGAGCATGGGTGACGCTAGCTACAAACGATTCCTATGGTCTTGGGGCTCTG
GTTCTGGCGCATTCTTTGCGTCGGGCGTCCTCCTCTTACCCAGCAGTCGTGCTGATTACA
CCCTCGGTCACTGAGCCCATGAGGGAGCGTCTCCGTGCAGTATTCGCTGAAGTAATCCTA
GTGGACGTTTTGGATTCCAAGGATGCAGCTCATCTCGCCTTGCTGCAGCGGCCGGAACTG
GGCATCACATTTACCAAAATACATTGTTGGAATCTCACGCAGTATGAGAAATGTGTCTTC
CTTGATGCCGATACACTGATCGTTCAGAACTGTGATGAGTTATTTGAACGCGAGGAGTTG
TCCGCGGCTCCCGACGTCGGCTGGCCCGACTGTTTCAATTCGGGAGTTTTTGTATTCAAA
CCTTCCGCCGATACATTCAGCAAACTCGTCACATTTGCATCCGAAAGGGGCAGTTTTGAT
GGTGGTGACCAGGGACTCTTGAATTCCTACTTCTCGGATTGGGCCCATGGTGACATTAAC
AAGCATTTGCCCTTTCTGTACAATGTGACATCTGCTGCCTTTTACTCCTATATCCCAGCC
TTAAAGCATTACGGCCAAAATTTAAAAATTATCCATTTCATCGGCGCCGCTAAGCCGTGG
CTCCAGCATTTCAACTGGCAGTCTCGGTCAGTCGAGGCCCCCGAACATTTACGAGGTTTC
TTGCAACTATGGTGGGACCTCTTTGTTGCACAAGTTCATTCACAGCTAGACACACAAATG
GCTGAGGAAGTTCCTCTGGGGATTGACTTAGAAGAAGAAGAACCGAGTGAATATGATGAA
CCAGTACAGGATTATAGTTTCTATGAACCGACACTGGATCCCAGTTCTGAGTTTCCATGG
CATCGTCCTTATGATCAGATCAAAAACACAGAATCCATTGAGCCCAGCATTGACATAGGT
CAATTTCATGATCCATGGCAAATTTACAGAGGGAACATACCTCCTAGTAAAGATGATGCA
AGTTGTATAAATGCAACGGAAAGTCATAGACAGTATGCCTGGGACTACATGCAACCGCAA
ACACAGCATTATACACCTGAGAATAGTCACAATTCTGAAAACACTTACACTCAAAATTAT
AATAGCGAAATATGGCAATATAATTCCGAACATAGTTCTCAACCTCAAACTACGCAACAG
TTTACGACATTTACACCTTCAATAAGTTCTCAATGGGAAGAGAGTCAATGCAACATAAAT
GTTCACGATCAACACTATCACACACCGATCCAAGAAATAATTGTTCACCATGATCATTAC
CCAAGTCACAGTAATCAACAAAGTTCCCCTGAATCTCAAAATCAGCCTGGTGATCATCAA
GGTCATAACAACCACCAAAGTCACACCCATCACCAACATATCGAACATCATCAAAACAAT
TATCAGAATTACAATCAATCTCATCACCATGAAAATGATCAGAACCAAAACGATTTCACT
CAGCAGCGTTTTGAAAGTCATGTACATGAACAAAATCAAAATTATAATTATCAACATTCA
TCAGAACATCATCACGAGTCACAATCCTATCACGACCCTGGTTTTGAGCAATCACATCAA
AGAGCCGATTATTATCAAGACAAACACGATTCACAGTCTCTTTTTCATAACCATTCCCAT
AGCAATATCACAGAAAATAACAAAAATGTTAACAATGATGAAAGATTCAATAACAGTTAT
ATGAAAAATGTTGAAATAAATTATTCGCAGTTTAAAAAACAAACTCAACCACAAATATAC
ACTGTCATGATGGATCATGAAAGGTTGCATAATGTTCGTAAATTACATTCAAATCTAAAT
GGCTGTGAAGCAGAATACTATAGTAATACTTTTGAAGATATCCCTAGGCATCCGTACGAT
GGATTTTATCTAAGACATAGAACTACTATAGATTCTCGGGGGCGAAAAATCTGTATTCAT
GAAATACCTTTATCCCCTCCTTCGCCAACACCGTCACTCGAGTCGTCACTTGAAAGCGAT
GATGAAAATGAAATATTCAAAGATATTAACTATGACAGGCTGAACGGTGAGGAAAGTCAA
ACTGGTGTAGCAGGTAACCTGGCTAAAGTAGTGCCGGGTGAACCGCAGCAACAAGAGGCA
GTCGATGAGCTTACGAGGCGACAGGGCTGGGAAGCCGGCAATATTGATTATATGGGTGCT
GACTCCTTTGACAATATCTGGGCGAAAATATCGCAAACACTAAGTCAACCACCAAGCTCT
CCGCCCCGACAACCTTCTCCATCTAATGACCAGTCTGTTCAACCTAGCGAAGATCGTGCT
GTCGCAATAGAAGAAGTAAAAGAGGCCGTGGTAGCACCGGTTGAATCAAAACCAGAAGAA
CCAGTCAAAGGTAGTATACCGTCAGACGCATCCTCTGAAACACCCGTTGCAGTAGAGGCA
CCGGTCATGGCATCCGAAGTAGATGCCTCTACTGCTTCTACTGAAACTGTAGAAAATGTT
GTGTCACTGGACGCTGCAGAAACTGTCGCGCCTACTGAGACAAGTGAAAGCGTTCCCCCT
CCAGAAGCACCTGCATGTGTTTCCCTTCCCGTGGCATCTGATTGTGTTTCCTCGCCTGAA
GTTCCAGCAACTGTTGATGCAACAGAACCTGTTGTACAGATTGAAGCTGAAGCAACAGAA
AGTGTTTCCCCTCCCGAAGCATCAAAACCTGTTGCACAGATTGAAGCAACAGAAAGTGTT
TCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCT
CCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAA
GCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCA
GAGCATGTTGCACAGATTGAAGCAACCTGTGTTTCCTCACCCCAACCATCGGAAAATATT
CCAATAGTTGAATCAAAAGAAACTGTGGCACCACCTGTCGCAGACGTAAACGTTGCTTCG
ACTGAAGTCTTAGAAAATGTTTCTGCGCCTGCAACACCTGTTGCTCCGGCTGAATCGATC
GATAGTTTGACTGCTCCTGAACCACCTGCACCTGCAACTGAAAATAAAGATGATGTTGTC
CCTGAAACTTCACCAGCAGCAGCGTCAATTGAAGAGCCTGTCGTAGATGTCCCAGTTCCA
GCGAGTGAAGCCGCCGAATCTCCAGGTAAACAAGAAGATCCTATTCCTGTTCCAGCCGAG
ATTGCTGCTATACCAAAAAGCATGCCAGACGAATCCCAAACGGAGATGGAGCCATCAGGT
GACATCCCAGTGACTGCTGAAGAAGCAATAAAAATTGACATTGCATGCGTCCCTAAATCT
AGTGAAATGCCAGCAGTGGATGAGCATTCAGATTCTCCTTTAAATAAATCATTGGCAACC
AATGAACAATCCTCAGACAAAAAGGAAGTGGCCTCTGACAGTCCTCCTCTGGCCAATACC
CCTTCCAAAGAAGAAATACCTAGTCCGCCAGCTGCCAAAACGGAGGAGCGTCGCAAGCCG
TTGGGGAAACTGTCGCTGCCGCCCGCGGCTTGTGACACGCTGCCAACACCTGACAGCGAG
CTAGAGGACGCGGCCTCGCTTGCACACGCCATCATCGCCGGTGAACTGCGCACGCCTACT
GTCACTTCCCCCTCACCTCCCGTCATATCTTCTTCACCTCAAACACAACCCTCACAAACA
CAAGCCCGCAGTCTATCCATCGACCAGCCCGAAGCACCAACTCCTCCCCTTGATTCCCCC
CTATCATTATCTCAGATCGGCGTCAAATCAAAACCCACCATCGCATCTCAAATAGAAACC
TCGGTTTCTAAGACCGAATCGGCCCCGACTTCCGAGGTGTCCGAAGCACCTAAACCGAAG
TCAGACGCTCCCAAAAAGAAAATAGTGAAGAAAGTGGTGAAGAAGGTGGAAAAGGAAGGC
GGTGCCAGTGGTGACGCGCCAGTCCCCGTCCCGCCGCCGCGGAAAAAGGAAAAGAAACCC
AAGGAGAAATAA

Protein sequence:

MSNRAWVTLATNDSYGLGALVLAHSLRRASSSYPAVVLITPSVTEPMRERLRAVFAEVIL
VDVLDSKDAAHLALLQRPELGITFTKIHCWNLTQYEKCVFLDADTLIVQNCDELFEREEL
SAAPDVGWPDCFNSGVFVFKPSADTFSKLVTFASERGSFDGGDQGLLNSYFSDWAHGDIN
KHLPFLYNVTSAAFYSYIPALKHYGQNLKIIHFIGAAKPWLQHFNWQSRSVEAPEHLRGF
LQLWWDLFVAQVHSQLDTQMAEEVPLGIDLEEEEPSEYDEPVQDYSFYEPTLDPSSEFPW
HRPYDQIKNTESIEPSIDIGQFHDPWQIYRGNIPPSKDDASCINATESHRQYAWDYMQPQ
TQHYTPENSHNSENTYTQNYNSEIWQYNSEHSSQPQTTQQFTTFTPSISSQWEESQCNIN
VHDQHYHTPIQEIIVHHDHYPSHSNQQSSPESQNQPGDHQGHNNHQSHTHHQHIEHHQNN
YQNYNQSHHHENDQNQNDFTQQRFESHVHEQNQNYNYQHSSEHHHESQSYHDPGFEQSHQ
RADYYQDKHDSQSLFHNHSHSNITENNKNVNNDERFNNSYMKNVEINYSQFKKQTQPQIY
TVMMDHERLHNVRKLHSNLNGCEAEYYSNTFEDIPRHPYDGFYLRHRTTIDSRGRKICIH
EIPLSPPSPTPSLESSLESDDENEIFKDINYDRLNGEESQTGVAGNLAKVVPGEPQQQEA
VDELTRRQGWEAGNIDYMGADSFDNIWAKISQTLSQPPSSPPRQPSPSNDQSVQPSEDRA
VAIEEVKEAVVAPVESKPEEPVKGSIPSDASSETPVAVEAPVMASEVDASTASTETVENV
VSLDAAETVAPTETSESVPPPEAPACVSLPVASDCVSSPEVPATVDATEPVVQIEAEATE
SVSPPEASKPVAQIEATESVSPPEASEHVAQIEATESVSPPEASEHVAQIEATESVSPPE
ASEHVAQIEATESVSPPEASEHVAQIEATCVSSPQPSENIPIVESKETVAPPVADVNVAS
TEVLENVSAPATPVAPAESIDSLTAPEPPAPATENKDDVVPETSPAAASIEEPVVDVPVP
ASEAAESPGKQEDPIPVPAEIAAIPKSMPDESQTEMEPSGDIPVTAEEAIKIDIACVPKS
SEMPAVDEHSDSPLNKSLATNEQSSDKKEVASDSPPLANTPSKEEIPSPPAAKTEERRKP
LGKLSLPPAACDTLPTPDSELEDAASLAHAIIAGELRTPTVTSPSPPVISSSPQTQPSQT
QARSLSIDQPEAPTPPLDSPLSLSQIGVKSKPTIASQIETSVSKTESAPTSEVSEAPKPK
SDAPKKKIVKKVVKKVEKEGGASGDAPVPVPPPRKKEKKPKEK