New model in OGS2.0 | DPOGS201438  |
---|---|
Genomic Position | scaffold544:+ 24083-36573 |
See gene structure | |
CDS Length | 4092 |
Paired RNAseq reads   | 8318 |
Single RNAseq reads   | 28073 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002596 (2e-24) |
Best Drosophila hit   | glycogenin, isoform B (2e-93) |
Best Human hit | glycogenin-1 isoform 2 (9e-81) |
Best NR hit (blastp)   | PREDICTED: similar to Glycogenin CG9480-PB, isoform B [Apis mellifera] (2e-110) |
Best NR hit (blastx)   | AGAP007724-PC [Anopheles gambiae str. PEST] (3e-104) |
GeneOntology terms    | GO:0016757 transferase activity, transferring glycosyl groups GO:0005515 protein binding |
InterPro families   | IPR002495 Glycosyl transferase, family 8 |
Orthology group | MCL15009 |
Nucleotide sequence:
ATGTCAAACCGAGCATGGGTGACGCTAGCTACAAACGATTCCTATGGTCTTGGGGCTCTG
GTTCTGGCGCATTCTTTGCGTCGGGCGTCCTCCTCTTACCCAGCAGTCGTGCTGATTACA
CCCTCGGTCACTGAGCCCATGAGGGAGCGTCTCCGTGCAGTATTCGCTGAAGTAATCCTA
GTGGACGTTTTGGATTCCAAGGATGCAGCTCATCTCGCCTTGCTGCAGCGGCCGGAACTG
GGCATCACATTTACCAAAATACATTGTTGGAATCTCACGCAGTATGAGAAATGTGTCTTC
CTTGATGCCGATACACTGATCGTTCAGAACTGTGATGAGTTATTTGAACGCGAGGAGTTG
TCCGCGGCTCCCGACGTCGGCTGGCCCGACTGTTTCAATTCGGGAGTTTTTGTATTCAAA
CCTTCCGCCGATACATTCAGCAAACTCGTCACATTTGCATCCGAAAGGGGCAGTTTTGAT
GGTGGTGACCAGGGACTCTTGAATTCCTACTTCTCGGATTGGGCCCATGGTGACATTAAC
AAGCATTTGCCCTTTCTGTACAATGTGACATCTGCTGCCTTTTACTCCTATATCCCAGCC
TTAAAGCATTACGGCCAAAATTTAAAAATTATCCATTTCATCGGCGCCGCTAAGCCGTGG
CTCCAGCATTTCAACTGGCAGTCTCGGTCAGTCGAGGCCCCCGAACATTTACGAGGTTTC
TTGCAACTATGGTGGGACCTCTTTGTTGCACAAGTTCATTCACAGCTAGACACACAAATG
GCTGAGGAAGTTCCTCTGGGGATTGACTTAGAAGAAGAAGAACCGAGTGAATATGATGAA
CCAGTACAGGATTATAGTTTCTATGAACCGACACTGGATCCCAGTTCTGAGTTTCCATGG
CATCGTCCTTATGATCAGATCAAAAACACAGAATCCATTGAGCCCAGCATTGACATAGGT
CAATTTCATGATCCATGGCAAATTTACAGAGGGAACATACCTCCTAGTAAAGATGATGCA
AGTTGTATAAATGCAACGGAAAGTCATAGACAGTATGCCTGGGACTACATGCAACCGCAA
ACACAGCATTATACACCTGAGAATAGTCACAATTCTGAAAACACTTACACTCAAAATTAT
AATAGCGAAATATGGCAATATAATTCCGAACATAGTTCTCAACCTCAAACTACGCAACAG
TTTACGACATTTACACCTTCAATAAGTTCTCAATGGGAAGAGAGTCAATGCAACATAAAT
GTTCACGATCAACACTATCACACACCGATCCAAGAAATAATTGTTCACCATGATCATTAC
CCAAGTCACAGTAATCAACAAAGTTCCCCTGAATCTCAAAATCAGCCTGGTGATCATCAA
GGTCATAACAACCACCAAAGTCACACCCATCACCAACATATCGAACATCATCAAAACAAT
TATCAGAATTACAATCAATCTCATCACCATGAAAATGATCAGAACCAAAACGATTTCACT
CAGCAGCGTTTTGAAAGTCATGTACATGAACAAAATCAAAATTATAATTATCAACATTCA
TCAGAACATCATCACGAGTCACAATCCTATCACGACCCTGGTTTTGAGCAATCACATCAA
AGAGCCGATTATTATCAAGACAAACACGATTCACAGTCTCTTTTTCATAACCATTCCCAT
AGCAATATCACAGAAAATAACAAAAATGTTAACAATGATGAAAGATTCAATAACAGTTAT
ATGAAAAATGTTGAAATAAATTATTCGCAGTTTAAAAAACAAACTCAACCACAAATATAC
ACTGTCATGATGGATCATGAAAGGTTGCATAATGTTCGTAAATTACATTCAAATCTAAAT
GGCTGTGAAGCAGAATACTATAGTAATACTTTTGAAGATATCCCTAGGCATCCGTACGAT
GGATTTTATCTAAGACATAGAACTACTATAGATTCTCGGGGGCGAAAAATCTGTATTCAT
GAAATACCTTTATCCCCTCCTTCGCCAACACCGTCACTCGAGTCGTCACTTGAAAGCGAT
GATGAAAATGAAATATTCAAAGATATTAACTATGACAGGCTGAACGGTGAGGAAAGTCAA
ACTGGTGTAGCAGGTAACCTGGCTAAAGTAGTGCCGGGTGAACCGCAGCAACAAGAGGCA
GTCGATGAGCTTACGAGGCGACAGGGCTGGGAAGCCGGCAATATTGATTATATGGGTGCT
GACTCCTTTGACAATATCTGGGCGAAAATATCGCAAACACTAAGTCAACCACCAAGCTCT
CCGCCCCGACAACCTTCTCCATCTAATGACCAGTCTGTTCAACCTAGCGAAGATCGTGCT
GTCGCAATAGAAGAAGTAAAAGAGGCCGTGGTAGCACCGGTTGAATCAAAACCAGAAGAA
CCAGTCAAAGGTAGTATACCGTCAGACGCATCCTCTGAAACACCCGTTGCAGTAGAGGCA
CCGGTCATGGCATCCGAAGTAGATGCCTCTACTGCTTCTACTGAAACTGTAGAAAATGTT
GTGTCACTGGACGCTGCAGAAACTGTCGCGCCTACTGAGACAAGTGAAAGCGTTCCCCCT
CCAGAAGCACCTGCATGTGTTTCCCTTCCCGTGGCATCTGATTGTGTTTCCTCGCCTGAA
GTTCCAGCAACTGTTGATGCAACAGAACCTGTTGTACAGATTGAAGCTGAAGCAACAGAA
AGTGTTTCCCCTCCCGAAGCATCAAAACCTGTTGCACAGATTGAAGCAACAGAAAGTGTT
TCCCCTCCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCT
CCCGAAGCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAA
GCATCAGAGCATGTTGCACAGATTGAAGCAACAGAAAGTGTTTCCCCTCCCGAAGCATCA
GAGCATGTTGCACAGATTGAAGCAACCTGTGTTTCCTCACCCCAACCATCGGAAAATATT
CCAATAGTTGAATCAAAAGAAACTGTGGCACCACCTGTCGCAGACGTAAACGTTGCTTCG
ACTGAAGTCTTAGAAAATGTTTCTGCGCCTGCAACACCTGTTGCTCCGGCTGAATCGATC
GATAGTTTGACTGCTCCTGAACCACCTGCACCTGCAACTGAAAATAAAGATGATGTTGTC
CCTGAAACTTCACCAGCAGCAGCGTCAATTGAAGAGCCTGTCGTAGATGTCCCAGTTCCA
GCGAGTGAAGCCGCCGAATCTCCAGGTAAACAAGAAGATCCTATTCCTGTTCCAGCCGAG
ATTGCTGCTATACCAAAAAGCATGCCAGACGAATCCCAAACGGAGATGGAGCCATCAGGT
GACATCCCAGTGACTGCTGAAGAAGCAATAAAAATTGACATTGCATGCGTCCCTAAATCT
AGTGAAATGCCAGCAGTGGATGAGCATTCAGATTCTCCTTTAAATAAATCATTGGCAACC
AATGAACAATCCTCAGACAAAAAGGAAGTGGCCTCTGACAGTCCTCCTCTGGCCAATACC
CCTTCCAAAGAAGAAATACCTAGTCCGCCAGCTGCCAAAACGGAGGAGCGTCGCAAGCCG
TTGGGGAAACTGTCGCTGCCGCCCGCGGCTTGTGACACGCTGCCAACACCTGACAGCGAG
CTAGAGGACGCGGCCTCGCTTGCACACGCCATCATCGCCGGTGAACTGCGCACGCCTACT
GTCACTTCCCCCTCACCTCCCGTCATATCTTCTTCACCTCAAACACAACCCTCACAAACA
CAAGCCCGCAGTCTATCCATCGACCAGCCCGAAGCACCAACTCCTCCCCTTGATTCCCCC
CTATCATTATCTCAGATCGGCGTCAAATCAAAACCCACCATCGCATCTCAAATAGAAACC
TCGGTTTCTAAGACCGAATCGGCCCCGACTTCCGAGGTGTCCGAAGCACCTAAACCGAAG
TCAGACGCTCCCAAAAAGAAAATAGTGAAGAAAGTGGTGAAGAAGGTGGAAAAGGAAGGC
GGTGCCAGTGGTGACGCGCCAGTCCCCGTCCCGCCGCCGCGGAAAAAGGAAAAGAAACCC
AAGGAGAAATAA
Protein sequence:
MSNRAWVTLATNDSYGLGALVLAHSLRRASSSYPAVVLITPSVTEPMRERLRAVFAEVIL
VDVLDSKDAAHLALLQRPELGITFTKIHCWNLTQYEKCVFLDADTLIVQNCDELFEREEL
SAAPDVGWPDCFNSGVFVFKPSADTFSKLVTFASERGSFDGGDQGLLNSYFSDWAHGDIN
KHLPFLYNVTSAAFYSYIPALKHYGQNLKIIHFIGAAKPWLQHFNWQSRSVEAPEHLRGF
LQLWWDLFVAQVHSQLDTQMAEEVPLGIDLEEEEPSEYDEPVQDYSFYEPTLDPSSEFPW
HRPYDQIKNTESIEPSIDIGQFHDPWQIYRGNIPPSKDDASCINATESHRQYAWDYMQPQ
TQHYTPENSHNSENTYTQNYNSEIWQYNSEHSSQPQTTQQFTTFTPSISSQWEESQCNIN
VHDQHYHTPIQEIIVHHDHYPSHSNQQSSPESQNQPGDHQGHNNHQSHTHHQHIEHHQNN
YQNYNQSHHHENDQNQNDFTQQRFESHVHEQNQNYNYQHSSEHHHESQSYHDPGFEQSHQ
RADYYQDKHDSQSLFHNHSHSNITENNKNVNNDERFNNSYMKNVEINYSQFKKQTQPQIY
TVMMDHERLHNVRKLHSNLNGCEAEYYSNTFEDIPRHPYDGFYLRHRTTIDSRGRKICIH
EIPLSPPSPTPSLESSLESDDENEIFKDINYDRLNGEESQTGVAGNLAKVVPGEPQQQEA
VDELTRRQGWEAGNIDYMGADSFDNIWAKISQTLSQPPSSPPRQPSPSNDQSVQPSEDRA
VAIEEVKEAVVAPVESKPEEPVKGSIPSDASSETPVAVEAPVMASEVDASTASTETVENV
VSLDAAETVAPTETSESVPPPEAPACVSLPVASDCVSSPEVPATVDATEPVVQIEAEATE
SVSPPEASKPVAQIEATESVSPPEASEHVAQIEATESVSPPEASEHVAQIEATESVSPPE
ASEHVAQIEATESVSPPEASEHVAQIEATCVSSPQPSENIPIVESKETVAPPVADVNVAS
TEVLENVSAPATPVAPAESIDSLTAPEPPAPATENKDDVVPETSPAAASIEEPVVDVPVP
ASEAAESPGKQEDPIPVPAEIAAIPKSMPDESQTEMEPSGDIPVTAEEAIKIDIACVPKS
SEMPAVDEHSDSPLNKSLATNEQSSDKKEVASDSPPLANTPSKEEIPSPPAAKTEERRKP
LGKLSLPPAACDTLPTPDSELEDAASLAHAIIAGELRTPTVTSPSPPVISSSPQTQPSQT
QARSLSIDQPEAPTPPLDSPLSLSQIGVKSKPTIASQIETSVSKTESAPTSEVSEAPKPK
SDAPKKKIVKKVVKKVEKEGGASGDAPVPVPPPRKKEKKPKEK