New model in OGS2.0 | DPOGS205946  |
---|---|
Genomic Position | scaffold20:+ 211246-214855 |
See gene structure | |
CDS Length | 1674 |
Paired RNAseq reads   | 1449 |
Single RNAseq reads   | 3372 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002868 (0.0) |
Best Drosophila hit   | CG3194, isoform A (3e-133) |
Best Human hit | D-glucuronyl C5-epimerase (3e-128) |
Best NR hit (blastp)   | PREDICTED: similar to CG3194-PA [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG3194-PA [Apis mellifera] (2e-171) |
GeneOntology terms    | GO:0006024 glycosaminoglycan biosynthetic process GO:0016021 integral to membrane GO:0016857 racemase and epimerase activity, acting on carbohydrates and derivatives |
InterPro families   | IPR010598 D-glucuronyl C5-epimerase |
Orthology group | MCL14684 |
Nucleotide sequence:
ATGATGCTTATGCGTGTGAACATGCGCCTGGTGCTGGTGGTGCTGTGCGGTGCTGTGCTG
CTCGCTACACTGTACTGGGGGCACTGCGGAGACATCTCATTGAGACCCACAGTCTCATTG
CTGTCGGAGAGCCAGACCGCTCTGTCGGAGATCGAGTGTTCCATCAACGGGGAATACTCG
GTCTCGTGCAGGCGGGACCGGGACCAGGTCTACTTGCCCTTCTCCTTCATACACAAGTAC
TTCGAGATTTATGGTAAGATTACGGCGACGGATGGAGTTGAAAAATTTGAGTGGTCGCAC
AGCTACAGTAAGGTGTACCATCCCAAGAAAAAGTACGACCCTCGAGGCACCTTCACCACC
TTCGAGAACTACAACGTGGAAGTCCGGGATCGAGTGAAATGCATCAGCGGGATCGAAGGT
GTTCCGGTGTCCACCCAGTGGGAGCCCCGGGGCTTCTTTTATCCCACCCAAATAGCGCAA
TTCGGTTTGGCGCACTACAGCAAGAATCTAACTGAACCAGAACCGAGGATCAGGGTCATC
GACGACGGAGACAAAGTCCTCGAGAACTGGATCGTTAGCCAGGACGCACTCATGTCCCGG
GAATTGGATCCGGAGCTGAAGGCGAATGTTATAAGATTCACGACAACCGACCAGCCGGCT
AGTCAAGTACGAGCCAGACTTAACATCAGCCAAGACTTCGTACTCAGCATGGATCTCATG
CTGAAGCCGAATTCTTCTGTGACTGTGATCTTACAGAATAAAGACAAGAAGGAAACGGTA
TACTTGCATTACGTAACCAGTACCCAGTTAATATATGCACAGGAGGACCACATATACTAT
GGCGTCGGCTTAGAACAGAAATGGCGGAGGTTAACACGAGATCTGATCATAGACATGCAA
AAAGGTTGGGCGCTGCAAGACAGACCCAAGAGGAAGTCACCTAGAAATAAGTTTAAGGTG
TCGGGGCTGGCCCTGTCGGGTGCGGGGAGTGTGGGTAACGTGCGCGTGTCGTCCAGCGAG
CACATGTTCCACTTCTTCTCTGCCGCTCGCTGGCTGGTGTCCTCTCAGCGAGCGGACGGC
GGCTGGCCGGTGCCCGCCAGGAGACGCCTCGCCCCCCGCGTCCCCGACCTCATGCCGGGC
TGGCACTCGGCCATGAGTCAGGGTCACGCCATCTCTCTGCTGTCCCGGGCCTACCACCGC
AGCGGTGACTCGCGGTACCTGCGGGCAGCCAGGCGAGCGCTGGGCCCTCTGGACCGGCCC
CCGGACAAGGGCGGAGTGCGGGCGCTGTTCATGAACAGATATGTCTGGTATGAGGAGTAC
CCCACCAAGCCGCCCATCTTCGTGCTGAACGGGTTCATCTACACTCTGCTGGGCCTGTAC
GACCTGCACTACACCGAGGGTGGGCGGGCCGCGTCCCCCGCCAAGGATATGTTCGAGGCC
GGAATGCTGTCTCTCAAGACGCTGCTGCCGCTATTCGACACCGGCAGCGGCAGCTTCTAC
GACCTGCGGCACTTCACGCTGGGAGCGAGCCCCAACATCGCGCGCTGGGACTACCACGCC
ACGCACGTCAACCAGCTGTATCTGCTGGCGGGCCTCGACCCCGACCCCGTGCTGCTCGCC
ACGGCCCGCAGATGGGAGGGCTACATGCAGGGCCGGCGGGCCGCGCACAACTGA
Protein sequence:
MMLMRVNMRLVLVVLCGAVLLATLYWGHCGDISLRPTVSLLSESQTALSEIECSINGEYS
VSCRRDRDQVYLPFSFIHKYFEIYGKITATDGVEKFEWSHSYSKVYHPKKKYDPRGTFTT
FENYNVEVRDRVKCISGIEGVPVSTQWEPRGFFYPTQIAQFGLAHYSKNLTEPEPRIRVI
DDGDKVLENWIVSQDALMSRELDPELKANVIRFTTTDQPASQVRARLNISQDFVLSMDLM
LKPNSSVTVILQNKDKKETVYLHYVTSTQLIYAQEDHIYYGVGLEQKWRRLTRDLIIDMQ
KGWALQDRPKRKSPRNKFKVSGLALSGAGSVGNVRVSSSEHMFHFFSAARWLVSSQRADG
GWPVPARRRLAPRVPDLMPGWHSAMSQGHAISLLSRAYHRSGDSRYLRAARRALGPLDRP
PDKGGVRALFMNRYVWYEEYPTKPPIFVLNGFIYTLLGLYDLHYTEGGRAASPAKDMFEA
GMLSLKTLLPLFDTGSGSFYDLRHFTLGASPNIARWDYHATHVNQLYLLAGLDPDPVLLA
TARRWEGYMQGRRAAHN