Monarch geneset OGS2.0

DPOGS205946
TranscriptDPOGS205946-TA1674 bp
ProteinDPOGS205946-PA557 aa
Genomic positionDPSCF300156 + 232496-236105
RNAseq coverage538x (Rank: top 23%)
Annotation
HeliconiusHMEL0064590.076.57% 
BombyxBGIBMGA002868-TA0.073.17% 
DrosophilaCG3194-PA2e-14952.88% 
EBI UniRef50UniRef50_F4X3F80.054.24%D-glucuronyl C5-epimerase n=5 Tax=Coelomata RepID=F4X3F8_ACREC
NCBI RefSeqXP_393602.30.055.41%PREDICTED: similar to CG3194-PA [Apis mellifera]
NCBI nr blastpgi|3071775160.054.95%D-glucuronyl C5-epimerase [Camponotus floridanus]
NCBI nr blastxgi|3071775160.054.95%D-glucuronyl C5-epimerase [Camponotus floridanus]
Group
Gene OntologyGO:00160211.3e-65integral to membrane
GO:00168571.3e-65racemase and epimerase activity, acting on carbohydrates and derivatives
GO:00060241.3e-65glycosaminoglycan biosynthetic process
KEGG pathwayame:4101180.0 
 K01793 (E5.1.3.17)maps-> Glycosaminoglycan biosynthesis - heparan sulfate
InterPro domain[358-546] IPR0105981.3e-65D-glucuronyl C5-epimerase
Orthology groupMCL13005 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205946-TA
ATGATGCTTATGCGTGTGAACATGCGCCTGGTGCTGGTGGTGCTGTGCGGTGCTGTGCTGCTCGCTACACTGTACTGGGGGCACTGCGGAGACATCTCATTGAGACCCACAGTCTCATTGCTGTCGGAGAGCCAGACCGCTCTGTCGGAGATCGAGTGTTCCATCAACGGGGAATACTCGGTCTCGTGCAGGCGGGACCGGGACCAGGTCTACTTGCCCTTCTCCTTCATACACAAGTACTTCGAGATTTATGGTAAGATTACGGCGACGGATGGAGTTGAAAAATTTGAGTGGTCGCACAGCTACAGTAAGGTGTACCATCCCAAGAAAAAGTACGACCCTCGAGGCACCTTCACCACCTTCGAGAACTACAACGTGGAAGTCCGGGATCGAGTGAAATGCATCAGCGGGATCGAAGGTGTTCCGGTGTCCACCCAGTGGGAGCCCCGGGGCTTCTTTTATCCCACCCAAATAGCGCAATTCGGTTTGGCGCACTACAGCAAGAATCTAACTGAACCAGAACCGAGGATCAGGGTCATCGACGACGGAGACAAAGTCCTCGAGAACTGGATCGTTAGCCAGGACGCACTCATGTCCCGGGAATTGGATCCGGAGCTGAAGGCGAATGTTATAAGATTCACGACAACCGACCAGCCGGCTAGTCAAGTACGAGCCAGACTTAACATCAGCCAAGACTTCGTACTCAGCATGGATCTCATGCTGAAGCCGAATTCTTCTGTGACTGTGATCTTACAGAATAAAGACAAGAAGGAAACGGTATACTTGCATTACGTAACCAGTACCCAGTTAATATATGCACAGGAGGACCACATATACTATGGCGTCGGCTTAGAACAGAAATGGCGGAGGTTAACACGAGATCTGATCATAGACATGCAAAAAGGTTGGGCGCTGCAAGACAGACCCAAGAGGAAGTCACCTAGAAATAAGTTTAAGGTGTCGGGGCTGGCCCTGTCGGGTGCGGGGAGTGTGGGTAACGTGCGCGTGTCGTCCAGCGAGCACATGTTCCACTTCTTCTCTGCCGCTCGCTGGCTGGTGTCCTCTCAGCGAGCGGACGGCGGCTGGCCGGTGCCCGCCAGGAGACGCCTCGCCCCCCGCGTCCCCGACCTCATGCCGGGCTGGCACTCGGCCATGAGTCAGGGTCACGCCATCTCTCTGCTGTCCCGGGCCTACCACCGCAGCGGTGACTCGCGGTACCTGCGGGCAGCCAGGCGAGCGCTGGGCCCTCTGGACCGGCCCCCGGACAAGGGCGGAGTGCGGGCGCTGTTCATGAACAGATATGTCTGGTATGAGGAGTACCCCACCAAGCCGCCCATCTTCGTGCTGAACGGGTTCATCTACACTCTGCTGGGCCTGTACGACCTGCACTACACCGAGGGTGGGCGGGCCGCGTCCCCCGCCAAGGATATGTTCGAGGCCGGAATGCTGTCTCTCAAGACGCTGCTGCCGCTATTCGACACCGGCAGCGGCAGCTTCTACGACCTGCGGCACTTCACGCTGGGAGCGAGCCCCAACATCGCGCGCTGGGACTACCACGCCACGCACGTCAACCAGCTGTATCTGCTGGCGGGCCTCGACCCCGACCCCGTGCTGCTCGCCACGGCCCGCAGATGGGAGGGCTACATGCAGGGCCGGCGGGCCGCGCACAACTGA

Protein sequence:

>DPOGS205946-PA
MMLMRVNMRLVLVVLCGAVLLATLYWGHCGDISLRPTVSLLSESQTALSEIECSINGEYSVSCRRDRDQVYLPFSFIHKYFEIYGKITATDGVEKFEWSHSYSKVYHPKKKYDPRGTFTTFENYNVEVRDRVKCISGIEGVPVSTQWEPRGFFYPTQIAQFGLAHYSKNLTEPEPRIRVIDDGDKVLENWIVSQDALMSRELDPELKANVIRFTTTDQPASQVRARLNISQDFVLSMDLMLKPNSSVTVILQNKDKKETVYLHYVTSTQLIYAQEDHIYYGVGLEQKWRRLTRDLIIDMQKGWALQDRPKRKSPRNKFKVSGLALSGAGSVGNVRVSSSEHMFHFFSAARWLVSSQRADGGWPVPARRRLAPRVPDLMPGWHSAMSQGHAISLLSRAYHRSGDSRYLRAARRALGPLDRPPDKGGVRALFMNRYVWYEEYPTKPPIFVLNGFIYTLLGLYDLHYTEGGRAASPAKDMFEAGMLSLKTLLPLFDTGSGSFYDLRHFTLGASPNIARWDYHATHVNQLYLLAGLDPDPVLLATARRWEGYMQGRRAAHN-