Monarch geneset OGS2.0

DPOGS207810
TranscriptDPOGS207810-TA2088 bp
ProteinDPOGS207810-PA695 aa
Genomic positionDPSCF300042 + 442095-523772
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0119640.081.96% 
BombyxBGIBMGA005500-TA0.077.95% 
DrosophilaCG15117-PB0.056.25% 
EBI UniRef50UniRef50_Q6NL660.056.25%RE15795p n=27 Tax=cellular organisms RepID=Q6NL66_DROME
NCBI RefSeqXP_969423.10.055.67%PREDICTED: similar to CG15117 CG15117-PA [Tribolium castaneum]
NCBI nr blastpgi|910894830.055.67%PREDICTED: similar to CG15117 CG15117-PA [Tribolium castaneum]
NCBI nr blastxgi|910894830.057.58%PREDICTED: similar to CG15117 CG15117-PA [Tribolium castaneum]
Group
Gene OntologyGO:00431696.8e-81cation binding
GO:00059756.8e-81carbohydrate metabolic process
GO:00038246.8e-81catalytic activity
GO:00045535.9e-69hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwaytca:6579030.0 
 K01195 (E3.2.1.31, GUSB, uidA)maps-> Starch and sucrose metabolism
    Lysosome
    Porphyrin and chlorophyll metabolism
    Drug metabolism - other enzymes
    Glycosaminoglycan degradation
    Pentose and glucuronate interconversions
    Flavone and flavonol biosynthesis
InterPro domain[370-665] IPR0178534e-82Glycoside hydrolase, superfamily
[372-665] IPR0137816.8e-81Glycoside hydrolase, subgroup, catalytic core
[373-666] IPR0061035.9e-69Glycoside hydrolase, family 2, TIM barrel
[72-268] IPR0089794.8e-48Galactose-binding domain-like
[86-267] IPR0061042.2e-39Glycoside hydrolase, family 2, N-terminal
[269-369] IPR0138121.1e-24Glycoside hydrolase, family 2/20, immunoglobulin-like beta-sandwich domain
[269-369] IPR0061021.7e-22Glycoside hydrolase, family 2, immunoglobulin-like beta-sandwich
[174-189] IPR0061017.3e-18Glycoside hydrolase, family 2
Orthology groupMCL10429 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207810-TA
ATGGCGCGCGAAATGCTCCAAGTCGGGGTTGCTATGCGTTCGACAGTTAGTTTGATGCCGGTCAGAATACTCCTTCTCTCAGGCGAAATTCAGATGATGTTATCAACCAGATCTCGTCTGTTCCTCATCGCAGCGGTTTTAACAGTAGTGGAGGCGGAGTTGCCTGGTACATCCAGCACGAACGAGATTAACCAAACCCCAAAGAGACGATCCACATTTGTCGGGGGTACATTATACCCCCAAGCATCGGAGACGAGAGACCTAAAAAGACTAGACGGTATATGGAAATTCAGAAAATCACCCACCGACCCTGAATACGGTCAACGTAATGGCTGGTACGAACAGGATCTTGAAAAGACTGGTCCCGTGATCGATATGCCGGTCCCTTCTTCATACAATGACGTGGGAGAGGATCCTTCGCTGAGGGATCACGTTGGTCTAGTTTGGTACGATCGCCGTTTCTACGTCCCTCACTGGTGGAAAACCGCGGGACAACGAGTGTGGCTGAGATTCAGCAGCGTACATTACGCGGCTCTAGTTTATGTCAACGGTCAAGCTGCCACGTATCACGAGGTGGGACACCTTCCATTCGAAGTGGAGATCACTGATATTGTCTCATACAATACGAGCAATCTACTCACCGTCGTTGTTGACAACACTTTGCTTAGTGACACCGTACCACAGGGCAATATCAAGGACATATTTGTGGGAAACTCCAAAATCCGTCAAGAGCAGACGTACACCTTCGATTTCTTCAACTACGCCGGCATTCACCGCTCCGTGTTCCTGTACTCGACACCACAGACATACATAGATGACGTCATCGTGAATACAGACATACAAGGACTCACAGGCTTCGTTGTTTACAACATAACATACAAGGGTACCCCGCGAGCGCAATGTTTCGTTCAATTATACGACAAACTTGGCAACCAAGTGACAGCGGCTAATGAGTGCGCTGGTCTACTGGAGATCGGGAACGCTAACTTCTGGTGGCCTTATCTGATGCACCCGGAACCAGGTTACCTCTATACTTTGAAGACCACATTAATAGGCTCGCTCGGTGAAACTATAGACACTTACAGTCTTAAAGTTGGCATTAGAACTGTCACGTGGACGAACACCTCAATCTACCTCAACGATAAGCCCATCTACCTCAGAGGGTTCGGGATGCACGAAGACTCAGACTTGCGTGGTAAAGGTTGGGACCCGGTGTTGTGGGTGAAGAATTTCAACTTGATAAAGTGGACCGGCGGTAACGCATTCCGAACCTCGCACTATCCTTACGCCGAAGAAATATACCAGCTGGCCGACGAGCACGGCATCATGATCATTGACGAATGCCCCAGTGTCGATACCGACATTTTCACGGATTCACTGCTGGAGAAGCACAAACAGTCCCTCACTGAGCTCATAAGACGTGATAAGAACCACGCCAGCGTCATCATGTGGTCCATCGCCAACGAGCCGCGGTCCGCTAACATCAGAGCCGACGCGTATTTCCAAAAAGTTGTTAAACATGTCAAATCAATGGATCTCTCTAGACCGGTCACTATAGCTATAGCTCAGAGCCATATCGCTGATAGATCGGGTCAACATCTAGATGTGATATCGTTCAACCGCTACAACGGCTGGTACTCTAACACCGGTTCGTTATTAAACATCGCCGCTAACGTCGCGGACGAGGCCACGGCCTTCAACATCAGATACAACAAACCCATCATCATGATGGAGTACGGAGCTGACACTATCGCTGGTCTCCATTTGTTGCCAGAATACGTATGGTCTGAGGAGTACCAAGTATCGTTGATGTCGGAACACTTCAAGGCTTTCGATCGTCTGCGACAGGCGGGCTTCTTCGTGGGAGAGTTCATATGGAACTTCGCTGACTTTAAAACAGCTCAGACAATAACCCGAGTTGGCGGGAACAAGAAAGGTATATTCACACGTTCGCGCCAACCGAAAGCGTCCGCTCATCACCTCCGCGAGCGTTACCTCGCGCTCGCCGCCGCCGACACTAACTCGCCACCACCCGAATCACCGTACTACGTCAGCGACCATCTACCATTTAAACACGAAGAATTATAA

Protein sequence:

>DPOGS207810-PA
MAREMLQVGVAMRSTVSLMPVRILLLSGEIQMMLSTRSRLFLIAAVLTVVEAELPGTSSTNEINQTPKRRSTFVGGTLYPQASETRDLKRLDGIWKFRKSPTDPEYGQRNGWYEQDLEKTGPVIDMPVPSSYNDVGEDPSLRDHVGLVWYDRRFYVPHWWKTAGQRVWLRFSSVHYAALVYVNGQAATYHEVGHLPFEVEITDIVSYNTSNLLTVVVDNTLLSDTVPQGNIKDIFVGNSKIRQEQTYTFDFFNYAGIHRSVFLYSTPQTYIDDVIVNTDIQGLTGFVVYNITYKGTPRAQCFVQLYDKLGNQVTAANECAGLLEIGNANFWWPYLMHPEPGYLYTLKTTLIGSLGETIDTYSLKVGIRTVTWTNTSIYLNDKPIYLRGFGMHEDSDLRGKGWDPVLWVKNFNLIKWTGGNAFRTSHYPYAEEIYQLADEHGIMIIDECPSVDTDIFTDSLLEKHKQSLTELIRRDKNHASVIMWSIANEPRSANIRADAYFQKVVKHVKSMDLSRPVTIAIAQSHIADRSGQHLDVISFNRYNGWYSNTGSLLNIAANVADEATAFNIRYNKPIIMMEYGADTIAGLHLLPEYVWSEEYQVSLMSEHFKAFDRLRQAGFFVGEFIWNFADFKTAQTITRVGGNKKGIFTRSRQPKASAHHLRERYLALAAADTNSPPPESPYYVSDHLPFKHEEL-