Monarch geneset OGS2.0

DPOGS214966
TranscriptDPOGS214966-TA1143 bp
ProteinDPOGS214966-PA380 aa
Genomic positionDPSCF300546 - 19388-22326
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0144833e-10852.82% 
BombyxBGIBMGA010811-TA2e-7238.00% 
DrosophilaCG9701-PA6e-4943.10% 
EBI UniRef50UniRef50_G6DAN37e-9145.55%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001183226.12e-8344.75%PREDICTED: similar to lactase-phlorizin hydrolase [Strongylocentrotus purpuratus]
NCBI nr blastpgi|1157100204e-8244.75%PREDICTED: similar to lactase-phlorizin hydrolase [Strongylocentrotus purpuratus]
NCBI nr blastxgi|1157100209e-8344.75%PREDICTED: similar to lactase-phlorizin hydrolase [Strongylocentrotus purpuratus]
Group
Gene OntologyGO:00045536.5e-149hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059756.5e-149carbohydrate metabolic process
GO:00431692.6e-67cation binding
GO:00038242.6e-67catalytic activity
KEGG pathwaybta:5143321e-48 
 K01229 (LCT)maps-> Galactose metabolism
InterPro domain[24-378] IPR0013606.5e-149Glycoside hydrolase, family 1
[22-380] IPR0178535.1e-116Glycoside hydrolase, superfamily
[153-379] IPR0137812.6e-67Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL16206 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214966-TA
ATGAGGAGTAATAACGTGTATATACAATACATTTACAGTGCACTGTTAGGAGTTGGATTTTGTAGAAAATTTCCACCCGGGTTCAAATTTGGTGCAGCCACAGCTGCTTACCAGGTCGAGGGCGCCTGGAACGTCAGCGACAAATCCGCAAGTATCTGGGACACGTTCGTGCACACTAGACCAGAGATTATAGCAGATAGATCCAACGGGGACGTCGCCTGTGACAGCTACAACCAATGGATGAAAGACGTGGAAATAGCTTCGGAGTTGGGATTAGATTTCTACAGATTTTCTCTCTCCTGGCCAAGAATTTTGCCCAATGGTTTTGCAAATAAGATAAGTGAAGACGGTGTAAAATTTTACTCAAATCTCATTGATGCTTTATTGGAGAGAGGAATTGAGCCTGTCGTAACAATATATCACTGGGATTTACCACAAAATTTACAAGATCTTGGTGAAGCGGCTGAACTGGCTCTACAGTTAATGGGAGGATTGTACTCACATCCAATCTTCTCTAAGAAAGGCGGCTGGCCTGAGCAAATAGAAAGACTCGTAGCGGAAAAGAGCAAACAAGAGGGTTTCTCCAAATCCAGATTGCCAGAATTTACGAAAGAAGAAAAAAAAATAGTAAGAGGCACATATGATTTCTTCGGCTTGAACTACTATACCTCACGAACTGCTCGCCGTGCCCGAGGAGAAGTTGTTGGTCCTTGGCCTCTCTCCGGTGCACCAGACATTGATGTAATAATATCAGTCCGACCAGAATGGCCGCAGGCTGGCACCAGCTGGTTGTATGTATACCCGGAAGGTTTCCGGAAGCTCATATCTTGGTTGAAGAAACAGTACGGAAACGTGGAAATCTTTATAACAGAGAACGGTTTCTTAACCAGCGGCGAGGATTTAGAGGATCAAGCTCGTATAGATTATCATAAGGAGCATTTGGAACAGGTTCTCCTCGCGATTCAAGAAGATAAAGCCAATGTCGTGGCGTACACTGCTTGGTCCATGTTAGACAACTTTGAATGGAGCGATGGCTATCGTTCCAAATTCGGTTTGTACGAAGTGGACTTCAACGACCCAGCTCGCGTCCGGCGCCCGAGAGCCTCCGCACAGTTTTACAAAGAGATTGTGCAAGCGAAATAA

Protein sequence:

>DPOGS214966-PA
MRSNNVYIQYIYSALLGVGFCRKFPPGFKFGAATAAYQVEGAWNVSDKSASIWDTFVHTRPEIIADRSNGDVACDSYNQWMKDVEIASELGLDFYRFSLSWPRILPNGFANKISEDGVKFYSNLIDALLERGIEPVVTIYHWDLPQNLQDLGEAAELALQLMGGLYSHPIFSKKGGWPEQIERLVAEKSKQEGFSKSRLPEFTKEEKKIVRGTYDFFGLNYYTSRTARRARGEVVGPWPLSGAPDIDVIISVRPEWPQAGTSWLYVYPEGFRKLISWLKKQYGNVEIFITENGFLTSGEDLEDQARIDYHKEHLEQVLLAIQEDKANVVAYTAWSMLDNFEWSDGYRSKFGLYEVDFNDPARVRRPRASAQFYKEIVQAK-