Monarch geneset OGS2.0

DPOGS201332
TranscriptDPOGS201332-TA2718 bp
ProteinDPOGS201332-PA905 aa
Genomic positionDPSCF300176 + 615418-624425
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0123990.068.13% 
BombyxBGIBMGA010537-TA0.063.39% 
DrosophilaCG9701-PA2e-13350.11% 
EBI UniRef50UniRef50_G6DAN30.0100.00%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001659853.10.047.78%glycoside hydrolases [Aedes aegypti]
NCBI nr blastpgi|1571211590.047.78%glycoside hydrolases [Aedes aegypti]
NCBI nr blastxgi|2700097210.045.54%hypothetical protein TcasGA2_TC009016 [Tribolium castaneum]
Group
Gene OntologyGO:00045534.4e-231hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059754.4e-231carbohydrate metabolic process
GO:00431692.7e-160cation binding
GO:00038242.7e-160catalytic activity
KEGG pathwaybta:5143320.0 
 K01229 (LCT)maps-> Galactose metabolism
InterPro domain[1-898] IPR0013604.4e-231Glycoside hydrolase, family 1
[447-886] IPR0137812.7e-160Glycoside hydrolase, subgroup, catalytic core
[450-899] IPR0178531e-143Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201332-TA
ATGTTTGGAACGGCCACAGCCTCTTACCAAGTTGAAGGTGCTTGGGACGTAGACGGAAAATCGGAAAATATCTGGGACCATTTAACTCATACAAATCCTTGCAAAGTGCTGGATTGCTCTAATGGTGATATTGCTGACAACTCTTATTATCTCTATAAAAGAGATGTGGAAATGATGCGAGAGTTAGGACTCGAAACTTACAGGTTTTCTATCTCGTGGTCTAGAATCCTTCCTACTGGTTTTCCAAATTACATTAATGAAGCTGGCGTCGCATATTACAACAACTTAATTGATGAAATGCTTAAATATAACATTCAGCCGATAATAACTTTATACCATTGGGATCTACCTCAAAAATTACAAGACATGGGAGGCTGGGCAAATAATGAAATTGTTAATTGGTTTGGGGACTACGCACGAGTTATATTTAATCTTTTTGGTGATAGAGTAAAATATTTTATCACTATTAATGAACCTCATCAAGTGTGCGCTCTTGGATACGGTAATGAATTACTGGCGCCAGCACTAAATATACAAGGTATCGCTGACTACTTATGCGTGAAGAATCTGCTGTTAGCTCACGCTAGAGCTTATCACATTTATGATAAAGAATTTAGAGCGAAACAAAGTGGAAATATATTTATTACCATAAACGCCGAGTGGCATGAATCAGAGTCATTGGATCACGAGGATGCAGCCCGGGATGCTAGACAATTTCAATGGGGGGTATATGCTCATCCAATATTTTCAAAATCGGGAAACTACCCACCAGAAATGATAAAGAGAATAGCGGATAAAAGTGCTGCCCAAGGTTTTCTCAGATCAAGATTACCAGAACTAACTAACGAGGAAATTAAATTTGTACAAGGAACCTCTGACTTTTTTGGGCTGAATCATTATTCAACAAGTTATGTCTATAGAAATGAGAGTTCTTCTGAAATTTATCCTGTACCTTCATACAACGACGACCTTGATGTAATACAATATAAATTACCTGTATGGAAAATTGGTGAATCAGATATGACAAAGTTTGTCCCATGGGGTTTTCGATCAGTACTAAACAGCATCAGCCAACTGTATGGAAATCCCCCTATATTGGTCACTGAAAATGGATTTGCGACCAATGGTGGAATCGACGACAAAGACAGGGTGACATATTACAGAGGATATTTAAACGCCCTCCTAGATGCTATCGACGACGGCGTTGATATACGAGGTTATACGGCCTGGAGTCTCATGGATAATTTTGAGTGGTCACAAGGATACACTGTTAACACATTAAATAAAACTGCAAGTTTACAGTTCGGTAAATCACAAATTTCACTCGTCTCATCGCGACGGTCAAGTAAGTCTGAAAACATATGGGATCGCGTATCACACAGGGAACCTTGTGTTGTCGACAACTGCGACACAGGTGACGTTGCCGGTGATTCGTATCATCAATATAAGCGTGATGTGGAAATGATGCGGGAGCTAGGTCTCGACTTTTATAGGTTCTCTCTCTCCTGGTCGAGAATATTACCAACGAGTTTTCCAGACCAAATTAATGAAAAAGCAGTACAATATTATAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACTCTTTATCACTGGGATTTGCCTCAGAAGCTGCAAGATCTGGGAGGATGGACCAATCCCCATATCGTTGATTGGTTTACCGATTACTCCAGAGTAGTGTTTCGGTTATTTGGAGATAGGGTTAAGTATTGGATAACTATCAATGAACCGCGAGAGGTTTGTTATCAGGGATATGCAGCACAGTCTCTAGCTCCTCTTTACAATATTTCTGGATATGCTGATTACATGTGTGCCAAAAATTTATTGCTAGTTCATGCCAACGTCTATCATTTATATAACAATGTATTTCGTAAAGCCCAAGGTGGTCAAATCGGTATAACAATAAGCGCACAATGGTACGAACCTGAATCAGAGGAAGATGTAGAGGCTGCTGAGGATTACAGACAGTTTGAGTGGGGAATTTACGCAAATCCAATATTTTCGGAATCTGGAGACTTTCCAGCAGTCATGAAACGTAGGATAGCAGCAAAGAGTAAGGAACAAGGATTTCCAAGATCACGATTACCACAATTCACTCCGGAGGAGGTTGATTTAATTAAAGGCAGTTTCGACTTCTTTGGGTTGAATCATTATACTACTTATAGGGTTTACAGAAATGAATCAGTCTATGGACATTATAATTCACCATCTACTTACGATGATCTCGAAGCAATAAGTTATCAAGATAGTTCATGGGATTCAGCTGCTTCAAAGTGGTTAAAGCGTGTGCCCTGGGGATTTTATAATTTGCTTACAAAAATACGAAAGGACTACAACAACCCGCCAGTTTTCATCACTGAGAATGGATTCTCAACCCGAGGTGGTTTAGTTGACGACGACCGCATAAAGTATTACAGAACATACATTGACGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGAGTTTACGCAGCGTGGAGTTTGATGGACAATTTTGAATGGATGAGGGGATACAGCGAACGTTTCGGACTGTACGAGGTGGACTACGAGAGTCCTGAACGCACCCGAACTCCTCGCAAATCTGCTTATGTATACAAGGAGATGCTGCGCACACGAACACTGGACTATCATTATGAACCTGATATGAGCCTGGGAATGAATGTCGAAGAAAACTAA

Protein sequence:

>DPOGS201332-PA
MFGTATASYQVEGAWDVDGKSENIWDHLTHTNPCKVLDCSNGDIADNSYYLYKRDVEMMRELGLETYRFSISWSRILPTGFPNYINEAGVAYYNNLIDEMLKYNIQPIITLYHWDLPQKLQDMGGWANNEIVNWFGDYARVIFNLFGDRVKYFITINEPHQVCALGYGNELLAPALNIQGIADYLCVKNLLLAHARAYHIYDKEFRAKQSGNIFITINAEWHESESLDHEDAARDARQFQWGVYAHPIFSKSGNYPPEMIKRIADKSAAQGFLRSRLPELTNEEIKFVQGTSDFFGLNHYSTSYVYRNESSSEIYPVPSYNDDLDVIQYKLPVWKIGESDMTKFVPWGFRSVLNSISQLYGNPPILVTENGFATNGGIDDKDRVTYYRGYLNALLDAIDDGVDIRGYTAWSLMDNFEWSQGYTVNTLNKTASLQFGKSQISLVSSRRSSKSENIWDRVSHREPCVVDNCDTGDVAGDSYHQYKRDVEMMRELGLDFYRFSLSWSRILPTSFPDQINEKAVQYYNNLINEMLKYNIQPMVTLYHWDLPQKLQDLGGWTNPHIVDWFTDYSRVVFRLFGDRVKYWITINEPREVCYQGYAAQSLAPLYNISGYADYMCAKNLLLVHANVYHLYNNVFRKAQGGQIGITISAQWYEPESEEDVEAAEDYRQFEWGIYANPIFSESGDFPAVMKRRIAAKSKEQGFPRSRLPQFTPEEVDLIKGSFDFFGLNHYTTYRVYRNESVYGHYNSPSTYDDLEAISYQDSSWDSAASKWLKRVPWGFYNLLTKIRKDYNNPPVFITENGFSTRGGLVDDDRIKYYRTYIDAMLDAIEDGSDIRVYAAWSLMDNFEWMRGYSERFGLYEVDYESPERTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNVEEN-