Monarch geneset OGS2.0

DPOGS201926
TranscriptDPOGS201926-TA1590 bp
ProteinDPOGS201926-PA529 aa
Genomic positionDPSCF300112 + 14971-27595
RNAseq coverage389x (Rank: top 31%)
Annotation
HeliconiusHMEL0073160.077.97% 
BombyxBGIBMGA002660-TA0.076.38% 
DrosophilaCG9701-PA1e-13951.05% 
EBI UniRef50UniRef50_Q16ET61e-14849.61%Glycoside hydrolases n=9 Tax=Neoptera RepID=Q16ET6_AEDAE
NCBI RefSeqXP_001850321.12e-15253.38%glycoside hydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|3640236090.078.80%seminal fluid protein CSSFP029 [Chilo suppressalis]
NCBI nr blastxgi|3640236090.079.01%seminal fluid protein CSSFP029 [Chilo suppressalis]
Group
Gene OntologyGO:00045531.7e-243hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.7e-243carbohydrate metabolic process
GO:00431693.6e-173cation binding
GO:00038243.6e-173catalytic activity
KEGG pathwaytca:6645776e-136 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[31-503] IPR0013601.7e-243Glycoside hydrolase, family 1
[30-495] IPR0137813.6e-173Glycoside hydrolase, subgroup, catalytic core
[30-507] IPR0178536e-162Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201926-TA
ATGTATAAATATATTTTTTATTTTTCTTTATTTTTATGCCATGGGCAAGCGGAAGTATTAAATTTAGCTGGAGGCGCAAAGTCTAACTACACTTTTCCGAAGGATTTTCTTTTTGGTGTCTCAACAGCGGCAATACAAATTGAAGGAGCATGGAATGAAGATGGGAAGACGGAAAGCATATGGGATCACTTAGTGCGTGTAAATCCTAACTTCACTAAAGACGGATCTACCCCTGACGTAGCAGCAGATTCCTATCATTTATACAAGAGAGATGCTGAAATGGTCCACGAACTTGGAGTGAATATGTATAGGTTTTCAATATCCTGGCCAAGGATACTACCAACTGGTTTAGCCAATCAAGTTAATCCTCTCGGAATTGAGTACTATAAAAATCTCATAAGCGAGTTGGAAAGGTACAATATTACTCCTATGGTCACCATTTACCATTGGGATCTACCTCAGAAATTACAGGATATTGGGGGTTGGACGAACGCGCATATCATAGATTATTATACGGACTACGCGAATGTATTGTTTGAAAATTTCGCGGATAAAGTTAAATATTGGATAACATTCAACGAGCCAATGCAAACCTGCCTGGAAGGTTACGGCAACACGTACCGAGCGCCTGCACTGAACCGACACGGTATAGCTGAATATCTGTGCACACACAATTTGTTAAAAGCGCACGCAAGCGTTTACCATTTGTTCAATAAGCAGTATCGTCCACTGTATGGAGGGAAAATGGGTATGTCACTGGACTCTAATTGGGCAGAACCCAAAACAGATACACCAAGAGACAAGAAAGCTGCGGAGTTGTACCTTAAAACTCATCTTGGATGGTATGCACATCCTGTATATTCGGAAACTGGAAATTATCCAGAAGAGCTTATCAAACTTGTTGATGAAAAAAGTAAGAAACAGAACTACACCCACTCTCGACTTCCCAAGTTTACTCCTGAGGAAATAGCCTATATACGAGGAACTGCAGACTTCTTCGGTTTAAACCATTACACCACGTATCTTTTGAGCATGGCTGACAGTGAAGTTGGTGAGGTGCCATCACATGCAAACGATGTTGGTATTGTTAGGGTTCAAGATCCCAAGTGGCCGTCGAAGTCCTCTTCCTCTTGGCTAAAGGTGGTGCCATTTGGATTTCGTCGCCTCTTAAATTGGATAACTAAAACGTACAATAACGTGCCAATAATCGTTACGGAGAACGGATATGCTGACTTTAGTGGAGTGAAAGATGAAGCAAGAGTTTCTTACTATTGCCACTATTTAAATTCTCTCCTCCATTCAATACACGAAGATAAGACAAACGTTCAAGGGTATTTCGCTTGGAGTCTGATGGATAATTGGGAATGGGACGACGGCTATGCGTCCCGCTTCGGTCTTTACTTGGTCGATTTCAATAGTCCCAACAAGACGAGAACTGCTAAGGAATCGGCGAAATTGTACACGAGCGTAATATCCTCTCGAGGCCTGCCCGCCGACTACGACCCAGAAGATTTCACCGCCTTTTCCAGTGCTTCTCTTCTCGTTCCAACTCTACTCTCACTCTTACCCTTTTATAGGCTACTTACATGA

Protein sequence:

>DPOGS201926-PA
MYKYIFYFSLFLCHGQAEVLNLAGGAKSNYTFPKDFLFGVSTAAIQIEGAWNEDGKTESIWDHLVRVNPNFTKDGSTPDVAADSYHLYKRDAEMVHELGVNMYRFSISWPRILPTGLANQVNPLGIEYYKNLISELERYNITPMVTIYHWDLPQKLQDIGGWTNAHIIDYYTDYANVLFENFADKVKYWITFNEPMQTCLEGYGNTYRAPALNRHGIAEYLCTHNLLKAHASVYHLFNKQYRPLYGGKMGMSLDSNWAEPKTDTPRDKKAAELYLKTHLGWYAHPVYSETGNYPEELIKLVDEKSKKQNYTHSRLPKFTPEEIAYIRGTADFFGLNHYTTYLLSMADSEVGEVPSHANDVGIVRVQDPKWPSKSSSSWLKVVPFGFRRLLNWITKTYNNVPIIVTENGYADFSGVKDEARVSYYCHYLNSLLHSIHEDKTNVQGYFAWSLMDNWEWDDGYASRFGLYLVDFNSPNKTRTAKESAKLYTSVISSRGLPADYDPEDFTAFSSASLLVPTLLSLLPFYRLLT-