Monarch geneset OGS2.0

DPOGS204071
TranscriptDPOGS204071-TA2679 bp
ProteinDPOGS204071-PA892 aa
Genomic positionDPSCF300200 + 111374-116728
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0131360.064.94% 
BombyxBGIBMGA010812-TA0.067.35% 
DrosophilaCG9701-PA5e-11948.07% 
EBI UniRef50UniRef50_E3WTL50.040.55%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=E3WTL5_ANODA
NCBI RefSeqXP_001659853.10.044.05%glycoside hydrolases [Aedes aegypti]
NCBI nr blastpgi|1571211590.044.05%glycoside hydrolases [Aedes aegypti]
NCBI nr blastxgi|1571211590.044.44%glycoside hydrolases [Aedes aegypti]
Group
Gene OntologyGO:00045531e-202hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751e-202carbohydrate metabolic process
GO:00431697.3e-153cation binding
GO:00038247.3e-153catalytic activity
KEGG pathwaybta:5143320.0 
 K01229 (LCT)maps-> Galactose metabolism
InterPro domain[18-608] IPR0013601e-202Glycoside hydrolase, family 1
[19-450] IPR0137817.3e-153Glycoside hydrolase, subgroup, catalytic core
[18-454] IPR0178531.1e-145Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204071-TA
ATGAAGTTGTTATCATTTCTTTGCTTGTTAATCGGCAGCCATGCTCAAGAAAGAAGATTTCCTGAGGACTTCATGTTCGGGGCTGCCACATCAGCATATCAGATAGAAGGAGGATGGAGCGCTGATGACAAAGGAGAGAATATATGGGATCGTTTGACTCACACCAAACCTAACGTAATCAAGGATGTGAGCAATGGTGATGTTGCAGCCGACACATACAATAACTACAAACGTGATGTGGAGATGATGAGGGAGTTGGGGCTAGATGCTTACAGGTTCTCTCTCTCCTGGTCTAGAATACTACCCAATGGCCTGGCCAACAAAGTCAGCGATGCCGGAGTTGAGTTCTACAACAACTATATAGATGAAATGATCAAATACGGTATAAAGCCCATGGTCACTCTGTTCCACTGGGACTTGCCACAGAAGTTACAAGATTTGGGAGGATTCATGAATCCATTATTCCCAGAGTGGTTTGAAGACTACGTCCGTGTGGTCTTTGGAAAGTTTGGAGACAGAGTCAAGCACTGGATTACTTTCAATGAACCAAAAGAGTTTTGTCATCAAGGTTATGGCAGTGCGACCAAAGCTCCACTTTTAAATATAACTGATATTGGTACTTACTATTGCGCTCGAACTCTACTCCTCGCCCATGCTCGGGCGTACAGGGTTTATAAAAACGAATTCAAGAAATCTCAAGGCGGTGTGTGTGGTATTGCCATCAGCGTAGGATGGTATCTACCACTAACTGATTCTGAACAAGATAAGTTTGCTGCCGAAATTAAAAGACAAGCGGACTTCGATCTCTACATAAAACCGATATATTCGAAGGAAGGAGGTTTTCCGAAGGAGATTGTTAAAATTGTTGCAGAGAAAAGCGCTAATCAGGGTTACCTACGATCTCGTTTACCAAATTTCACTGAAGAAGAGAAAGAACTTATACGAGGTACTGGTGACTTCATTGGAGTAAACCATTACACCAGCAGCTTTGTGTCTGCCACGGAATACAAGATAATTCATCCAGTGCCATCTTTATACGACGACATTGATGTTGGGTCTTACGCACCCCCGGAGTGGCCAAAATCTGCGTCTGCGTGGTTAGTTCAAACACCTAACAGTCTTTACGATGCCTTGATCAATCTTCACAAGAGGTACAACGGTCCAATATTCTACATCACGGAGAACGGCTGGTCCTCGTCTCCGGAAGCTGATATCCTTGATGATGATAGGATTAGATACTACCGTGCTGCGCTAAACGATGCTCTCAAAGCCATAGAAGATGGAGTGGATCTGCGAGGATTCATGGCTTGGAGCTTGATGGATAATTTTGAATGGTTTGAGGGTTACGGCTTGTTAATCGGCAGCCATGCTCAAGAAAGAAGATTTCCTGAGGACTTCATGTTCGGGGCTGCCACATCAGCATATCAGATAGAAGGAGGATGGAGCGCTGATGACAAAGGAGAGAATATATGGGATCGTTTGACTCACACCAAACCTAACGTAATCAAGGATGTGAGCAATGGTGATGTTGCAGCCGACACATACAATAACTACAAACGTGATGTGGAGATGATGAGGGAGTTGGGGCTAGATGCTTACAGGTTCTCTCTCTCCTGGTCTAGAATACTACCCAATGGCCTGGCCAACAAAGTCAGCGATGCCGGAGTTGAGTTTTACAACAACTATATAGATGAAATGATCAAATACGGTATAAAGCCCATGGTCACTCTGTACCACTGGGACTTGCCACAGAAGTTGCAAGATTTGGGAGGATTCACGAATCCATTATTCCCAGAGTGGTTTGAAGACTACGTCCGGGTGGTTTTTGGAAAGTTTGGAGACAGAGTCAAGCACTGGATTACTTTCAATGAACCCAGAGAAATCTGTTTCGAAGGCTATGGTTCAGACACCAAAGCGCCTATCCTAAATGCAACCGACGTCGGTGTTTATTACTGTGCCAAAAATCTGGTTATGGGTCACGCTAGAGCTTATTACGCATATGTCAATGACTTCAAGCCGAGCCAAGAAGGTGTCTGTGGTATCACAATAAGTGTGAATTGGTTCGGGGCGTTGACAGATTCCGAGGAAGATCAATTTGCTGCCGAAATGAAGAGACAAGCAGAATGGGGGCTCTATGCTGAACCTATTTTCTCTGAAGAGGGTGGTTTTCCTAAGGAATTAGCAGAAATTGTTGCCAAAAAAAGCGCTGAACAGGGTTATCCTCGGTCGCGTATGCCAGAATTCTCTGATGAAGAGAAGGATTTCGTAAAAGGCACTGCTGACTTTTTAGGAGTAAATCATTACACAGCCGGCTTAGTATCTGCAACTGAATATAAGACTCACCACCCAGTGCCGTCTTTATATGATGATATTGATGTAGGAAGCTACACTCCGCCGGAGTGGCCAAAATCTGCTTCATCTTGGTTAAAATTAGCACCAAACAGTATTTACAATGCCCTCACTCACCTTCACAAGAAGTACAACGGTCCCATATTCTACATCACGGAGAACGGCTGGTCCTCGCCTCCGGAAGCTGATATCCTTGATGATGACAGGATTAGATACTACCGAGCGGCTTTGAACAGTGTGCTCGATACCTTGGAGGCTGGAGTGGATCTACGGGGGTACATGGCATGGAGTCTGATGGACAACTTTGAGTGGATGGAGGGTTACACGTAA

Protein sequence:

>DPOGS204071-PA
MKLLSFLCLLIGSHAQERRFPEDFMFGAATSAYQIEGGWSADDKGENIWDRLTHTKPNVIKDVSNGDVAADTYNNYKRDVEMMRELGLDAYRFSLSWSRILPNGLANKVSDAGVEFYNNYIDEMIKYGIKPMVTLFHWDLPQKLQDLGGFMNPLFPEWFEDYVRVVFGKFGDRVKHWITFNEPKEFCHQGYGSATKAPLLNITDIGTYYCARTLLLAHARAYRVYKNEFKKSQGGVCGIAISVGWYLPLTDSEQDKFAAEIKRQADFDLYIKPIYSKEGGFPKEIVKIVAEKSANQGYLRSRLPNFTEEEKELIRGTGDFIGVNHYTSSFVSATEYKIIHPVPSLYDDIDVGSYAPPEWPKSASAWLVQTPNSLYDALINLHKRYNGPIFYITENGWSSSPEADILDDDRIRYYRAALNDALKAIEDGVDLRGFMAWSLMDNFEWFEGYGLLIGSHAQERRFPEDFMFGAATSAYQIEGGWSADDKGENIWDRLTHTKPNVIKDVSNGDVAADTYNNYKRDVEMMRELGLDAYRFSLSWSRILPNGLANKVSDAGVEFYNNYIDEMIKYGIKPMVTLYHWDLPQKLQDLGGFTNPLFPEWFEDYVRVVFGKFGDRVKHWITFNEPREICFEGYGSDTKAPILNATDVGVYYCAKNLVMGHARAYYAYVNDFKPSQEGVCGITISVNWFGALTDSEEDQFAAEMKRQAEWGLYAEPIFSEEGGFPKELAEIVAKKSAEQGYPRSRMPEFSDEEKDFVKGTADFLGVNHYTAGLVSATEYKTHHPVPSLYDDIDVGSYTPPEWPKSASSWLKLAPNSIYNALTHLHKKYNGPIFYITENGWSSPPEADILDDDRIRYYRAALNSVLDTLEAGVDLRGYMAWSLMDNFEWMEGYT-