Monarch geneset OGS2.0

DPOGS213645
TranscriptDPOGS213645-TA1935 bp
ProteinDPOGS213645-PA644 aa
Genomic positionDPSCF300165 - 15617-29479
RNAseq coverage2036x (Rank: top 6%)
Annotation
HeliconiusHMEL0226506e-14142.43% 
BombyxBGIBMGA004586-TA0.074.26% 
DrosophilaTreh-PC5e-15851.29% 
EBI UniRef50UniRef50_A3RLQ40.069.53%Trehalase-2 n=9 Tax=Endopterygota RepID=A3RLQ4_9NEOP
NCBI RefSeqNP_001036910.10.068.03%trehalase-2 [Bombyx mori]
NCBI nr blastpgi|1567674990.071.74%trehalase-2 [Spodoptera exigua]
NCBI nr blastxgi|1567674990.071.85%trehalase-2 [Spodoptera exigua]
Group
Gene OntologyGO:00045555.7e-269alpha,alpha-trehalase activity
GO:00059915.7e-269trehalose metabolic process
GO:00038243.2e-52catalytic activity
KEGG pathwayphu:Phum_PHUM2677400.0 
 K01194 (E3.2.1.28, treA)maps-> Starch and sucrose metabolism
InterPro domain[10-553] IPR0016615.7e-269Glycoside hydrolase, family 37
[109-546] IPR0089283.2e-52Six-hairpin glycosidase-like
Orthology groupMCL11246 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213645-TA
ATGTTAGTGGCCCTGGAGCTGGTCGCCGGGGACCGCACCAAGCTGCCGCCCACCTGTGATAGTATGATTTACTGCCACGGACCGCTGCTGGACACGGTCCAAATGGCGGGCCTCTATAAGGACTCCAAGACCTTCGTGGATATGAAACTGAAGCTCTCCGCCAACATCACCATGGAGCACTTCCACGAGATGATGGCCAGGACGAGTGACCGACCGACCAAAGCCGACATTCAGGAGTTCGTAGACAATAACTTTGACCCCGCGGGCTCCGAGTTTGAAGAGTGGAGACCCACGGATTGGAAGGACAACCCAGCCTTCCTGTCTCGTATCAAGGACCCTCGCTTCCACCAGTGGGCGTCCGACCTGAACGCTCTGTGGCTTCAGCTCGGCAGGAAGATGAAGGACGACGTCAAAAACAACGAGCACCTCTACTCCATTATATACGTACACAACCCGGTTATAGTACCAGGTGGCAGGTTCCGCGAGTTCTACTACTGGGACTCCTACTGGATTATCAAGGGTCTGCTTCTATCGGAGATGAGGTCCACCGCCATGGGGATGATCTCCAACTTCCTGGAGATAGTCGACAGGTTCGGATTCATACCTAACGGAGGACGGATCTACTACCTCATGAGATCCCAGCCTCCGCTACTGATCCCGATGGTGAAGCTGCTGATGGATGATTTTGAGGACCTGGGCTTCCTCCGGTCACACATACACACGCTGGACAGAGAGTTCGAGTTCTGGATGAACAACCACACGGTCAGCGTCGACTATGATGGGAAAAAGTATCAGATGGCTCGATACAACGACATGTCTCAAGGTCCGCGACCCGAGAGTTACAAGGAGGACATAGATTGTGCCAAACATCTGGACAGTCGCGAGTTGAAGGAGGAGCTATACGCAGAGCTGAAAGCGGGCGCGGAGTCAGGGTGGGATTTCTCCTCCAGATGGTTCATACTCAACGGAACCAATAAAGGTAACTTGACGAACCTGAAGACTCGCTCCATAGTCCCGGTGGACCTGAACGCCATCTTGTGTTGGAACGCTGGTCTGTTGTCAGAGTTCCACGCGCGGCTCGGGGACACCTCCCGGGCCGATTACTACCGGGAGGTCAGGGCTCGCCTGCTGGAGGCCATCGAGAAGGTGTTGTGGCACGAGGAGGTGGGCGCGTGGCTGGACTTCAGCTTGGAGTCGGGGCGAGCGCGGGACTACTTCTACCCCTCGAACGTGGCGCCGCTCTGGACCGGCGCCTACGATCGCGGCCGCGAGGAATACTACGTCAATAGGGTCATCAACTACCTCGATAAAGTCAAGGTGGACATCTTCGAGGGCGGCATCCCGGCGACGTTCGAGCACTCCGGGGAGCAGTGGGACTATCCGAACGCCTGGCCGCCGCTACAGCACATGGTGGTGGAGGGTCTGGCGGGCACCAGGCACGCCGCCGCCAACAGGCTGGCCGGGGAGATCGCCGCCAAGTGGGTGCGCTCCAACTACGAGGTCTGGAGACACAAGACCGCCATGCTGGAGAAGTACGACGCCACGGTGTTCGGTGGGTTCGGCGGCGGCGGCGAGTACGTGGTGCAGACCGGCTTCGGCTGGACCAACGGAGTGGTGATGGTGCTGCTCAACGAGTACGGAGAGGACGCGTTCGGCGGAGGGGACGAGGAAGGCGGGGCGAGAGGGGAAGGGGGAGAGGGGCTGCACGTCCAGGGGGCGGTTCCAGGGGGCGGGGGTGGGGGCGGGAGGGGTCGCCACCGCCCTGCTGGTGGTCATGGCGTCGCTCGCTGCCGGGACGCTTGGGTGAGTGTGATGGTGTACAGGAAGCGTCGTGATTACACTCCCCTCGTCACCGGCGAGGATCTGAAGCTCCTCAAGCGCCCCTACACCGAGCTGAGGTCTATCAACGGAGCCTCGGACACGAGACTGCGGTGA

Protein sequence:

>DPOGS213645-PA
MLVALELVAGDRTKLPPTCDSMIYCHGPLLDTVQMAGLYKDSKTFVDMKLKLSANITMEHFHEMMARTSDRPTKADIQEFVDNNFDPAGSEFEEWRPTDWKDNPAFLSRIKDPRFHQWASDLNALWLQLGRKMKDDVKNNEHLYSIIYVHNPVIVPGGRFREFYYWDSYWIIKGLLLSEMRSTAMGMISNFLEIVDRFGFIPNGGRIYYLMRSQPPLLIPMVKLLMDDFEDLGFLRSHIHTLDREFEFWMNNHTVSVDYDGKKYQMARYNDMSQGPRPESYKEDIDCAKHLDSRELKEELYAELKAGAESGWDFSSRWFILNGTNKGNLTNLKTRSIVPVDLNAILCWNAGLLSEFHARLGDTSRADYYREVRARLLEAIEKVLWHEEVGAWLDFSLESGRARDYFYPSNVAPLWTGAYDRGREEYYVNRVINYLDKVKVDIFEGGIPATFEHSGEQWDYPNAWPPLQHMVVEGLAGTRHAAANRLAGEIAAKWVRSNYEVWRHKTAMLEKYDATVFGGFGGGGEYVVQTGFGWTNGVVMVLLNEYGEDAFGGGDEEGGARGEGGEGLHVQGAVPGGGGGGGRGRHRPAGGHGVARCRDAWVSVMVYRKRRDYTPLVTGEDLKLLKRPYTELRSINGASDTRLR-