Monarch geneset OGS2.0

DPOGS215036
TranscriptDPOGS215036-TA1608 bp
ProteinDPOGS215036-PA535 aa
Genomic positionDPSCF300208 - 538728-540335
RNAseq coverage411x (Rank: top 30%)
Annotation
HeliconiusHMEL0226500.068.51% 
BombyxBGIBMGA005664-TA0.057.43% 
DrosophilaTreh-PC1e-11842.46% 
EBI UniRef50UniRef50_C9S2640.068.32%Putative Trehalase-1A n=198 Tax=Nymphalidae RepID=C9S264_9NEOP
NCBI RefSeqNP_001037458.10.057.28%trehalase precursor [Bombyx mori]
NCBI nr blastpgi|2613359300.068.32%putative Trehalase-1A [Heliconius melpomene]
NCBI nr blastxgi|3149131350.070.02%trehalase 1a [Heliconius doris]
Group
Gene OntologyGO:00045557.2e-211alpha,alpha-trehalase activity
GO:00059917.2e-211trehalose metabolic process
GO:00038242.2e-51catalytic activity
KEGG pathwayphu:Phum_PHUM6170102e-139 
 K01194 (E3.2.1.28, treA)maps-> Starch and sucrose metabolism
InterPro domain[1-502] IPR0016617.2e-211Glycoside hydrolase, family 37
[62-497] IPR0089282.2e-51Six-hairpin glycosidase-like
Orthology groupMCL16294 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215036-TA
ATGCGAGATGAACCCGATTCAATTCTAGCTGACTTCGATTCTCTTTTAAACCAAACTAATGATAATCCTTCCCGAGATCAGATTCAAAAATTCGTCGACGATCACTTTGACTCCATTGGGGAGTTGGAGGAATGGATACCAACAGATTTTAATAAAAACCCAGCATTCCTGAATGGTGTGCGCGATGATAATCTTAGAAAAATGGGACAAAATATTAACGACATTTGGCCGACCCTCGGTAGAAAAGTCAAGTCCGTCGTTTTCGAACATTCAGAACGCTTCAGCTTCATCCCTGTGACTAATGGATTTATCATACCTGGTGGAAGGTTTAAAGAATTATACTATTGGGATACCTATTGGATTATAGAGGGTCTTCTTGTCAGTGGTATGGAAGACACTGTTAAAGGAGTTATAGGAAATCTCATAGAATTGGTAAAGAAACTTGGCCACGTTCCTAACGGTAGTAGATGGTACTACCAAGAAAGAAGTCAACCTCCTCTATTGACAGCAATGATGTCACTTTATATAAAAGCAACGGATGATCTTGAATTCCTTAAGAAAAATGTAGACGTCCTTGAACAAGAGTTGAGATATTGGCTAGACACGCAATTGGTTACATTTAATGTTGGTGACGCGACTTACACTCTTCTGAGATATTACGCACCTAGTAAAGGTCCACGACCCGAGTCGTATTACGAAGATTACAAGGACTCAAGAATGTTTGAAACTCAGGAACTTCAGCAGAGTTTTTATACAGAAATAAAAAGTGCCGCTGAAAGTGGATGGGACTTCTCGACTCGTTGGTTTATTAATAATGACGGAGAGAATAAAGGAAATCTCTCAACAATCAACACCAGCTACCTTATACCCGTTGACCTAAACGCAATATTTGCGAATGCTCTAGATAACATGGCTCACTTCCAAGCTTTACTTCTAAACTATCGCCAGAGTAGTCACTGGGCCTACTTGGCCAAACAGTGGCGCTACAATATAAAGGAAGTCTTCTGGAACAAAGAGGACGGAATTTGGTACGATTGGGATATGAAAAATAACCGCCACAGAAAATACTTCTACCCTAGTAACCTTGCACCGTTGTGGATGAAAGTAGCCAACAAAAGTTTTGTTAATTTAAACTCTAAACGCATTCTCCAATGGCTCAAAAATTCAAACGGCATCGATTATCCTGGTGGAGTACCAGCTTCTTTGATTCGAAGTGGAGAGCAATGGGACTTTCCAAATGCTTGGCCACCTCTAGTCAGTATAGTAGTAAATGCTCTGGAGGCTTTGGAAACTAAAGAATCGTTAGAAGTGGCCTTTGAAATCGCTCAATCGTGGGTTAGGGCGTGCTACAAAGGCTTTAATGCAACCAATCAGTTATTCGAAAAGTATGACGTCGAAATCCCCGGCCGGATAGGAGGTGGAGGTGAATACACTGTTCAGACAGGTTTTGGATGGTCCAATGGGGTAATATTGGAGTTCCTTGCAAAATATGGGCATCGAATGACTTTGTATGACAAAAGTGATGACTATCTTCTAGTACTCCCTTATGACTCACAGCCACAAAAATCTACAGAGCTTTTATCGAAAAAAAACAAATCTGAAAGCTAA

Protein sequence:

>DPOGS215036-PA
MRDEPDSILADFDSLLNQTNDNPSRDQIQKFVDDHFDSIGELEEWIPTDFNKNPAFLNGVRDDNLRKMGQNINDIWPTLGRKVKSVVFEHSERFSFIPVTNGFIIPGGRFKELYYWDTYWIIEGLLVSGMEDTVKGVIGNLIELVKKLGHVPNGSRWYYQERSQPPLLTAMMSLYIKATDDLEFLKKNVDVLEQELRYWLDTQLVTFNVGDATYTLLRYYAPSKGPRPESYYEDYKDSRMFETQELQQSFYTEIKSAAESGWDFSTRWFINNDGENKGNLSTINTSYLIPVDLNAIFANALDNMAHFQALLLNYRQSSHWAYLAKQWRYNIKEVFWNKEDGIWYDWDMKNNRHRKYFYPSNLAPLWMKVANKSFVNLNSKRILQWLKNSNGIDYPGGVPASLIRSGEQWDFPNAWPPLVSIVVNALEALETKESLEVAFEIAQSWVRACYKGFNATNQLFEKYDVEIPGRIGGGGEYTVQTGFGWSNGVILEFLAKYGHRMTLYDKSDDYLLVLPYDSQPQKSTELLSKKNKSES-