Monarch geneset OGS2.0

DPOGS215037
TranscriptDPOGS215037-TA1509 bp
ProteinDPOGS215037-PA502 aa
Genomic positionDPSCF300208 - 535074-536582
RNAseq coverage367x (Rank: top 32%)
Annotation
HeliconiusHMEL0226270.069.64% 
BombyxBGIBMGA005665-TA0.068.39% 
DrosophilaTreh-PC3e-10940.64% 
EBI UniRef50UniRef50_E3UKQ73e-16172.29%Trehalase 1B (Fragment) n=1 Tax=Biston betularia RepID=E3UKQ7_9NEOP
NCBI RefSeqNP_001037458.11e-14350.72%trehalase precursor [Bombyx mori]
NCBI nr blastpgi|3267862139e-16172.29%trehalase 1B [Biston betularia]
NCBI nr blastxgi|3267862132e-17472.29%trehalase 1B [Biston betularia]
Group
Gene OntologyGO:00045551.1e-205alpha,alpha-trehalase activity
GO:00059911.1e-205trehalose metabolic process
GO:00038241.2e-54catalytic activity
KEGG pathwayphu:Phum_PHUM6170105e-137 
 K01194 (E3.2.1.28, treA)maps-> Starch and sucrose metabolism
InterPro domain[1-502] IPR0016611.1e-205Glycoside hydrolase, family 37
[50-483] IPR0089281.2e-54Six-hairpin glycosidase-like
Orthology groupMCL16294 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215037-TA
ATGAAAGAAACAAACAATAAACCTTCACGTGACCGTCTTATGAATTTCGTTGATCAAAATTTCGGAGAAGGCAATGAACTCGAGCATTGGATGCCGCCTGACTTCGATCCGAATCCACCAATCTTAGAACAAATATCTAATCCTAAGTTAAAAGCATTTGCTAAGAGTGTTATAGCGTTATGGGCCAAACTGGGACGAAAAGTACAATCGGATGTAATGCTAAACCCAGATCAGTATAGCTTCTTGTACGTGCCGAACGGATTCATCGTACCGGGAGGTCGATTTAAGGAATTGTATTACTGGGACTCATTTTGGACGATACGCGGCCTCATGATCAGCAACATGACTAAAACAGCTAAAGGTATGATAGAAAACCTGTTGCATTTAGTGAGGAAATATGGTTATATACCGAATGGGAGTCGTGTGTATTACTTAGGGAGAAGTCAACCTCCTTTACTAGCAGCTATGGTAGCCAGCTACTACGAGGCAACAGGGGATCTAGCGTGGATAGAACAACATATTAGCACTGTGGAAAAAGAACTATACTATTGGTTAGACAAAAAGAAAGTGACCGTCGAAATCAATGGTAACAAGTTTATGCTGCTGAGATATCTAGCGGATAAGAAAGATAGAGGTCCTCGTCCGGAATCATATTACGAAGACTATACAAACTCCAGAATCTTCCCTAATGAAGACAGACGAGATGATTTTTATTTAGAAATCAAGAGCGCTGCCGAAAGTGGCTGGGACTTCTCATCACGCTGGTTCGTGTCAGCCGACGGTGCAATTGGAAACTTGACTGATGTTCACGCGACACGGATCCTACCCGTTGATTTGAATGCCATATTCGCTGGTGCCTTGCAAACCGTCGGCGATCTCCATGACATACTGAAACACAGGAGAGAGGCGCAGAAATGGTGGAGCCTAGCCAGGTATTGGAGAAGCGCAATCGAGAATATCATGTATCATGAGGTAGATGGAGTGTGGTACGATTTTGACGCACAGACTGGCTCCCCTAGGAAACATTTTTATCCCAGCTGCGCTACTCCGCTATGGGCTAAGGTTGTCAATGAAACGAAAGCTGATAAATATGCTCTGCTGTTGGTTAATTATCTCAAATCAACCGGCGCCCTTAACTTCCCGGGCGGCGTCCCATCCTCCATACTACATTCGGGTGAACAATGGGATTTTCCGAATGCATGGCCCCCGATGCAAAGTATTTTAATCGGAGCATTGGACACCAGTGGAAACGTAGAAGCACGGAAATTGGCGAAGGAGCTAGCTGGTGTATGGATACGATCAAATTACATAGGTTATAACAACTGGCAGAAAATGTTTGAGAAGTACAGTGCAGTGCATCCAGGGCACGAGGGTGGTGGTGGGGAGTACGTGGTGCAGGACGGCTTCGGGTGGACCAATGGGGTTGTGTTAGAACTGTTACAGAGGTATGGAAAAGACCTCACGCTTCATGAACGACCGGGAGCCACGCCAACGGTTGCTCTTATATAA

Protein sequence:

>DPOGS215037-PA
MKETNNKPSRDRLMNFVDQNFGEGNELEHWMPPDFDPNPPILEQISNPKLKAFAKSVIALWAKLGRKVQSDVMLNPDQYSFLYVPNGFIVPGGRFKELYYWDSFWTIRGLMISNMTKTAKGMIENLLHLVRKYGYIPNGSRVYYLGRSQPPLLAAMVASYYEATGDLAWIEQHISTVEKELYYWLDKKKVTVEINGNKFMLLRYLADKKDRGPRPESYYEDYTNSRIFPNEDRRDDFYLEIKSAAESGWDFSSRWFVSADGAIGNLTDVHATRILPVDLNAIFAGALQTVGDLHDILKHRREAQKWWSLARYWRSAIENIMYHEVDGVWYDFDAQTGSPRKHFYPSCATPLWAKVVNETKADKYALLLVNYLKSTGALNFPGGVPSSILHSGEQWDFPNAWPPMQSILIGALDTSGNVEARKLAKELAGVWIRSNYIGYNNWQKMFEKYSAVHPGHEGGGGEYVVQDGFGWTNGVVLELLQRYGKDLTLHERPGATPTVALI-