Monarch geneset OGS2.0

DPOGS212595
TranscriptDPOGS212595-TA2103 bp
ProteinDPOGS212595-PA700 aa
Genomic positionDPSCF300245 - 216708-223023
RNAseq coverage1793x (Rank: top 7%)
Annotation
HeliconiusHMEL0067840.088.15% 
BombyxBGIBMGA005182-TA0.091.32% 
DrosophilaTps1-PA0.072.27% 
EBI UniRef50UniRef50_E2BI260.078.83%Alpha,alpha-trehalose-phosphate synthase [UDP-forming] A n=14 Tax=Pancrustacea RepID=E2BI26_HARSA
NCBI RefSeqXP_001603693.10.078.47%PREDICTED: similar to trehalose 6-phosphate synthase [Nasonia vitripennis]
NCBI nr blastpgi|2813725190.092.30%trehalose-6-phosphate synthase [Spodoptera litura]
NCBI nr blastxgi|1485409600.092.44%trehalose 6-phosphate synthase [Spodoptera exigua]
Group
Gene OntologyGO:00059921.9e-113trehalose biosynthetic process
GO:00038241.9e-113catalytic activity
GO:00081528.6e-19metabolic process
KEGG pathwaynvi:1001200060.0 
 K00697 (E2.4.1.15, otsA)maps-> Starch and sucrose metabolism
InterPro domain[5-369] IPR0018301.9e-113Glycosyl transferase, family 20
[404-643] IPR0033378.1e-41Trehalose-phosphatase
[557-653] IPR0232141.9e-40HAD-like domain
[408-604] IPR0063798.6e-19HAD-superfamily hydrolase, subfamily IIB
Orthology groupMCL15680 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212595-TA
ATGCCGGATCGCGCCACCTTCATCGCGGACCACTGGCGAGCCTATTCAAAAATCAACGAGCAATTTGCCGAAAAGACCATCTACGCGCTTAAGCTTTTAAAAGACAGCAAGGAAAAGAGCGGGAACTCCCCGCCTATAGTTTGGGTTCACGATTATCATCTGATGCTGGCAGCTAATTGGATCAGACAGAGAGCGGAGGAAGACGAAATAAGATGCAAACTGGCATTTTTCCTACACATACCCTTCCCCCCCTGGGACATATTCAGACTGTTCCCGTGGTCCGATGAAGTCTTGCAGGGAATCTTGGGTTGCGACATGGTCGGGTTCCACATAACAGACTACTGTTTGAACTTTATAGATTGTTGCCAGAGAAATTTAGGTTGTCGAGTCGATAGGAAGAATCTTCTCGTCGAGTTAGGCGGCCGTACCATTTGCGTGCGGCCGTTGCCCATCGGCGTACCCTTTGACAGATTTGTCTCATTAGCCCAGAATGCAAAACCAGTCCTTTTTACGAATCAGAAAGTCTTACTGGGTGTAGACAGACTGGATTACACCAAAGGCTTAGTACACAGACTGAAAGCCTTCGAAAGACTGCTCGAGAAACATCCGGAACATAGAGAGGAAGTCGTCCTGCTACAGATATCGGTTCCTTCGAGAACTGACGTCAAAGAATACCAGGACCTGAAGGAAGAAATGGATCAATTGGTCGGCAGAATAAACGGAAGGTTCACCACACCGAACTGGTCTCCAATAAGATACATATACGGCTGCGTGGGTCAAGATGAATTGGCGGCTTTCTATCGCGATGCAGCCGTAGCTCTGGTCACACCGCTACGTGATGGGATGAATCTGGTCGCTAAGGAGTTCGTGGCCTGTCAGATAAAGAAGCCTCCGGGGGTCCTGATAGTGTCACCGTTCGCCGGCGCCGGTGAAATGATGCACGAAGCTCTGATATGCAACCCTTACGAACTAGACGACGCCGCTGAAGTCATTCACAGGGCGCTGATTATGCCAGAGGACGAGCGCACGGTCCGCATGAATCATCTGAGACGGCGGGAAAAACAGAACGACGTGGACAGTTGGATGAAGGCGTTCCTCAAGGCGATGGACTCGCTGGAGGAGGAGGCGGACGACATCGGAGCCACGTCCATGCAACCCGTCACCATCGACGACTTCGACGAGTACCTGTCCAAATACATCGGCTACACCCAGAAGTTAGCTCTCCTGCTAGATTATGACGGGACCCTGGCTCCGATCGCGCCACATCCGGACCTCGCCACACTGCCGTTGGAGACGAAGAATACTTTGCAACGACTCTCCAACATGGCCGACGTGTATATAGCTATTGTTTCCGGCAGAAACGTCAACAACGTTAAGGAAATGGTTGGTATAGAGGGCATCACCTACGCAGGAAACCACGGCTTAGAGATATTGCACCCGGACGGCAGCAAGTTCGTTCATCCCATGCCTATGGAGCTGCAAGACAAAGTTGTCGATCTACTCAAAGCATTGCAAGAACAAGTGTGTCGAGACGGAGCCTGGGTGGAAAACAAAGGAGCTCTACTGACCTTCCACTTCCGGGAGACTCCATTGGCTAAGAGGGCGGCTCTAGAGAATACTGCCAAGAAACTAATCACAGCCGCAGGTTTCACACCGGCCCCCGCACATTGCGCCATCGAAGCCAGACCGCCCGTACAGTGGAATAAAGGTCGCGCGTCCATATACATTTTAAGAACGGCTTTCGGTTTGGATTGGAGCGAACGTATCCGAGTAATATACGCTGGCGATGACGTCACCGACGAAGACGCCATGTTGGCTCTGAAAGGTATGGCGGCCACATTCCGTATAGCGTCGTCCACCATAACGAAGACGTCAGCCGAGCGCCGCCTGTCGTCCACCGACTCGGTGCTAGCTATGCTGAAGTGGGTAGAACGACATTTCTCCAACCGCAAGCCGAGAGCTAACTCACTGACATACAAAAACGCAAGATTAGCCCGCGACACTATACAAATGCATATGTCTTATCAAATACCGAAAAGGTCGCCGCGTCACACACCCCCCTGCACTCCAGAGAAATCCTCGAGCGGATCGGAATCCAATTAA

Protein sequence:

>DPOGS212595-PA
MPDRATFIADHWRAYSKINEQFAEKTIYALKLLKDSKEKSGNSPPIVWVHDYHLMLAANWIRQRAEEDEIRCKLAFFLHIPFPPWDIFRLFPWSDEVLQGILGCDMVGFHITDYCLNFIDCCQRNLGCRVDRKNLLVELGGRTICVRPLPIGVPFDRFVSLAQNAKPVLFTNQKVLLGVDRLDYTKGLVHRLKAFERLLEKHPEHREEVVLLQISVPSRTDVKEYQDLKEEMDQLVGRINGRFTTPNWSPIRYIYGCVGQDELAAFYRDAAVALVTPLRDGMNLVAKEFVACQIKKPPGVLIVSPFAGAGEMMHEALICNPYELDDAAEVIHRALIMPEDERTVRMNHLRRREKQNDVDSWMKAFLKAMDSLEEEADDIGATSMQPVTIDDFDEYLSKYIGYTQKLALLLDYDGTLAPIAPHPDLATLPLETKNTLQRLSNMADVYIAIVSGRNVNNVKEMVGIEGITYAGNHGLEILHPDGSKFVHPMPMELQDKVVDLLKALQEQVCRDGAWVENKGALLTFHFRETPLAKRAALENTAKKLITAAGFTPAPAHCAIEARPPVQWNKGRASIYILRTAFGLDWSERIRVIYAGDDVTDEDAMLALKGMAATFRIASSTITKTSAERRLSSTDSVLAMLKWVERHFSNRKPRANSLTYKNARLARDTIQMHMSYQIPKRSPRHTPPCTPEKSSSGSESN-