Monarch geneset OGS2.0

DPOGS215737
TranscriptDPOGS215737-TA948 bp
ProteinDPOGS215737-PA315 aa
Genomic positionDPSCF300041 + 652798-671496
RNAseq coverage538x (Rank: top 23%)
Annotation
HeliconiusHMEL0040712e-16599.28% 
BombyxBGIBMGA003610-TA6e-12895.48% 
DrosophilaCG9008-PA2e-13679.78% 
EBI UniRef50UniRef50_Q9V3D12e-13479.78%CG9008, isoform A n=28 Tax=Pancrustacea RepID=Q9V3D1_DROME
NCBI RefSeqXP_002428429.12e-14485.51%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420155763e-14385.51%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420155762e-14285.51%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00302461.2e-77carbohydrate binding
GO:00059751.2e-77carbohydrate metabolic process
GO:00038241.2e-77catalytic activity
GO:00168531.5e-43isomerase activity
KEGG pathwayphu:Phum_PHUM3804605e-144 
 K01792 (E5.1.3.15)maps-> Glycolysis / Gluconeogenesis
InterPro domain[6-275] IPR0147181.2e-77Glycoside hydrolase-type carbohydrate-binding, subgroup
[8-275] IPR0110134.4e-55Glycoside hydrolase-type carbohydrate-binding
[6-274] IPR0081831.5e-43Aldose 1-epimerase
Orthology groupMCL12659 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215737-TA
ATGGCAGCGACGAGTGTTGTCGTTCTGGACAGAGGCAACAATACAACTTGTACCGTAAATCTTTTCGGTGCTACAGTGGTATCATGGCGGGTTAATAATCAAGAACAATTATTTGTAAGTAAACAGGCCGTATTCGATGGGAAGAGGGCGATACGAGGAGGAATACCATTCGTATTTCCTCAATTCGGTCAATGGGCGTTCGGACCCCAGCATGGGTTCGCGCGCGTGGCTCGCTGGCACGTCGAGAAGATGCCAGAGAGACTGCCGTCTGGAGACGTGGAAGCTGTCTTCAGTCTCATGGATGACGACTTCACTAGATCCATGTGGCACTTCCAGTTCAGATTGACTTACCGGCTCATACTCCGCGAGAAGGAGTTGCACTTCAACATCGGCGTGTACAACCCCAGCAAGGAGTTGACCTTCAGCTGTCAACTGTTACTGCACACGTACTTCAAGGTGCCGGACGTGAGGCGCTGTCAGATAACCGGCATGCACGGCTGTATGTTTATTGATAAGACCCGTGAAGGCGCCGTGTACCAGGAAACCCGCGAGGTGGTCACCATCAATGAGTGGACGGACCGCGTGTATCAGAACACGATGCAGGAGCACATCATCACCAACGTGGTCAGCGGCCGGAAGATGAGGATACAGAAGTACAACTTCCCAGATACAGTGATTTGGAATCCTTGGGCGGAGTTCGCTAAGGAGATACCTGATTTCGGTGACGACGAGTTCCCGAACATGGTGTGTGTGGAAGCGGGCCGGGTCGCTGCACCCATTGTGCTGCTCCCAGGGACGGCCTTCGAAGCCTCACAGATATTACAGGTTTGGAGAGAAATAGGATTGGCTAGCCTCTCACTGCCATTATTTTGCTACCAGAAATATCTCAAGGATGTATTAGACTGCATTGTTGTCATGGAAGACAATTGTCCAACATGTAGAAGTTGA

Protein sequence:

>DPOGS215737-PA
MAATSVVVLDRGNNTTCTVNLFGATVVSWRVNNQEQLFVSKQAVFDGKRAIRGGIPFVFPQFGQWAFGPQHGFARVARWHVEKMPERLPSGDVEAVFSLMDDDFTRSMWHFQFRLTYRLILREKELHFNIGVYNPSKELTFSCQLLLHTYFKVPDVRRCQITGMHGCMFIDKTREGAVYQETREVVTINEWTDRVYQNTMQEHIITNVVSGRKMRIQKYNFPDTVIWNPWAEFAKEIPDFGDDEFPNMVCVEAGRVAAPIVLLPGTAFEASQILQVWREIGLASLSLPLFCYQKYLKDVLDCIVVMEDNCPTCRS-