Monarch geneset OGS2.0

DPOGS213742
TranscriptDPOGS213742-TA1263 bp
ProteinDPOGS213742-PA420 aa
Genomic positionDPSCF300212 - 874630-883479
RNAseq coverage396x (Rank: top 30%)
Annotation
HeliconiusHMEL0084543e-13672.24% 
BombyxBGIBMGA009232-TA4e-13467.37% 
DrosophilaCG10467-PA1e-8551.90% 
EBI UniRef50UniRef50_E3WQS36e-9751.63%Putative uncharacterized protein n=2 Tax=cellular organisms RepID=E3WQS3_ANODA
NCBI RefSeqXP_001654671.13e-9854.30%aldose-1-epimerase [Aedes aegypti]
NCBI nr blastpgi|1571266107e-9754.30%aldose-1-epimerase [Aedes aegypti]
NCBI nr blastxgi|1571266101e-9754.30%aldose-1-epimerase [Aedes aegypti]
Group
Gene OntologyGO:00168533.3e-132isomerase activity
GO:00193183.3e-132hexose metabolic process
GO:00302465.3e-115carbohydrate binding
GO:00059755.3e-115carbohydrate metabolic process
GO:00038245.3e-115catalytic activity
GO:00082341e-22cysteine-type peptidase activity
GO:00065081e-22proteolysis
KEGG pathwayaag:AaeL_AAEL0105901e-97 
 K01785 (E5.1.3.3, galM)maps-> Glycolysis / Gluconeogenesis
InterPro domain[104-411] IPR0154433.3e-132Aldose 1-epimerase, subgroup
[105-409] IPR0147185.3e-115Glycoside hydrolase-type carbohydrate-binding, subgroup
[105-408] IPR0110134.3e-100Glycoside hydrolase-type carbohydrate-binding
[105-406] IPR0081832.8e-78Aldose 1-epimerase
[12-105] IPR0006681e-22Peptidase C1A, papain C-terminal
Orthology groupMCL12256 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213742-TA
ATGATTACTAATGTCCATACTGTCGCCAGTCTGGTGGGCGCGGAGGATAAGATAATCGAGTCTATAGCAACCCATGGTCCAGTGGCCGTCGCGGTGAACGCGCTCACGTGGCAGAACTACCTTGGCGGTGTCATACAGTACCATTGCAGCGGTAGCCCCAAAGAACTGAACCACGCTGTGGAGCTGGTTGGTTATGATCTAACAGCAGAGGTACCTTACTACATAGCCAAGAACTCGTGGGGCAAAGGCTTTGGTCTCGACGGATATCTTAAACTGGCGATCGGCTGCAACATATGCGGACTAGCCAATGAGGTCCCTGATAGAAACGGTGATATAGAAGATGTAGTCTTGGGTTTTGATGACCTAGAAAGCTATGTCAACCGTAATACACCTTACCTTGGGGCGACGGTAGGAAGATGTGCAAACAGAATCGGGGGAGCCAGTTTTAATATAGACGGAGTTGAATACAAACTTGCCAAGAACGTTGGGCAGGATCACTTACATGGCGGAATTGTTGGCTTTAACAAGGCAAATTGGAATTACATTAGAGATGGCAATAAAGTTATTTTCAGTCATTTATCTAAAGACGGCGACGAGGGTTACCCTGGGGATTTGTTAGCGAATGTCATTTACGAGGTCAAGGATGACGATACACTTTACGTAGAATTTCTGGCGACCGCTACCAAGAGAACCGTTGTTAATCTGACCAACCATTCGTATTTCAATTTGGCGGGACACGATAGTGGTGCGGAGGAATTATACAACCATGTCATCATGATTAATGCCGATAAAATAACAGAGACAACATCAGAATCTATTCCTACGGGTAACTTTATCAAGGTTGGTGGAACGGCTTACGACCTGCGTGCCCCTAAACGACTTGGGGATGCGATGACGTCTACTGGATATGGATTCGATGATAACTTCTGCGTAAACATGTACGATAAGGATTTGACGTTCGTGTCCCGGGTTAGTCACCCCTCATCAGGCAGATACCTGGAAGTATATAGCGACCAGCCGGGAGTACAGCTGTACACGAGCAATTTTCTCCCATCTCCTTACGAAGAGGCTCTTGTTGGAAAGAAAGGTGTTGGATATCGAAGGCATGGTGCCTTCTGCCTGGAGACGCAGAAGTTCCCAGATGCTGTACATCATGACAATTTCCCAAGCGCCATACTCACTCCCGGGGATGTTTACGTACATAAAGTGAATTACAGATTCGGTGTCAATGATGGCGACGCACCGCGCGTCGTCCTAAATTAG

Protein sequence:

>DPOGS213742-PA
MITNVHTVASLVGAEDKIIESIATHGPVAVAVNALTWQNYLGGVIQYHCSGSPKELNHAVELVGYDLTAEVPYYIAKNSWGKGFGLDGYLKLAIGCNICGLANEVPDRNGDIEDVVLGFDDLESYVNRNTPYLGATVGRCANRIGGASFNIDGVEYKLAKNVGQDHLHGGIVGFNKANWNYIRDGNKVIFSHLSKDGDEGYPGDLLANVIYEVKDDDTLYVEFLATATKRTVVNLTNHSYFNLAGHDSGAEELYNHVIMINADKITETTSESIPTGNFIKVGGTAYDLRAPKRLGDAMTSTGYGFDDNFCVNMYDKDLTFVSRVSHPSSGRYLEVYSDQPGVQLYTSNFLPSPYEEALVGKKGVGYRRHGAFCLETQKFPDAVHHDNFPSAILTPGDVYVHKVNYRFGVNDGDAPRVVLN-