Monarch geneset OGS2.0

DPOGS213269
TranscriptDPOGS213269-TA1194 bp
ProteinDPOGS213269-PA397 aa
Genomic positionDPSCF300264 - 53338-56011
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0166690.079.35% 
BombyxBGIBMGA001185-TA0.075.44% 
DrosophilaCG17065-PB9e-12753.75% 
EBI UniRef50UniRef50_B4JKA63e-12153.90%GH12083 n=3 Tax=Metazoa RepID=B4JKA6_DROGR
NCBI RefSeqXP_001655220.12e-14158.90%n-acetylglucosamine-6-phosphate deacetylase [Aedes aegypti]
NCBI nr blastpgi|1571288423e-14058.90%n-acetylglucosamine-6-phosphate deacetylase [Aedes aegypti]
NCBI nr blastxgi|1571288421e-13758.90%n-acetylglucosamine-6-phosphate deacetylase [Aedes aegypti]
Group
Gene OntologyGO:00084484.8e-87N-acetylglucosamine-6-phosphate deacetylase activity
GO:00060444.8e-87N-acetylglucosamine metabolic process
GO:00167875.5e-16hydrolase activity
GO:00168103e-11hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
KEGG pathwayaag:AaeL_AAEL0024305e-141 
 K01443 (E3.5.1.25, nagA, AMDHD2)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[10-390] IPR0037644.8e-87N-acetylglucosamine-6-phosphate deacetylase
[58-376] IPR0066805.5e-16Amidohydrolase 1
[6-397] IPR0110593e-11Metal-dependent hydrolase, composite domain
Orthology groupMCL12630 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213269-TA
ATGAAGGCAAAATCGGGGTTAACAAGATTTTCTAATTGTTATATTTTGCGTGACGGAAACATTATAAAAGAAGATTTATGGATACGCAATGGAAAAATTGTAAACCCTGAACAGGTATTTTATGTAGAACAAGAAGAAGCTGACATAACGGTAAACAGCGAAGACTCTCTCATAGTGCCAGGATTTATAGACATTCAAATAAATGGTGGATGGGGTGTAGATTTTTCCTATGATTCAGAAAATGTTGAAGAAGGGGTAAATAAAGTATCAAAACAGCTATTGGCTCATGGAGTGACCTCATTCTGTCCAACTATGGTTACATCTGAGAAAGATAAATATTATAAGATTTTACCCAAAATACAAAAAAGGCAAGGAGGAGAACATGGAGCTACGGTTCTTGGAGTGCATCTTGAAGGGCCATTTATTAGTTTAGCAAAAAAGGGTGCACACAAAGATGAATATATTTTAAATCCTGAAAAGGGGCTCGAATCAATTAAAGAGGTGTATGGATCTTTAGACAATGTAATTTTAGTTACAATAGCCCCAGAATTGCCTGGGGCTTTGGATGCTATAAGAGGGTTATCAAACATGGGCATCAAAGTGGCCCTAGGACATTCATCTGCTAGCCTTGCTCAAGGTGAAGAAGGCATTAAAAAGGGAGCAAACTTAATAACACACTTATTCAATGCTATGCTTCCATTTCATCATCGGGACCCTGGTTTGGTGGGCTTACTTGCTTCGAAGACTGATAGACAAGTTTATTATGGGATAATATCAGATGGCATTCACACTCATCCTGCAGCTTTGAGAATTGCTTGTCGAACTAATCAAGAAGGTTTGATATTAGTGAGTGATGCCGTAGCGGCTCAAGGCCTACCAGATGGTGCATACCGCATCGGACCTCAAGCTGTAAATGTCAATGAAGGCCGCGCATATGTCGCTGGGACCAAAACTCTCTGTGGCAGTACTACTGCCCTGGACCAGTCAATTAAAACATTCAAAGAAGCCACAGAATGTTCACTGGAATATGCTATAGAAGCAGCAACTCTGCATCCAGCCAAGGCTTTGGGAATAGATGATAGGAAGGGCAAATTAAATTTTGGCTTCGACGCAGATTTCGTCATCTTACATCCCAAATCTCTGAATGTTTTGTCCACTTGGATTGCTGGCGAATGCGTCTATCGATCTTCTTAG

Protein sequence:

>DPOGS213269-PA
MKAKSGLTRFSNCYILRDGNIIKEDLWIRNGKIVNPEQVFYVEQEEADITVNSEDSLIVPGFIDIQINGGWGVDFSYDSENVEEGVNKVSKQLLAHGVTSFCPTMVTSEKDKYYKILPKIQKRQGGEHGATVLGVHLEGPFISLAKKGAHKDEYILNPEKGLESIKEVYGSLDNVILVTIAPELPGALDAIRGLSNMGIKVALGHSSASLAQGEEGIKKGANLITHLFNAMLPFHHRDPGLVGLLASKTDRQVYYGIISDGIHTHPAALRIACRTNQEGLILVSDAVAAQGLPDGAYRIGPQAVNVNEGRAYVAGTKTLCGSTTALDQSIKTFKEATECSLEYAIEAATLHPAKALGIDDRKGKLNFGFDADFVILHPKSLNVLSTWIAGECVYRSS-