Monarch geneset OGS2.0

DPOGS202722
TranscriptDPOGS202722-TA861 bp
ProteinDPOGS202722-PA286 aa
Genomic positionDPSCF300272 + 184809-186917
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0078906e-13781.32% 
BombyxBGIBMGA008433-TA4e-14788.97% 
DrosophilaGmd-PA1e-12167.48% 
EBI UniRef50UniRef50_F1KSH13e-11366.08%GDP-mannose 4,6 dehydratase 1 n=3 Tax=Opisthokonta RepID=F1KSH1_ASCSU
NCBI RefSeqXP_001863206.14e-12872.73%GDP-mannose 4,6 dehydratase [Culex quinquefasciatus]
NCBI nr blastpgi|1700546077e-12772.73%GDP-mannose 4,6 dehydratase [Culex quinquefasciatus]
NCBI nr blastxgi|1700546077e-12372.73%GDP-mannose 4,6 dehydratase [Culex quinquefasciatus]
Group
Gene OntologyGO:00196731.5e-158GDP-mannose metabolic process
GO:00056221.5e-158intracellular
GO:00084461.5e-158GDP-mannose 4,6-dehydratase activity
GO:00442376.3e-50cellular metabolic process
GO:00038246.3e-50catalytic activity
GO:00506626.3e-50coenzyme binding
GO:00054882e-32binding
KEGG pathwaycqu:CpipJ_CPIJ0129191e-127 
 K01711 (E4.2.1.47, gmd)maps-> Amino sugar and nucleotide sugar metabolism
    Fructose and mannose metabolism
InterPro domain[15-285] IPR0063681.5e-158GDP-mannose 4,6-dehydratase
[15-188] IPR0015096.3e-50NAD-dependent epimerase/dehydratase
[14-126] IPR0160402e-32NAD(P)-binding domain
Orthology groupMCL12996 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202722-TA
ATGGTTTCTCGTCAAACAATACCTCGTTTACAATACCTGATAAGACCAAAGGAAATATATAACTTGGGGGCTCAATCCCACGTTAAGGTGTCGTTCGAGTTGAGTGAATATACAGCTCAAGTAGACGCTCTGGGTACCCTTAGACTGCTGGAAGCGGTGCGGACTGCTGGATTAGAGAAGGAAACTAGAATATACCAGGCTTCCACCTCGGAACTATACGGGAAAGTATTAGAAGTTCCCCAGAATGAGAAAACGCCGTTCTATCCAAGGTCGCCTTATGCGTGTGCTAAGCTCTACGGTCACTGGATAGTAGTGAACTATAGGGAGGCGTACGGGATGTTCGCGTGTAACGGCTTACTCTTCAACCATGAGAGCCCGAGGCGGGGGGAGAACTTCGTAACGAGAAAAATAACCCGCGGAGTGGCCAAGATACAGCTGGGATTGATGTCGCATCTGGAAATGGGCAATTTGGATAGCAAAAGAGATTGGGGACATGCCAAGGATTATGTAGAGGCTATGTGGCTGATACTGCAGCAGGATGAGCCGGAAGATTTCGTGGTTGCCTCCGGCGAGGCTCACAGTGTGAGGGAATTCATAGAGAAGGCTTTCGCGTGTGTGGGGAGAGGGGTGGTGTGGAGGGGGGAGGGGGTGCACGAAACTGGTCACGATAAACATACGGACCAACTACTCGTTAAGGTCAACCCAAAATATTTCAGACCCACGGAAGTGGATCTTCTTCTAGGTGACGCGTCAAAAGCCAAACAGAAATTAGGTTGGACCAACAAAACCACCTTCGAAGAGCTGGTCAAAGATATGGTGGAGGCCGACCTTGAACTCATGAAGAAAAACCCCGAGGCATAA

Protein sequence:

>DPOGS202722-PA
MVSRQTIPRLQYLIRPKEIYNLGAQSHVKVSFELSEYTAQVDALGTLRLLEAVRTAGLEKETRIYQASTSELYGKVLEVPQNEKTPFYPRSPYACAKLYGHWIVVNYREAYGMFACNGLLFNHESPRRGENFVTRKITRGVAKIQLGLMSHLEMGNLDSKRDWGHAKDYVEAMWLILQQDEPEDFVVASGEAHSVREFIEKAFACVGRGVVWRGEGVHETGHDKHTDQLLVKVNPKYFRPTEVDLLLGDASKAKQKLGWTNKTTFEELVKDMVEADLELMKKNPEA-