Monarch geneset OGS2.0

DPOGS202692
TranscriptDPOGS202692-TA912 bp
ProteinDPOGS202692-PA303 aa
Genomic positionDPSCF300324 - 71906-74232
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0078702e-12469.77% 
BombyxBGIBMGA004859-TA4e-9967.06% 
Drosophila% 
EBI UniRef50UniRef50_E2AJ866e-6745.88%N-acetylneuraminate lyase n=11 Tax=Endopterygota RepID=E2AJ86_CAMFO
NCBI RefSeqXP_001813448.14e-7446.53%PREDICTED: similar to N-acetyl neuraminate lyase [Tribolium castaneum]
NCBI nr blastpgi|3323762325e-7747.97%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323762324e-7547.97%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00081521.8e-72metabolic process
GO:00038241.8e-72catalytic activity
GO:00168291.4e-70lyase activity
KEGG pathwaytca:1001423851e-73 
 K01639 (E4.1.3.3, nanA, NPL)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[7-302] IPR0137851.8e-72Aldolase-type TIM barrel
[4-303] IPR0022201.4e-70Dihydrodipicolinate synthetase-like
Orthology groupMCL11472 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202692-TA
ATGGTTGTTTTTAACGCTAGAGGCTTAGTGCCGCCGGTTTTCACCGCTCTGAATGATGACTACAGTGTGAATTATGCTGCCATTCCTGGTTACGCCAATTATCTGGCTTCGAATGGAATAAAGGGAGTCCTGGTTGGAGGCACTACAGGAGAAAATATGTCTCTCTCTGTATCTGAAAGAAAGAAAATAGCTGATGAGTGGGTGAAGGCCGGAAAGATTCACGGCTTGCACATCATGGTACAAGTTGGTGGGGCGCCTTTTGTAGACGTCATTGAATTGGCAAAACATTGCTCGAAAATCGGAGTGGATTCGTTACTTACATTACCGGAGCTCTACTTCAAGCCTCAATCGGTAACCGAGTTGGTTTCGTATGTTGAATTGGTAGCGCAGGCTGCCCCCAACTTACCGGTTTTATACTATCATATACCATTTATGAGTAATGTTGCCATGAACATGCCAGCTTTTGTGACAGAAGCGACAAAACGGATCCCCAATTTCAAAGGTCTCAAATTCACCAGCAACGATTTGTCTGAAGGGTCCCAAACCTTACGAGCCTTGAAGAATGATCAGGAGATATTCTTAGGAGCGGATACTCTGCTGGCTCCAGCGGTACTTCTCGGTATAAAGTCCAGTATCGGTACCACGTACAATCTGTTCCCGCGGCAAGCTCAGGATATAATGGATGCGGTAGCTTGCTCAGACCTTGAACGTGCTAAAGCTTTACAGGAACAGTTGAACAAAGCTGTAGAGGCTTTCACGGCCGAAGGCCCCTGGGTCCCCACTTTAAAGGCTGCGATGGAGATCGTTACCGGCATGAAATTCGGCCCCCCGGCCTTACCTCAAAGACCGATTTCTGAAGCAGCGAGAAAAAGAATTGAAGAAGAGCTTAGGATTTTGAAACTAATAAATTGA

Protein sequence:

>DPOGS202692-PA
MVVFNARGLVPPVFTALNDDYSVNYAAIPGYANYLASNGIKGVLVGGTTGENMSLSVSERKKIADEWVKAGKIHGLHIMVQVGGAPFVDVIELAKHCSKIGVDSLLTLPELYFKPQSVTELVSYVELVAQAAPNLPVLYYHIPFMSNVAMNMPAFVTEATKRIPNFKGLKFTSNDLSEGSQTLRALKNDQEIFLGADTLLAPAVLLGIKSSIGTTYNLFPRQAQDIMDAVACSDLERAKALQEQLNKAVEAFTAEGPWVPTLKAAMEIVTGMKFGPPALPQRPISEAARKRIEEELRILKLIN-