Monarch geneset OGS2.0

DPOGS214127
TranscriptDPOGS214127-TA1167 bp
ProteinDPOGS214127-PA388 aa
Genomic positionDPSCF300014 - 1501316-1503836
RNAseq coverage350x (Rank: top 33%)
Annotation
HeliconiusHMEL0113742e-14966.84% 
BombyxBGIBMGA006176-TA4e-12855.84% 
DrosophilaCG9449-PF3e-4329.77% 
EBI UniRef50UniRef50_UPI00015B57704e-5334.10%UPI00015B5770 related cluster n=1 Tax=unknown RepID=UPI00015B5770
NCBI RefSeqXP_001605452.17e-5434.10%PREDICTED: similar to venom acid phosphatase [Nasonia vitripennis]
NCBI nr blastpgi|1565500751e-5234.10%PREDICTED: venom acid phosphatase Acph-1-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565500753e-5234.48%PREDICTED: venom acid phosphatase Acph-1-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00039933.8e-32acid phosphatase activity
KEGG pathwayame:1001922033e-45 
 K01078 (E3.1.3.2)maps-> Riboflavin metabolism
    gamma-Hexachlorocyclohexane degradation
InterPro domain[43-331] IPR0005603.8e-32Histidine phosphatase superfamily, clade-2
Orthology groupMCL10493 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214127-TA
ATGAAGTCGTTTTGTTGGATAGCCTTAATGGTGGTGGCTGTGTGTGGTGAAGAGACACGTCAGAATCCGTCTGTAGTTGGTCGTATACGAAACAATGGTGCGCTGAATGAAGACGCCGAAAACACCGAATTAGTTCAGGCTTTCGTGGTTTTCCGTCACGGCGATAGGACCCCAGATGAGGCTGAGATAGAAAAGTATCCCGCTGATGTTAAAAATAATGACATTTTCTTTCCTTATGGAACTAAAGCATTGACTAATAAAGGGAAGCAGCGAGGATATCTCGTAGGAGAGTATCTCAGAAAACGCTACGATAACTTCATATCCCGCTTATATCTGCCGGACGAGATTTCAATACGAACGACATCCTTTGCACGGACTAAGATGACGGCGTTGACTGCTCTGGCCGCACTATATATACCTCCACCAGCTCAGAAATGGAACCCCTTCCTTAATTGGCAGCCTGTTCCATATGACACCATGGCGGCGGAAGACGACGACTTAATGTATTACTACAATTGCCCACGTTATCTAAAACTGAAAGATGCTGTCAACGATTACCCAGAGTTCCAACCAAAGGTGAAATCCTACGAAGGACTATTTAATTTTATAAGTTCACAAACTGGAACTAATATCACAACTCCTGATGACGTTTTCTTCCTAGATAATCTCTTTCAAACGTTGGAGAACGTTGGCGTGAGCCCTCCAAATTGGGCACAAAAGGTGATGCCGAAAATAAAAGAAGTAACAAAATTAGAATACGCCATAGAATTTTATACAAGTGAGGAGATACGATTAGCCTCAGGAGTCCTTCTGATGGATATACTGAACGCGACGAGCTCCGTTATAGCAGGTGATAAGGATCGCCCGAAATTATATTTAATATCAGCTCATGAAAACAATGTAGCCGGACTCATGGCCGCCATAAGGGTCTTTAAACCTCATCAGCCCCGATATGGAGCAACCATATCGCTTGAATTGAGGAGGCGACCATCGACGGGACAGTACGGATTTATGGCTGTGTATGCTGGAAATGCTGGCGGTCCGGGAGAGATTTTGCCTATCGCTGGCTGCGGGGGACAAACTTTCTGCGACTATAACACCTTTATACAACTGACAAGAGATAATGTTTTGCCGAGAAGTGAATTAAAAACTCTGTGTTATAGTTAA

Protein sequence:

>DPOGS214127-PA
MKSFCWIALMVVAVCGEETRQNPSVVGRIRNNGALNEDAENTELVQAFVVFRHGDRTPDEAEIEKYPADVKNNDIFFPYGTKALTNKGKQRGYLVGEYLRKRYDNFISRLYLPDEISIRTTSFARTKMTALTALAALYIPPPAQKWNPFLNWQPVPYDTMAAEDDDLMYYYNCPRYLKLKDAVNDYPEFQPKVKSYEGLFNFISSQTGTNITTPDDVFFLDNLFQTLENVGVSPPNWAQKVMPKIKEVTKLEYAIEFYTSEEIRLASGVLLMDILNATSSVIAGDKDRPKLYLISAHENNVAGLMAAIRVFKPHQPRYGATISLELRRRPSTGQYGFMAVYAGNAGGPGEILPIAGCGGQTFCDYNTFIQLTRDNVLPRSELKTLCYS-