Monarch geneset OGS2.0

DPOGS203487
TranscriptDPOGS203487-TA840 bp
ProteinDPOGS203487-PA279 aa
Genomic positionDPSCF300055 - 844038-848630
RNAseq coverage1263x (Rank: top 10%)
Annotation
HeliconiusHMEL0142986e-13283.27% 
BombyxBGIBMGA008293-TA1e-12576.59% 
DrosophilaCG1637-PB4e-8158.17% 
EBI UniRef50UniRef50_F4W5K71e-8659.76%Iron/zinc purple acid phosphatase-like protein n=5 Tax=Coelomata RepID=F4W5K7_ACREC
NCBI RefSeqXP_002426882.18e-8655.31%acid phosphatase precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071801683e-8656.16%Iron/zinc purple acid phosphatase-like protein [Camponotus floridanus]
NCBI nr blastxgi|3838645461e-8659.36%PREDICTED: iron/zinc purple acid phosphatase-like protein-like [Megachile rotundata]
Group
Gene OntologyGO:00468721.4e-29metal ion binding
GO:00039931.4e-29acid phosphatase activity
GO:00167875.9e-15hydrolase activity
KEGG pathwayafm:AFUA_7G008005e-08 
 K01078 (E3.1.3.2)maps-> Riboflavin metabolism
    gamma-Hexachlorocyclohexane degradation
InterPro domain[8-125] IPR0089631.4e-29Purple acid phosphatase-like, N-terminal
[31-126] IPR0159146.7e-24Purple acid phosphatase, N-terminal
[133-278] IPR0048435.9e-15Metallophosphoesterase domain
Orthology groupMCL13376 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203487-TA
ATGAAATTGTTAATTTTTGTTGTTGTTATAACTTTATCGAAAGCCAATAAAACTCCCCGCGTTAGTCCCGGATACGATTGTGATTATTGCCAACCAGAACAGATTCATATTTCCTTTGGATCCAAAACCAACGACATAGTGGTAACATGGACAACATTCAACGACACTCAAGAGTCACGAGTACAGTATGGGGTGGGGGTCATGGACCAGGAGGCCGTGGGCTCCAGCACCGTGTTCACTGACGGAGGAAGGAGGAAGAGGAACATGTGGATACATCGCGTCCTGTTGAAAGACCTCAACTTTAATACTAAATATGTATATCACGCGGGGTCGGTGTACGGTTGGTCGGAGCAGTTGTCGTTCAAGACTCCGCCCCAGGGCGAAGACTGGGTGGTGAGGGCCGCCGTCTACGGAGACATGGGCAGCAAGAACGCGCATTCGCTGTCGTACCTCCAGGACGAGGCCGAGCGCGGCCACTTCGACCTGATCCTGCACGTGGGAGACTTCGCCTACGACATGGACACAGACGACGCGCTCGTGGGAGACGAGTTCATGAGGCAGATACAACCGCTGGCGGCCGGCCTGCCCTACATGACCTGTCCGGGGAACCACGAGTCCAAGTACAACTTCAGTAACTACCGCAACAGATTCTCGATGCCGGGCGACTCTGAGAGTATGTTCTACTCCTTCGACCTGGGCCCGGTCCACTTCGTGTCCATCTCCACGGAGTTCTACTACTTCCTCAACTACGGCTTCAAGATGGTCGCCAACCAGTTCTACTGGCTCGAGGAAGATCTCAGGAAGGCCAACGAACCGGAAAATAGGCAAGACATCCTGTGA

Protein sequence:

>DPOGS203487-PA
MKLLIFVVVITLSKANKTPRVSPGYDCDYCQPEQIHISFGSKTNDIVVTWTTFNDTQESRVQYGVGVMDQEAVGSSTVFTDGGRRKRNMWIHRVLLKDLNFNTKYVYHAGSVYGWSEQLSFKTPPQGEDWVVRAAVYGDMGSKNAHSLSYLQDEAERGHFDLILHVGDFAYDMDTDDALVGDEFMRQIQPLAAGLPYMTCPGNHESKYNFSNYRNRFSMPGDSESMFYSFDLGPVHFVSISTEFYYFLNYGFKMVANQFYWLEEDLRKANEPENRQDIL-