Monarch geneset OGS2.0

DPOGS213511
TranscriptDPOGS213511-TA909 bp
ProteinDPOGS213511-PA302 aa
Genomic positionDPSCF300033 - 998409-1000727
RNAseq coverage392x (Rank: top 31%)
Annotation
HeliconiusHMEL0077811e-16489.40% 
BombyxBGIBMGA011791-TA2e-13878.16% 
DrosophilaCG5567-PA9e-8551.15% 
EBI UniRef50UniRef50_D2A3901e-9054.64%Putative uncharacterized protein GLEAN_07560 n=1 Tax=Tribolium castaneum RepID=D2A390_TRICA
NCBI RefSeqXP_974660.13e-9154.64%PREDICTED: similar to 4-nitrophenylphosphatase [Tribolium castaneum]
NCBI nr blastpgi|3227861615e-9154.52%hypothetical protein SINV_01329 [Solenopsis invicta]
NCBI nr blastxgi|3227861616e-8854.52%hypothetical protein SINV_01329 [Solenopsis invicta]
Group
Gene OntologyGO:00081528.1e-87metabolic process
GO:00167918.1e-87phosphatase activity
GO:00167873.2e-50hydrolase activity
GO:00038242e-08catalytic activity
KEGG pathwaytca:6635278e-91 
 K01101 (E3.1.3.41)maps-> gamma-Hexachlorocyclohexane degradation
InterPro domain[25-296] IPR0063498.1e-872-phosphoglycolate phosphatase, eukaryotic
[23-299] IPR0232141.6e-58HAD-like domain
[27-268] IPR0063573.2e-50HAD-superfamily hydrolase, subfamily IIA
[92-219] IPR0232151.1e-32Nitrophenylphosphatase-like domain
[25-252] IPR0058342e-08Haloacid dehalogenase-like hydrolase
Orthology groupMCL12544 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213511-TA
ATGTTCAAATCTTCAGTTTTTAATCTTCAAGAGGCGACAGCTGATCAGGTCAAGGAGTTTTTGAATTCCTTTGATACTGTTTTAAGTGATTGTGATGGTGTGCTATGGATTAATAACAGCGCCATACCAGGTTCAGCGGAGGCTATGAACTTTTTTCGGAAATTAGGAAAAAGAATTTTTTACGTTACCAATAACTCTACTAAAATTCGTAGCGATTTTGCTGTCAAAGCGCAACAACTAGGGTTTATTGCAGAACCGGAAGAAATTCTATCAACAGCTTATTTAGTAGCCCATTACTTAAAAGGTATCGGATTCAGAAAAAAAGTTTACCTAATAGGTTCCAATGGCATCGGTGACGAGTTGAAAGCTGTGGGAATACGACACATAGGTGTTGGACCAGACCAAGTCAAACAGGATTTCAAATCCATGAATTCATCAGATTTGGATCCAGAAGTGGGAGCGGTAGTTGTTGGTTTCGATGAACACATAAGCTATCCGAAGTTTATGAAGGCTGCATCCTATCTCGCCAACGAGCAGTGTCTATTTGTAGCCACAAACACAGACGAGAGGTTCCCTAAATCTAGTACGGTCATAATACCCGGAACTGGGACACTGGTTAGAGCGGTGGAAACATGCTCAGAGAGAAAAGCACTAGTTCTCGGGAAACCACATGATTATGTAAGGAAGTTTCTAGAATCATTCGGCTTGGATCCTGCGAGGACTTTGATGATTGGTGACAGATGCAACACGGACATAGAGTTTGGTGTACGTTGTGGTTTCCAAACTTTGCTCGTCATGACCGGAGTGACTTCACCAAAAGACTTGGAAAGAATGAGAAGTGACAAGAAACCTCCGCTACCCGATGTCGTTTTACCAAAACTGGGCGACATACTGAGCCTCGCCTCATAG

Protein sequence:

>DPOGS213511-PA
MFKSSVFNLQEATADQVKEFLNSFDTVLSDCDGVLWINNSAIPGSAEAMNFFRKLGKRIFYVTNNSTKIRSDFAVKAQQLGFIAEPEEILSTAYLVAHYLKGIGFRKKVYLIGSNGIGDELKAVGIRHIGVGPDQVKQDFKSMNSSDLDPEVGAVVVGFDEHISYPKFMKAASYLANEQCLFVATNTDERFPKSSTVIIPGTGTLVRAVETCSERKALVLGKPHDYVRKFLESFGLDPARTLMIGDRCNTDIEFGVRCGFQTLLVMTGVTSPKDLERMRSDKKPPLPDVVLPKLGDILSLAS-