Monarch geneset OGS2.0

DPOGS208133
TranscriptDPOGS208133-TA1332 bp
ProteinDPOGS208133-PA443 aa
Genomic positionDPSCF300154 + 361509-365997
RNAseq coverage123x (Rank: top 57%)
Annotation
HeliconiusHMEL0062233e-12765.87% 
BombyxBGIBMGA006575-TA1e-5461.07% 
DrosophilaMipp2-PA4e-8540.72% 
EBI UniRef50UniRef50_E0VP922e-9141.69%Multiple inositol polyphosphate phosphatase 1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VP92_PEDHC
NCBI RefSeqXP_002427936.14e-9241.69%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420145217e-9141.69%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420145211e-9141.78%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00167912.6e-61phosphatase activity
GO:00039933.2e-20acid phosphatase activity
KEGG pathwaydme:Dmel_CG43173e-83 
 K03103 (MINPP1)maps-> Inositol phosphate metabolism
InterPro domain[3-443] IPR0162742.6e-61Histidine acid phosphatase, eukaryotic
[73-397] IPR0005603.2e-20Histidine phosphatase superfamily, clade-2
Orthology groupMCL16412 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208133-TA
ATGAAACACTTTAGTATAAATTACTTTGCAAGTATAATATATATAATTGTTATCAGTTTGTTAGTAGTTATTAATTATAGTTCGAAAGCTGTGGCTTTAGTAACGAATTCTTCTGATATTCGCAATTATTTAGGAACTCGCACGCCTTATAGATTTAAATCAAACAAAGATGATTCCAAAATAAAATATCCTAATTGTAAACACTCGAAGATTTGGATGCTTTTGAGGCATGGAACCCGTTTGCCTAGTGCCAAGGATATACTGGGCATGAATACGATTCTAAAAGATTTAAAATATAAAATTCTAATGCAAAACAATCATGGAAAGGTTACTTCTATGTTAGGACCGTTAAACAAGGAACAGTTACATTGGTTTTCAAAATGGTCCAGTAATATATCGGTGGAGCAAGAGAAGTTTCTTACCTATGAGGGTCAGGATGAGATGATACTTTTGGCTGAAAGGATGCGGAAGAGATTCCCAAATGCCATAAAAGAGAAATATGACAACAAATCATTTCTGTTTAAATACACGGCAACACAGAGAGCTCAGCAGAGTGCGTTATATTTTACTATTGGGCTATTCGACAGGAAGAAATCAAGAGATGTTATTTTTGAACCTGCCATGAAAGTTGACACAACATTGAGATTTTACAAACACTGTGATAAGTGGCAGAAGCAAGTGAAAAAAAATCCTGAGACATATAAGGAACAAAGGGCATTCGCTGCAAGCCAGGCCATGAATGACACTTTCGATGCTGTGGCAAAGAATCTCGGATTGGAAGGAGTTTTGTCTAAAGAGATTGTTATTTTAATGTACAAGATATGTGGCTATGAAACTTCGTGGCACAAATACTATACATCGCCATGGTGTTACGGGTTCGATCTGAAAAGTATTAAAACCTTGGAATATTATCATGATTTGAAGCACTATTGGTTAGATGGATACGGCCACGAGTTAACGTCTGTCCAGGCCTGCATGATACTCAAAAATATGAAAGAAAATAAAACAGCTGCTTTCCTGTTTTCACATTCTGGAACATTGTTGAAATTGTTGACACATTTAGGCTTATACAAACCACAGACACATCTGAGAGGAGATAGTGTCATAGAGGATAGACTCTGGAGAGCGTCGAATATTGACTGTTTCGCATCAAATATAGCCTTCGTTTTATACAAATGTGATGACGGAGATAAGATATTAACGCTGCATCAAGAGAGAGTTATCAAGCTGCCAATGTGCGAGACGGAGCTCTGTCCGTTGGAACATTTGAAGGCTTATTTTCGTAACACCATACATAATTGTGACTTCGCTGATATGTGTGAAGCTGTTTAA

Protein sequence:

>DPOGS208133-PA
MKHFSINYFASIIYIIVISLLVVINYSSKAVALVTNSSDIRNYLGTRTPYRFKSNKDDSKIKYPNCKHSKIWMLLRHGTRLPSAKDILGMNTILKDLKYKILMQNNHGKVTSMLGPLNKEQLHWFSKWSSNISVEQEKFLTYEGQDEMILLAERMRKRFPNAIKEKYDNKSFLFKYTATQRAQQSALYFTIGLFDRKKSRDVIFEPAMKVDTTLRFYKHCDKWQKQVKKNPETYKEQRAFAASQAMNDTFDAVAKNLGLEGVLSKEIVILMYKICGYETSWHKYYTSPWCYGFDLKSIKTLEYYHDLKHYWLDGYGHELTSVQACMILKNMKENKTAAFLFSHSGTLLKLLTHLGLYKPQTHLRGDSVIEDRLWRASNIDCFASNIAFVLYKCDDGDKILTLHQERVIKLPMCETELCPLEHLKAYFRNTIHNCDFADMCEAV-