Monarch geneset OGS2.0

DPOGS206366
TranscriptDPOGS206366-TA1143 bp
ProteinDPOGS206366-PA380 aa
Genomic positionDPSCF300082 + 1357289-1360053
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0171283e-7035.10% 
BombyxBGIBMGA000402-TA3e-5936.80% 
DrosophilaMipp1-PB4e-3025.79% 
EBI UniRef50UniRef50_E0VT888e-4230.43%Multiple inositol polyphosphate phosphatase 1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VT88_PEDHC
NCBI RefSeqXP_002429332.11e-4230.43%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420177143e-4130.43%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420177146e-4130.19%multiple inositol polyphosphate phosphatase 1 precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00039938.7e-10acid phosphatase activity
KEGG pathwaydme:Dmel_CG43174e-17 
 K03103 (MINPP1)maps-> Inositol phosphate metabolism
InterPro domain[221-338] IPR0005608.7e-10Histidine phosphatase superfamily, clade-2
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206366-TA
ATGTGTATCTGTATTAATCTGCCGTTAGTGTACAGCCAGTGTTACTGGAATAGAACACCGTATAGTTATTATGGTAGCAAGACACCATATGATGCAACAAGAGGAGATTTTAGAGACGTTCCTCCATTGAAAGGTTGTAAACTTGAAAGCATATGGTTCATGGCGCGGGATGGAGCTCGGCACCCTGATAAAGAAGATAAGAATGACATGAAGGACATTCTAGATCTGAAGGATGATATCTTGGACAATTATGAAGATGGAAGAGGTGACTTATGTGCACAGGATATTGCAGATTTGCGGGCATGGACCTGGAATGATAAATTGGATAGAGCTGTTTACCACCTAACTCCAGAAGGTTACAGGGAATTACTGGGTTTTGGTGAAAGGTTTTCAGTAATGTTTCAATCATTATTGGAAAACCTGGATGTATCCCTCATAAGATCGACTAAAGAGCAAAGAACCATTAAAAATGCAAGATCTTTTATCGAAGGTTTGAAAAATATCAAAAAACCTATTGTAGTTGACGATCCTATTTTGGACGACCCTGTAGCAAGACTCCAAGCTAATGTACAAAGACGAGTAGGTCTCGATTTTGAATTGAATCCTAAAAGTATTCTCAGTGTATACAATCTCTGTAGGAATTACCGATCATACTCTGTGCTGAAGAGAAGTCCTTGGTGTGCGCTTTTCACTGACGATGAGCTTATGATCTTGGAATATGTTGAGAATATCACACATTATTACAGGAATGGCTACGGGCATTCCACCAACATACTTCTTGGAGCTCTGGGTCTGAAAGATCTTTATCAGAAGTTTGAAGAGGCGTCAAAGGGGGGCTTGAAAACGTTGACGGGTTACTTTACTCATGAAACTATGCTGCATATGATTTACACGGCCATGGGACTTTATATGGATTACCCAGAAGTATCCGGTCTCGAGAGGGTTAAACATAGAAAGTGGCGGACGAGTTTCTTGACACCATTCGCTGCTAATTTCGTCGCCGTTCTACACAGAGTGCAGTTCTTAGTCAACGAGAAAGAGCTACATCTTTGCGGGGACCGCTCCTGTTCCCTTGAGGAATTCAGACATAAGTTCCAAAAATTCAACAACGCCTCGTACGACTTCTGCAATGAGGACTATTAA

Protein sequence:

>DPOGS206366-PA
MCICINLPLVYSQCYWNRTPYSYYGSKTPYDATRGDFRDVPPLKGCKLESIWFMARDGARHPDKEDKNDMKDILDLKDDILDNYEDGRGDLCAQDIADLRAWTWNDKLDRAVYHLTPEGYRELLGFGERFSVMFQSLLENLDVSLIRSTKEQRTIKNARSFIEGLKNIKKPIVVDDPILDDPVARLQANVQRRVGLDFELNPKSILSVYNLCRNYRSYSVLKRSPWCALFTDDELMILEYVENITHYYRNGYGHSTNILLGALGLKDLYQKFEEASKGGLKTLTGYFTHETMLHMIYTAMGLYMDYPEVSGLERVKHRKWRTSFLTPFAANFVAVLHRVQFLVNEKELHLCGDRSCSLEEFRHKFQKFNNASYDFCNEDY-