Monarch geneset OGS2.0

DPOGS216080
TranscriptDPOGS216080-TA2547 bp
ProteinDPOGS216080-PA848 aa
Genomic positionDPSCF300529 - 10032-19173
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0171285e-10344.85% 
BombyxBGIBMGA000402-TA2e-14136.38% 
DrosophilaMipp1-PB9e-4432.05% 
EBI UniRef50UniRef50_UPI00022461446e-5832.95%UPI0002246144 related cluster n=2 Tax=unknown RepID=UPI0002246144
NCBI RefSeqXP_393246.34e-8227.39%PREDICTED: similar to Multiple inositol polyphosphate phosphatase 1 CG4123-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3454787792e-5732.95%PREDICTED: multiple inositol polyphosphate phosphatase 1-like [Nasonia vitripennis]
NCBI nr blastxgi|3454787791e-5632.95%PREDICTED: multiple inositol polyphosphate phosphatase 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00039936.8e-24acid phosphatase activity
KEGG pathwaydme:Dmel_CG43177e-31 
 K03103 (MINPP1)maps-> Inositol phosphate metabolism
InterPro domain[48-387] IPR0005606.8e-24Histidine phosphatase superfamily, clade-2
Orthology groupMCL35089 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216080-TA
ATGTCGAGTGTCGTGGTCAGTAAAGAGGAGTGTTACTGGAATAGACAGTGCAAGTATCAGTTATTTTCAACGACTACACCCTATGATATCATACGAGGGGACATTCGTGACCAGCCCAACCCTGACGGTTGTAAGGTCGTGAGTTTGTGGTCAATACATCGACACGGGAACCGTCATCCTGGAAGCAGGGTCGTAAAGGACACCAACGAGTTATGGGTCAAGTTGAGGGACCAAATTATAAGAAGTGAGGCAGAGTCAAGGAACTCACTTTGCTCACAGGACTTGGAAGATATATTAAATTGGAAATGGGATTCTTCGCTAGAAACTACACCATCTTACCTCACACAAGTGGGTAACGACGAAATATACTCGATCGGTAAAAGGGTAGCGAAAAAATACAATGAACTGATGCACGAAAGGATTGACCGATATTATTTCAGAGGAACCAACGAACAACGTACGAAAGCAAGTGTTCTAGCGTACGTCAATGGCCTAACTCATGGTTCAGACATGATCCTCACTTCACGGATAGAGGAATCTAGGGAACGAGATGACACTATTCGGCCTTACGAAAACTGTGATCGCTACCAGGAGTCAGTTAAGAACGGTTCGCTGTTGCCGGATCAGTTGGCTGAGTATGATCAAAGCTCCGAGTATTTAGCGGTCAGAGACCGAGTTTTCAAGCGACTAGGTATAACAAACGACACGGAAGAAATAAACGTATTCAATCTTTATGAGCTATGTCGGTTCTATCGGACCTGGAGTCCTAATCTTCAGTGTCCATGGTGCTCGCTCTTCTCCGACGAAGACCTGGTTGTGTTGGAGTACAGAGATGATGTACGGCATTATTACAAAAACGGATACGGGTTTGATATTAATGCAGATTTAGGTACACTCCCACTGAGGGATTTATTTGAGAATTTTGAGTTAGCGACGAGAGGGGAGGGTAAGAACATAGTTTCGTACTTTACCCACGACACTATGATGGAAATGATGTTCTGCGCTCTTGGGCTCTATAAGGACAAGAGCGTCATAAAAGGATCCTCAAGAAATCCAGACAGATTATGGCGGACAAGTTATATAGCATCGTTTTCTACAAATTTTATCGCCGTCCTTCACAGATGTGACTCCGATACTCATAGAGTCCAGCTGTTCATCAACGAGAAGCCCACCAGTCTTTGCCCTGTCGAAGGCTGCTCGTGGTCAGAGTTCGTCGAAACTTTCCAAAGGTTCTCCAACTCCTCTGACCGTAAAACATGTTTAGCTGACTCTGATGTGGATGAAGACAGCAATAATTTAAGTTGTAAGGTCGTGAGTTTGTGGTCAATACATCGACACGGGAACCGTCATCCTGGAAGCAGGGTCGTAAAGGACACCAACGAGTTATGGGTCAAGTTGAGGGACCAAATTATAAGAAGTGAGGCAGAGTCAAGGAACTCACTTTGCTCACAGGACTTGGAAGATATATTAAATTGGAAATGGGATTCTTCGCTAGAAACTACACCATCTTACCTCACACAAGTGGGTAACGACGAAATATACTCGATCGGTAAAAGGGTAGCGAAAAAATACAATGAACTGATGCACGAAAGGATTGACCGATATTATTTCAGAGGAACCAACGAACAACGTACGAAAGCAAGTGTTCTAGCGTACGTCAATGGCCTAACTCATGGTTCAGACATGATCCTCACTTCACGGATAGAGGAATCTAGGGAACGAGATGACACTATTCGGCCTTACGAAAACTGTGATCGCTACCAGGAGTCAGTTAAGAACGGTTCGCTGTTGCCGGATCAGTTGGCTGAGTATGATCAAAGCTCCGAGTATTTAGCGGTCAGAGACCGAGTTTTCAAGCGACTAGGTATAACAAACGACACGGAAGAAATAAACGTATTCAATCTTTATGAGCTATGTCGGTTCTATCGGACCTGGAGTCCTAATCTTCAGTGTCCATGGTGCTCGCTCTTCTCCGACGAAGACCTGGTTGTGTTGGAGTACAGAGATGATGTACGGCATTATTACAAAAACGGATACGGGTTTGATATTAATGCAGATTTAGGTACACTCCCACTGAGGGATTTATTTGAGAATTTTGAGTTAGCGACGAGAGGGGAGGGTAAGAACATAGTTTCGTACTTTACCCACGACACTATGATGGAAATGATGTTCTGCGCTCTTGGGCTCTATAAGGACAAGAGCGTCATAAAAGGATCCTCAAGAAATCCAGACAGATTATGGCGGACAAGTTATATAGCATCGTTTTCTACAAATTTTATCGCCGTCCTTCACAGATGTGACTCCGATACTCATAGAGTCCAGCTGTTCATCAACGAGAAGCCCACCAGTCTTTGCCCTGTCGAAGGCTGCTCGTGGTCAGAGTTCGTCGAAACTTTCCAAAGGTTCTCCAACTCCTCACTGGCATTTTGTACAAATCGACGCTCTGTTGTGGATGAAGACAGCAATAATTTAAGTAATATAATTACCGTCTCGAAATTTTTGACCTCACTTTTAATGTTGCTTCCATTGGTGCTTTCTGCTAATTAA

Protein sequence:

>DPOGS216080-PA
MSSVVVSKEECYWNRQCKYQLFSTTTPYDIIRGDIRDQPNPDGCKVVSLWSIHRHGNRHPGSRVVKDTNELWVKLRDQIIRSEAESRNSLCSQDLEDILNWKWDSSLETTPSYLTQVGNDEIYSIGKRVAKKYNELMHERIDRYYFRGTNEQRTKASVLAYVNGLTHGSDMILTSRIEESRERDDTIRPYENCDRYQESVKNGSLLPDQLAEYDQSSEYLAVRDRVFKRLGITNDTEEINVFNLYELCRFYRTWSPNLQCPWCSLFSDEDLVVLEYRDDVRHYYKNGYGFDINADLGTLPLRDLFENFELATRGEGKNIVSYFTHDTMMEMMFCALGLYKDKSVIKGSSRNPDRLWRTSYIASFSTNFIAVLHRCDSDTHRVQLFINEKPTSLCPVEGCSWSEFVETFQRFSNSSDRKTCLADSDVDEDSNNLSCKVVSLWSIHRHGNRHPGSRVVKDTNELWVKLRDQIIRSEAESRNSLCSQDLEDILNWKWDSSLETTPSYLTQVGNDEIYSIGKRVAKKYNELMHERIDRYYFRGTNEQRTKASVLAYVNGLTHGSDMILTSRIEESRERDDTIRPYENCDRYQESVKNGSLLPDQLAEYDQSSEYLAVRDRVFKRLGITNDTEEINVFNLYELCRFYRTWSPNLQCPWCSLFSDEDLVVLEYRDDVRHYYKNGYGFDINADLGTLPLRDLFENFELATRGEGKNIVSYFTHDTMMEMMFCALGLYKDKSVIKGSSRNPDRLWRTSYIASFSTNFIAVLHRCDSDTHRVQLFINEKPTSLCPVEGCSWSEFVETFQRFSNSSLAFCTNRRSVVDEDSNNLSNIITVSKFLTSLLMLLPLVLSAN-