Monarch geneset OGS2.0

DPOGS215854
TranscriptDPOGS215854-TA2367 bp
ProteinDPOGS215854-PA788 aa
Genomic positionDPSCF300029 - 1228395-1235357
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0118105e-14656.81% 
BombyxBGIBMGA000402-TA8e-16743.30% 
DrosophilaMipp1-PB3e-3126.43% 
EBI UniRef50UniRef50_D6WUK95e-5431.15%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUK9_TRICA
NCBI RefSeqXP_393246.34e-7728.78%PREDICTED: similar to Multiple inositol polyphosphate phosphatase 1 CG4123-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|910879612e-5331.15%PREDICTED: similar to multiple inositol polyphosphate phosphatase [Tribolium castaneum]
NCBI nr blastxgi|910879611e-5530.99%PREDICTED: similar to multiple inositol polyphosphate phosphatase [Tribolium castaneum]
Group
Gene OntologyGO:00039932.7e-24acid phosphatase activity
KEGG pathwaymdo:1000224881e-26 
 K03103 (MINPP1)maps-> Inositol phosphate metabolism
InterPro domain[63-398] IPR0005602.7e-24Histidine phosphatase superfamily, clade-2
Orthology groupMCL20750 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215854-TA
ATGTCGTCTAATTCACACTATAATTTCATCGTTCTCTGTTCTTTTATAATATCAGTGTTTGGAAAAAATTGCTTTTGGAATTCTCAGTGTGCCTTTAACCATTTCTCAAGCAAAACCCCTTATAATTGTATCAGAGGAGACCTGAGAGATTCATTCGTGAAGGTTAAAGGCTGCGAACCAATAAGCGTTTGGGGACTTATAAGACATGGTAAAAGAACTCCAGGTACTGAATTTGTATATCAAATAAGAGAAGCGGTGAAATTTAAGGATTACATAGTTGATAGTTTTAATAAGGGTAATAGCTTTTTGTGTGCACAAGACGTCGAAAATTTAAAGAACTGGTTCGATGAGAGGAAAGTATTTGATAGCGTGCAGTCTTTAACTGAAGAGGGCTATCAAGAAATTTTTGGTATTGGTAAAAGATTAAGAGCGACATTTAAAGAACTATTAAGAGATCTCGGTAATAGTAGTTATAGAGTAAGGTCGGCTTACGGTCCGTGGGTCGAAAATGGAGCAGAGGCCTTTGTTAAAGGTTTCTCTGATATTCCAATAAACATAGAACCGGCGAATCCCAACGATAATATTATAGCTCCGTATGAATCGTGTCCCAAATTCCTCGATGAAGTTAGAGATAATCCAGATACGTATTATGAAGCTTCTAAATATAAAGAATCTGAAGAATTCCAGGCGTCAAAGGCGCAAATTCAGAAAAGAATGGCTATTGAATATAACCTCACAAACAAAAACATTACCGCCCTATATGACCTTTGTCGATACAGCTGGTCTGGTATAGACAACAAGCGAAGTCCCTGGTGTGCCCTGTTTACGTTAGAAGACCTCATAGTAAATGAGTACCATGGAGATTTGCGGCATTACTACAGAAATGGTCCTGGAAATCGTTACAGTGAAATATTCGGTCGCTTGCCCTTATCTAATCTCTATGAAACATTTGAAAATGTGAAATTGGGTGAAAAAATGAAAATGACTATTTATTTCTCTCATGCTACTATGATGGATATGGTTTATAGTGCTCTAGGATGGTTCACGGACAAGGAACCTTTAACTCACGCATATAGAAATCCGAAAAGAAAATGGAAGTCTACCAAATTGGGTGCCTTTGCGGCAAACTTAATAGTAGTTTTACATAGATGCTTAGAGGATGACAATGAAGAGTATAAAGTAACGTACTATATTAACGAAGAATTTGTGACATCAGTATGCTCTGACGGCATTTGTTCCTGGCAACAATTTGAAAACACATTAAAGCCTTTCCTCAACACAACATTAGATTTTTGCTATTTATTCACTTTTAAAATGAAATTATTTTTAATAATATGCATTTGTTGTTTCTTTAAATTAACGGCATCGAATTTTTGTTACTGGAATACGGGTTGCCCTTATAAATACTTATCAACGGAAACCCCATACAATTCGGTTAGAGGAGATATAAGAGATTCTATTGTAAGGTTACGAGGTTGTGAACCAGTAAGTATATGGGGAATATATCGCCATGGAAAAAGAGAACCGGGCGCTAAATTCGCAGAAAGCATGAAGCAGGCTTTACCTATAAGGAATTATATCACAACAAGTTACAAAAAAGGACGTAGTTCTCTTTGTGCTCAGGATGTTGAAAATTTACAGAACTGGCAATTAAATCAAAACACTTTAAACGGTAAAAGTGATCTGACAGAAGAAGGCCGCCAAGAAATGCTTGGGCTTAGCAAGCGCCTTAAAGAAGTGTTTCCAGATTTATTGAGTGAACTTCGCAATGGAGATTATTCTTTTAGATCAGCTTCAGGTTCTTGGATAGAAAAAAGTATTCAACATTTCGTAAAGGGCTTAGGAGACGACTTAACAATAGAAAAAGTAAAAGCAGGTGCAGATGTGATGGCTCCATATGCAACATGTGGTTCCTACCAGAAAGATGTGCAGCGAAATCCTAATATTTATGTTGAAGCTGCGAAGTATATGCAGAATTCAGAATATTTAGCAACCAAAGACAGAATACAGAGACGTACTGGCATTGATTATATGCTCACCGATGATAATATAACAGCTTTATACGATTTGTGTCGATACACGTGGTCTGCAGTTGACAATAAATTCAGTCCCTGGTGTGCTGTTTTTACCAAAGATGATTTAGAAGTCCTCGAATATATTCAAGATCTGAAACATTACTATAGGAATGGTTATGGTACATGTACAGACAATTCAGAAGACTATAACGTGGTGTTCTACTTAAACGAAGAGCCAATGAGATCGATATGCGAGGAGGGTGTTTGTTCCTGGCGAGAATTCGAGAATAAATTGCGGCCATTTATTAACACAACTATCGACTTTTGCGAGTTTCGAAGTGAGCCTTATTAG

Protein sequence:

>DPOGS215854-PA
MSSNSHYNFIVLCSFIISVFGKNCFWNSQCAFNHFSSKTPYNCIRGDLRDSFVKVKGCEPISVWGLIRHGKRTPGTEFVYQIREAVKFKDYIVDSFNKGNSFLCAQDVENLKNWFDERKVFDSVQSLTEEGYQEIFGIGKRLRATFKELLRDLGNSSYRVRSAYGPWVENGAEAFVKGFSDIPINIEPANPNDNIIAPYESCPKFLDEVRDNPDTYYEASKYKESEEFQASKAQIQKRMAIEYNLTNKNITALYDLCRYSWSGIDNKRSPWCALFTLEDLIVNEYHGDLRHYYRNGPGNRYSEIFGRLPLSNLYETFENVKLGEKMKMTIYFSHATMMDMVYSALGWFTDKEPLTHAYRNPKRKWKSTKLGAFAANLIVVLHRCLEDDNEEYKVTYYINEEFVTSVCSDGICSWQQFENTLKPFLNTTLDFCYLFTFKMKLFLIICICCFFKLTASNFCYWNTGCPYKYLSTETPYNSVRGDIRDSIVRLRGCEPVSIWGIYRHGKREPGAKFAESMKQALPIRNYITTSYKKGRSSLCAQDVENLQNWQLNQNTLNGKSDLTEEGRQEMLGLSKRLKEVFPDLLSELRNGDYSFRSASGSWIEKSIQHFVKGLGDDLTIEKVKAGADVMAPYATCGSYQKDVQRNPNIYVEAAKYMQNSEYLATKDRIQRRTGIDYMLTDDNITALYDLCRYTWSAVDNKFSPWCAVFTKDDLEVLEYIQDLKHYYRNGYGTCTDNSEDYNVVFYLNEEPMRSICEEGVCSWREFENKLRPFINTTIDFCEFRSEPY-