Monarch geneset OGS2.0

DPOGS210868
TranscriptDPOGS210868-TA1599 bp
ProteinDPOGS210868-PA532 aa
Genomic positionDPSCF300027 + 1097266-1110078
RNAseq coverage633x (Rank: top 20%)
Annotation
HeliconiusHMEL0141783e-17560.68% 
BombyxBGIBMGA006993-TA3e-14161.64% 
DrosophilaMipp1-PB4e-5234.05% 
EBI UniRef50UniRef50_D6WUK91e-7934.20%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUK9_TRICA
NCBI RefSeqXP_972932.12e-8034.20%PREDICTED: similar to multiple inositol polyphosphate phosphatase [Tribolium castaneum]
NCBI nr blastpgi|910879614e-7934.20%PREDICTED: similar to multiple inositol polyphosphate phosphatase [Tribolium castaneum]
NCBI nr blastxgi|910879619e-7732.27%PREDICTED: similar to multiple inositol polyphosphate phosphatase [Tribolium castaneum]
Group
Gene OntologyGO:00039932.9e-16acid phosphatase activity
KEGG pathwaydme:Dmel_CG43172e-31 
 K03103 (MINPP1)maps-> Inositol phosphate metabolism
InterPro domain[183-457] IPR0005602.9e-16Histidine phosphatase superfamily, clade-2
Orthology groupMCL11033 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210868-TA
ATGCGCCTCATCCTCGGCGTGTTGACGCTCACAAGCCTGGCCTATTGCCAGGAAACTTGCCAGACCGCTGATGAGGACCCCTATCTTCTTTTCGGCACCAAAACAGCTTACACCTTCGCTAACAAAGGCATTCCGGTCAACAGAGCTCATGATATACCCGGTTGTCAACCAATCGCAATTTGGTTGCTGAACCGCCACGGCTCCCACAATCCCGAGGCAGACGAAATACCAGATCTCCAGAAGTTAACAGATCTTAAAAATAATATCATCGCGAACTACAAGAATGGCAACTTTAGGAACACTAATATCCGTATGTGCACATCGGACGTCAATCTCCTAGAGCGATGGGAATGGAATTCTCGTCAGAACGTGACATTCGCTGGAGAACTCACCAGCGACGGATATATATCCACTCAGGAGCTGGCACAAGCTTGGAAACAACGGTTCCCTGGACTACTGACAGATAATAGACACGATTATTTGATCCGTATGTGCACATCGGACGTCAATCTCCTAGAGCGATGGGAATGGAATTCTCGTCAGAACGCGACATTCGCTGGAGAACTTACCAGCGATGGATATATATCCACTCAGGAGCTGGCACAAGCTTGGAAACAACGGTTCCCTGGACTACTGACAGATAATAGACATGATTATTTGTTCAAATTTGTGAACGACCAGCGGTCGGAAACAACGTTCCGCGCTTTCACCGAAGGTCTGTTCAGGTCTCAGGCAGACAATTACGATATACCGAAGGAAAGCGATGAGAAGTTACTGAGGCCTTATAAATTCTGCCCATCATGGACCAAACAAGTCGAAGAGAATAACGACACTTTGTCACAGTTACGAACGTTCGAGTCAAAACAAGAATTTAAAGAGATGATAACCAACATATCCCTTCGAATGGGCTTCAACTATGACGTCCAGCGTGAGGTAGTCCAGCGAGCGTACGACATGTGCCGATATAACAAGGCCTGGAATGTGGCACAAATATCTCCCTGGTGTGCTGTTTTTTCCAAAGACGATCTGAAGCGTCTAGAGTACGCAGAAGACTTGGAGACCTATTACAAATACGGCTACGGTTCATACATGAATCAACAGATAGGATGTACCGGCGTCAAGGATATGATGGACTTCTTTAAAATACACGTTGAACATGAAACTCCGCAACAGCCGCGCGCGACCGTTCACTTCACTGAGGCGGCCATGTTGTTTCTGTCGTTGACGTCTTTTGGCGCGAGACGTGACGCCGCGCCGCTCACAGGCGACAACTATCACACGCCGACAGCCACCGCTCGCCACTGGACATCCTCTAGCATTTCACCGTACAATGCGAATCTTGCTGCTATACTGTACAAATGCACACCAAATAGCAATTTTCAAATTAACGACAAATATCAGGTGCTATTCTTGGAGAACGAGAGACCTTTATACCTCGAGGGATGTCGAGTTGGTCTGTGCGAATGGAACCTTGTCAAGAATCGTTTCGGTTTGATCGCTGACAATTGCAATTTAAATTTCTGCAACTCAGCCACTAAAGCCAACAGCATCGGCTTAAGTTTAGCTGTATTTGTGTTCATAACCAAATATATATTCTAG

Protein sequence:

>DPOGS210868-PA
MRLILGVLTLTSLAYCQETCQTADEDPYLLFGTKTAYTFANKGIPVNRAHDIPGCQPIAIWLLNRHGSHNPEADEIPDLQKLTDLKNNIIANYKNGNFRNTNIRMCTSDVNLLERWEWNSRQNVTFAGELTSDGYISTQELAQAWKQRFPGLLTDNRHDYLIRMCTSDVNLLERWEWNSRQNATFAGELTSDGYISTQELAQAWKQRFPGLLTDNRHDYLFKFVNDQRSETTFRAFTEGLFRSQADNYDIPKESDEKLLRPYKFCPSWTKQVEENNDTLSQLRTFESKQEFKEMITNISLRMGFNYDVQREVVQRAYDMCRYNKAWNVAQISPWCAVFSKDDLKRLEYAEDLETYYKYGYGSYMNQQIGCTGVKDMMDFFKIHVEHETPQQPRATVHFTEAAMLFLSLTSFGARRDAAPLTGDNYHTPTATARHWTSSSISPYNANLAAILYKCTPNSNFQINDKYQVLFLENERPLYLEGCRVGLCEWNLVKNRFGLIADNCNLNFCNSATKANSIGLSLAVFVFITKYIF-