Monarch geneset OGS2.0

DPOGS204663
TranscriptDPOGS204663-TA2439 bp
ProteinDPOGS204663-PA812 aa
Genomic positionDPSCF300170 - 415583-425405
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0082540.079.84% 
BombyxBGIBMGA007465-TA0.071.27% 
DrosophilaCG15385-PA2e-6830.71% 
EBI UniRef50UniRef50_Q7Q0T73e-8233.89%AGAP010170-PA n=3 Tax=Culicidae RepID=Q7Q0T7_ANOGA
NCBI RefSeqXP_319343.46e-8333.89%AGAP010170-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582992171e-8133.89%AGAP010170-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3800205343e-4339.92%PREDICTED: acid phosphatase-like protein 2-like [Apis florea]
Group
Gene OntologyGO:00039933.3e-12acid phosphatase activity
KEGG pathway 
InterPro domain[528-740] IPR0005603.3e-12Histidine phosphatase superfamily, clade-2
Orthology groupMCL15071 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204663-TA
ATGATGAAGTTTTCGTTTCAGCACAGGGCATTTTACTGCTATCTTGGTATGAGTATTTGGATATGTTTTCTTATAACTGTAGTTTACAAATACATGTCTGTAGCTGAAGACACGGTCGCGTTGAAAGTTACTCATAACCAGTATGTGTCCAAGACTGATTCAAAATATAGGAAGCTCTTCATGCGGGCCTGCAATCCCCCTGATAGTATAGTCAGGGGATCTGAAGCTGCAGTAGACTCAGACAACTGGCTGTTACAAGGCATCTTAGTTATCACGAGGCACGGAGATCGCGGACCATTGACACATCTGAAGGGCGGGGATAAGCTGCCCTGCGATGTGGTTCCCGTCTCGCCGTTGCTTAAAAGCTATGAGGAGTTCGTTTTGAACGCCTCATCATCAGGTCGCGCCTGGTGGGTGTCAAGCGCTGGGCCGTTTCACAACTTCCCTTCCTTGCCACGAGCCGCGGCCACGCACTGCGCCCTCGGGCAGCTCACACCCACCGGACTGCTTCAGATGATCACCGTCGGCAACATCATTCGTGAGGCGTACAGTGAGAAATTGGGTCCAGAATATTTAGATTTAACGGGGAAACACGAGAGAATAGTTTACAAATACATGTCTGTAGCTGAAGACACGGTCGCGTTGAAAGTTACTCATAACCAGTATGTGTCCAAGACTGATTCAAAATATAGGAAGCTCTTCATGCGGGCCTGCAATCCCCCTGATAGTATAGTCAGGGGATCTGAAGCTGCAGTAGACTCAGACAACTGGCTGCTACAAGGCATCTTAGTTATCACGAGGCACGGAGATCGCGGACCATTGACACATCTGAAGGGCGGGGATAAGCTGCCCTGCGATGTGGTTCCCGTCTCGCCGTTGCTTAAAAGCTATGAGGAGTTCGTTTTGAACGCCTCATCATCAGGTCGCGCGTGGTGGGTGTCAAGCGCTGGGCCGTTTCACAACTTCCCTTCCTTGCCACGAGCCGCGGCCACGCACTGCGCCCTCGGGCAGCTCACACCCACCGGCCTGCTTCAGATGATCACCGTCGGCAACATCATTCGTGAGGCGTACAGTGAGAAATTGGGTCCAGAATATTTAGATTTAACAGGGAAACACGAGAGAAGTGGCGTAGCGTATAGCACGCGGTACCGGCGTACGTTCCAGTCCTTGCAGGCGGTCTCTTGGGGCGTGGGCCGGGGCGCCGCCGCCGCCAGGGAAGCGCACAGTGTGGCCTTCTGTTACAGACACTGTGCCTGTCACGCACACCACCTCCTTGACAAAAAAATAAGCACTGAGGCAAAGAGACGTTTGGAATCCCATCCAGCAATGAAGGAACTGATCAAGAAATTATCGAGAGTATTGTTCGAATCACAGGAATACACGGATGCGGATGTGGTCAGGGACGCGCTGCTGGCTTACATGTGCCACGAAGCGCCGCTGCCGTGCTCGGAGAGAACGAAGAGAAATAAGAAAAAACTATCGCTCAATAAAGGGAAAAGAAAGTATAGATCGGAGACAATCCCGCAAAGAAATCTCTTAGACATAGACATAGACGCGTTGAACTTAGAGCTAGACTATATCAATAACCGACTGGACTTCAACAACGAAATAGGAAGAAAGGCCAGAGACATCATAGGGAAGTATGACAAAAAGACGCCCTTGGATTTCGACGCCCAGATGGAGAGGGAGAAGCTTTTATACTACCAGCAAAGGTACCTGGACAACGCCGAATCCTACGACGACGTTGTTGTCGTCAAGAAAAATCTCGACGCGGATTTTAACTTCCCAAACGAAGCCCGTGAAGATTTCGAAGAAGATTACAAGGAACCGACGCCGGATGCTGAGGATTTTTGCATAAAAAAGGAGCACATTATATCCGTTTTCGCCTATCTGGAGTGGAGCTACCGTCAGGACGTCAAGAACACGCACAACAGAAGACGCGGGCTCTTGATAGCTTACGGCCTCATACACAACGTCGTACAAAACATGATAAGAATTATATCTGAGAACAAACCCAAATTCGTCCTATACTCCGGCCACGACAAGACCTTGCAGGCGTTAGTATTGGCGCTGGGACTCAAGAGCTACCAGCATTACAACATACAGTACGCATCGAGAGTCATCTTCGAGGTCTACAGGAAGAAGGATTTACGCGACGAATTCAAATTCATGAAACGGAAAGCGGTCGCTCAGGACTTCTACTTCCGGGTGGTTTACAACGGGGAGGACGTGACGGATAAGCTCAGCTTTTGCGCGGACACGCAGCTCGTGACTATGAAGGTGGTGGACCCGATCGACGACGTCAAAGCCTACAACACACACCTCTGCCCGATAGAAAATATCGTCAGATTCATTCACGACGATTACTTCTCAAGTTTCAACGTGAGTAATTACAAAGACGCCTGCGCCACCTACGGCGGCTCTAAGACTGTTTATTGA

Protein sequence:

>DPOGS204663-PA
MMKFSFQHRAFYCYLGMSIWICFLITVVYKYMSVAEDTVALKVTHNQYVSKTDSKYRKLFMRACNPPDSIVRGSEAAVDSDNWLLQGILVITRHGDRGPLTHLKGGDKLPCDVVPVSPLLKSYEEFVLNASSSGRAWWVSSAGPFHNFPSLPRAAATHCALGQLTPTGLLQMITVGNIIREAYSEKLGPEYLDLTGKHERIVYKYMSVAEDTVALKVTHNQYVSKTDSKYRKLFMRACNPPDSIVRGSEAAVDSDNWLLQGILVITRHGDRGPLTHLKGGDKLPCDVVPVSPLLKSYEEFVLNASSSGRAWWVSSAGPFHNFPSLPRAAATHCALGQLTPTGLLQMITVGNIIREAYSEKLGPEYLDLTGKHERSGVAYSTRYRRTFQSLQAVSWGVGRGAAAAREAHSVAFCYRHCACHAHHLLDKKISTEAKRRLESHPAMKELIKKLSRVLFESQEYTDADVVRDALLAYMCHEAPLPCSERTKRNKKKLSLNKGKRKYRSETIPQRNLLDIDIDALNLELDYINNRLDFNNEIGRKARDIIGKYDKKTPLDFDAQMEREKLLYYQQRYLDNAESYDDVVVVKKNLDADFNFPNEAREDFEEDYKEPTPDAEDFCIKKEHIISVFAYLEWSYRQDVKNTHNRRRGLLIAYGLIHNVVQNMIRIISENKPKFVLYSGHDKTLQALVLALGLKSYQHYNIQYASRVIFEVYRKKDLRDEFKFMKRKAVAQDFYFRVVYNGEDVTDKLSFCADTQLVTMKVVDPIDDVKAYNTHLCPIENIVRFIHDDYFSSFNVSNYKDACATYGGSKTVY-