Monarch geneset OGS2.0

DPOGS214128
TranscriptDPOGS214128-TA1413 bp
ProteinDPOGS214128-PA470 aa
Genomic positionDPSCF300014 - 1493188-1498157
RNAseq coverage129x (Rank: top 56%)
Annotation
HeliconiusHMEL0113733e-17976.28% 
BombyxBGIBMGA006176-TA3e-5432.71% 
DrosophilaCG9449-PF1e-3426.88% 
EBI UniRef50UniRef50_UPI00015B57702e-4731.09%UPI00015B5770 related cluster n=1 Tax=unknown RepID=UPI00015B5770
NCBI RefSeqXP_001605452.13e-4831.09%PREDICTED: similar to venom acid phosphatase [Nasonia vitripennis]
NCBI nr blastpgi|1565500757e-4731.09%PREDICTED: venom acid phosphatase Acph-1-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565500752e-4731.09%PREDICTED: venom acid phosphatase Acph-1-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00039935.6e-17acid phosphatase activity
KEGG pathwayame:1001922031e-36 
 K01078 (E3.1.3.2)maps-> Riboflavin metabolism
    gamma-Hexachlorocyclohexane degradation
InterPro domain[115-404] IPR0005605.6e-17Histidine phosphatase superfamily, clade-2
Orthology groupMCL35010 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214128-TA
ATGTCTGTTCGCGTGTGTGTAAATATAATAGTACAATCGATTGGACGAAGTCAATCATCTGGTAGGAAAGAGGGCGGTAGCGAGGAGGTGCCAACCAACGCGGCTGATGAGCCCAGCATGAAGACCGAACATACTTGCTGTTGTCTAGGCGTGTTGGTGGGCGTCTCTCTCATCGCCGGTTTGAAAGAGGGCGGTAGCGAGGAGGTGCCAACCAACGCGGCTGATGAGCCCAGCATGAAGACCGAACATACTTGCTGTTGTCTAGGCGTGTTGGTGGGCGTCTCTCTCATCGCCGCCCTCGTAGCAGTCATACTAATGAAAGACACTACAGTTGAAGTTACAGTACTACGACAAGTTCACGTGTTAATGTCGCACGGGGAGCGCACGCCCAGCGAACGTGAATTGGAGATGCTGGGGGCGCCTCCATCGGAACACGTTTTTGTACCTTACGGTGCAGGAGCATTGACTAATGAAGGGAAGCTCTTGACGTACGAGATGGGCGCATTACTTAGAAAAAGATATAACGATTTCTTAGGACCGTATTACGAAGCTGAAAAAAGCATTGTGATAGCATCAGATACGAATTTGAGCAAGATGACGGCGTTGCTGATAGCGGCTGGTTTATGGCCGCCGATTTTAAATCAAATGTGGAACGATTCCATAAGTTGGCAGCCCGTTCCTTACACCTATCCACCTAGAAGTGAAGACTACCTTTTATACGAAGAAAACTGCCCGCGTTACAATCAAGAGAAACAAAGACTATTGAAAGTGTACATCAATGAAGGACTGCTTGTACCGTATCGGGATTTCTTTCACAAAATTGCACACATGACAAACACTAACTTCAGCACACCACAGGACGCGTACAATTTAAACAACTTATTCGTGATACAGGACGATATTAAAGTTGCGAATCCAAAATGGGCCAAGCATGTGAAGAGAAAATTGATGGACGTCGCTCGATTGGAATACTCAATGATGTTTCACAATAACCTTCTAAGAAAACTCAGCGGAGGTGCTCTACTGCAGCAAATAATAACGGAAGCTATATGGAACACGAGAGACGTCGATAGTCCAAAGGTGTTGGTTCGTATCGGCACACCCGTGTCGGTGGCGGCGCTACTGTCGGCGTGTGTGGCGCCCCCGCCGCGTCTGCCGGACCCGGGGGCAGCGTTGCTGTTTGAGTTACACGAAAAACAACCTTCCGCAAAAGGGAAGAAGGATAAAAATGACTTGACGCACGGTCAGCGATTTGGATTTAAGATATACTACTGGGATGACGATTCAGCCGAGCCTCGACTAATGGAAGTTCCCGGATGTAACGCATTCTGCCCTTTGGATACTTTCTCTAAGATAACAAAAAGCATTGTGTCCCACGACTATAAGAAAGACTGTGAGCTTACTAGTGCATAG

Protein sequence:

>DPOGS214128-PA
MSVRVCVNIIVQSIGRSQSSGRKEGGSEEVPTNAADEPSMKTEHTCCCLGVLVGVSLIAGLKEGGSEEVPTNAADEPSMKTEHTCCCLGVLVGVSLIAALVAVILMKDTTVEVTVLRQVHVLMSHGERTPSERELEMLGAPPSEHVFVPYGAGALTNEGKLLTYEMGALLRKRYNDFLGPYYEAEKSIVIASDTNLSKMTALLIAAGLWPPILNQMWNDSISWQPVPYTYPPRSEDYLLYEENCPRYNQEKQRLLKVYINEGLLVPYRDFFHKIAHMTNTNFSTPQDAYNLNNLFVIQDDIKVANPKWAKHVKRKLMDVARLEYSMMFHNNLLRKLSGGALLQQIITEAIWNTRDVDSPKVLVRIGTPVSVAALLSACVAPPPRLPDPGAALLFELHEKQPSAKGKKDKNDLTHGQRFGFKIYYWDDDSAEPRLMEVPGCNAFCPLDTFSKITKSIVSHDYKKDCELTSA-