Monarch geneset OGS2.0

DPOGS204644
TranscriptDPOGS204644-TA879 bp
ProteinDPOGS204644-PA292 aa
Genomic positionDPSCF300462 - 48479-60775
RNAseq coverage225x (Rank: top 44%)
Annotation
HeliconiusHMEL0214818e-10285.02% 
BombyxBGIBMGA001972-TA3e-8274.44% 
DrosophilaCG11438-PA2e-2635.39% 
EBI UniRef50UniRef50_E2AJW75e-5237.25%Lipid phosphate phosphohydrolase 1 n=6 Tax=Formicidae RepID=E2AJW7_CAMFO
NCBI RefSeqXP_001605796.15e-4935.88%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071769772e-5137.25%Lipid phosphate phosphohydrolase 1 [Camponotus floridanus]
NCBI nr blastxgi|3071769774e-5241.18%Lipid phosphate phosphohydrolase 1 [Camponotus floridanus]
Group
Gene OntologyGO:00160202.9e-24membrane
GO:00038242.9e-24catalytic activity
KEGG pathwayapi:1001646161e-28 
 K01080 (E3.1.3.4, PPAP2)maps-> Glycerolipid metabolism
    Glycerophospholipid metabolism
    Fc gamma R-mediated phagocytosis
    Sphingolipid metabolism
    Ether lipid metabolism
InterPro domain[104-250] IPR0003262.9e-24Phosphatidic acid phosphatase type 2/haloperoxidase
[109-252] IPR0161186.6e-16Phosphatidic acid phosphatase/chloroperoxidase, N-terminal
Orthology groupMCL16389 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204644-TA
ATGCAGGCGCGTCACAGTCTCGCCTGGCACCTACTTATAGACCTGCCGATACTGCTGCTAGTGGCAGCAGTATGTATACTCCTGGAGGTCGGAGCTATTCCCAGCAGACGAAGCGGCTTCAGATGCAATGACCCCGACCTCTCATTCCCTCACACGGGAGACACTCTCTCCATATCTCTGATAGCAGCCATTACAGTCATTGTGCCATACCTCATTATATGGGCGGTAGAGTCGACACTACAGTTGGACGACGAGTACACCATCAAGCAAAACAAGCTGCTGACAAGCGCTAAAACCGCTGGATTCATCTACCGCGATTATATATACGCTGCTATTGTAAATCTCACTGTCTTGGAAGTGGTGAAATGTGTCGTCGGATCACCCAGACCGACCTTCTTTGATTTATGCAAGCCCGATACAGCAAGAACGTGTAATGGGTCGGATTACGTGAGCAGTTACACGTGCACCTCCACGCGCTTCTCTCGCTACCTCCAGATTGATTCCAGCCGGAGCTTCCCCTCGGCGCACACGTCGCTCTCAGTCTACTGCGGTCTCTTCTTGGCTTGGTATCTCCAAAGGCGAGCCTTCAGCTGGCATAGTCGGTCTGTTCTGGTGGTGCCGCTGCTGCAGATCCTGTGCATCATCTACGCTGTGACTTGCCCGCTCACCAGAATCACAGACCACAGACATCACTGGTGGGATGTCCTCGCTGGTGCAGTCATGGGGGTCGCTACTGTCATTTATACAGTATTAGTCCTATGCAAGAACTTCTCCCACCCAGAAGTGGTGAGCACCAGCGACATCTCCAGCAGCGATGGTAATAACCACCAGTCTGTGCGAAGACTGCTAGCCAGCGGACCGCGCGGGGTGGTGCCCTAG

Protein sequence:

>DPOGS204644-PA
MQARHSLAWHLLIDLPILLLVAAVCILLEVGAIPSRRSGFRCNDPDLSFPHTGDTLSISLIAAITVIVPYLIIWAVESTLQLDDEYTIKQNKLLTSAKTAGFIYRDYIYAAIVNLTVLEVVKCVVGSPRPTFFDLCKPDTARTCNGSDYVSSYTCTSTRFSRYLQIDSSRSFPSAHTSLSVYCGLFLAWYLQRRAFSWHSRSVLVVPLLQILCIIYAVTCPLTRITDHRHHWWDVLAGAVMGVATVIYTVLVLCKNFSHPEVVSTSDISSSDGNNHQSVRRLLASGPRGVVP-