Monarch geneset OGS2.0

DPOGS204645
TranscriptDPOGS204645-TA783 bp
ProteinDPOGS204645-PA260 aa
Genomic positionDPSCF300462 - 35157-38235
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0212913e-8958.78% 
BombyxBGIBMGA001850-TA2e-3636.16% 
DrosophilaCG11426-PA8e-2730.04% 
EBI UniRef50UniRef50_E2AJW71e-3939.24%Lipid phosphate phosphohydrolase 1 n=6 Tax=Formicidae RepID=E2AJW7_CAMFO
NCBI RefSeqXP_001605796.16e-4038.33%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071769773e-3939.24%Lipid phosphate phosphohydrolase 1 [Camponotus floridanus]
NCBI nr blastxgi|3071769774e-4039.24%Lipid phosphate phosphohydrolase 1 [Camponotus floridanus]
Group
Gene OntologyGO:00160203.7e-23membrane
GO:00038243.7e-23catalytic activity
KEGG pathwaydme:Dmel_CG114266e-25 
 K01080 (E3.1.3.4, PPAP2)maps-> Glycerolipid metabolism
    Glycerophospholipid metabolism
    Fc gamma R-mediated phagocytosis
    Sphingolipid metabolism
    Ether lipid metabolism
InterPro domain[3-231] IPR0003263.7e-23Phosphatidic acid phosphatase type 2/haloperoxidase
[93-228] IPR0161182.8e-18Phosphatidic acid phosphatase/chloroperoxidase, N-terminal
Orthology groupMCL21022 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204645-TA
ATGTTAAATAAATTTAAATATTTCTGGGGGAAAACGAATCGATGGCATCGAGTTTTCCTCATATTTCTCATAATCGAGTTAAGGATATTGCCCGGTGGTAAATATGGTTTTAAATGCAACGACCCAGCACTCTCCCACACATTTACCGGAGACACTGTCAGTTGGAAACTGCTCTTGCTAATAACACTTACCTTACCCTTAATTGTGATGTTCATTGTTGAAAAGACACAAGAGAATTCAGAATCAGCCAAACAGCAAGCATTGTATTGGTACAAGGAATATTTATTTGGCTTCTTACTAAATCTAACATCAGTACAATTCATTAAGTTCCTCGTTGGATCACCGAGACCCCATTTCTTTGATACATGCCGTCCAGTAGAGGCAGAGACGTGCACAGAGTCACAGTACATATATAGTTATCAGTGTACGAACCCGGCTTGGATCAATCAATCGGACACCAGCTTCCCCTCGGGGCATTCCTCATTGGCGTTCCACTCTGCTCTCTTTATTGTTTACTACTTGTATCAAAGGAAGACATTGTGTAAAAACACATTAGTGACCCAGTCCCTCTGCGTGCTGCTGTCCGGGTACTGCGCCGTGTCCCGGTTGTCGGATCACCGTCATCACTGGTGGGACGTGGTCGCGGGCTTCGTCATAGCCATCGTTGTATTGATTTATACTATATTCCATTTATGCGGAAACTTTAAATGTGTGATGTCAAATAAGAAGCAAACACTCGAACGAAACGAGCACAGCGAGTGCACAATAGAGGGCGCCACGTAA

Protein sequence:

>DPOGS204645-PA
MLNKFKYFWGKTNRWHRVFLIFLIIELRILPGGKYGFKCNDPALSHTFTGDTVSWKLLLLITLTLPLIVMFIVEKTQENSESAKQQALYWYKEYLFGFLLNLTSVQFIKFLVGSPRPHFFDTCRPVEAETCTESQYIYSYQCTNPAWINQSDTSFPSGHSSLAFHSALFIVYYLYQRKTLCKNTLVTQSLCVLLSGYCAVSRLSDHRHHWWDVVAGFVIAIVVLIYTIFHLCGNFKCVMSNKKQTLERNEHSECTIEGAT-