Monarch geneset OGS2.0

DPOGS213427
TranscriptDPOGS213427-TA1167 bp
ProteinDPOGS213427-PA388 aa
Genomic positionDPSCF300271 + 106049-110616
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0168052e-15265.13% 
BombyxBGIBMGA004452-TA4e-13156.09% 
DrosophilaCG33116-PA1e-9546.70% 
EBI UniRef50UniRef50_UPI0001792CFE7e-9546.35%UPI0001792CFE related cluster n=1 Tax=unknown RepID=UPI0001792CFE
NCBI RefSeqXP_001850857.11e-10249.48%phosphatidyltransferase [Culex quinquefasciatus]
NCBI nr blastpgi|1700466262e-10149.48%phosphatidyltransferase [Culex quinquefasciatus]
NCBI nr blastxgi|3320254883e-10147.78%Ethanolaminephosphotransferase 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00160203.2e-16membrane
GO:00086543.2e-16phospholipid biosynthetic process
GO:00167803.2e-16phosphotransferase activity, for other substituted phosphate groups
KEGG pathwaycqu:CpipJ_CPIJ0093623e-102 
 K00993 (EPT1)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
    Ether lipid metabolism
InterPro domain[3-389] IPR0144722.4e-107Choline/ethanolamine phosphotransferase
[47-155] IPR0004623.2e-16CDP-alcohol phosphatidyltransferase
Orthology groupMCL25023 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213427-TA
ATGTTTGGTTATAAATACTTAACGTCGAAACATTTGAAAGGATTTGACAAATACAAGTACAGCGCCATAGACACCAGTCCCCTCAGCAGATATGTGATGCATCCATCCTGGAACTTCATGGTGAAGTTCATCCCGAAATCGATAGCGCCGAACCTTCTGACCTTCTCCGGTTTCCTCTGTATGTTGTTGTGTGTTTTAATGCTGCTGGTCTATGACTACGACTGCACGGCATCTGGAAGACCTGGAGATGGAAGAAAGGATGAATACAGCATACCGAATACAGTATTCGTTCTGTGCGGAATACTAGTGTTCTTGGCGTACAATCTAGATGGTTTGGACGGGAAGCAGGCTCGTCGTATCGGAGTCTCTGGACCTCTGGGTGAGTTGTTCGACCACGGTCTGGACTCCTACATCGTGTTCCTCATACCCTACTGCCTATTTTCAGTCTTCGGAAGAGATCAGTTCTCCATAACCGTCTTCAGAGGCTACCTCATCATATTAAGCATAGTTCTTAATTTCTACGTGAGTCACTGGGAGAAGTACAACACAGGAACGTTGTACCTTCCATGGGGTTACGACCTCAGTATGTGGGTATCGACGATCCTGTTCCTACAAGCTGGCGCCCAAGGTCCGGTAATCTTTAAAACCTTCGTCTTCAATGACGTGACATTCGTCCAAGCCCTGGAGGTCGCTATACACGCCACTGGCTTATTCACGACCCTGCCGGTTGCCGTTTATAATGTTATCTTGTCACACAGGAACAGGACAGGTAAGATGTTGTCTATGAGGGAGGCTCTCCGTCCGCTGTATCCCATGACGGTACTGACGATCACCTCCACACTCTGGGCCCTGAAGACAGATGCCCTGGAACAAGACCCCAGAGCCTTCCTCCTGGCTTTTGGGACAATATTCAGTAACATAGCGAGCCGTCTCATCGTGTCGGAGATGAGCGGTCAGCGATGCGACGGTGTCAGTCTGCTGAACATTCCTCTAGTGGCGGTGGTGGTGGTGTCTTCGTATCTCCCACACTTGACGCTACCACTGCTATATCTGTTGTTGTTCGTCGTTACCACCGCTCATGTCCATTACGGTGTCTGTGTGGTCCGTCAGATGTGCGATCACTTTAAAGTTAATTGTTTCTCTGTGCCGCGAGATAAAATTAAATGA

Protein sequence:

>DPOGS213427-PA
MFGYKYLTSKHLKGFDKYKYSAIDTSPLSRYVMHPSWNFMVKFIPKSIAPNLLTFSGFLCMLLCVLMLLVYDYDCTASGRPGDGRKDEYSIPNTVFVLCGILVFLAYNLDGLDGKQARRIGVSGPLGELFDHGLDSYIVFLIPYCLFSVFGRDQFSITVFRGYLIILSIVLNFYVSHWEKYNTGTLYLPWGYDLSMWVSTILFLQAGAQGPVIFKTFVFNDVTFVQALEVAIHATGLFTTLPVAVYNVILSHRNRTGKMLSMREALRPLYPMTVLTITSTLWALKTDALEQDPRAFLLAFGTIFSNIASRLIVSEMSGQRCDGVSLLNIPLVAVVVVSSYLPHLTLPLLYLLLFVVTTAHVHYGVCVVRQMCDHFKVNCFSVPRDKIK-