Monarch geneset OGS2.0

DPOGS215958
TranscriptDPOGS215958-TA1458 bp
ProteinDPOGS215958-PA485 aa
Genomic positionDPSCF300078 - 898643-908072
RNAseq coverage694x (Rank: top 18%)
Annotation
HeliconiusHMEL0164515e-6973.84% 
BombyxBGIBMGA001073-TA0.072.84% 
Drosophilabbc-PC4e-9163.05% 
EBI UniRef50UniRef50_E2APR65e-17963.67%Choline/ethanolaminephosphotransferase 1 n=19 Tax=Coelomata RepID=E2APR6_CAMFO
NCBI RefSeqXP_002033716.11e-16157.06%GM20274 [Drosophila sechellia]
NCBI nr blastpgi|3838479990.063.52%PREDICTED: choline/ethanolaminephosphotransferase 1-like [Megachile rotundata]
NCBI nr blastxgi|3838479993e-17963.79%PREDICTED: choline/ethanolaminephosphotransferase 1-like [Megachile rotundata]
Group
Gene OntologyGO:00160206e-21membrane
GO:00086546e-21phospholipid biosynthetic process
GO:00167806e-21phosphotransferase activity, for other substituted phosphate groups
KEGG pathwaydse:Dsec_GM202744e-161 
 K00993 (EPT1)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
    Ether lipid metabolism
InterPro domain[48-153] IPR0004626e-21CDP-alcohol phosphatidyltransferase
Orthology groupMCL12638 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215958-TA
ATGCAGTTTTACAAGGAAAGGATTCTTAACGCTGCTCAGTTAAAAAGACTCAGTGAACACAAATATTCGTGTACGAGTGCTAGTATTCTAGATGCTTGGCTTCAACCTTGGTGGTGCTGGCTAGTTTCAAAGACGCCGCTATGGCTTGCTCCGAATCTCATTACTATTCTAGGATTAATAGTGAATATAGTTACTACTCTCATATTGGTCTGGTATAGTCCAGATGCAAGACAGGAACCGCCACAGTGGGCCTTCGCCCTCTGCGCGTTGGGAGTGTTCGTGTACCAGAGCTTGGACGCTATTGACGGCAAGCAGGCTCGTAGAACCGGCAGCCAGTCACCATTAGGAGAACTCTTCGATCACGGCTGTGACAGTATATCGACAGTTTTCATCGCACTGGGGGCCTGTATAGCTGTGAAGCTCGGGGAATATCCTACGTGGATGTTCTTTCAGTGTTTCTGCGCTATGACGTTATTTTATTGCGCTCATTGGCAAGCTTACGTCACTGGGACCCTCAAAATGGGCCGGATTGACGTCACCGAGGCCCAGTACACGATCATAGGGATCCATCTCATATCAGCGACCCTCGGGCCAGACGCCTGGGCCACTAAGATCGGCTCGCTGGAGGCTCGCTACTCGCTAGGTGGCGCTGCTGTGTTGGGGGCTACCCTGACGCTTGGCGCTCTCGGAGCAGCCATCGCCAAAGGGGGGGTCGGCAAGAATGGATCCACTGTCGCTATTCACGGTTTAGATATACCGGTACGGCTGTTCACGTGTACGTTAACGTTTTCAACGGCGTTGTACTTTGTCGTCAACTTCGCCGCCTCCTTCAGGCACAGGGGCTGTGGCAAGAACGGATCCACTGTGGCCTTGCCATCTACCGGCGTCGGTTTGAACCTGTTGTCGAATTACGCTGTGGTGTTGGTGACTGGCACCATCGTCCTCGGATACGTTAAGGTCATTATGAAAGGCGGAGTCGGCAGGAACGGGTCCACGGTCGCAGGTACCAGTATCCTGTCCCCGGTGATACCATTCTCATTGGTGGTAGTCCCAGCGTTTATCATCTTCCAGAAGAGCGAGTCCCAGGTCTATGAGAACCATCCCGCCTTGTATATTATAGCGTTCGGTATGGTTACTGCTAAAGTAACAAATCGCCTTGTGGTGGCGCATATGACGAAGAGCGAGATGGAATACTACGACTGGTCGCTCCTAGGCCCGGCCATGCTCTTCCTCAATCAGTACTTCAACCACGCCCTGCCCGAGTATTACGTGCTTTGGCTTTGCACGATATGGGTCGTGGTTGAACTGATCCGTTACTGTGGTCAGATATGTCTGGAGATATGTGACCACCTGCACATCAGCCTGTTCAGGATAACGAGACAGTCGCCGGCAACAGCGTCGCCTCACGATAAGAACGGTACGAACAGGGGCAGGAGGGCCAAGAGGGTGCCCGCTTAG

Protein sequence:

>DPOGS215958-PA
MQFYKERILNAAQLKRLSEHKYSCTSASILDAWLQPWWCWLVSKTPLWLAPNLITILGLIVNIVTTLILVWYSPDARQEPPQWAFALCALGVFVYQSLDAIDGKQARRTGSQSPLGELFDHGCDSISTVFIALGACIAVKLGEYPTWMFFQCFCAMTLFYCAHWQAYVTGTLKMGRIDVTEAQYTIIGIHLISATLGPDAWATKIGSLEARYSLGGAAVLGATLTLGALGAAIAKGGVGKNGSTVAIHGLDIPVRLFTCTLTFSTALYFVVNFAASFRHRGCGKNGSTVALPSTGVGLNLLSNYAVVLVTGTIVLGYVKVIMKGGVGRNGSTVAGTSILSPVIPFSLVVVPAFIIFQKSESQVYENHPALYIIAFGMVTAKVTNRLVVAHMTKSEMEYYDWSLLGPAMLFLNQYFNHALPEYYVLWLCTIWVVVELIRYCGQICLEICDHLHISLFRITRQSPATASPHDKNGTNRGRRAKRVPA-