Monarch geneset OGS2.0

DPOGS204339
TranscriptDPOGS204339-TA1512 bp
ProteinDPOGS204339-PA503 aa
Genomic positionDPSCF300142 + 18552-25362
RNAseq coverage235x (Rank: top 43%)
Annotation
HeliconiusHMEL0053950.078.88% 
BombyxBGIBMGA007242-TA0.071.23% 
Drosophilaoys-PA5e-11744.89% 
EBI UniRef50UniRef50_Q6NN557e-11544.89%CG18445 n=10 Tax=Drosophila RepID=Q6NN55_DROME
NCBI RefSeqXP_002063522.13e-11946.17%GK21363 [Drosophila willistoni]
NCBI nr blastpgi|3071882001e-12545.94%Membrane-bound O-acyltransferase domain-containing protein 2 [Camponotus floridanus]
NCBI nr blastxgi|3071882006e-12345.63%Membrane-bound O-acyltransferase domain-containing protein 2 [Camponotus floridanus]
Group
KEGG pathwaytca:6627875e-111 
 K13517 (MBOAT1_2)maps-> Glycerolipid metabolism
    Glycerophospholipid metabolism
InterPro domain[120-435] IPR0042993.9e-38Membrane bound O-acyl transferase, MBOAT
Orthology groupMCL14854 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204339-TA
ATGTCTACAGAATATTATGATTATTATGACGGCAGCAAATTATTTTTGTTTTTATCAAACCGACTCGGGCTACCCATCGACTTGGTTAATTTTTTGATCGCACAATTTGCTGCATTATGCGTGGCAAGACTTTTTCGGAAACCATTAAAACTAGCTTCGCCAGAATTTCGACATTCATTATGCCTAGTTATAGGGTTGCTAATGGGATATTTCTGCTTTGGAAAACAAGCGGTCCATTTATCGGTGCTACCTATGATAGGATATATTCTACTGAGGATGTTATCACCTAATTTAATGGGAAATGCAATTCTGGCCGCTTCTATGATATATTTGTCTTGTATACATTTGCATCGACAGATTTATCACACAGCTGATTATTCATTAGATATAACGGGTCCTCTGATGGTTATCACTCAGAGGGTTACATCCTTAGCATATTCGTTGCAAGATAAATTAACCTCAAAAGAGATAAACGCTAACTCTACAAGTAATTCATGGGGTAAGGACTTCAAAACGGATAAAATGCCCTCTCCACTCGAATATTTCGCATACACATTGGCGTTCCAGACGTTGATGTGCGGTCCGGTCGTGTTTTATTCAGATTATATAAGTTTCATTGAAGGCGATCATGTTAATGGAGAGGGAAAAAGTCATATGACAGAAAGAGAACCCTCTCCTAGATTGGCAGTACTTTATAAAGTAGCAGGCTCTGTGGCAGCAGCTGTTCTATATTTGTCTCTGGCTAAGAAGTATCCTCTTGCAGTGCTTGAAGAATTAACCAATCCACATTCAGAAGTGTCTCGGTCTTGGTCTGCTTTATATTTACTGTGGTACGCGTATCTATCTACGCTGGTTGTCCGTTGTAAGTACTATCACGCGTGGCTGCTGTCTGAGGCAATATGTAACAACTCAGGGATGGGTTTCAATGGATATGACACGAATGGGAGCCCAAAATGGGATAAGATGTCCAATATAGATGTACTGGGATTTGAGCTGGCTCAGAACTTCCGCGCGGCTGTATCAAGCTGGAACAAGAACACGAACGCCTGGCTGCGGTTGGTCGCATACAGCCGAGGTGGTTGGCGGACGACGCGAGTGTACGCTCTGTCTGCGATCTGGCACGGCTTCCATCCCGGTTACTACTTCACTTTCTTCGCGGGAGGTCTCTTCACAGTCGCTGCCAAGAAGGTGCGATCTTTCGCTCGTCCCATGTTTGTGGACAGTAAACCTAAGAAGATGTTGTACGACGCTCTAACCTTCATAACGACTCGCGTCGCCATGACGTACGCCACAGTGCCATTTATTCTACTAAATCTTTCACCGACACTTGCTTTTTATGGAAAATTCTACTATTCATTGCATTTTATTGCTCTGGGTGCGCTGTTTTTGCCCAATAACAATAAATACCACACGTCTCAGATCTCCAATTGTCCTAATACGTCTTCCAAATTAAAAGAAATTCCGGTTCAGGAAACCATAGTGGAGGCCAACGGTAAGATGAAGGTGCGCTAA

Protein sequence:

>DPOGS204339-PA
MSTEYYDYYDGSKLFLFLSNRLGLPIDLVNFLIAQFAALCVARLFRKPLKLASPEFRHSLCLVIGLLMGYFCFGKQAVHLSVLPMIGYILLRMLSPNLMGNAILAASMIYLSCIHLHRQIYHTADYSLDITGPLMVITQRVTSLAYSLQDKLTSKEINANSTSNSWGKDFKTDKMPSPLEYFAYTLAFQTLMCGPVVFYSDYISFIEGDHVNGEGKSHMTEREPSPRLAVLYKVAGSVAAAVLYLSLAKKYPLAVLEELTNPHSEVSRSWSALYLLWYAYLSTLVVRCKYYHAWLLSEAICNNSGMGFNGYDTNGSPKWDKMSNIDVLGFELAQNFRAAVSSWNKNTNAWLRLVAYSRGGWRTTRVYALSAIWHGFHPGYYFTFFAGGLFTVAAKKVRSFARPMFVDSKPKKMLYDALTFITTRVAMTYATVPFILLNLSPTLAFYGKFYYSLHFIALGALFLPNNNKYHTSQISNCPNTSSKLKEIPVQETIVEANGKMKVR-