Monarch geneset OGS2.0

DPOGS204546
TranscriptDPOGS204546-TA1359 bp
ProteinDPOGS204546-PA452 aa
Genomic positionDPSCF300297 + 11823-13882
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0088773e-2627.32% 
BombyxBGIBMGA005334-TA4e-2638.89% 
Drosophilafrj-PA2e-1925.00% 
EBI UniRef50UniRef50_D6WX872e-3828.63%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WX87_TRICA
NCBI RefSeqXP_973277.11e-4230.13%PREDICTED: similar to CG9526 CG9526-PA [Tribolium castaneum]
NCBI nr blastpgi|910889233e-4130.13%PREDICTED: similar to CG9526 CG9526-PA [Tribolium castaneum]
NCBI nr blastxgi|910889235e-4429.76%PREDICTED: similar to CG9526 CG9526-PA [Tribolium castaneum]
Group
KEGG pathwaytca:6620624e-42 
 K13516 (MBOAT7)maps-> Glycerophospholipid metabolism
InterPro domain[90-384] IPR0042999.8e-09Membrane bound O-acyl transferase, MBOAT
Orthology groupMCL30328 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204546-TA
ATGTACGGCTCGGATGTCGTTTACTACATATCCCTAGTAATTTGCATATCTCTCGGCAGCTGCTACAAAAAAATTAATAATAATGAAATGAAAAAGAATTACGGAACGGGGCTCGGATTATTGCTCGTCTGTTTGATATGTGGAACTTCAATTTATCATACCGTGTTAATGGTATGGGGTAATATCGTAATTATTAAATGTTGCGACCGAAGATATTTACATCAAATCAGCATGGCCTATACGTGGTTGTATTTGTTATATCTACATTCGTACGTGATTAACGATTACATTATATGGATACATCAATCAATGGCATTAAAACTTGTCGGATTAGCGTTTGAAATGAATGCGGTTCATGTTAAGGCGGAAGGAAAGAGTCCTGTGTCGAAAATTAATCTTCGCGATATGGATAACACTCCGCCTGACCCGAGCGCAGCGGATATCATAGCATATGCGTTTTATTTTATTGGAATCCATAGAGGTCCTTATTATAGATATAAAATATTTAACGATCACTTCCAAAATACGTTTGGACTTTTAGGAGATTGCAGAATTATAACTGAACAAAAACTTAAAAAAGCATTAGTTTGCGTTATGGGGTACATAATTATAAGCAAGAATTATTCTCCCGAGTTGTATTATAAAGATATATTCTATACAACCTACGGGGCAGACTCTCGTTACCTGTACAATATTCCGCAGCTGGTGTTAAATGTTCTTGAGTATGAGGCTATCATGATGTTGAGTACCAGTGTCTTCACAGAAACCGGCTTTGGAGTGTATCCAGCTAAATGTCAACCGATACCTGGTTATGGACCGAGTGCCCAATTTAGTTTTCTTACACTAGCGTCCGCGTCTCCAGACATAGCTATTGAAGAGGAATATAACTTTTCGATGCTAAAATGTTTTGATAACGAACGTTTGATTCTTGGGCCAAAAATGAGAATGACGATGAGAGCTTTTGACATGCCTACTAGATTCTGGTTTTGGGCTTACGTGCAGAAAAGCTTCATAAAATCCAATAAGGAAGTCAGAAGCGCCTTTTCTTTTTTAGCATGGACCGTGTGGTATGGGCCTACGATACCAAAGTTCATAATTTCCTCGACATTGTGGGTGTACGTGCATCTTGAAGAGGAGTATAGCATTTTGTATAACACGTCTGGTCCTATGAAGTTACCATGGGACATCGGTTTTTCGATCATGAGGATGTTCTGCCTCCTGTATCTGACGCCGTGTTTCACTGTTAAAGATACACGTATTGTACTTAAATATTATAATTCGATATTTTGGGTCTTCCATTTTATACTGCTGCTCATGATGATCCTCTCCATAATACTGTACAAAACTAAACGTGATTAG

Protein sequence:

>DPOGS204546-PA
MYGSDVVYYISLVICISLGSCYKKINNNEMKKNYGTGLGLLLVCLICGTSIYHTVLMVWGNIVIIKCCDRRYLHQISMAYTWLYLLYLHSYVINDYIIWIHQSMALKLVGLAFEMNAVHVKAEGKSPVSKINLRDMDNTPPDPSAADIIAYAFYFIGIHRGPYYRYKIFNDHFQNTFGLLGDCRIITEQKLKKALVCVMGYIIISKNYSPELYYKDIFYTTYGADSRYLYNIPQLVLNVLEYEAIMMLSTSVFTETGFGVYPAKCQPIPGYGPSAQFSFLTLASASPDIAIEEEYNFSMLKCFDNERLILGPKMRMTMRAFDMPTRFWFWAYVQKSFIKSNKEVRSAFSFLAWTVWYGPTIPKFIISSTLWVYVHLEEEYSILYNTSGPMKLPWDIGFSIMRMFCLLYLTPCFTVKDTRIVLKYYNSIFWVFHFILLLMMILSIILYKTKRD-