Monarch geneset OGS2.0

DPOGS203601
TranscriptDPOGS203601-TA1173 bp
ProteinDPOGS203601-PA390 aa
Genomic positionDPSCF300063 - 563104-568440
RNAseq coverage495x (Rank: top 25%)
Annotation
HeliconiusHMEL0088770.080.56% 
BombyxBGIBMGA007271-TA9e-18075.26% 
Drosophilafrj-PA1e-7843.52% 
EBI UniRef50UniRef50_D6WX875e-8139.02%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WX87_TRICA
NCBI RefSeqXP_001847441.12e-9444.59%leukocyte receptor cluster member 4 protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700392214e-9344.59%leukocyte receptor cluster member 4 protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700392212e-9545.03%leukocyte receptor cluster member 4 protein [Culex quinquefasciatus]
Group
KEGG pathwaycqu:CpipJ_CPIJ0054726e-94 
 K13516 (MBOAT7)maps-> Glycerophospholipid metabolism
InterPro domain[22-329] IPR0042996.3e-31Membrane bound O-acyl transferase, MBOAT
Orthology groupMCL11982 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203601-TA
ATGTTCGGCTACCTCACAATATTCCGTCTGGGCGAGCGGTTCGGTTTATCACCGTCTTCCGGGCACACGAATCTCATAGAGATGATCATAGTACTGCGTGTCGTGGGAGTGGCCTTTGAAATGAACGGCTCCTATCTGGCGATGATCGAGAGGAAGAGGGGCGATAGAAGAGACGACCGCCCAAAGGACAGGGATGAGGATTTCGTGGAACTCATCAACCCTGATTATGTCAACCTGTTCCATTACACGTTCAACTATATTGGATTATTGACTGGTCCGTATTATCGTTATCGGACCTACGAGGACTATTTCAAACTACCCTTCAGTAAGCACGTGGATTGTTTCGGCTTCACCATCAACACTCTCAAGGCTGTGCCCTTATACGTTACACTGTACTTGGCTCTGTCACATCTTTTTCCGTTGGAGTATGTTCTAACGGAAGAGCACAACAGCCGGCATTTCGTATATCGCATGTTCTATCCGTGGCTATTGTTCGCTGCTTTCCGCCAACGGATATACGCTGGCATGACTCTGGCTGAGAGCGTCTGCACTTCCGCTGGACTCGGGGCGTATCCGGTACAGGGCAGGAACAGATCCGGACACGGACCAACCGTGGGATACTTGAAGATGAAACAAATGACGACTCCCGAGGTTGAGGCCGCTCAATACGACTTCAAGACTGTGGAGTCGATGGAGGTTTGGGGCTGCGAGACTGTGGTCACTCTCAGGGATTCCATGAAGGTGTGGAACAAAGCTGTCCAGTACTGGGTCGCTATGGTGGTGTACAAACGATTCCCATTGAAGGCTTTCAAGATCCACGCGGCTCTGTTTGTGTCCGTCATCTGGCACGGTTTCCACCCCGGCTACTTCTTCTGTATCTACTTCTGTCCATTCTACTTGATGGCCGAAGATCTTTACTATCACCTTTATTACAAGGACGCTACCGGCACGAAAAAGAAAATAATCGGCTTCATAATGTGGTTCCTTCGTTCTCATTCCGAGTCCTATCAAGCTGCGGCATTTCTGCTACTAAGTATTGATAGAGTGTGGGCTTATTACTCCTCCGTATACCACTACTGGTACTTCGTGTGGTTCGGTTTCTTCGTACTAGGCCTCATACTGAACCGCGTCAGGCTCATGCTGGGCCCGAGGGTGCTAGTTCATACTGAGTAA

Protein sequence:

>DPOGS203601-PA
MFGYLTIFRLGERFGLSPSSGHTNLIEMIIVLRVVGVAFEMNGSYLAMIERKRGDRRDDRPKDRDEDFVELINPDYVNLFHYTFNYIGLLTGPYYRYRTYEDYFKLPFSKHVDCFGFTINTLKAVPLYVTLYLALSHLFPLEYVLTEEHNSRHFVYRMFYPWLLFAAFRQRIYAGMTLAESVCTSAGLGAYPVQGRNRSGHGPTVGYLKMKQMTTPEVEAAQYDFKTVESMEVWGCETVVTLRDSMKVWNKAVQYWVAMVVYKRFPLKAFKIHAALFVSVIWHGFHPGYFFCIYFCPFYLMAEDLYYHLYYKDATGTKKKIIGFIMWFLRSHSESYQAAAFLLLSIDRVWAYYSSVYHYWYFVWFGFFVLGLILNRVRLMLGPRVLVHTE-