Monarch geneset OGS2.0

DPOGS208554
TranscriptDPOGS208554-TA1080 bp
ProteinDPOGS208554-PA359 aa
Genomic positionDPSCF300064 + 1092116-1099917
RNAseq coverage68x (Rank: top 67%)
Annotation
HeliconiusHMEL0079594e-7243.23% 
BombyxBGIBMGA010327-TA1e-12764.58% 
DrosophilaCG7149-PA7e-10148.08% 
EBI UniRef50UniRef50_Q8T0S31e-9848.08%CG7149 n=21 Tax=Endopterygota RepID=Q8T0S3_DROME
NCBI RefSeqXP_395166.21e-10548.18%PREDICTED: similar to CG33116-PA [Apis mellifera]
NCBI nr blastpgi|3407298397e-10648.70%PREDICTED: ethanolaminephosphotransferase 1-like [Bombus terrestris]
NCBI nr blastxgi|3838511709e-10949.35%PREDICTED: ethanolaminephosphotransferase 1-like [Megachile rotundata]
Group
KEGG pathwayame:4116984e-105 
 K00993 (EPT1)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
    Ether lipid metabolism
InterPro domain[1-360] IPR0144721.2e-96Choline/ethanolamine phosphotransferase
Orthology groupMCL19582 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208554-TA
ATGTCTCGTTTCCTATCAAAAGACCAGCTAGAAGGGTTCGAAAAATATAAGTATAACTCCATAGACACTAGTATACTGAGTACCTATGTTATGCATCCATTTTGGAACTGGTGTGTCCAGTTTTGTCCGGTATGGGTGGCACCCAACCTGCTAACATTTTCTGGTTTTCTGCTGACGGTTATTAACTTTCTGTTATTCTCCTATTATGATTATGGATTCCATGCGCTATCGAAGGAGAATTTCACAAACGATAGCATACCAAACTGGGTGTGGGCTGTTACCGCTGTCAATTTATTTGTGGCTTACACACTCGGTGAGAATCATATTAAAATAATCATTGCAGAACTAACATCACACGATATACAGGACGAGCTGACCCCTCTCAGATTTTATTTCGTTATATGGAACATATTTCTGAACTTTTATTTGACACATTGGGAGAAATACAACACCGGTGTCATGTTCCTACCGTGGGGCTACGACTTTACCATGCTCGGCTCCTGTATCCTCCTGCTGGTGACGTCACTGATAGGTCCTTCAGCGTGGCACGTCACCTTGCCGGGCGGTCTAACACCCGGCGTGGTGTTTGAGATCGTGCTATACTTCTCCGCCATCATAACGAGCCAGACGGTTATATTGTGGAACATTTATAAATCATATCGCGACGGTACTGGTAAAATGCGTCCATTCATAGAGGCCGTCCGTCCCTTGTTCCCGCTAGCCATCTTTTTTATTCTGAGCACAGCCTGGGCACTTTACTCTCCTAATGACGTCATTAACAAAGGCCCGCGACTGTTCTATATCTTGACAGGAACCATTTTCTCAAATATCAACTGTCGTCTGATCGTGTCTCAAATGAGTGACACATGCTGCGAGTCCTTCAATGATCTTCTAATACCATACGCGGCTGTAGTGTTCGCTTGCCTGTACGGCGTCTCCGAGGCGGTGGAGCTGGTGCTGCTGTCGCTCCTCACGGCGCTGGTGTCCGTAGCGCACATCTACTACGGAACACACGTGGTACAAGAAATGTGCGAACATTTTAAAATTTCCTGTTTTAAAATTAAGTCAAAATCGAATTAA

Protein sequence:

>DPOGS208554-PA
MSRFLSKDQLEGFEKYKYNSIDTSILSTYVMHPFWNWCVQFCPVWVAPNLLTFSGFLLTVINFLLFSYYDYGFHALSKENFTNDSIPNWVWAVTAVNLFVAYTLGENHIKIIIAELTSHDIQDELTPLRFYFVIWNIFLNFYLTHWEKYNTGVMFLPWGYDFTMLGSCILLLVTSLIGPSAWHVTLPGGLTPGVVFEIVLYFSAIITSQTVILWNIYKSYRDGTGKMRPFIEAVRPLFPLAIFFILSTAWALYSPNDVINKGPRLFYILTGTIFSNINCRLIVSQMSDTCCESFNDLLIPYAAVVFACLYGVSEAVELVLLSLLTALVSVAHIYYGTHVVQEMCEHFKISCFKIKSKSN-