Monarch geneset OGS2.0

DPOGS209876
TranscriptDPOGS209876-TA1185 bp
ProteinDPOGS209876-PA394 aa
Genomic positionDPSCF300302 + 179834-186928
RNAseq coverage565x (Rank: top 22%)
Annotation
HeliconiusHMEL0079599e-17680.79% 
BombyxBGIBMGA004438-TA5e-14872.73% 
DrosophilaCG7149-PA1e-10748.01% 
EBI UniRef50UniRef50_UPI0001792CFE4e-10749.11%UPI0001792CFE related cluster n=1 Tax=unknown RepID=UPI0001792CFE
NCBI RefSeqXP_395166.21e-12453.81%PREDICTED: similar to CG33116-PA [Apis mellifera]
NCBI nr blastpgi|3504020771e-12554.71%PREDICTED: ethanolaminephosphotransferase 1-like [Bombus impatiens]
NCBI nr blastxgi|3504020774e-12754.45%PREDICTED: ethanolaminephosphotransferase 1-like [Bombus impatiens]
Group
Gene OntologyGO:00160202e-13membrane
GO:00086542e-13phospholipid biosynthetic process
GO:00167802e-13phosphotransferase activity, for other substituted phosphate groups
KEGG pathwayame:4116984e-124 
 K00993 (EPT1)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
    Ether lipid metabolism
InterPro domain[3-395] IPR0144721.3e-110Choline/ethanolamine phosphotransferase
[47-165] IPR0004622e-13CDP-alcohol phosphatidyltransferase
Orthology groupMCL14880 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209876-TA
ATGTTTGACTATAAATACCTTGGTCGTGAACATCTGGAAGGCTTCGATAACTACAAGTATATGGCTAGGGATACAAGTCCTTTAAGTGTATATGTAATGCACCCGTTCTGGAACAAGGTTGTTGAGTTAGTACCCCGTTGGATAGCACCGAATGTCCTAACGTTTGCAGGGTTTATATTAACAGTGGCAGATTTCTTGCTGCTATCTTTCTATGACTACGATTACTATGCAGCGGCCACTAAATATAACACGACCAGTACTGAACCTTTGAACGGACACACGGAAGTTATACCGCAGTCATTGTGGTATCTATTGGCTGTGTTTCTCTTCTTAGCATACACTTTAGACGGCATTGATGGTAAACAGGCCCGTAGAACACAGACCTCAGGGCCTTTAGGAGAATTATTCGACCATGGATTAGATTCTTACTCTGTGTTCTTCATACCAGCATGTCTTTACTCTATATTTGGTCGTCTAGACTACTCAATATCGCCTATCAGGATGTATTACGTGATGTGGAATCTTTTACTGAATTTCTATCTGAGTCACTGGGAGAAGTACAACACTGGCGTGTTATTCCTTCCCTGGGGCTACGACTTCAGTATGTGGACGTCAACGTTCGTGTTCCTATGGACGGGATCTCACGGGACATCGTATTACAAAAAAAACCTCTTCGGTTCATATACACTGGCCGATGGTTTCGAGTGGCTCATCTACGCCACCGGAGTATTCACTAACTTGCCAGTCGCTGTATACAATATATATCAGTCATACAAGCTTAAGACAGGTAAGATGAGAGCTCCTCTTGAGGCAGTACGTCCATTGTGGTCCTTGCTCAGCATATTCATAGTGTGTACCGTCTGGGTTCACTGGTCGAACAATTTACCTAATAGCGACCCACGAGCTCTATTCCTTCTTATTGGGACACTATTCAGTAATGTAGCTTGTCGGCTGATAGTGAGTCAGATGAGCAACCAGCGTTGTGAGGCGGTCAGTTGGTTGTTGTGGCCGTTGTCTATAAGTGTGATGTCTTCTCTGGCCACTCCGCAGTACGAGCCGCTGTTCTTCTACACTATGACAGTGTTCAGTGTCCTCGCCCATGTCCACTACGGCACCTGTGTTGTCCGTCAAATGTGTGAGCACTTCAGGATAGGCTGTTTCCATATAAAGCAGCGTTCAGACTGA

Protein sequence:

>DPOGS209876-PA
MFDYKYLGREHLEGFDNYKYMARDTSPLSVYVMHPFWNKVVELVPRWIAPNVLTFAGFILTVADFLLLSFYDYDYYAAATKYNTTSTEPLNGHTEVIPQSLWYLLAVFLFLAYTLDGIDGKQARRTQTSGPLGELFDHGLDSYSVFFIPACLYSIFGRLDYSISPIRMYYVMWNLLLNFYLSHWEKYNTGVLFLPWGYDFSMWTSTFVFLWTGSHGTSYYKKNLFGSYTLADGFEWLIYATGVFTNLPVAVYNIYQSYKLKTGKMRAPLEAVRPLWSLLSIFIVCTVWVHWSNNLPNSDPRALFLLIGTLFSNVACRLIVSQMSNQRCEAVSWLLWPLSISVMSSLATPQYEPLFFYTMTVFSVLAHVHYGTCVVRQMCEHFRIGCFHIKQRSD-