Monarch geneset OGS2.0

DPOGS211979
TranscriptDPOGS211979-TA1047 bp
ProteinDPOGS211979-PA348 aa
Genomic positionDPSCF300011 + 1351317-1355827
RNAseq coverage261x (Rank: top 41%)
Annotation
HeliconiusHMEL0180462e-12262.73% 
BombyxBGIBMGA001219-TA2e-9551.17% 
Drosophilaeas-PE1e-7639.86% 
EBI UniRef50UniRef50_Q7QEI85e-8947.49%AGAP000010-PA n=1 Tax=Anopheles gambiae RepID=Q7QEI8_ANOGA
NCBI RefSeqXP_311150.44e-8848.62%AGAP000010-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479629992e-8847.49%AGAP000010-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479629994e-8747.49%AGAP000010-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00167721.2e-84transferase activity, transferring phosphorus-containing groups
GO:00167732.6e-56phosphotransferase activity, alcohol group as acceptor
KEGG pathwayaga:AgaP_AGAP0000101e-87 
 K00894 (E2.7.1.82, EKI1)maps-> Glycerophospholipid metabolism
InterPro domain[10-345] IPR0110091.2e-84Protein kinase-like domain
[66-269] IPR0025732.6e-56Choline/ethanolamine kinase
Orthology groupMCL11717 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211979-TA
ATGTCTCGGGTTTGTCACGCTGCCGGTCACAAGTTCATACCGAATCAAATTAAAGAGGACGATCTGTATGGAGGAATATCGAACGTTCTTTCAGTCATAAGGCCCGAGTGGCCTTCGGAGAACATTAAATACAAGTTATTTACTGATGGAATAACAAATAAATTAGTAGCTTGCCAATTAAAATCAGGGGATGAAATTTTGCTCGTAAGGATCTATGGAAACAAAACGGATCTCTTGATTGACAGAGATGCTGAAATAAGAAACATCACCCTGTTAAACAAAGAGGGTTTGGCGCCAAAAATATATGGTGTTTTTAAAAATGGTCTAGTCTATGAATATTACCCTGGGGTGACTCTGAACACGGAAACTGTTACAGACACTAAAATATCAACATTGGTCGCACGACAAATGGCTAAGATGCACAAAGTACAACTCGGACCTGAAACAAAGAAAGAGCCAATGATTTGGGATAAAATAGAACAATTCCTGAAACTGATTCCCGAAGAATATTCAGATCCTCACAAGCAGTCCAGGTTCGAGCGCAGCTTCGGGTCGTCGTCCAGGCTGTGGTCGGAGTACCGCGAGCTGCGGCGGCGGCTGGCGGAGTGCAGCAGTCCGCTGGTGTTCGCTCACAACGACCTGCTGCTGGGGAACGTGGTGCACGACGAGCGAGCGGGGGCCGTGGCCTTCATAGACTACGAGTACGCCGGATACAACTACCAGGCCTTCGACATCGCCAACCACTTCAACGAGTACGTCGGTCTCTCGCTGGACGACATCGACTACTCCCGCTACCCGTGCGAGGAGTTCCAGCGCCGCTGGGTCCACACCTACCTGTCGGAGTTCGAGGCCCGCGAGGTCGGCGAGGAGCAGGTGTCGCGCGTGTGCGACGAGGTGCGACGCCTGGCACCCCTATCTCACTTCCTGTGGGCCGTGTGGGCGCTGGTCCAGTATCACCTCTCAGATATTCACTTCGACTTCTTAAGATACGCTGAGATACGACTCGGGAGGTACTATGAGCTGAAGGAGGAACGTGACGCGCCCTGA

Protein sequence:

>DPOGS211979-PA
MSRVCHAAGHKFIPNQIKEDDLYGGISNVLSVIRPEWPSENIKYKLFTDGITNKLVACQLKSGDEILLVRIYGNKTDLLIDRDAEIRNITLLNKEGLAPKIYGVFKNGLVYEYYPGVTLNTETVTDTKISTLVARQMAKMHKVQLGPETKKEPMIWDKIEQFLKLIPEEYSDPHKQSRFERSFGSSSRLWSEYRELRRRLAECSSPLVFAHNDLLLGNVVHDERAGAVAFIDYEYAGYNYQAFDIANHFNEYVGLSLDDIDYSRYPCEEFQRRWVHTYLSEFEAREVGEEQVSRVCDEVRRLAPLSHFLWAVWALVQYHLSDIHFDFLRYAEIRLGRYYELKEERDAP-