Monarch geneset OGS2.0

DPOGS214423
TranscriptDPOGS214423-TA1440 bp
ProteinDPOGS214423-PA479 aa
Genomic positionDPSCF300069 + 461616-472790
RNAseq coverage735x (Rank: top 18%)
Annotation
HeliconiusHMEL0044749e-16378.39% 
BombyxBGIBMGA011364-TA2e-14075.89% 
DrosophilaCG2201-PB4e-9137.79% 
EBI UniRef50UniRef50_F4WLH34e-10148.06%Choline/ethanolamine kinase n=3 Tax=Formicidae RepID=F4WLH3_ACREC
NCBI RefSeqXP_001603632.12e-10248.40%PREDICTED: similar to choline/ethanolamine kinase [Nasonia vitripennis]
NCBI nr blastpgi|3838495712e-10148.66%PREDICTED: choline/ethanolamine kinase-like [Megachile rotundata]
NCBI nr blastxgi|1953847189e-9940.52%GJ14204 [Drosophila virilis]
Group
Gene OntologyGO:00167725.9e-88transferase activity, transferring phosphorus-containing groups
GO:00167732.1e-70phosphotransferase activity, alcohol group as acceptor
KEGG pathwayhsa:11206e-71 
 K00894 (E2.7.1.82, EKI1)maps-> Glycerophospholipid metabolism
 K00866 (E2.7.1.32, CHK)maps-> Glycerophospholipid metabolism
InterPro domain[61-477] IPR0110095.9e-88Protein kinase-like domain
[159-414] IPR0025732.1e-70Choline/ethanolamine kinase
Orthology groupMCL13135 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214423-TA
ATGTGCGGCACTGAAGAGGAAATGAGGGAGGTCGCTGGCAGAATTTGCAGGAATTATCTCCATGGAGCCTGGAAGACAGTCGATCCATCGGAGCTGGACTTCAAGAGAATCAGAGGAAATGAGGGAGGTCGCTGGCAGGATTTGCAGGAATTATCTCCATGGAGCCTGGAAGACAGTCGACCCATCGGAGCTGGACTTCAAGAGAATCAGAAATTACAAATGTGCGGCACTGAAGAGGAAATGAGGGAGGTCGCTGGCAGGATTTGCAGGAATTATCTCCATGGAGCCTGGAAGACAGTCGACCCATCGGAGCTGGACTTCAAGAGAATCAGTGGAGGCCTATCGAATTTCCTATACTACGTGGCGCTCCCAGACACGAGCAAGATACGGTACACCCGTGAGAGTTCCTTGGAAGCTGATGAAGCCAGGCTTCAGAGGCGACAAATCATGAGCTCCCATTCATTCTCCATGGATGAACCGAAAAAGGTACTGCTGCGTATATACGGCCAGGTTCATGGAGAAAGAGCAATGGACGCTATAGTCACCGAGTCAGTTATATTCACTCTATTATCTGAAAGAAGACTCGGTCCGAAGTTGCACGGAGTATTCTCTGGAGGAAGAATCGAACAGTACGTGCCGGCGAGGTCGCTTCTCACCAAAGAGCTCTCAGAACCCTCGCTCTCTATGAAGATAGCTGAGAAGATGGCGGCCATACACTCCATGGACGTGCCGCTGTCCAAAGAACCGAACTGGCTTTGGAAGACCATATACAAGTGGAGCAAGATAGTTAAAGAAGAGAGGCTAGATAACACAGTTGTTGGAAAGAACGATCAGGAACAGAGTATCATCAAGCATCTGCGTACGATAGACTTCGATAAGGAAATTGAATGGTTGAAAAAATTCCTGGCCACGGTCGAGTCGCCCGTGGTGTTCTGCCACAACGACATGCAAGAAGGCAACATCCTAATGTTGGAGGATGACACTCCGAACGAGGAAGAATCTACGGCCTACGTCGGCTCGTACGAGGACAAGAAGGACATCCACTACGACGACGAGGACTCCATCATAAGCCAGATATCGGACAGCGGGGAACCCAAGCTGGTGCTCATAGACTTTGAGTACTGCGCCTACAATTACAGGGGCTTCGACATTGCGAACCACTTCCAGGAGTGGTGCTACGACTACACCAACCCCGAGACGCCGTTCTACCACGAGAACCACGACAACGCGGCCACGCTGGAGCAAAAGGAGATTTTTATCAAGGAGTACCTGAAGCATTATCACTCAGCGGAGGACAGGTCACCTTCTATAGATGATGTGAATCAACTGCTAGCGGAGGTGGAGGCCTTCGCGTTGGCCAGCGACCTGTTCTGGTCTCTATGGTCCATTGTCAATGCATCCAAGAGCCAAATACCCTTCGGTTATTGGGTAAGCGGATAA

Protein sequence:

>DPOGS214423-PA
MCGTEEEMREVAGRICRNYLHGAWKTVDPSELDFKRIRGNEGGRWQDLQELSPWSLEDSRPIGAGLQENQKLQMCGTEEEMREVAGRICRNYLHGAWKTVDPSELDFKRISGGLSNFLYYVALPDTSKIRYTRESSLEADEARLQRRQIMSSHSFSMDEPKKVLLRIYGQVHGERAMDAIVTESVIFTLLSERRLGPKLHGVFSGGRIEQYVPARSLLTKELSEPSLSMKIAEKMAAIHSMDVPLSKEPNWLWKTIYKWSKIVKEERLDNTVVGKNDQEQSIIKHLRTIDFDKEIEWLKKFLATVESPVVFCHNDMQEGNILMLEDDTPNEEESTAYVGSYEDKKDIHYDDEDSIISQISDSGEPKLVLIDFEYCAYNYRGFDIANHFQEWCYDYTNPETPFYHENHDNAATLEQKEIFIKEYLKHYHSAEDRSPSIDDVNQLLAEVEAFALASDLFWSLWSIVNASKSQIPFGYWVSG-