Monarch geneset OGS2.0

DPOGS213532
TranscriptDPOGS213532-TA1131 bp
ProteinDPOGS213532-PA376 aa
Genomic positionDPSCF300033 - 620572-627523
RNAseq coverage473x (Rank: top 26%)
Annotation
HeliconiusHMEL0079000.086.21% 
BombyxBGIBMGA011813-TA0.083.74% 
DrosophilaPect-PB5e-14465.78% 
EBI UniRef50UniRef50_Q8IP802e-14467.96%Phosphoethanolamine cytidylyltransferase, isoform D n=18 Tax=Coelomata RepID=Q8IP80_DROME
NCBI RefSeqXP_966534.14e-15371.93%PREDICTED: similar to ethanolamine-phosphate cytidylyltransferase [Tribolium castaneum]
NCBI nr blastpgi|910787129e-15271.93%PREDICTED: similar to ethanolamine-phosphate cytidylyltransferase [Tribolium castaneum]
NCBI nr blastxgi|1571106998e-15071.23%ethanolamine-phosphate cytidylyltransferase [Aedes aegypti]
Group
Gene OntologyGO:00090583e-21biosynthetic process
GO:00038243e-21catalytic activity
GO:00167791.3e-18nucleotidyltransferase activity
KEGG pathwaytca:6549721e-152 
 K00967 (E2.7.7.14, PCYT2)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
InterPro domain[196-331] IPR0147294.4e-37Rossmann-like alpha/beta/alpha sandwich fold
[12-76] IPR0048213e-21Cytidyltransferase-related
[14-106] IPR0048201.3e-18Cytidylyltransferase
Orthology groupMCL14069 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213532-TA
ATGAGTGACAATAAGAACGATAATAAACAAATACGCGTGTGGTGTGATGGCTGTTACGATATGGTACATTTTGGTCATGCCAACTCGTTAAGGCAAGCAAAGTCCTTAGGAGATGTGTTGATTGTGGGAGTGCACACGGATGAAGAAATATCCAAACATAAAGGTCCACCTGTGTTCACACAGCAGGAGAGATACAAAATGGTTGGTGCAATCAAATGGGTCGACCATGTGGTGGAGGGTGCACCTTATGTGACAACATTAGAAACATTAGACAAATACCAGTGTGATTTCTGTGTGCATGGAGATGACATAACAGTAACGGCGGATGGAATCGATACATACCATTTAGTAAAAGAAGCCGGCAGATACAGCATTCTTGCTTTTACATGTGATTTAAAGTCATTATTAAATATAACCATAACTTTCAGGAGAGGTGACAAAGAATATTCAGTTGAACTGGAACATTCATCAAATCTTGGGACGGATTCCACAGCGAGGTCACCATACACCGGTTGTTCACAGTTCCTGCCTACTACACAGAAAATTATACAATTTAGCAGTGGTCTTTCACCAAAGCCCACGGATAAAGTAGTATATGTGGCTGGTGCCTTCGACCTGTTCCATGTCGGTCACCTGGACTTCCTGGAGGCGGCGCACGCGCAGGGCGACTTTCTCATAGTAGGGCTCCACACTGACCTCGAGGTCAACAGATATAAGGGCTCCAACTACCCCATCATGAACCTTCACGAGAGAGTCCTGTCCGTGCTGGCTTGTAAGTATGTGCACGAGGTGGTGATCGGGGCTCCCTACAGCGTGACGGCTGAGCTGATGGACCACTTCGGTGTGAAGGTGGTGTGTCACGGACTGACGCCCATCGCCAGCGACAAGGACGGAGCTGATCCGTACCAGGTGCCCAAGGAACGGGGCTGCTTTAAGACAATAAACTCGGGCAACACAATGACAACAGAAGACATAGTACAGAGAATAATCCGTCACCGACTGGAATTCGAAGAGAGGAATTCTAAGAAGGAACAAAAAGAAATTGCCGTTATGAAAACTATACAGAAGAAACATTTGAAACAGAACGGTAACTGTGAGAATGTCAAAGGTTATGGCGAACTAATTGAATAA

Protein sequence:

>DPOGS213532-PA
MSDNKNDNKQIRVWCDGCYDMVHFGHANSLRQAKSLGDVLIVGVHTDEEISKHKGPPVFTQQERYKMVGAIKWVDHVVEGAPYVTTLETLDKYQCDFCVHGDDITVTADGIDTYHLVKEAGRYSILAFTCDLKSLLNITITFRRGDKEYSVELEHSSNLGTDSTARSPYTGCSQFLPTTQKIIQFSSGLSPKPTDKVVYVAGAFDLFHVGHLDFLEAAHAQGDFLIVGLHTDLEVNRYKGSNYPIMNLHERVLSVLACKYVHEVVIGAPYSVTAELMDHFGVKVVCHGLTPIASDKDGADPYQVPKERGCFKTINSGNTMTTEDIVQRIIRHRLEFEERNSKKEQKEIAVMKTIQKKHLKQNGNCENVKGYGELIE-