Monarch geneset OGS2.0

DPOGS208716
TranscriptDPOGS208716-TA1194 bp
ProteinDPOGS208716-PA397 aa
Genomic positionDPSCF300043 - 37680-49314
RNAseq coverage3565x (Rank: top 3%)
Annotation
HeliconiusHMEL0152661e-17485.43% 
BombyxBGIBMGA003362-TA2e-15584.76% 
DrosophilaCct2-PA2e-11271.07% 
EBI UniRef50UniRef50_UPI0001792F2D4e-11162.82%UPI0001792F2D related cluster n=1 Tax=unknown RepID=UPI0001792F2D
NCBI RefSeqXP_395764.21e-12771.06%PREDICTED: similar to CTP:phosphocholine cytidylyltransferase 1 CG1049-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800255519e-12771.38%PREDICTED: choline-phosphate cytidylyltransferase B-like [Apis florea]
NCBI nr blastxgi|3320270543e-12661.64%Choline-phosphate cytidylyltransferase B [Acromyrmex echinatior]
Group
Gene OntologyGO:00090581.8e-19biosynthetic process
GO:00038241.8e-19catalytic activity
GO:00167795.6e-19nucleotidyltransferase activity
KEGG pathwayame:4123034e-127 
 K00968 (E2.7.7.15, PCYT1)maps-> Glycerophospholipid metabolism
    Phosphonate and phosphinate metabolism
InterPro domain[81-214] IPR0147291.5e-34Rossmann-like alpha/beta/alpha sandwich fold
[81-148] IPR0048211.8e-19Cytidyltransferase-related
[85-211] IPR0048205.6e-19Cytidylyltransferase
Orthology groupMCL12589 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208716-TA
ATGTCATCATTAACCATGACGAATACCGTCACCAACGTGCACCTGAATGGTAAAGTGGAGACCAGCTTAAACGGGCATAGGAAGAACGGTTACTTGAAACACGAAAGCTCAGACAAGACATCACTTAGACACGCGGCGCCCTTCAGCACAGACCCGATAGCTGTTGCTGAACGCGAGAGATGCTCGTATGGAAGGATACCGAGGGCTCAGGCTGTGGCCGGCACCGCGCCCAGGAAGGTCAGGGTCTACTCCGACGGGATATACGACATGTTCCACCAGGGACACGCGAGACAGCTGCAACAAGCGAAGACGGTCTTCCCTAACGTATATCTCATTGTTGGAGTGTGCAACGACAAACTGACACACAGTCGTAAAGGTCGCACCGTTATGACTGAGGAGGAGAGGTACGAGGCTGTGAGACACTGCAGATATGTTGACGAGGTGGTCTGCGACGCGCCCTGGGAGTACGACGAGGCGTTTCTGGAGAAGCACAAGATAGACTTCCTGGCGCACGACGACATCCCCTACACCACCGAGGACTGCGAGGACACGTACGCCATGATCAAGGCGAAGGACATGTTCGTGGCGACGGAGCGGACGGAAGGCGTGTCCACATCAGACATAGTCGCTCGCATAGTCCGCGACTACGACATCTACGTGCGGCGGAACCTAGCCCGCGGCTACTCCGCCAAGGAGCTGAACGTGTCCTTCCTGAACGAGAAGAAGTTCCGGTTACAGAATAAAATGGACGAACTAAAGGATAAGGGGAAGAAGGTGATGACGAATATAGGCGAAAAGCGCGTGGACATCTTGACCAAGTGGGAGGAGAAGTCTCGCGAGCTGATAGACGCGTTCCTCCTGTTGTTCGGTCCGGACGGCAGGCTGTCCAGCATCTGGAACGAGTCCAAGGGCCGTCTGATGCAGGCCCTGTCCCAACCGCCCTCACCGCACTCGTCGTCACCGCCCAGCGAGAACGGAGACCATTCACACTCACCGTCACCTGAACCAGAACACAGAGGGTCGGCCTCCCCCCCACCACCTAAATCTCGGCGGTGTGACCTGTTGTGGGAGGAGCAAGCAGATGCATCGAGCTTTGACCGCGCTGAGTTCTCGGAAGAGGCCGCACAGCGCCAGCTGACGGACGACGGCTCGGACAGCGACGACGACTACCAGGACACCAGCCCTTACCTATAA

Protein sequence:

>DPOGS208716-PA
MSSLTMTNTVTNVHLNGKVETSLNGHRKNGYLKHESSDKTSLRHAAPFSTDPIAVAERERCSYGRIPRAQAVAGTAPRKVRVYSDGIYDMFHQGHARQLQQAKTVFPNVYLIVGVCNDKLTHSRKGRTVMTEEERYEAVRHCRYVDEVVCDAPWEYDEAFLEKHKIDFLAHDDIPYTTEDCEDTYAMIKAKDMFVATERTEGVSTSDIVARIVRDYDIYVRRNLARGYSAKELNVSFLNEKKFRLQNKMDELKDKGKKVMTNIGEKRVDILTKWEEKSRELIDAFLLLFGPDGRLSSIWNESKGRLMQALSQPPSPHSSSPPSENGDHSHSPSPEPEHRGSASPPPPKSRRCDLLWEEQADASSFDRAEFSEEAAQRQLTDDGSDSDDDYQDTSPYL-