Monarch geneset OGS2.0

DPOGS208992
TranscriptDPOGS208992-TA1812 bp
ProteinDPOGS208992-PA603 aa
Genomic positionDPSCF300009 + 1902221-1905754
RNAseq coverage379x (Rank: top 32%)
Annotation
HeliconiusHMEL0167915e-15979.08% 
BombyxBGIBMGA008116-TA0.089.92% 
DrosophilaCTPsyn-PC0.070.18% 
EBI UniRef50UniRef50_Q2M1970.070.48%CTP synthase n=18 Tax=Opisthokonta RepID=PYRG_DROPS
NCBI RefSeqNP_001135804.10.075.09%CTP synthase [Nasonia vitripennis]
NCBI nr blastpgi|3071781630.076.91%DNA replication licensing factor Mcm2 [Camponotus floridanus]
NCBI nr blastxgi|3838561280.076.70%PREDICTED: CTP synthase-like [Megachile rotundata]
Group
Gene OntologyGO:00062211.6e-223pyrimidine nucleotide biosynthetic process
GO:00038831.6e-223CTP synthase activity
KEGG pathwaynvi:1001175170.0 
 K01937 (E6.3.4.2, pyrG)maps-> Pyrimidine metabolism
InterPro domain[4-546] IPR0044681.6e-223CTP synthase
[4-276] IPR0174561.8e-122CTP synthase, N-terminal
[314-545] IPR0179262.2e-48Glutamine amidotransferase type 1
Orthology groupMCL11390 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208992-TA
ATGCCTGATATGAAATATATCCTTGTTACTGGTGGCGTTATAAGTGGCGTAGGAAAAGGTGTCATCGCTAGTTCTTTTGGCACAATTCTTAAAAGTTGTGGTATTGATGTAACATCAATAAAAATTGACCCCTATATTAATATTGATGCAGGAACCTTCTCTCCTTATGAACATGGAGAAGTTTACGTACTTGATGACGGTGGCGAAGTAGATCTTGATCTTGGCAACTATGAACGTTTCTTAGACATAACACTGCACAGGGATAACAATATTACTACAGGAAAAATATACCAACAGGTTATAGAGCGCGAGCGCCGTGGTGATTATTTAGGAAAGACAGTACAAGTTATTCCACATATTACTGATGCGATACAGGAATGGGTGCAGAGGGTTGCCCATATTCCTGTTACACCAGAGAACATGCCCCCAAGGGTTTGTATTGTAGAACTTGGTGGAACAATCGGTGACATTGAAGGAATGTCCTTTGTTGAGGCATTCAGGCAATTTCAGTTTAGGGTTAAGAGAGAAAACTTTTGTTGTGCTCATGTCTCATTAATACCTATGCCAAAATCAACTGGTGAACCAAAAACAAAGCCAACACAGTCTTCAGTGAGGGAATTGAGAGGTCTTGGTTTGTCACCGGATCTCATATTATGTAGATCAGAGAAACCTATAAACCACAATGTGAAAGAGAAAATTTCTAACTTTTGTCATGTAGCCCCAGATCAGGTGATTTGTATTCATGATTTGAGTTCAGTGTATCATGTGCCACTATTGATGGAGGCTCAAGGAGTTGTCCAGTACTTGAATGAAAGGCTACAGTTGAATATTGCTATACCTAGACCTGGCAGGTTCATGCAAAAATGGCGAAATCTAGCAAAACGTGTAGACAATTTAAGGAAAGAAGTGAATATATCTCTTGTTGGCAAGTACACTAAACTAGAAGACAGTTATGCCAGTGTCACTAAAGCTCTTCAGCATGCGAGTATAGCTGCTGGGTGCAGATTGAAGTTGACATATATTGAAGCAGTTAATTTAGAAGAACAAACGAAAATCGATAATCCAGTTAGTTATCATAAGGCCTGGCAAGAGGTTTGCAAAAGCGACGGTCTTATTGTACCTGGTGGTTTTGGTCAGAGAGGCTTAGAAGGGAAAATAGAAGCATGCCGTTGGTGTCGGGAAACTCAAAAGCCGATGCTTGCTATATGTCTGGGTTTACAAGCAGCTGTGATAGAATTTGCTAGAAATGTGTTAGGTCTTAAGGGTGCTAATAGTACAGAAGTCAATCCAGACTGTGAGGACAAGCTTGTTATTGATATGCCAGAACATCACCCTGGAAACCTCGGCGGTACCATGAGGTTGGGAAAGAGAAAGACCTATTTGGAACCCAGTATAATTTCTAAACTGTACAAGAAGGAAGTGATAGAGGAGCGTCACAGGCACCGGTATGAAGTGAACCCTGAGTATATAGAAAGACTAGAGAAGGCCGGTTTAAGATTCGTTGGTCGCGACTCATCTCGGACACGCATGGAGGTAGCAGTCATCGACTCACACCCATACTATGTGGGAGTTCAGTTTCACCCTGAGTATCTGTCTAGGCCGCTATCTCCAAGTCCACCGTTCTTAGGATTTATACTCGCTTCTCTTGGAAAACTAAAGAATTATATGTCAAAGGGGTGCCGATTCAGTCCTAGGAATCAGTCAGATGTTAGTTCAGATGATGACGACATATCAGTGTCTTCGTTGAGTCTAGTTGAAGAAAAGACCATCATAGAAAATGGTCAAGCAAAAGTGTCTAATGGAGTCCATTAA

Protein sequence:

>DPOGS208992-PA
MPDMKYILVTGGVISGVGKGVIASSFGTILKSCGIDVTSIKIDPYINIDAGTFSPYEHGEVYVLDDGGEVDLDLGNYERFLDITLHRDNNITTGKIYQQVIERERRGDYLGKTVQVIPHITDAIQEWVQRVAHIPVTPENMPPRVCIVELGGTIGDIEGMSFVEAFRQFQFRVKRENFCCAHVSLIPMPKSTGEPKTKPTQSSVRELRGLGLSPDLILCRSEKPINHNVKEKISNFCHVAPDQVICIHDLSSVYHVPLLMEAQGVVQYLNERLQLNIAIPRPGRFMQKWRNLAKRVDNLRKEVNISLVGKYTKLEDSYASVTKALQHASIAAGCRLKLTYIEAVNLEEQTKIDNPVSYHKAWQEVCKSDGLIVPGGFGQRGLEGKIEACRWCRETQKPMLAICLGLQAAVIEFARNVLGLKGANSTEVNPDCEDKLVIDMPEHHPGNLGGTMRLGKRKTYLEPSIISKLYKKEVIEERHRHRYEVNPEYIERLEKAGLRFVGRDSSRTRMEVAVIDSHPYYVGVQFHPEYLSRPLSPSPPFLGFILASLGKLKNYMSKGCRFSPRNQSDVSSDDDDISVSSLSLVEEKTIIENGQAKVSNGVH-