Monarch geneset OGS2.0

DPOGS208976
TranscriptDPOGS208976-TA1032 bp
ProteinDPOGS208976-PA343 aa
Genomic positionDPSCF300009 + 1215508-1217047
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0026433e-13567.95% 
Bombyx% 
DrosophilaCG1750-PA6e-6240.27% 
EBI UniRef50UniRef50_UPI00021A737A4e-6341.25%UPI00021A737A related cluster n=2 Tax=unknown RepID=UPI00021A737A
NCBI RefSeqXP_002019989.19e-6541.56%GL13743 [Drosophila persimilis]
NCBI nr blastpgi|3504018834e-6643.42%PREDICTED: methionyl-tRNA formyltransferase, mitochondrial-like [Bombus impatiens]
NCBI nr blastxgi|3504018835e-6543.42%PREDICTED: methionyl-tRNA formyltransferase, mitochondrial-like [Bombus impatiens]
Group
Gene OntologyGO:00719512.9e-70conversion of methionyl-tRNA to N-formyl-methionyl-tRNA
GO:00044792.9e-70methionyl-tRNA formyltransferase activity
GO:00090588.9e-46biosynthetic process
GO:00167428.9e-46hydroxymethyl-, formyl- and related transferase activity
GO:00038244.5e-11catalytic activity
KEGG pathwaycfa:6107638e-66 
 K00604 (MTFMT, fmt)maps-> Aminoacyl-tRNA biosynthesis
    One carbon pool by folate
InterPro domain[34-331] IPR0155184.2e-87Methionine tRNA Formyltransferase-like
[34-333] IPR0057942.9e-70Methionyl-tRNA formyltransferase
[34-228] IPR0023768.9e-46Formyl transferase, N-terminal
[228-342] IPR0110344.5e-11Formyl transferase, C-terminal-like
[227-329] IPR0057937.8e-11Formyl transferase, C-terminal
Orthology groupMCL14843 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208976-TA
ATGTCAAATATAATTTGTAACCTCTCAATCTCAATAAATAGATTTCTTTTAAAAGAAAAATATAATTATTTACGTAAAATACACTTACGATCATATTACAACGTGCTTTTCTTTGGATCAGACGTAATTGGCCTAAACTGTTTACAAGAAATCGATAAGATCAGGAAGAATGAAAACCTTATACGTCGATTGGATTTGGTCACAGCAAATAGTAGTAAAAATAAAAATGCTATTGAGAAGTATGCCGAAACCAATAATATGCGAATCTTGCATTGGCCGAACTTGAACATCAACGAAGGTGAATACGACATTGGTTTGATAGTGGCATTCGGTCATTTAATTAAGGCCGATGTACTCAAAAAATTCCCTCTAGGTATGGTGAATGTTCATCCTAGTCTGTTACCCCGCTGGCGTGGTGCTGCACCAATTATTTACACTCTAATGCATGGTGATACCGTAGGAGGTGTATCACTTATGAAAATTAAACCAGACATATTTGATGTTGGTGAAATAATAAGTCAACAAAGAGTTCCAGTATCTGATACTATAAAGTTGCCAGAACTAACAAGACAATTTTCGGATATTGGTGCTAAAATGTTGGTGAACTGCTTGAAGAACTTACCCAGAAGTCTTGAGAATGCTCAGCCTCAGAGCAATGAAGGTGTCACCTATGCCAAGAAGATAAATAAATCGATTAGCCAAGTAAGATGGATGGAAATGAGTGCCAAAGAGGTTTACAACTTGTACCGTGCAATATATGGATTATATCCATTAACAACAATGTTTAAAGATAAGCAAGTGAAGTTGTTTAATGCATTTTTAGTAAATAATGACTTGTATAGTGATAAACCCATCGGAACCCTTGAATATTGTGAATCAACAAAATCTATTAGGATATTGTGTAAAGATAAGAAATTCATAAATTTCAAATCACTTAGAATAGTCGGTAGAAAAGAAATATCAGCTGTAGATTTTTATAATGGTTATATAAAGAATATATCTGTAGATGTAAGATTATGTGTTGCCTGTTAG

Protein sequence:

>DPOGS208976-PA
MSNIICNLSISINRFLLKEKYNYLRKIHLRSYYNVLFFGSDVIGLNCLQEIDKIRKNENLIRRLDLVTANSSKNKNAIEKYAETNNMRILHWPNLNINEGEYDIGLIVAFGHLIKADVLKKFPLGMVNVHPSLLPRWRGAAPIIYTLMHGDTVGGVSLMKIKPDIFDVGEIISQQRVPVSDTIKLPELTRQFSDIGAKMLVNCLKNLPRSLENAQPQSNEGVTYAKKINKSISQVRWMEMSAKEVYNLYRAIYGLYPLTTMFKDKQVKLFNAFLVNNDLYSDKPIGTLEYCESTKSIRILCKDKKFINFKSLRIVGRKEISAVDFYNGYIKNISVDVRLCVAC-