Monarch geneset OGS2.0

DPOGS204524
TranscriptDPOGS204524-TA927 bp
ProteinDPOGS204524-PA308 aa
Genomic positionDPSCF300297 - 414637-416011
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0087324e-16585.06% 
BombyxBGIBMGA004327-TA5e-15782.47% 
DrosophilaAats-phe-PA3e-11562.18% 
EBI UniRef50UniRef50_B4MNQ63e-11362.82%GK19627 n=14 Tax=Coelomata RepID=B4MNQ6_DROWI
NCBI RefSeqXP_001848330.15e-11763.43%phenylalanyl-tRNA synthetase, mitochondrial [Culex quinquefasciatus]
NCBI nr blastpgi|1700411369e-11663.43%phenylalanyl-tRNA synthetase, mitochondrial [Culex quinquefasciatus]
NCBI nr blastxgi|1700411369e-11463.43%phenylalanyl-tRNA synthetase, mitochondrial [Culex quinquefasciatus]
Group
Gene OntologyGO:00064324.4e-108phenylalanyl-tRNA aminoacylation
GO:00055244.4e-108ATP binding
GO:00057374.4e-108cytoplasm
GO:00048264.4e-108phenylalanine-tRNA ligase activity
GO:00001664.4e-108nucleotide binding
GO:00000492.7e-29tRNA binding
GO:00430392.7e-29tRNA aminoacylation
GO:00048122.7e-29aminoacyl-tRNA ligase activity
GO:00002871.4e-26magnesium ion binding
GO:00080331.4e-26tRNA processing
KEGG pathwaycqu:CpipJ_CPIJ0064571e-116 
 K01889 (FARSA, pheS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[2-306] IPR0045304.4e-108Phenylalanyl-tRNA synthetase, class IIc, mitochondrial
[95-197] IPR0023192.7e-29Phenylalanyl-tRNA synthetase
[212-308] IPR0051211.4e-26Phenylalanyl-tRNA synthetase, beta subunit, ferrodoxin-fold anticodon-binding
Orthology groupMCL13333 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204524-TA
ATGACGGCTCATCAAACTGAATTGCTCAGATCAGGTCTTGATAACTTCCTGATGATCGGAGATGTGTATAGAAGAGATGAAATAGATTCAACTCATTTTCCTGTTTTTCACCAAGTTGATGCTGTGAGATCTCAACTCAGAGAGCAACTTTTTGAGAATCATCCCGACCTGAACATCTTCGAACCAATATACGAACCAAACGATCCAAATGCTTTCGTAAACTCAATATCTGACCCAGGGAAACAAAGCTGTCATACCTTGGAGGCTTCAAAGTTGATGGAAACGCAGCTGAAGAATCATTTAATCGGTTTAGTCAAAGTATTGTTCGGTGAAAGTATCCAATACAGATGGGTTGAGGCGTACTTCCCATTTACTCATCCATCATGGGAATTAGAAATATATTACGAGAATAATTGGATGGAAGTTTTGGGCTGTGGTATTGTAAGGCATGAAATTATGGTCAATGCCGGATCCAATAATAGCATAGCTTATGCATTCGGACTCGGTTTGGAGAGACTGGCTATGGCGTTATACAAAATCCCAGATATAAGACTCATGTGGAGCACTGATTCCGGGTTTCTGAGCCAATTCCAGAATAAGGATGTTGATGCCAAGATAACATACAAGCCAGTCTCTGTTTACCCTCAGTGTAAAAATGATCTGTCGTTCTGGTTGCCACCAACATTAACTGTGGAAACATTTATGAACAATGATTTTTACGATCTAGTCAGAGATATAGGAGGTGACGTCATAGAACAGGTTACCTTAAAGGACAAATTTGTTCATCCGAAAACAAAGAAACATAGTTTGTGTTACAGCATCGTGTACAGACATTTAGAACGCACATTAACACAAGCAGAAGTTAATAAAATACACAAAGAAATAGAAATAGCTGCAATGAATGCATTTAATGTTGTTATTAGATAA

Protein sequence:

>DPOGS204524-PA
MTAHQTELLRSGLDNFLMIGDVYRRDEIDSTHFPVFHQVDAVRSQLREQLFENHPDLNIFEPIYEPNDPNAFVNSISDPGKQSCHTLEASKLMETQLKNHLIGLVKVLFGESIQYRWVEAYFPFTHPSWELEIYYENNWMEVLGCGIVRHEIMVNAGSNNSIAYAFGLGLERLAMALYKIPDIRLMWSTDSGFLSQFQNKDVDAKITYKPVSVYPQCKNDLSFWLPPTLTVETFMNNDFYDLVRDIGGDVIEQVTLKDKFVHPKTKKHSLCYSIVYRHLERTLTQAEVNKIHKEIEIAAMNAFNVVIR-