Monarch geneset OGS2.0

DPOGS204192
TranscriptDPOGS204192-TA5145 bp
ProteinDPOGS204192-PA1714 aa
Genomic positionDPSCF300034 + 485370-509122
RNAseq coverage289x (Rank: top 38%)
Annotation
HeliconiusHMEL0169362e-4936.67% 
BombyxBGIBMGA005117-TA0.067.58% 
DrosophilaAats-glupro-PA0.054.49% 
EBI UniRef50UniRef50_Q7PRA20.052.08%AGAP002945-PA n=3 Tax=Coelomata RepID=Q7PRA2_ANOGA
NCBI RefSeqXP_001998750.10.052.31%GI23457 [Drosophila mojavensis]
NCBI nr blastpgi|1951083390.052.31%GI23457 [Drosophila mojavensis]
NCBI nr blastxgi|1700636370.054.43%bifunctional aminoacyl-tRNA synthetase [Culex quinquefasciatus]
Group
Gene OntologyGO:00064336.1e-164prolyl-tRNA aminoacylation
GO:00055246.1e-164ATP binding
GO:00057376.1e-164cytoplasm
GO:00048276.1e-164proline-tRNA ligase activity
GO:00168765.8e-56ligase activity, forming aminoacyl-tRNA and related compounds
GO:00001665.8e-56nucleotide binding
GO:00430395.8e-56tRNA aminoacylation
GO:00048123.3e-31aminoacyl-tRNA ligase activity
GO:00064184e-27tRNA aminoacylation for protein translation
GO:00064121.5e-09translation
KEGG pathwaybfo:BRAFLDRAFT_1178850.0 
 K01885 (EARS, gltX)maps-> Aminoacyl-tRNA biosynthesis
    Porphyrin and chlorophyll metabolism
 K01881 (PARS, proS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[1-1546] IPR0009240Glutamyl/glutaminyl-tRNA synthetase, class Ib
[1223-1714] IPR0044996.1e-164Prolyl-tRNA synthetase, class IIa, archaeal-type
[475-656] IPR0200585.8e-56Glutamyl/glutaminyl-tRNA synthetase, class Ib, catalytic domain
[354-403] IPR0147296.4e-38Rossmann-like alpha/beta/alpha sandwich fold
[1488-1620] IPR0041543.3e-31Anticodon-binding
[581-657] IPR0200614e-27Glutamyl/glutaminyl-tRNA synthetase, class Ib, alpha-bundle domain
[1627-1714] IPR0160611.1e-24Prolyl-tRNA synthetase, class II, C-terminal
[783-839] IPR0090682.9e-22S15/NS1, RNA-binding
[792-844] IPR0007381.2e-21WHEP-TRS
[1262-1436] IPR0023142.3e-21Aminoacyl-tRNA synthetase, class II (G/ H/ P/ S), conserved domain
[1621-1714] IPR0174491.9e-15Prolyl-tRNA synthetase, class II
[67-169] IPR0109871.4e-09Glutathione S-transferase, C-terminal-like
[659-709] IPR0110351.5e-09Ribosomal protein L25/Gln-tRNA synthetase, anti-codon-binding domain
[659-709] IPR0200592.6e-07Glutamyl/glutaminyl-tRNA synthetase, class Ib, anti-codon binding domain
Orthology groupMCL13519 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204192-TA
ATGAAAGTTCTGTGTAATAAAACGAATCCACCGCTTGGCGGTCTACTAGCAGTAGAATTCTACAGATCTTCGGCTAAAAATGTAGATATCGTATGGGGCAATGAATCATCTATTACACTTTTAAATTCACCAAAGCCAGTGCCATATGGAACCAGCAATGATTTGATTAGAATTCTTGAAAATAATTTTAATAAATCCGTTGGACCCCTAGAAAGGGTTACGATGAATCACTGGCTGTCCCTCAGCCTAATTCTAATTGAAGAAATGCCTAAATCTCTAGAATATCTTGATAAGACTCTTGGTCCTATTACTTATTTACATGGGGAAGCTTTATCTGTTTCCGATTTAGCTGTGTTTAGTGCTTTGTATGCGTCTAATAAATTCCAAGAATTGTCAAAAACTAAAACATATAATAATATAACGAGGTGGATGAAACTCATTCAGGCCCAAGAACCAGTAACAAAAGCACTCAAAAGTATTCCAGCTGATATACTTGAAAATTTATCTAAAATATCTTCAAGATCCACACCAGAAAACAAGGAAACAGGAATGCGTCAGGAAGGCAAATTTGTTGAATTGCCACATGCTGAGATGGGCAAAGTGGTTGTGAGATTCCCTCCCGAGGCATCTGGCTACCTACACATTGGGCATGCTAAGGCAGCTCTTCTAAATCAATATTATCAGGAGGCCTTCGAAGGAAAGTTGGTAATGAGATTTGATGACACCAACCCAGCTAAGGAGAATGCTGAGTTTGAAAAGGTGATATTAGAAGATGTTGAAATGCTTGAAATAAAACCTGACATGTTTACACACACTTCGCAATACTTTGATCTGATGCTGCAATTCTGTGAGAAGCTTATTAAGGAGGGCAAAGCATTTGTTGATGACACTCCGGCAGAACAGATGAAGAATGAACGGGAACAGAGGATTGATAGCAGGAATAAAAGCAACTCTATTATTGATATGCAGTCTGCTAATGGATGTTTGCGAGATCCAACTATATACAGATGTAAGCCAGAACCACATCCGAGAACAGGAACACAATATAAAGTGTATCCAACATATGACTTCGCTTGTCCTATCGTCGATTCCATAGAAGGTGTAACACATGTTCTTAGAACAATGGAGTATCATGACAGGGATCCACAGTTCTATTGGTTCATAGATGCTTTGGGGCTGCGGAAACCATATATTTGGGAATACAGCAGGTTAAGTATGACAAATACGGTTTTGTCAAAAAGAAAGCTGACTTGGTTCGTTGAGCAAGGTCTTGTTGATGGATGGGACGATCCCCGCATACCGACTGTACGCGGTGTCCTCCGTAGAGGCCTGACCGTGGAGGCGTTGAGAAGGTTCATACGAGCTCAAGGCTCGAGTCGCTCCGTTGTGTTCATGGAATGGGATAAGATATGGGCCATCAACAAGAAGCTTTGGGAGGAGATGAAGAAGGGTAGTGATATTGGCATCCAAAACTGTGTTAGAGCTATTATTGATATGCAGTCTGCTAATGGATGTTTGCGAGATCCAACTATATACAGATGTAAGCCAGAACCACATCCGAGAACAGGAACACAATATAAAGTGTATCCTACATATGACTTCGCTTGTCCTATCGTCGATTCCATAGAAGGTGTAACACATGTTCTTAGAACAATGGAGTATCATGACAGGGATCCACAGTTCTATTGGTTCATAGATGCTTTGGGGCTGCGGAAACCATATATTTGGGAATACAGCAGGTTAAGTATGACAAATACGGTTTTGTCAAAAAGAAAGCTGACTTGGTTCGTTGAGCAAGGTCTTGTTGATGGATGGGACGATCCCCGCATACCGACTGTACGCGGTGTCCTCCGTAGAGGCCTGACCGTGGAGGCGTTGAGAAGGTTCATACGAGCTCAAGGCTCGAGTCGCTCCGTTGTGTTCATGGAATGGGATAAGATATGGGCCATCAACAAGAAGGTCATTGACCCCGTAGCTCCTAGATTCACTGCCTTAGAATCAAAGCCGGTTCCAGTTAACCTCAAAGGCGTCACATCTGACAGTACTTTGGATGTGCCGCTACATCCCAAAAACCCGGATGTGGGCAACAAAAAAGTTTGGATATCAAAGACATTATTGATAGACCAGTGGGAAGTGCCGATGCTGGGTGAAATGGCTTTAGAGAGTGTGAAGGAAGGAGACATCATCCAGCTACAGCGACGCGGCTTCTTCCGAGTGGACGCGGCGGGGGGACGCTCGGCTCTCACTGGTCAAGTTCGCCCGCTCGTATTATTACATGTGCCAGACGGACGAGCGGAAACGCAGACGAAACCAATACAAGCAATCGCTAAGGAACCAGTCTCCTCTCCATCTTCAGGCGATGCTGAGAACTTGAATATTGAAATCACGAAGCAAGGAGACGTGGTGAGATCACTGAAAACTTCTAAAGCTGAAAAAGCCAAAATTGATGAAGCCGTAAAAACCTTGTTGGATCTTAAAGCCAAGTATAAGGAAGCTACGGGCCAGGATTGGAAGCCGGGCGCCTCACCAGCCAAGGCTAACTCTTCGCCGAGTAGTGACGTATCTGCTTTGAATTCTGAAATAACAAAGCAAGGGGATTTGGTTAGATCACTGAAAGCCTCTAAAGCCGAAAAGGGAAAAGTTGATGAAGCAGTAAAAGCATTGTTGGAGCTCAAAGCCAAGTATAAGGCAGCCACAGGCCAGGATTGGAAACCAGACGCAGCACCGGTTCAAACATCACCTGTCTCTGACGCCTCGTCTCTGAACAGCGAGATCATTAAACAAGGTGACATAGTGAGGAGTCTGAAGTCTGCAAAGGCTGAGAAGGCGAAGGTCGATGAAGCTGTCAAAGTATTACTGGACTTGAAGAACAAATATAAGGCTGCTACCGGACAGGATTGGAAGCCAGGAAAAACGCCTTTTGTAACTCCGCTATCAAACTTAAACAGCGAGATCATTAAACAAGGTGACTTAGTGAGGAGTCTAAAGTCTGCGAAGGCTGAGAAGGCGAAGGTCGATGAAGCTGTTAAATTGTTACTGGAATTAAAGAACAAATATAAGGCTGCTACCGGCCAGGATTGGAAGCCGGAAGGAACACAAGCATCATCTAACAATTCATCAGCTATGGCATCAAATGAAGCTTCGTCCTTGAACAGCGAGATCATCAAGCAGGGAGACTTAGTGAGGAGTCTGAAGTCTGCGAAGGCTGAGAAGGCGAAGGTCGATGAAGCTGTCAAAGTATTACTGGACTTGAAGAACAAATATAAGGCTGCTACCGGACAGGATTGGAAGCCGACACAAGAAGTCAAGGTCGATGATGCCAAAGTGACGGATATTTTGAATGAAATAACGTCCCAAGGAGATAAAATCCGCACTCTCAAGACAGAAAAGGCTGACAAATCTGTTATAGACGCAGAGGTTAAAAATCTTCTAAATCTTAAAGCTCAATATAAGAATTTAACTGGTAGTGAATGGACAGCAAATGCCGCTGCAAAACAAGATAATAAGAAGTCTGAGAAGAAGTCGGACTCACAGGCAAGTGCTGGAAAAGCTAATAAGGCAGATAAAAAAGAGAAGAAGCCAAAGGAAAACAAACCTAAAGAGCAGGTTAAACCGAAAGAGGAAAGTGGCTCCGGAGTTAAGAAAGTAACCCGCCTCGGTATGGAAGCGAACAAGGAGACCGACCTTCCAGAGTGGTACTCACAAGTTATAACTAAGTCGGAAATGATAGACTACTACGACATATCCGGTTGTTATATTCTACGTCCGTGGTCATTCAGTATATGGGAGGGTATACGGAGCTTCCTTAGCGCTCAATTCAAGAAAATGGGAGTCAAAGACGGTTATTTCCCAATATTTGTGTCGAAGGCGGCTCTAGAACGTGAGAAGACCCACATCTCGGACTTCGCCCCCGAGGTGGCATGGGTGACGCACTCGGGGTCCTCGGAGCTGGCGGAGCACGTGGCCGTGAGACCCACCTCGGAGACCGTCATGTACCCCGCGTACGCCAAGTGGATACAGAGCCACAGAGACTTACCGCTCAGGATCAACCAGTGGAACAATGTTGTGAGGTGGGAGTTCAAACAGCCTCAGCCTTTCCTTCGCACGCGTGAGTTCCTCTGGCAGGAAGGACACACGGCCTTCCGCACTAAGGAGGAAGCGGACAAGGAGGTTCTACATGTACTCGATCTGTACGCCCAAGTGTATGAAGATCTCTTGGCTATACCGGTTGTGAAGGGAAGGAAGACGGAGAAGGAGAAATTCGCGGGAGCTGATTACACCACCACGGTAGAAGCGTACATACCTGCTAGCGGCAGGGGCATACAGGGCGCTACGAGTCATCACCTCGGACAGAACTTCTCGAAGATGTTCGAAATAGTCTACGACGACCCCGACACCCAAGAGAAGAAGTTCGTGTATCAGAACTCATGGGGTGTGATGGTGTTAGTCCACGGCGACGACCGCGGCCTGGTGCTTCCTCCGAGGATAGCCAGCATTCAAGTGATAGTGGTGCCGTGTGGCATAACCGCATCCAGCACAGACGAGGAGAGGAAGTCACTCATAGACGCCTGCAAGCAACTAGCCGAAGAACTGTCTGCGTCCGGCATCAGAGCTGAAGGAGACTATAGAGATAACTACTCCCCTGGGTGGAAGTTCAATCACTGGGAACTTAAGGGTGTCCCGATCCGAGCTGAGCTGGGTCCCAAAGATCTCTCTCGTGGCGAGGTAGTGTGCGTGTCGCGCGTGTCGTCATCTCGGAGCACCCTCAAGAGGGACGGAGCCCCGGCCGCCATCGCTGATATGTTGGAACAAATACACAAAGATATGCTGGCTAAGGCTACTAAGGAACGTGATGAACGGTTCTCCATGGTGACTGAGTGGAATGACTTCACCGAGGCTTTGGAACGGAAGCATCTTTTATATGCACCATTCTGTGGAGATATTCCATGCGAGGATAACATCAAAACTGACAGCGCTCGCACTGAAGATGACCCTACCACAGAAGTGAAAGGTCCGGCGATGGGTGCTAAATCACTGTGTATACCTTTCTCTCAACCGCGGCCTTTGAAGCCCGAAGATAAATGCATACATCCGCTTTGCAATAATAAACCCCAATACATTACACTCTTTGGCAGAAGCTATTAA

Protein sequence:

>DPOGS204192-PA
MKVLCNKTNPPLGGLLAVEFYRSSAKNVDIVWGNESSITLLNSPKPVPYGTSNDLIRILENNFNKSVGPLERVTMNHWLSLSLILIEEMPKSLEYLDKTLGPITYLHGEALSVSDLAVFSALYASNKFQELSKTKTYNNITRWMKLIQAQEPVTKALKSIPADILENLSKISSRSTPENKETGMRQEGKFVELPHAEMGKVVVRFPPEASGYLHIGHAKAALLNQYYQEAFEGKLVMRFDDTNPAKENAEFEKVILEDVEMLEIKPDMFTHTSQYFDLMLQFCEKLIKEGKAFVDDTPAEQMKNEREQRIDSRNKSNSIIDMQSANGCLRDPTIYRCKPEPHPRTGTQYKVYPTYDFACPIVDSIEGVTHVLRTMEYHDRDPQFYWFIDALGLRKPYIWEYSRLSMTNTVLSKRKLTWFVEQGLVDGWDDPRIPTVRGVLRRGLTVEALRRFIRAQGSSRSVVFMEWDKIWAINKKLWEEMKKGSDIGIQNCVRAIIDMQSANGCLRDPTIYRCKPEPHPRTGTQYKVYPTYDFACPIVDSIEGVTHVLRTMEYHDRDPQFYWFIDALGLRKPYIWEYSRLSMTNTVLSKRKLTWFVEQGLVDGWDDPRIPTVRGVLRRGLTVEALRRFIRAQGSSRSVVFMEWDKIWAINKKVIDPVAPRFTALESKPVPVNLKGVTSDSTLDVPLHPKNPDVGNKKVWISKTLLIDQWEVPMLGEMALESVKEGDIIQLQRRGFFRVDAAGGRSALTGQVRPLVLLHVPDGRAETQTKPIQAIAKEPVSSPSSGDAENLNIEITKQGDVVRSLKTSKAEKAKIDEAVKTLLDLKAKYKEATGQDWKPGASPAKANSSPSSDVSALNSEITKQGDLVRSLKASKAEKGKVDEAVKALLELKAKYKAATGQDWKPDAAPVQTSPVSDASSLNSEIIKQGDIVRSLKSAKAEKAKVDEAVKVLLDLKNKYKAATGQDWKPGKTPFVTPLSNLNSEIIKQGDLVRSLKSAKAEKAKVDEAVKLLLELKNKYKAATGQDWKPEGTQASSNNSSAMASNEASSLNSEIIKQGDLVRSLKSAKAEKAKVDEAVKVLLDLKNKYKAATGQDWKPTQEVKVDDAKVTDILNEITSQGDKIRTLKTEKADKSVIDAEVKNLLNLKAQYKNLTGSEWTANAAAKQDNKKSEKKSDSQASAGKANKADKKEKKPKENKPKEQVKPKEESGSGVKKVTRLGMEANKETDLPEWYSQVITKSEMIDYYDISGCYILRPWSFSIWEGIRSFLSAQFKKMGVKDGYFPIFVSKAALEREKTHISDFAPEVAWVTHSGSSELAEHVAVRPTSETVMYPAYAKWIQSHRDLPLRINQWNNVVRWEFKQPQPFLRTREFLWQEGHTAFRTKEEADKEVLHVLDLYAQVYEDLLAIPVVKGRKTEKEKFAGADYTTTVEAYIPASGRGIQGATSHHLGQNFSKMFEIVYDDPDTQEKKFVYQNSWGVMVLVHGDDRGLVLPPRIASIQVIVVPCGITASSTDEERKSLIDACKQLAEELSASGIRAEGDYRDNYSPGWKFNHWELKGVPIRAELGPKDLSRGEVVCVSRVSSSRSTLKRDGAPAAIADMLEQIHKDMLAKATKERDERFSMVTEWNDFTEALERKHLLYAPFCGDIPCEDNIKTDSARTEDDPTTEVKGPAMGAKSLCIPFSQPRPLKPEDKCIHPLCNNKPQYITLFGRSY-