Monarch geneset OGS2.0

DPOGS210187
TranscriptDPOGS210187-TA2961 bp
ProteinDPOGS210187-PA986 aa
Genomic positionDPSCF300283 - 205377-214140
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0095241e-12950.20% 
BombyxBGIBMGA003259-TA0.065.71% 
DrosophilaCG15100-PA0.049.95% 
EBI UniRef50UniRef50_A1ZBE90.049.95%CG15100 n=17 Tax=Bilateria RepID=A1ZBE9_DROME
NCBI RefSeqXP_002061477.10.050.24%GK20695 [Drosophila willistoni]
NCBI nr blastpgi|1954267880.050.24%GK20695 [Drosophila willistoni]
NCBI nr blastxgi|1947578920.050.34%GF13746 [Drosophila ananassae]
Group
Gene OntologyGO:00055241.2e-156ATP binding
GO:00001661.2e-156nucleotide binding
GO:00064311.2e-156methionyl-tRNA aminoacylation
GO:00048251.2e-156methionine-tRNA ligase activity
GO:00057371.2e-156cytoplasm
GO:00064183.7e-136tRNA aminoacylation for protein translation
GO:00048123.7e-136aminoacyl-tRNA ligase activity
KEGG pathwaydwi:Dwil_GK206950.0 
 K01874 (MARS, metG)maps-> Aminoacyl-tRNA biosynthesis
    Selenoamino acid metabolism
InterPro domain[253-796] IPR0147581.2e-156Methionyl-tRNA synthetase
[252-645] IPR0154133.7e-136Aminoacyl-tRNA synthetase, class I (M)
[429-645] IPR0147295.3e-76Rossmann-like alpha/beta/alpha sandwich fold
[638-809] IPR0090808.2e-36Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[61-176] IPR0109871.1e-15Glutathione S-transferase, C-terminal-like
[884-928] IPR0007381.1e-15WHEP-TRS
[876-926] IPR0090684e-13S15/NS1, RNA-binding
Orthology groupMCL11901 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210187-TA
ATGAAGATTTATACTAACGAGAACAACACGGCTACATTGAAACTGCTTATAGCGGCGAAGTTAGCTGGGAAAGACGTTGAATTGCTGAAGGGAACTCACGAAGAATCACCAGGTCCAGCAGTATTACCGCGTCTGGAAGTCCACGAGGAGCTCAGCTTCTTTAGCAGTAACGCAGCAGTTCAGTATCTATTCCCCGTACTAGATCTTTCCCAGAATGGACAATGTCTACAGATGTTAGAATGGGAAGCGACTCGTCTTTATCCCGTCGTGTCGACCGTCCTGACCTCTAAGACCGTTTCATCCGAGTTGAAGGAAGCTTTAAACACCTCGTTACTGATTGCCGACAATCTTCTGGCTAAACATCAGTACATACTTGGGGACAAACTGAGTCCTGTAGATGTATCAATATTCAGCACACTATATCCATTGTGCTGCACAGATCTCAAGGATACCTATCTTAAAGAGTACAGTCATGTTCTCCGATGGTCCGGAGATATCGGGAACTCTGAGGCCGTCCAGGAAGCTGTAAAACAGTGGGGCGGATCCCCTAACAGTCCACCATCAGCCTCATCGCTGCTGGGTACTCCACAAGTCGTCATACAAACACCGACCGGATCCCCCGATGAAGTGCCAGAGAAGCTGTCCGCCGAGGAGTTGGAAATGGCGAGAGACAACTTCCTCAACGGAATCAACAAGCTGCAGCCGCCGTTAAAGAGAGAAGGAGTCGTCCTACCGGATAAGGATCGCAGGAACGTCCTGATAACCTCCGCCCTGCCGTACGTCAACAACGTGCCTCACCTCGGCAACATTATAGGCTGCGTCCTCTCCGCGGATATATTCTCCAGGTATTGTCGTCTGTGTGGCTTCAACACGCTGTTCGTGTGCGGCACGGACGAGTACGGCACGGCCACTGAGACGAAGGCGTTGGAGGAAGGCGTCACACCGCGTCAGATATGTGATAAGTATTTCGCTATCCACGACGCTGTGTATCGCTGGTTCAACATAGACTTTGATTACTTCGGAAGGACCAGCACCGAGCAACAGACAAGGATAGCCCAGGACCTGTTCAAGAAACTGAACGCCAACGGCTTCGTCAGCAAGCAGACGGTGGAGCAGTTGTACTGTGAGAAGTGTGACAGATTCCTCGCTGACAGGTTCGTGGAGGGTACCTGTCCCCACCCCGGCTGTTTGTACGACAACGCCCGCGGGGACCAGTGCGATAAGTGCGGGAAGCTCATCAACGCTGTCGAGCTCCGCGAGGCGAGGTGCAAGGTGTGCTCCAGCTCGCCCGCCGTCAGGAACAGCGACCAGCTGTTCATAGAACTACCTCAGTTGGAGCCCTCGCTCCGTTCGTGGGCGTCGCGGGCGGAGGCCGGGTGGTCTGGTCCAGCTCGCGCCGTGCTCCGGGCCTGGATGAGGGACAAGCTGAGATGTAGGGCCGTCACTAGAGATCTCAAGTGGGGTGTCCCTGTGCCTATAACCGGCTTTGAGAATAAAGTATTCTACGTGTGGTTCGACGCACCGATCGGCTACCTCAGTATAACGGAGTGCGCGACCGGGAACTACGAGAAGTGGTGGAAACGGTCGCCGGACTACGACGTGAAGCTCTACCAGTTCATGGCCAAGGACAACGTTCCGTTCCACGTGATAATGTTCCCAGCTACGGTCATCGGGGTCAACGAGGGTCACCTGCTGGTGGACCACATCTACGCCACAGAATATCTGAACTATGAAGACACTAAGTTCTCCAAGTCCCGCGGCGTGGGCGTGTTCGGGACGGACGCTCGGGACACGGGCATACCGTCCGACGTGTGGCGCTTCTACCTGGCCATGATCAGGCCAGAGACCTCCGACTCCAGCTTCAGCTGGGCGGACCTCGCCACCAGGAACAACTCGGAGCTGCTCAACAACCTGGGTAACTTCTGCCACCGGAGCCTGAGCTTCTGTTACAGCTCGTTCTCCGCAGCCGTTCCCGACACGCAGCTCACGCCCACGGACCTGGAGATCATAGCCGGAGTCAACAGGGACGTGGTCGCGTATGTCCAGCACCTGGAGCGAGGTCGGCTGCGGGACGCGCTCCGCCACGTGCTGCGAGTGTCCCGCGCCGGCAACCTGTACATGCAGGACACGCAGCCCTGGGCGCTGCTGAAGGGCGGGACACAGGACAGGGTGAAGGCTGCAACAACGATAGGTGTCTGTTGTGAGCTGGTGGCTCTGCTGGCAGCCCTGCTGGCCCCGTACATGCCCGACACCAGCAAACGGCTCTGCACACAACTGAACATAGACCAGAGCGAGCTAAGGATCAATCCGACGGAGCCCTGTATGGTGAGGTTCCTGGGGCCGGGACACACGATCAACAAGCCGGAGCCGCTCTTCACCAAGATAGAGCAGCAGACGGTCGACGAGTTGCGGAGGAAGTACGCCGGTACACAGGCGGACAGGCGGAAGTCGAACGGAGACTGCAAGAAGCTGAGTGCCGCTGAGTTAGAGGCGGCTATATCCGCTCAGGGTGAAAAAGTTAGAAAATTGAAATCGTCTACAAAGGACAAGGCGGTTTGGCAACCGGAAGTAGACGTACTGCTGGCTCTGAAGAAACAACTCACCCTCGCGCACACACACGCCGACCAGCAGACGGGCAGCGCGGCGGAGCTGGAGAGAGCCGTCGCTGAACAAGGCGATAAAGTGAGAAAACTGAAGGCATCGACGAAAGATAAAACCGTTTGGCAGCCGGAGGTCAATAAACTGTTGGCGCTGAAAAAACAACTCGCGGAACACACGGACAGACAGACGGGGAACCACTCCCCGGGCAGCGTGGAGCAGCTGGAGAAGGCTATAGCTGAACAGGGAGATAAGGTCAGGAAGCTGAAAGCATCCACAAAGGACAAGTCAGTTTGGCAGCCAGAGGTCAACGTACTCTTGGACCTAAAAAAACAGCTGACAGCATTACAAGCCAATAAATAA

Protein sequence:

>DPOGS210187-PA
MKIYTNENNTATLKLLIAAKLAGKDVELLKGTHEESPGPAVLPRLEVHEELSFFSSNAAVQYLFPVLDLSQNGQCLQMLEWEATRLYPVVSTVLTSKTVSSELKEALNTSLLIADNLLAKHQYILGDKLSPVDVSIFSTLYPLCCTDLKDTYLKEYSHVLRWSGDIGNSEAVQEAVKQWGGSPNSPPSASSLLGTPQVVIQTPTGSPDEVPEKLSAEELEMARDNFLNGINKLQPPLKREGVVLPDKDRRNVLITSALPYVNNVPHLGNIIGCVLSADIFSRYCRLCGFNTLFVCGTDEYGTATETKALEEGVTPRQICDKYFAIHDAVYRWFNIDFDYFGRTSTEQQTRIAQDLFKKLNANGFVSKQTVEQLYCEKCDRFLADRFVEGTCPHPGCLYDNARGDQCDKCGKLINAVELREARCKVCSSSPAVRNSDQLFIELPQLEPSLRSWASRAEAGWSGPARAVLRAWMRDKLRCRAVTRDLKWGVPVPITGFENKVFYVWFDAPIGYLSITECATGNYEKWWKRSPDYDVKLYQFMAKDNVPFHVIMFPATVIGVNEGHLLVDHIYATEYLNYEDTKFSKSRGVGVFGTDARDTGIPSDVWRFYLAMIRPETSDSSFSWADLATRNNSELLNNLGNFCHRSLSFCYSSFSAAVPDTQLTPTDLEIIAGVNRDVVAYVQHLERGRLRDALRHVLRVSRAGNLYMQDTQPWALLKGGTQDRVKAATTIGVCCELVALLAALLAPYMPDTSKRLCTQLNIDQSELRINPTEPCMVRFLGPGHTINKPEPLFTKIEQQTVDELRRKYAGTQADRRKSNGDCKKLSAAELEAAISAQGEKVRKLKSSTKDKAVWQPEVDVLLALKKQLTLAHTHADQQTGSAAELERAVAEQGDKVRKLKASTKDKTVWQPEVNKLLALKKQLAEHTDRQTGNHSPGSVEQLEKAIAEQGDKVRKLKASTKDKSVWQPEVNVLLDLKKQLTALQANK-