Monarch geneset OGS2.0

DPOGS200863
TranscriptDPOGS200863-TA3147 bp
ProteinDPOGS200863-PA1048 aa
Genomic positionDPSCF300071 + 376368-386161
RNAseq coverage620x (Rank: top 21%)
Annotation
HeliconiusHMEL0126390.080.57% 
BombyxBGIBMGA009851-TA0.074.89% 
DrosophilaAats-val-PB0.062.94% 
EBI UniRef50UniRef50_P496960.055.18%Valine--tRNA ligase n=24 Tax=cellular organisms RepID=SYVC_TAKRU
NCBI RefSeqXP_002049233.10.064.21%GJ20865 [Drosophila virilis]
NCBI nr blastpgi|1953809780.064.21%GJ20865 [Drosophila virilis]
NCBI nr blastxgi|1953809780.064.21%GJ20865 [Drosophila virilis]
Group
Gene OntologyGO:00055241.7e-291ATP binding
GO:00048321.7e-291valine-tRNA ligase activity
GO:00001661.7e-291nucleotide binding
GO:00064381.7e-291valyl-tRNA aminoacylation
GO:00057371.7e-291cytoplasm
GO:00064181.5e-192tRNA aminoacylation for protein translation
GO:00048121.5e-192aminoacyl-tRNA ligase activity
KEGG pathwaydvi:Dvir_GJ208650.0 
 K01873 (VARS, valS)maps-> Aminoacyl-tRNA biosynthesis
    Valine, leucine and isoleucine biosynthesis
InterPro domain[53-1049] IPR0023030Valyl-tRNA synthetase
[96-741] IPR0023001.5e-192Aminoacyl-tRNA synthetase, class Ia
[720-739] IPR0147297.4e-135Rossmann-like alpha/beta/alpha sandwich fold
[295-446] IPR0090081.1e-59Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing domain
[753-989] IPR0090807.7e-49Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[788-938] IPR0131554.9e-36Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding
Orthology groupMCL10098 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200863-TA
ATGACTGATAACGAATCCCAAAACGGTCCGGCTGGGGACCAGCCTCAGGAAAAAACAGCAAAACAACTTGAGAAAGAAGCGAAAAAGGCAGCGAAGCTGGAGAAACTTAAAGCAAAATTGGACAAAAAGTCTTCTGCGCCAGCTGGGCAAAAAGATAAGCCAGAGAAAAAAACCAAAGAAGTAAAAGAGAGCGCCTTGTATACAGCGAACACTGCACCAGGTGATAAGAAGGACATTTCAGTGGTTATGCCCGACTCATATAGTCCCAGGTTTGTGGAGGCGGCCTGGTATTCATGGTGGGAGAAGCAGGGTTTCTTTAAACCAGAATACGGGAGGAATTCTGTATTGGATCCGAATCCTAAGGGCAAGTTTGTGATGGTGATCCCACCACCAAATGTTACAGGAACACTGCATCTTGGACATGCCCTAACAAATTCCGTGGAGGATGCTATCACCAGGTGGCACAGGATGAATGGTCGCACTACTCTCTGGAATCCAGGCTGCGATCACGCTGGCATTGCAACCCAAGTGGTGGTGGAAAAGAAGCTATGGAGGGAGGAAAAGAAGACTCGGCATGAGCTCGGACGAGATGAGTTCATCAAAAGAGTGTGGGAGTGGAAGGATGAGAAAGGGGTCAGGATCTACGAGCAACTTCGTTCTCTCGGTTCATCCTTGGACTGGAGTCGTGTTCGGTTCACCATGGACCCTTCAATGTGTAGGGCTGTCAACGAGGCCTTCATACGGTTGCATGACAGTGGTGACATCTACCGTGCTAACAGACTAGTCAACTGGTCCTGTGCTCTGAAGAGCGCTATATCTGATATAGAGGTGGATAAAATTGAACTGCCGGGAAGAACATTCCTTGCTATACCTGGTTACGAAGCGAAGGTGGAATTTGGTGTGCTGGTGTACTTCGCGTACAGGGCTGAGGATACCGACGAAGAAATAGTGGTGGCGACCACCAGAGTCGAGACCATGCTGGGGGATGTGGCAGTAGCTGTCAACCCTAACGACATCAGATACAAACACCTCATCGGGAGGAACCTCCTCCATCCGTTCATCAAACGGAAACTTCCAGTGATAGCTGATGAATACGTTGACATGAACTTTGGAACAGGTGCTGTTAAAATAACCCCCGCCCACGATCCGAACGACTACGAGATCGGCAAGCGACACAAGCTGCCCTTCATCACAGTGTTTGATGATGAGGGAAGGATGATGGACAACTGCGGTTACTTCTCTGGTAAGAAGCGCTTCGAGGTCCGCCGCGAGATCATCCAGAGCCTCGAGCACCTCAAGCTCTACAAGGAGACCAAGGATCATGCTATGGTGGTCCCACTCTGCAGCCGCTCCAAGGATGTAGTGGAACCAATGCTGAGACCACAGTGGTACATTCGTTGCGGCAACATGGCCGCTGAGGCGATAAAGGCTGTTAAGAGCGGACAGTTGAAGATAATACCGGATGTGCACGAAAAGTTGTGGCACCACTGGATGGACAACATCCGGGACTGGTGCATCTCCAGGCAACTGTGGTGGGGACACAGGATACCAGCCTACAAGCTGCTGGTCACCGAGGCGGTCCCTCCGCCGCGCTGTCGCCGCCGCCGCACTCGCAGCGGGAAGAGGAAGCGAGCGCGCGCCCGCGCTCTGCCGCCCCGCGAGGAAGAGTACTGGGTGTCAGCGCACTCTGAGGAAGAAGCAATCGAAAAAGCATCGGCGAAACTGAACCTTCCCGTGGAGGAGATCAAACTGACGAGAGATGAAGATGTACTGGACACGTGGTTCTCATCCGGACTCTTCCCCTTCGCCATCTTCGGCTGGCCCGATAACACTGAAGATCTACAGGCTTTCTACCCTGGCACATTACTGGAGACTGGCCATGATATTTTGTTCTTCTGGGTCGCCAGGATGGTGTTCTTTGGTCAACGGCTGCTAGGAAAATTGCCATTTAAGGAAGTTTACCTTCATCCTATGGTGCGAGATGCTCACGGCCGTAAGATGTCAAAGTCTCTGGGGAATGTGATCGATCCAGTGGACGTGGTGAGGGGCATCACGCTGGAACAGCTTCACCAACAGCTGGCGGACTCCAACCTTGACCCTCGGGAGGTGGACCGCGCCAAGAAGGGACAGGCCCAGGATTATCCCAACGGTATACCGGAATGTGGCACGGATGCTTTAAGGTTCGCCCTGTGCGCTATGACCGCCGGCCGCGACCTCAACCTGGACATCCAGCGCGTCCAGGGCTACCGCTTCTTCTGCAACAAGCTGTGGAACGCTACCAAGTTCGCCATGATGTACTTCCCACAGGACACCCTGTATAAATGTCACAGCTCCCCGCTCTCCCCCGCCCTGTCCCCCCTGGATCTTTGGATGCTGTCCCGTGTATCTCTGGCGGTGCAGAAGGTGAACTCCGGCTTCCAGAACTACGACTTCCCATCAGCTACAACCCACTGCTACAACCTGTGGCTCTATGACTTGTGTGATGTGTACTTGGAGTATCTGAAACCGGTGTTCACTCAGGGCAGCCCTGAGGCTCAGACTGCGGCCAGGCAGACTCTATACACAACCCTAGAGCTTGGATTGAAGCTGCTATCACCATTCATGCCCTTCATTACCGAGGAACTCTACCAGAGATTGCCACGAGAAGATAAATCTTGCCCCTCTATATGTGTAGCAGATTACCCCACTGATGAAACCACAGCGTGGAGAAATGAAGAACTAGAGGCGAATGTAGAAACAGCTCTCAAAATCGTTCACCTCATTCGTTCGACTCGCTCCGAATACAATTTGACGAACAAACAGCGGACCACCGTCCACGTGCTGACCGAAATGAATGAAGTCAAGGAACTGTTCAGGAGCCTGCAGACGCTCGCCAACAGCGAGCTGAGCGACGACTCCCCGCCCATGGGCTGTTCCATACTCACGGTGTCTGATAAGATCGAAGTGCATTTGGTACTGAAGTCATTGGAAAGACTTAAAGAGAATCTGTCACAAACCATAAACAAACTTCAGCAGGCCATGACGGCCGACGACTACACCGCCAAGGTACCGGCCGATGTGCGGAAGCTCAACTCGGAGAAGCTCGCCACCTCCCAAGGAGAAATAGAGAGACTACAGTCCGCCATAGAAACGTTGAAACTTATGTAA

Protein sequence:

>DPOGS200863-PA
MTDNESQNGPAGDQPQEKTAKQLEKEAKKAAKLEKLKAKLDKKSSAPAGQKDKPEKKTKEVKESALYTANTAPGDKKDISVVMPDSYSPRFVEAAWYSWWEKQGFFKPEYGRNSVLDPNPKGKFVMVIPPPNVTGTLHLGHALTNSVEDAITRWHRMNGRTTLWNPGCDHAGIATQVVVEKKLWREEKKTRHELGRDEFIKRVWEWKDEKGVRIYEQLRSLGSSLDWSRVRFTMDPSMCRAVNEAFIRLHDSGDIYRANRLVNWSCALKSAISDIEVDKIELPGRTFLAIPGYEAKVEFGVLVYFAYRAEDTDEEIVVATTRVETMLGDVAVAVNPNDIRYKHLIGRNLLHPFIKRKLPVIADEYVDMNFGTGAVKITPAHDPNDYEIGKRHKLPFITVFDDEGRMMDNCGYFSGKKRFEVRREIIQSLEHLKLYKETKDHAMVVPLCSRSKDVVEPMLRPQWYIRCGNMAAEAIKAVKSGQLKIIPDVHEKLWHHWMDNIRDWCISRQLWWGHRIPAYKLLVTEAVPPPRCRRRRTRSGKRKRARARALPPREEEYWVSAHSEEEAIEKASAKLNLPVEEIKLTRDEDVLDTWFSSGLFPFAIFGWPDNTEDLQAFYPGTLLETGHDILFFWVARMVFFGQRLLGKLPFKEVYLHPMVRDAHGRKMSKSLGNVIDPVDVVRGITLEQLHQQLADSNLDPREVDRAKKGQAQDYPNGIPECGTDALRFALCAMTAGRDLNLDIQRVQGYRFFCNKLWNATKFAMMYFPQDTLYKCHSSPLSPALSPLDLWMLSRVSLAVQKVNSGFQNYDFPSATTHCYNLWLYDLCDVYLEYLKPVFTQGSPEAQTAARQTLYTTLELGLKLLSPFMPFITEELYQRLPREDKSCPSICVADYPTDETTAWRNEELEANVETALKIVHLIRSTRSEYNLTNKQRTTVHVLTEMNEVKELFRSLQTLANSELSDDSPPMGCSILTVSDKIEVHLVLKSLERLKENLSQTINKLQQAMTADDYTAKVPADVRKLNSEKLATSQGEIERLQSAIETLKLM-