Monarch geneset OGS2.0

DPOGS210182
TranscriptDPOGS210182-TA2532 bp
ProteinDPOGS210182-PA843 aa
Genomic positionDPSCF300393 + 148006-153804
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0127508e-13465.88% 
BombyxBGIBMGA014150-TA0.065.72% 
DrosophilaCG5414-PB0.043.25% 
EBI UniRef50UniRef50_UPI00017922710.044.20%UPI0001792271 related cluster n=1 Tax=unknown RepID=UPI0001792271
NCBI RefSeqXP_001949755.10.044.20%PREDICTED: similar to CG5414 CG5414-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1936367170.044.20%PREDICTED: isoleucyl-tRNA synthetase, mitochondrial-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1936367170.043.95%PREDICTED: isoleucyl-tRNA synthetase, mitochondrial-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00048226e-286isoleucine-tRNA ligase activity
GO:00055241.5e-204ATP binding
GO:00064281.5e-204isoleucyl-tRNA aminoacylation
GO:00001661.5e-204nucleotide binding
GO:00057371.5e-204cytoplasm
GO:00064181.2e-134tRNA aminoacylation for protein translation
GO:00048121.2e-134aminoacyl-tRNA ligase activity
KEGG pathwayapi:1001667260.0 
 K01870 (IARS, ileS)maps-> Aminoacyl-tRNA biosynthesis
    Valine, leucine and isoleucine biosynthesis
InterPro domain[1-842] IPR0235856e-286Isoleucyl-tRNA synthetase, type 1
[1-764] IPR0023011.5e-204Isoleucyl-tRNA synthetase
[1-574] IPR0023001.2e-134Aminoacyl-tRNA synthetase, class Ia
[337-572] IPR0147294.7e-96Rossmann-like alpha/beta/alpha sandwich fold
[143-336] IPR0090086.7e-38Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing domain
[587-841] IPR0090803.9e-35Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[619-744] IPR0131553.3e-21Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding
Orthology groupMCL11857 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210182-TA
ATGGGGCATGCTGTTAACAAGATCATTAAGGATATAAACAACAGAAGTCAGATTCTTCAAGGCAATAAGGTCCATTATGTACCAGGCTGGGACTGCCATGGACTTCCAATAGAGTTTAAAGCTCTCCAGAAAACAAAAAGCAAAAATGAACCCACCGATCCAGTACAAACAAGACAAATAGCAAGAAGTTTTGCACTCGAAACAGTTAAGAGCCAAAAAGAAGCATTCGAAAGCTGGGGTATAATGGCCGATTGGGAAAAACAATGTTATTTAACACTCGATAAAAACTATGTCCAGAGCCAGCTGAGATTGTTCTATAAAATGTACAAATCTGGCTTAATTTACCAAGCCCTTAAACCGGTTTATTGGTCACCATCTTCAAGGACAGCATTAGCTGAAGCGGAATTGGAATATGATCCTAACTTTAAAAGCAAAGAGGTTTATTTTAAATTTCCCATGGAGAAAGTTCCTGATTTGGTAAAAAATGTCTGTTCTGACCAACAAATATTTGCCCTCATCTGGACAACAACTCCTTGGACATTGGTTGCTAACAAAGCGATAGCTTACAACCCCAGCATGGTGTACTGTGTCGCCAAAATGAGTAAAAGATCTGAGCTATTCCTTATTGCTAAAGATCAGATTCAAGAGCTAGAGAGAGTATTGGACTGTGGAATATCCATTGTGGTGGAGTTTGAGGGTCAACATTTATCATCCACAACTTATAACGGTCTGTCACACACTATGCCGCTAATACCAGGACCACACGTGACCAGTGGCAAGGGCACCGGCTTAGTTCATACAGCACCCGCACATGGACCGGATGACTTTGTCGTTGCTCTCAATAATAATATTACTGTGGAGTGCAATGTGGACGAACACGGCCGCTATATAAACCTCGGTCCCGATTTGGACGGGCTATACGTGCTGCAGGAAGGCCAGGAAACTGTGATGAAGAAGCTTCGAGACTCGATTATATATGAGGGTATATTCACCCATTCCTATCCATTGGATTGGAGAACTAAAAAACCGGTTATCCTAAGAGCCAGCCATCAGTGGTTCATAGACACGAACGCGTTGAAACAAACGGCCCTGGGAGCTCTAGATAAAGTGGCTATCCTACCACCATCCACGGCGGACCAGTCTAGACAGGGCTTCCGCGCTCAATTGGAGAAGAGGCCCTACTGGTGTATATCGAGACAGAGAGCCTGGGGCGTGCCCATACCTGCTCTATATAGGGGTAACGAGATCATTGTTGATGAGGAAATCATAGAGAATATCTGTTCTCTCATAGACAAGGACGGCACAGACGTTTGGTGGACTTGCGATGTGAAGGATCTCATACCGAAGAAGATTTCTGAGAAATTTAATTGTGAAGAAATTACCAAAGGAAAGGATATAATGGACATCTGGCTGGACTCTGGTCTTTCCTGGCACACCCTGGATCGTAAGGCTCACCTGTATTCTGAAGGTGTTGACCAACTCACCGGTTGGTTCCAAGCTTCCTTGCTAACATCTCTCGCTCTGAATGGTGAAGCGCCTTACGAATCTATATTCGTACACGGCTTCGTAGTTGACGACAAGAAACGTAAAATGTCTAAATCCATAGGCAATGTCATTGACCCGAAAACCATAATATTCGGTGACAAGAAGAACGCCGCTTACGGTGTCGACACCTTGAGGTGGTGGGTCGCGAGTCACTCCACTCAACATTCCCAAATAGTCATCAGCAAGAAACTTCTAGAGGACTGTCAGAACGAAGTGATAAGGATACGAAACATAATGAAATACCTGCTCGGCGTGATCAGCGATTTAGAGAAGACGGATTTCTACAAGAATCCAACATTAAATTTCTTCGACCGATACATGGTCACGGAATGTCATAGTTTTGTGAACGAAACTAATCACCATTACGATAATTTTAGATACAATCATGTGGCGCAGAATGTATTATATTTTATAAGTAATAAGGTGTCCGGGTTGTATTGTCACTGTATTAAAGACAGGTTGTACTGTTCAATGAGAAATTCTAAAGAGAGACTCGCCGCTCAGCTTGTGATACATACGATTCTGGTCTCTCTGTGTAAGGGTTTAGGGCCAATTTTGCCTCATCTGATCGAAGAAGTGTGGCAGTATCATCCGTTGTATGATGAACCGTTTTATTTCACCAAAGATCTGCCAGTCTTGAAGCCGTCTGATGTTGATTCGTCGTTAATGGAGGCCATATTGGATATCAAAAGAAACGTTATACTAAAAACTAAAAATGAACATTTGAAGAAATTCGAACTAAATTTAACAATAAATTCAGAGTTATATAATAAATTAGATGATTTAAACCACACAGATGGCATCAACGATAGTGTGTTATGTGAAATTCTAGAACTGTCATCTGTCAGATTGAATAATGGCGGGGAAAATATGCTAGTAGATTTGACACAGAGTAAAAAAGATCAATGTTTGAGATGCAGGAAATATAATGCGATAGATAATAGTGACAAGTGTTTGAGATGTGAAAAAGTTTTAGCTATGTATTGA

Protein sequence:

>DPOGS210182-PA
MGHAVNKIIKDINNRSQILQGNKVHYVPGWDCHGLPIEFKALQKTKSKNEPTDPVQTRQIARSFALETVKSQKEAFESWGIMADWEKQCYLTLDKNYVQSQLRLFYKMYKSGLIYQALKPVYWSPSSRTALAEAELEYDPNFKSKEVYFKFPMEKVPDLVKNVCSDQQIFALIWTTTPWTLVANKAIAYNPSMVYCVAKMSKRSELFLIAKDQIQELERVLDCGISIVVEFEGQHLSSTTYNGLSHTMPLIPGPHVTSGKGTGLVHTAPAHGPDDFVVALNNNITVECNVDEHGRYINLGPDLDGLYVLQEGQETVMKKLRDSIIYEGIFTHSYPLDWRTKKPVILRASHQWFIDTNALKQTALGALDKVAILPPSTADQSRQGFRAQLEKRPYWCISRQRAWGVPIPALYRGNEIIVDEEIIENICSLIDKDGTDVWWTCDVKDLIPKKISEKFNCEEITKGKDIMDIWLDSGLSWHTLDRKAHLYSEGVDQLTGWFQASLLTSLALNGEAPYESIFVHGFVVDDKKRKMSKSIGNVIDPKTIIFGDKKNAAYGVDTLRWWVASHSTQHSQIVISKKLLEDCQNEVIRIRNIMKYLLGVISDLEKTDFYKNPTLNFFDRYMVTECHSFVNETNHHYDNFRYNHVAQNVLYFISNKVSGLYCHCIKDRLYCSMRNSKERLAAQLVIHTILVSLCKGLGPILPHLIEEVWQYHPLYDEPFYFTKDLPVLKPSDVDSSLMEAILDIKRNVILKTKNEHLKKFELNLTINSELYNKLDDLNHTDGINDSVLCEILELSSVRLNNGGENMLVDLTQSKKDQCLRCRKYNAIDNSDKCLRCEKVLAMY-