Monarch geneset OGS2.0

DPOGS210181
TranscriptDPOGS210181-TA2748 bp
ProteinDPOGS210181-PA915 aa
Genomic positionDPSCF300393 + 125205-131473
RNAseq coverage0x (Rank: top 98%)
Annotation
HeliconiusHMEL0127501e-17767.13% 
BombyxBGIBMGA014150-TA0.063.95% 
DrosophilaCG5414-PB0.042.34% 
EBI UniRef50UniRef50_UPI00017922710.043.77%UPI0001792271 related cluster n=1 Tax=unknown RepID=UPI0001792271
NCBI RefSeqXP_001949755.10.043.77%PREDICTED: similar to CG5414 CG5414-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1936367170.043.77%PREDICTED: isoleucyl-tRNA synthetase, mitochondrial-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1936367170.043.37%PREDICTED: isoleucyl-tRNA synthetase, mitochondrial-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00055243.8e-205ATP binding
GO:00064283.8e-205isoleucyl-tRNA aminoacylation
GO:00001663.8e-205nucleotide binding
GO:00057373.8e-205cytoplasm
GO:00048223.8e-205isoleucine-tRNA ligase activity
GO:00064182.9e-141tRNA aminoacylation for protein translation
GO:00048122.9e-141aminoacyl-tRNA ligase activity
KEGG pathwayapi:1001667260.0 
 K01870 (IARS, ileS)maps-> Aminoacyl-tRNA biosynthesis
    Valine, leucine and isoleucine biosynthesis
InterPro domain[30-914] IPR0235850Isoleucyl-tRNA synthetase, type 1
[56-837] IPR0023013.8e-205Isoleucyl-tRNA synthetase
[78-654] IPR0023002.9e-141Aminoacyl-tRNA synthetase, class Ia
[435-652] IPR0147297.7e-105Rossmann-like alpha/beta/alpha sandwich fold
[241-434] IPR0090087.8e-38Valyl/Leucyl/Isoleucyl-tRNA synthetase, class Ia, editing domain
[667-913] IPR0090802.9e-30Aminoacyl-tRNA synthetase, class 1a, anticodon-binding
[699-818] IPR0131551.2e-15Valyl/Leucyl/Isoleucyl-tRNA synthetase, class I, anticodon-binding
Orthology groupMCL11857 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210181-TA
ATGTTCAAAACATACAATTTAAGAACTTTTCATAATAATTATAGAAAATCAAACATAAAATGGACCTCCTTCTTTTGTACGAGAGAGGAAAGTTCAACAAAACCAAAAACAAAAACATACTCTCACACAGTTTTGTTCCCTAAAACAAATTTTCCGGCTAGATCTAATATCAACAAAGAGAAAATTCAAAAGACTGCAAAGTTTTCTGAACTCTATGAATGGCAGCGGGAACATTTAACTGGTCCAGAATTTATTTTACATGACGGGCCACCTTATGCAAATGGAGCTTTACATATGGGGCATGCTGTTAACAAGATCATTAAGGATATAAACAACAGAAGTCAGATTCTTCAAGGCAATAAGGTCCATTATGCACCAGGCTGGGACTGCCATGGACTTCCAATAGAGTTTAAAGCTCTCCAGAAAACAAAAAGCAAAAATGAACCCACCGATCCAGTACAAACAAGACAAATAGCAAGAAGTTTTGCACTCGAAACAGTTAAGAGCCAGAAAGAAGCATTCGAAAGCTGGGGTATAATGGCCGATTGGGAAAAACAATGTTATTTAACACTCGATAAAAACTATGTCCAGAGCCAGCTGAGATTGTTCTATAAAATGTACAAATCCGGCTTAATTTACCAAGCCCTTAAACCGGTTTATTGGTCTCCATCTTCAAGGACAGCATTAGCTGAAGCGGAATTGGAATATGATCCTAACTTTAAAAGCAAAGAGGTTTATTTTAAATTTCCCATGGAGAAAGTTCCTGATTTGGTAAAAAATGTCTGTTCTGACCAACAAATTTTTGCCCTCATCTGGACAACAACTCCTTGGACATTAGTTGCTAACAAAGCGATAGCTTACAACCCCAGCATGGTGTACTGTGTTGCCAAAATGAGTAAAAGATCTGAGCTATTCCTTATTGCTAAAGATCAGATTCAAGAGCTAGAGAGAGTATTGGACTGTGGAATATCCATTGTGGTGGAGTTTGAGGGTCAACATTTATCATCCACAACTTATAACGGTCTGTCACACACTATGCCGCTAATACCAGGACCACACGTGACCAGTGGCAAGGGCACCGGCTTAGTTCATACAGCACCCGCACATGGACCGGATGACTTTGTCGTTGCTCTCAATAATAATATTACTGTGGAGTGCAATGTGGACGAACACGGCCGCTATATAAACCTCGGTCCCGATTTGGACGGGCTATACGTGCTGCAGGAAGGCCAGGAAACTGTGATGAAGAAGCTTCGAGACTCGATTATATATGAGGGTATATTCACCCATTCCTATCCATTGGATTGGAGAACTAAAAAACCGGTTATCCTAAGAGCCAGCCATCAGTGGTTCATAGACACGAACGCGTTGAAACAAACGGCCCTGGGAGCTCTAGATAAAGTGGCTATCCTACCACCATCCACGGCGGACCAGTCTAGACAGGGCTTCCGCGCTCAATTGGAGAAGAGGCCCTACTGGTGTATATCGAGACAGAGAGCCTGGGGCGTGCCCATACCTGCTCTATATAGGGACAAGGACGGCACAGACGTTTGGTGGACTTGCGATGTGAAGGATCTCATACCGAAGAAGATTTCTGAGAAATTTAATTGTGAAGAAATTACCAAAGGAAAGGATATAATGGACATCTGGCTGGACTCTGGTCTTTCCTGGCACACCCTGGATCGTAAGGCTCACCTGTATTCTGAAGGTGTTGACCAACTCACCGGTTGGTTCCAAGCTTCCTTGCTAACATCTCTCGCTCTGAATGGTGAAGCGCCTTACGAATCTATATTCGTACACGGCTTCGTAGTTGACGACAAGAAACGTAAAATGTCTAAATCCATAGGCAATGTCATTGACCCGAAAACCATAATATTCGGTGACAAGAAGAACGCCGCTTACGGTGTCGACACATTGAGGTGGTGGGTCGCGAGTCACTCCACTCAACATTCCCAAATAGTCATCAGCAAGAAACTTCTAGAGGACTGTCAGAACGAAGTGATAAGGATACGAAACATAATGAAATACCTACTAGGCGTGATCAGCGATTTAGAGAAGACGGATTTCTACAAGAATCCAACATTAAATTTCTTCGACCGATACATGGTCACGGAATGTCATAGTTTTGTGAACGAAACTAATCACCATTACGATAATTTTAGATACAATCATGTGGCGCAGAATGTGTCCGGATTGTATTGTCACTGTATTAAAGACAGGTTGTACTGTTCAATGAGAAATTCTAAAGAGAGACTCGCCGCTCAGCTTGTGATACATACGATTCTGGTCTCTCTGTGTAAGGGTTTAGGGCCAATTTTGCCTCATCTGATCGAAGAAGTGTGGCAGTATCATCCGTTGTATGATGAACCGTTTTTTTTCACCAAAGAACTGCCAGTCTTGAAGCCGTCTGATGTTGATTCGTCGTTAATGGAGGCCATATTGGATATTAAAAGAAACGTTATACTAAAAACTAAAAATGAACATTTGAAGAAATTCGAATTAAATTTAACAATAAATTCAGAGTTATATAATAAATTAGATGATTTAAACCACACAGATGGCATCAACGATAGTGTGTTATGTGAAATTCTAGAACTGTCATCTGTCAGATTGAATAATGGCGGGGAAAATATGCTCGTAGATTTGACACAGAGTAAAAAAGATCAATGTTTGAGATGCAGGAAATATAATGCGATAGATAATAGTGACAAGTGTTTGAGATGTGAAAAAGTTTTAGCTATGTATTGA

Protein sequence:

>DPOGS210181-PA
MFKTYNLRTFHNNYRKSNIKWTSFFCTREESSTKPKTKTYSHTVLFPKTNFPARSNINKEKIQKTAKFSELYEWQREHLTGPEFILHDGPPYANGALHMGHAVNKIIKDINNRSQILQGNKVHYAPGWDCHGLPIEFKALQKTKSKNEPTDPVQTRQIARSFALETVKSQKEAFESWGIMADWEKQCYLTLDKNYVQSQLRLFYKMYKSGLIYQALKPVYWSPSSRTALAEAELEYDPNFKSKEVYFKFPMEKVPDLVKNVCSDQQIFALIWTTTPWTLVANKAIAYNPSMVYCVAKMSKRSELFLIAKDQIQELERVLDCGISIVVEFEGQHLSSTTYNGLSHTMPLIPGPHVTSGKGTGLVHTAPAHGPDDFVVALNNNITVECNVDEHGRYINLGPDLDGLYVLQEGQETVMKKLRDSIIYEGIFTHSYPLDWRTKKPVILRASHQWFIDTNALKQTALGALDKVAILPPSTADQSRQGFRAQLEKRPYWCISRQRAWGVPIPALYRDKDGTDVWWTCDVKDLIPKKISEKFNCEEITKGKDIMDIWLDSGLSWHTLDRKAHLYSEGVDQLTGWFQASLLTSLALNGEAPYESIFVHGFVVDDKKRKMSKSIGNVIDPKTIIFGDKKNAAYGVDTLRWWVASHSTQHSQIVISKKLLEDCQNEVIRIRNIMKYLLGVISDLEKTDFYKNPTLNFFDRYMVTECHSFVNETNHHYDNFRYNHVAQNVSGLYCHCIKDRLYCSMRNSKERLAAQLVIHTILVSLCKGLGPILPHLIEEVWQYHPLYDEPFFFTKELPVLKPSDVDSSLMEAILDIKRNVILKTKNEHLKKFELNLTINSELYNKLDDLNHTDGINDSVLCEILELSSVRLNNGGENMLVDLTQSKKDQCLRCRKYNAIDNSDKCLRCEKVLAMY-