Monarch geneset OGS2.0

DPOGS214171
TranscriptDPOGS214171-TA2904 bp
ProteinDPOGS214171-PA967 aa
Genomic positionDPSCF300014 - 233571-236474
RNAseq coverage529x (Rank: top 24%)
Annotation
HeliconiusHMEL0068150.086.66% 
BombyxBGIBMGA006216-TA0.078.87% 
DrosophilaAats-ala-PB0.067.71% 
EBI UniRef50UniRef50_P218940.082.94%Alanine--tRNA ligase, cytoplasmic n=38 Tax=Opisthokonta RepID=SYAC_BOMMO
NCBI RefSeqNP_001037452.10.082.94%alanyl-tRNA synthetase, cytoplasmic [Bombyx mori]
NCBI nr blastpgi|1129842240.082.94%alanine--tRNA ligase, cytoplasmic [Bombyx mori]
NCBI nr blastxgi|1129842240.082.94%alanine--tRNA ligase, cytoplasmic [Bombyx mori]
Group
Gene OntologyGO:00001663e-264nucleotide binding
GO:00057373e-264cytoplasm
GO:00055245e-227ATP binding
GO:00064195e-227alanyl-tRNA aminoacylation
GO:00048135e-227alanine-tRNA ligase activity
GO:00168763.5e-15ligase activity, forming aminoacyl-tRNA and related compounds
GO:00430393.5e-15tRNA aminoacylation
GO:00036762.7e-11nucleic acid binding
KEGG pathwaytca:6549310.0 
 K01872 (AARS, alaS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[9-945] IPR0023183e-264Alanyl-tRNA synthetase, class IIc
[9-597] IPR0181645e-227Alanyl-tRNA synthetase, class IIc, N-terminal
[259-490] IPR0181629.4e-65Alanyl-tRNA synthetase, class IIc, anti-codon-binding domain
[597-762] IPR0181633.5e-40Threonyl/alanyl tRNA synthetase, class II-like, putative editing domain
[695-754] IPR0129473.5e-15Threonyl/alanyl tRNA synthetase, SAD
[890-956] IPR0031562.7e-11Phosphoesterase, DHHA1
Orthology groupMCL12078 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214171-TA
ATGAATAACAACATGAGCGGGAGTGAAATTAGAAAAACCTTCATTGATTTTTTCATTAATAAAGGGCATAAGTATGTCCATTCTTCATCCACAATTCCTTTAGATGATCCTACTTTGCTATTTGCAAATGCTGGCATGAATCAATTTAAACCCATTTTCTTAGGTTCAGTTGATCCAAACTCTGATATGGCTCAGTATGTGAGAGTTGTTAATACCCAAAAGTGTATAAGAGCAGGAGGAAAACATAATGATTTGGATGATGTTGGAAAAGATGTCTACCATCATACATTTTTTGAAATGTTGGGAAATTGGTCCTTTGGTGACTACTTTAAAAAAGAAATCTGTAGCTGGGCTTGGGAGCTTCTCACCGAAGTATATAAACTACCCGCTGATCGTTTATACATAACATACTTTGGAGGAGACAAAGCTTCTGGACTTGAACCTGATTTAGAATGTAAGAATATATGGTTGAGTTTAGGTGTTCCTGAATCACACATCCTACCAGGAAGTATGAAAGACAACTTTTGGGAAATGGGTGAAACGGGCCCTTGTGGGCCATGCTCTGAACTACATTATGATAGAATAGGAGGAAGGGATGCTGCACATCTTGTAAATATGGATGATCCTGATGTCCTTGAAATTTGGAATTTGGTATTTATTCAATTTAATAGGGAATCAGATGGATCACTTAAGTTACTACCAAAAAAACACATTGATTGTGGCTTAGGATTAGAAAGATTGGTTTCTGTGATTCAAAATAAGAAGGCTAATTATGACACTGATTTCTTTATGCCAATATTTAAGGCCATTGAAAATGGCACTAAGGTAAGACCTTATACTGGTCATGTTGGTGCAGAAGACACAGATGGTATTGATATGGCATACAGGGTATTAGCTGACCATGCCCGTACATTGACTATTGCCTTGGCAGATGGAGGTTACCCAGATAATACAGGAAGAGGGTATGTTTTACGAAGAATTTTGAGAAGAGCAGTTCGTTTCGCTTCAGAAAAACTAAATGCTAAACCAGGTTTCTTCAGCTCATTGGTAAACACTGTTGTAGAATTATTGGGTGATGTATTTCCTGAAATTAAAAAGGATCCCGAGTCTATTATTCAAATTATAAATGAGGAAGAAATTCAATTCCTAAAAACTCTTACAAGAGGCCGAAATCTACTAAACAGAACTATTGAAAAATTAGGCGATTCTAAAATTATACCAGGTGACGTTGCTTGGCGAATGTATGATACCTACGGCTTCCCTATCGATTTGACTCAGCTCATGTGTGAAGAGAAGGGCTTAAATATTGATATGGATGCTTATGAAAAGGCGAAAAAAGAATCCCAACTGTCATCACAAGGTAAAACTGCTGGACAGGAAGATTTGCTGGCTCTAGACATTCACGCCATAAGCCATTTACAAGAGACAGGTGTTAACACAACAGACGATTCACCAAAATACAATTATACACCATCCTCGACAGATAAAGATGCTGAATATAAATTTGCTCCATGTGAAGGTGTTATTACAGCGCTGCGCAGAAATAAACAATTTGTAGATGAAGTAAGTTCTGGTCAAGAATGTGGTATTATACTAGATCGCACAAATTTTTACGCTGAACAAGGAGGTCAAATATTCGACGAGGGTTACATGGTAAAGCTAGACGATGAAAGCGTAGAATTCACAGTTAAAAGCGTTCAAGTTAAGGGAGGATATGTTCTTCATATTGGAAAAGTTGAAGGAACTCTCAAAGTTGGAGATAGAGTTTCTCTCCATATCGACACTGAAAGGAGGAGATTAGTGATGAATAATCATACAGGAACACACATTCTGAACAACGTGCTGAGGAAAGTACTCGGAAATGATTCTGATCAACGTGGATCGCTCGTAATGCCGGACCGTCTGAGATTTGACTTCACAAATAAAGGACCAATGACTGTGAAACAGATAAAAGATGCGGAAGATCAAATTAAGGAAATAATAAACAAAAACAAAGTTGTATATGCTAAACATACCAGCTTAAGTGAAGCTAAAAAAATTAAGGGCTTGCGTGCTATGTTTGATGAACAGTACCCTGATCCAGTAAGAGTGGTTTCAGTTGGAATACCTGTTGAAGATTTGGAAAATAATCCCGATGGATTGTCTGGATTTGAAACTTCCGTTGAATTTTGCGGTGGTTCTCATTTGCATCAAACCGGTCATATTGGTGATTACGTTATTGTAAGCGAAGAAGGTATAGCAAAAGGAATTCGGAGAATTGTAGCATTAACAGGTCCCGAGGCTATCAAAGCCTTAAACAAAATGAGTTCTTTAGAAAACGAAGTAAATGCTATTGCAAATTTAATAAAGGAACAGGGTGAAAATGTTAATCAAAAAGAAGTTGTTAAAAATATAGTAGACCTGACTAATGACGTATCTCATGCTCAAATAGCTTACTGGAAAAAAGAGGAACTAAGAACACTGCTGAAAAATCTAAAAAAACAATTGGACGATAAGGAAAGGGCTGCAAAGGCGATGACAGCTAATATCGTTATAGAGAAGGCTAAAGAATTATGTTTAAAAGATGAAAATGCTATAGTTATTGTTGAAGAATTAAAGGCTTACAATAACACTAAGGCCCTGGATGGTGCTTTGAAACAAGTCAAACAGATGTTACCGAACACCGCTGCTATGTTTTTCTCCGTTGACGACGATTCAAATAAAATCTTCTGTCTAGCTGCTGTGCCTAAGATTTTAAATGAAAAAGGTTTATTGGCGTCTGAATGGGTCCAATCAGTCGTGCCTGTGATGGGAGGTAAAGGTGGGGGAAAGGCTGAATCCGCTCAGGCTTCTGGAAACAGTCCTAATAAACTACAGGATGCAATTAAAATTGGTCGTGATTTTGCTAATTCCAAACTGTCGTAA

Protein sequence:

>DPOGS214171-PA
MNNNMSGSEIRKTFIDFFINKGHKYVHSSSTIPLDDPTLLFANAGMNQFKPIFLGSVDPNSDMAQYVRVVNTQKCIRAGGKHNDLDDVGKDVYHHTFFEMLGNWSFGDYFKKEICSWAWELLTEVYKLPADRLYITYFGGDKASGLEPDLECKNIWLSLGVPESHILPGSMKDNFWEMGETGPCGPCSELHYDRIGGRDAAHLVNMDDPDVLEIWNLVFIQFNRESDGSLKLLPKKHIDCGLGLERLVSVIQNKKANYDTDFFMPIFKAIENGTKVRPYTGHVGAEDTDGIDMAYRVLADHARTLTIALADGGYPDNTGRGYVLRRILRRAVRFASEKLNAKPGFFSSLVNTVVELLGDVFPEIKKDPESIIQIINEEEIQFLKTLTRGRNLLNRTIEKLGDSKIIPGDVAWRMYDTYGFPIDLTQLMCEEKGLNIDMDAYEKAKKESQLSSQGKTAGQEDLLALDIHAISHLQETGVNTTDDSPKYNYTPSSTDKDAEYKFAPCEGVITALRRNKQFVDEVSSGQECGIILDRTNFYAEQGGQIFDEGYMVKLDDESVEFTVKSVQVKGGYVLHIGKVEGTLKVGDRVSLHIDTERRRLVMNNHTGTHILNNVLRKVLGNDSDQRGSLVMPDRLRFDFTNKGPMTVKQIKDAEDQIKEIINKNKVVYAKHTSLSEAKKIKGLRAMFDEQYPDPVRVVSVGIPVEDLENNPDGLSGFETSVEFCGGSHLHQTGHIGDYVIVSEEGIAKGIRRIVALTGPEAIKALNKMSSLENEVNAIANLIKEQGENVNQKEVVKNIVDLTNDVSHAQIAYWKKEELRTLLKNLKKQLDDKERAAKAMTANIVIEKAKELCLKDENAIVIVEELKAYNNTKALDGALKQVKQMLPNTAAMFFSVDDDSNKIFCLAAVPKILNEKGLLASEWVQSVVPVMGGKGGGKAESAQASGNSPNKLQDAIKIGRDFANSKLS-