Monarch geneset OGS2.0

DPOGS211403
TranscriptDPOGS211403-TA1497 bp
ProteinDPOGS211403-PA498 aa
Genomic positionDPSCF300115 - 163631-165127
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0098580.070.14% 
BombyxBGIBMGA010863-TA0.063.98% 
DrosophilaCG31739-PA1e-13349.71% 
EBI UniRef50UniRef50_B4KJR27e-13651.15%GI14195 n=6 Tax=Neoptera RepID=B4KJR2_DROMO
NCBI RefSeqXP_966901.23e-14650.67%PREDICTED: similar to aspartyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastpgi|1892378916e-14550.67%PREDICTED: similar to aspartyl-tRNA synthetase [Tribolium castaneum]
NCBI nr blastxgi|2700066993e-13950.29%hypothetical protein TcasGA2_TC013060 [Tribolium castaneum]
Group
Gene OntologyGO:00055249e-193ATP binding
GO:00064189e-193tRNA aminoacylation for protein translation
GO:00001669e-193nucleotide binding
GO:00048129e-193aminoacyl-tRNA ligase activity
GO:00057379e-193cytoplasm
GO:00168749e-193ligase activity
KEGG pathwayphu:Phum_PHUM1372903e-136 
 K01876 (DARS, aspS)maps-> Aminoacyl-tRNA biosynthesis
InterPro domain[1-497] IPR0045249e-193Aspartyl-tRNA synthetase, class IIb, bacterial/mitochondrial-type
[1-497] IPR0181509e-193Aminoacyl-tRNA synthetase, class II (D/K/N)-like
[61-470] IPR0043643.7e-107Aminoacyl-tRNA synthetase, class II (D/K/N)
[133-145] IPR0023123.5e-21Aspartyl/Asparaginyl-tRNA synthetase, class IIb
[1-47] IPR0160277e-08Nucleic acid-binding, OB-fold-like
[2-47] IPR0123409.2e-07Nucleic acid-binding, OB-fold
[224-287] IPR0041152.7e-06GAD domain
Orthology groupMCL11811 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211403-TA
ATGAAAGATTTACCATTAGAGACTATTGTTCAGATTGAAGGGACTGTAGTGGCACGACCGTCTACGATGGTGAATAAAGAAATGATTACTGGAAAAGTGGAAGTCGTTATTGACAAACTGAAAGTGTTGAACGAAGTAACGAAGTTGCCCTTTAATTTAAGGGATTATCAGAAACCAAAAGAACAGTTACGTCTTCAATATCGCTATATAGATTTAAGGTTTCCTGAGATGCAGTCAATTTTAAGAAAGCGGTCTGCAATGTTGCATGCTATGAGAAAGTTTCTCATAGAAGAACACAATTTCGTAGAAGTAGAAACACCAACGCTGTTCTGTCGCACACCCGGTGGAGCGAGAGAGTTTGTTGTCCCCACTCATCATTCCGGACTGTTCTATTCACTGGTTCAAAGTCCTCAGCAGTTCAAACAGATGTTGATGGCGGGTGGGGTGGACAGGTACTTCCAAGTGGCTCGTTGTTACAGAGATGAAACCACGCGGCCTGACAGACAGCCAGAGTTCACACAGCTCGATATAGAACTATCATTTACAAGTTTAGAGAATGTTCTCTCTCTGATAGAGCAATTGTTGTATGACACATATCTCATACATCTACCGAAACCACCATTCAGAAGAATTACACACAGAGAGGCATTAGAAAAATATGGAAGTGACAAACCTAACATAACGTATGACTTGTTATTGAGGGATGTTACAGAATTATTCCAAACAAATACTGACTCAAACTTTGGAGCATTTGTTCTGCCATATCCCAGTGAAGTTGGTAAACTTACAAGTAAACATAAAGAAAAAATAAAAGAGCTAAGAAAAAAATATAATGTGAAGGTCGTCTTGAATGAAAACATATCAAAAGAAGTAGGTAGTGATTTAAACAGTCAAATAACTAGTGAAGATTGTCATAACGTTCTGTGTCTTGGTGATAGGGACGATGTTTGTATGTGTTTGGGTGATTTAAGGGAAGATTTGGCAACTTTGTTAAAATCTAGAAACCTTCTATCAGTGAAAAAGAGCTCCGAGCCACTGTGGGTGGTTGATTTTCCCTTATTTAATAAAGGAGATGAGGGTTTGCAAACTTGTCATCATCCGTTCACCGCCCCACATCCTGATGACATACACTTACTACATACAGATCCACTGAAAGTGAGATCACTTGCCTATGATATAGTGTTAAATGGCAATGAAGTCGGCGGTGGATCCATAAGAATCCATAATGGGGATTTACAGGAAAGGATCTTGGCAATGTTGGATATAGATCCGCAGCCTCTATCACACTTTATAAGTGCATTAAGAAGTGGTTGTCCTCCGCACGGAGGCATTGCTTTAGGTATTGATAGACTGATGGCGATCGCTTGTGACGCCGAATCTATAAGAGAGGTCATAGCATTCCCAAAATCCCACTTAGGCAAAGATCCACTCTCAGGAGCTCCCAACCTTTTAAGTGAAGACGATAAGAAGTATTATCATATTAAGACAGAGTCATGA

Protein sequence:

>DPOGS211403-PA
MKDLPLETIVQIEGTVVARPSTMVNKEMITGKVEVVIDKLKVLNEVTKLPFNLRDYQKPKEQLRLQYRYIDLRFPEMQSILRKRSAMLHAMRKFLIEEHNFVEVETPTLFCRTPGGAREFVVPTHHSGLFYSLVQSPQQFKQMLMAGGVDRYFQVARCYRDETTRPDRQPEFTQLDIELSFTSLENVLSLIEQLLYDTYLIHLPKPPFRRITHREALEKYGSDKPNITYDLLLRDVTELFQTNTDSNFGAFVLPYPSEVGKLTSKHKEKIKELRKKYNVKVVLNENISKEVGSDLNSQITSEDCHNVLCLGDRDDVCMCLGDLREDLATLLKSRNLLSVKKSSEPLWVVDFPLFNKGDEGLQTCHHPFTAPHPDDIHLLHTDPLKVRSLAYDIVLNGNEVGGGSIRIHNGDLQERILAMLDIDPQPLSHFISALRSGCPPHGGIALGIDRLMAIACDAESIREVIAFPKSHLGKDPLSGAPNLLSEDDKKYYHIKTES-