Monarch geneset OGS2.0

DPOGS202414
TranscriptDPOGS202414-TA876 bp
ProteinDPOGS202414-PA291 aa
Genomic positionDPSCF300233 + 87179-89482
RNAseq coverage136x (Rank: top 55%)
Annotation
HeliconiusHMEL0074155e-13784.76% 
BombyxBGIBMGA003440-TA9e-13784.85% 
DrosophilaCG4573-PA9e-11165.53% 
EBI UniRef50UniRef50_Q9VV591e-10865.53%CG4573 n=25 Tax=Endopterygota RepID=Q9VV59_DROME
NCBI RefSeqXP_002012241.14e-11163.45%GI16866 [Drosophila mojavensis]
NCBI nr blastpgi|1951356418e-11063.45%GI16866 [Drosophila mojavensis]
NCBI nr blastxgi|910819271e-10764.18%PREDICTED: similar to GA18268-PA [Tribolium castaneum]
Group
Gene OntologyGO:00168769e-147ligase activity, forming aminoacyl-tRNA and related compounds
GO:00055249e-147ATP binding
GO:00001669e-147nucleotide binding
GO:00430399e-147tRNA aminoacylation
GO:00057379e-147cytoplasm
GO:00064249.2e-104glutamyl-tRNA aminoacylation
GO:00048189.2e-104glutamate-tRNA ligase activity
KEGG pathwaydmo:Dmoj_GI168661e-110 
 K01885 (EARS, gltX)maps-> Aminoacyl-tRNA biosynthesis
    Porphyrin and chlorophyll metabolism
InterPro domain[26-290] IPR0009249e-147Glutamyl/glutaminyl-tRNA synthetase, class Ib
[27-290] IPR0045279.2e-104Glutamyl-tRNA synthetase, class Ib, bacterial/mitochondrial
[27-290] IPR0200581.2e-88Glutamyl/glutaminyl-tRNA synthetase, class Ib, catalytic domain
[217-266] IPR0147296.9e-47Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL13406 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202414-TA
ATGTTTTCACTTGCTTCAAGAGGTATTTATGTAGTGTCTAAGCGGTATTTTTATCGCAATTTGAAGTCATTAGTGCCGAGAGTTAGGTTTGCTCCCAGTCCAACAGGATATCTTCACTTAGGAGGCTTAAGAACTGCATTGTATAATTATTTATTTGCTAAATCACGAGGCGGTGTTTTTATTCTACGCATAGAAGACACAGATCAAACTAGGAAAGTGGAGGGAGCCGTCGATGCCTTAATAAGTGATTTAGAGTGGGCAGGAATTGAATGTCACGAAGGTCCTACCAGAGGTGGGACTTGTGGGCCCTATATTCAGAGTGAAAGATTGGATATTTACCAGGAGCATATAAGTAAGCTTCTAGCAAATGGCTCTGCTTATAAATGTTTTTGTACTGAGAGAAGATTGAATATTCTTCGTCGGGATGCTGTTAAGAGTCAGAGGATACCGAAGTATGATAACAAATGTCGCAATCTATCACATCAGGATATAAAAGACAAGATTAAAGCGGGGATTCCATATTGTATTAGATTTAAGCTGTCGTCAGATATACAATCGTACGAGGACCTTATATTTGGAGGTATAGCCTATGATGTATCTTTAAACGAGGGTGATCCAGTGCTTATGAAGTCTGATGGCTACCCGACCTACCACTTTGCGAATGTTGTGGATGACCATCTCATGGGAGTGTCACATGTATTGAGAGGAGTTGAATGGCAGATCTCGACCACTAAACATCTCTTGATATATAAGGCCTTCGGTTGGACCCCTCCTGAATTTGGCCATTTGCCGTTGATAGTGAATTCCGACGGAACAAAACTCAGTAAGAGACAAAATGATGTCAAAGTTGAAGACTACAGGAATAAAGGTGATTGA

Protein sequence:

>DPOGS202414-PA
MFSLASRGIYVVSKRYFYRNLKSLVPRVRFAPSPTGYLHLGGLRTALYNYLFAKSRGGVFILRIEDTDQTRKVEGAVDALISDLEWAGIECHEGPTRGGTCGPYIQSERLDIYQEHISKLLANGSAYKCFCTERRLNILRRDAVKSQRIPKYDNKCRNLSHQDIKDKIKAGIPYCIRFKLSSDIQSYEDLIFGGIAYDVSLNEGDPVLMKSDGYPTYHFANVVDDHLMGVSHVLRGVEWQISTTKHLLIYKAFGWTPPEFGHLPLIVNSDGTKLSKRQNDVKVEDYRNKGD-