Monarch geneset OGS2.0

DPOGS210384
TranscriptDPOGS210384-TA1230 bp
ProteinDPOGS210384-PA409 aa
Genomic positionDPSCF300025 + 926814-930598
RNAseq coverage162x (Rank: top 52%)
Annotation
HeliconiusHMEL0051313e-4482.29% 
BombyxBGIBMGA011601-TA9e-15564.65% 
DrosophilaCG10802-PA4e-7438.95% 
EBI UniRef50UniRef50_D6W6E42e-9747.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W6E4_TRICA
NCBI RefSeqXP_971062.13e-9847.69%PREDICTED: similar to Alanyl-tRNA synthetase domain containing 1 [Tribolium castaneum]
NCBI nr blastpgi|910943837e-9747.69%PREDICTED: similar to Alanyl-tRNA synthetase domain containing 1 [Tribolium castaneum]
NCBI nr blastxgi|3323758875e-9646.04%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00001663.6e-26nucleotide binding
GO:00168761.4e-08ligase activity, forming aminoacyl-tRNA and related compounds
GO:00055241.4e-08ATP binding
GO:00430391.4e-08tRNA aminoacylation
GO:00057371.4e-08cytoplasm
KEGG pathway 
InterPro domain[96-249] IPR0181633.6e-26Threonyl/alanyl tRNA synthetase, class II-like, putative editing domain
[195-233] IPR0129471.4e-08Threonyl/alanyl tRNA synthetase, SAD
Orthology groupMCL17896 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210384-TA
ATGGTATTTAAATGTCAGGAGGACAGTTTTCTCAGAGAGTATACTTCTAAAGTCTTGAAGTGTGAGAAAACAAATGAATCGATTGTTGAATATGGAAAAGTTTCAAAATTTGATGGATATCAAATTACGTTGGAGAATACAATACTATTTCCAGCAGGAGGCGGCCAGCCTCATGATAAAGGCTGGTTGAATGACACAGAAGTGCTACAAGTATTACGTAAAGGTGACGAAGCCCTCCACTTCACACTTGAACCTATAACAGAAGGATCGGAGGTAGTACAGAAGATTGATTGGCAAAGAAGGTTTGATCATATGCAGCAACATACAGGACAGCATCTCTTATCAGCAATTCTTGAGAAAGAACACAATCTACCAACAACGAGTTGGTGGTTGGGAGCCGAGGAATGTTTTGTGGAACTCGATTCAACGACTATCAAATATGAAAACATAAAAACTGCCGAAGAAAGATGTAACAAACTAATCAGTGATGGGATTTCTGTTAATGTAAAATTTTTCAAAGCAAATGATCCCGCTTTAAATGAGGCACATACACGAGGTCTACCTAAAGATTGCATCGATACCATCAGAGTCATATGTATTGGAGATGTTGATGAGAACATGTGTTGTGGAACTCATGTTACTAATTTATGCCAGCTGCAAATGATAAAACTCTTAGGCACGGAACCGGGAAAGAAAGGAAAGACAAATCTAAGGTTTATTGTAGGTAATAGAGTTGTTAAGACATTCCAGAAAATGTTGGATAGAGAGAAAGCTCTGACAGGTCTATTAAAAAATGAACCCAGCAAACACGAAGAGCTCGTATCAAAAATGCAGAAAAACATGAAATTAAGTAATAAGAATCTACAAAATGTATTGGCGGAGTTGGCACAGTGTCAAATAGATAACATAAAAAATAGCAACCCCAAGCCTAAGTATTTCTGTTTGTTCAAGAGAGAGAGTACGCCGGAGTTTAATAGAATTATATGTAAGGGGCTTGAATCGGATGATATATTTATATTACTAGCATCTGAAGACCCCGATAAGACGAAGGAGGGCCAAATTATTTTACAAGGTCCAGAAGTTCATTGTAATGCTTTGGGACAACAAATAATGGATACATTGAAAGGCAAGGGTGCTTTCAAGAATGGAAAGTTTCAGGGGAAGGCCGGTGATGTCAGTAACTTCAATAAATGCACCAAGATAATTGAGGAATACTTTAATAATGTATGA

Protein sequence:

>DPOGS210384-PA
MVFKCQEDSFLREYTSKVLKCEKTNESIVEYGKVSKFDGYQITLENTILFPAGGGQPHDKGWLNDTEVLQVLRKGDEALHFTLEPITEGSEVVQKIDWQRRFDHMQQHTGQHLLSAILEKEHNLPTTSWWLGAEECFVELDSTTIKYENIKTAEERCNKLISDGISVNVKFFKANDPALNEAHTRGLPKDCIDTIRVICIGDVDENMCCGTHVTNLCQLQMIKLLGTEPGKKGKTNLRFIVGNRVVKTFQKMLDREKALTGLLKNEPSKHEELVSKMQKNMKLSNKNLQNVLAELAQCQIDNIKNSNPKPKYFCLFKRESTPEFNRIICKGLESDDIFILLASEDPDKTKEGQIILQGPEVHCNALGQQIMDTLKGKGAFKNGKFQGKAGDVSNFNKCTKIIEEYFNNV-