Monarch geneset OGS2.0

DPOGS203927
TranscriptDPOGS203927-TA1590 bp
ProteinDPOGS203927-PA529 aa
Genomic positionDPSCF300005 - 508186-511395
RNAseq coverage267x (Rank: top 40%)
Annotation
HeliconiusHMEL0135090.069.65% 
BombyxBGIBMGA002032-TA4e-17962.97% 
Drosophilawnd-PB2e-12350.13% 
EBI UniRef50UniRef50_Q7PP291e-12754.25%AGAP006461-PA n=5 Tax=Metazoa RepID=Q7PP29_ANOGA
NCBI RefSeqXP_316502.42e-12854.25%AGAP006461-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582958784e-12754.25%AGAP006461-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582958785e-12254.41%AGAP006461-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00167727.8e-76transferase activity, transferring phosphorus-containing groups
GO:00046722.4e-62protein kinase activity
GO:00064682.4e-62protein phosphorylation
GO:00047131.7e-61protein tyrosine kinase activity
GO:00055244.2e-60ATP binding
GO:00046744.2e-60protein serine/threonine kinase activity
KEGG pathwayapi:1001665412e-124 
 K04422 (MAP3K13, LZK)maps-> MAPK signaling pathway
InterPro domain[61-340] IPR0110097.8e-76Protein kinase-like domain
[77-311] IPR0012452.4e-62Serine-threonine/tyrosine-protein kinase
[72-311] IPR0206351.7e-61Tyrosine-protein kinase, catalytic domain
[72-313] IPR0022904.2e-60Serine/threonine-protein kinase domain
Orthology groupMCL11331 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203927-TA
ATGGATCAATTAGCTCAATTTGAGTTTAATGCTGCTCGCTACACAATGAAGGACGATTATCAGGGCTATGGATCTCCAGCTTTGGAGCGTGATAAAAAGACTATGTTTTGGATGAGTGGTGTACTCGATTGTTTTTCTACCGTTGTATCCTTGTTCAAGACTTCTGAATTTGATGTATCTCAAGAGGATGACTGGGAAGTACCCTTCGAGGCTATTACTGATATGGTATATCTTGGATCTGGAGCACAGGGTGTGGTATTTGGTGGAAATTTAAAAGGAGAAATGGTGGCAGTGAAAAAATTAAGAGATAAAAGCGAAGCTAATATAAAACATCTGAGGAAACTAAATCATGATAATATTGTGAGGTTTAGGGGCGTGTGTACAGTGGCTCCATTCTACTGTATTGTAATGGAGTATTGTCAGTATGGGCCCCTATTTGACTTTCTCCATAGTGGAGTTTCTTTCACACCTAAGCAAATAATCAGATGGGGGAGGGATATTGCTCTTGGCATGAGCTATCTTCACACACATAAGATCATTCATCGTGATCTGAAAAGTCCTAATATATTGATTGCAGACAATCTGGTGGTCAAAGTAAGTGATTTTGGTACTAGTCGTGAATGGAATGATGTCAGTGCTATAATGAGTTTCACTGGAACTGTAGCCTGGATGGCTCCAGAAGTAATCCGTCATGAGCCCTGTTCTGAGAGGGTTGATGTTTGGTCTTATGGGGTTGTCTTGTGGGAACTTCTAACACAGGAAGTTCCTTATAAAAATCTTGAAACTCATGCTATAATGTGGGGCGTGGGAACTGATACTATAACCCTTCCAATACCAACTACTTGCCCCAGCAGTTTGCAATTGCTGATAAATCAATGCTGGAATCGTACACCTCGCAGCAGACCACCATTCAAGATAATTGCCGCTCATCTGGATATGGCGGGTGAAGATTTGTGCTCTATGGACATGGAAAGTTTTAATAACACTCAAGCTCGCTGGCGCCAAGAAGTTCATCAGGCTATGGAAAGGCTCTACGCTAAACATGACAAAACAGCACCAGATTCTGTCGCACAACGTCGTGAGCATTTGAAACACGCTCGTGACGTTCGTTATGTATACGAGCAGCAACTGTCACGAGCCAACGAGCTCTATATGGAGGTGTGTGCTGTGCGCCTTCAGTTAGAACAGCGCGAGAGGGTCATTGCTGAGCGTGAGAATGCTTTGAGCAGTTGCCGTTGCGGCATTCGCAAGACGTTCAAATATTTCCATCGTCAGACGTCGTCCTCGTCCGACAGCATGAGGAATCTTCCGGCCAACTTCGACAGCCGCCGCAGAAGGAGGAAGATCGAGGCTGAAAAGAAGAACTTATCACAAATGATGGTAAACTACTCCAAGGATGACGGACAGACTGTGACCGTCGCAGTGGACGACAGCGTGTCGTGCTTGTGCGAGGAGAATGGAAACGTTACCGTCTCCATCAAAGACACCGTCATCGAGGACAATGGAAACGTCGTTGTAACTGATAACACGCTCCACAAGGATACACTCGCACTCAACGATAACTACATGCCTGATATGGCACATGTTTAA

Protein sequence:

>DPOGS203927-PA
MDQLAQFEFNAARYTMKDDYQGYGSPALERDKKTMFWMSGVLDCFSTVVSLFKTSEFDVSQEDDWEVPFEAITDMVYLGSGAQGVVFGGNLKGEMVAVKKLRDKSEANIKHLRKLNHDNIVRFRGVCTVAPFYCIVMEYCQYGPLFDFLHSGVSFTPKQIIRWGRDIALGMSYLHTHKIIHRDLKSPNILIADNLVVKVSDFGTSREWNDVSAIMSFTGTVAWMAPEVIRHEPCSERVDVWSYGVVLWELLTQEVPYKNLETHAIMWGVGTDTITLPIPTTCPSSLQLLINQCWNRTPRSRPPFKIIAAHLDMAGEDLCSMDMESFNNTQARWRQEVHQAMERLYAKHDKTAPDSVAQRREHLKHARDVRYVYEQQLSRANELYMEVCAVRLQLEQRERVIAERENALSSCRCGIRKTFKYFHRQTSSSSDSMRNLPANFDSRRRRRKIEAEKKNLSQMMVNYSKDDGQTVTVAVDDSVSCLCEENGNVTVSIKDTVIEDNGNVVVTDNTLHKDTLALNDNYMPDMAHV-