Monarch geneset OGS2.0

DPOGS201944
TranscriptDPOGS201944-TA1494 bp
ProteinDPOGS201944-PA497 aa
Genomic positionDPSCF300224 + 152129-189934
RNAseq coverage776x (Rank: top 17%)
Annotation
HeliconiusHMEL0022192e-11183.40% 
BombyxBGIBMGA011854-TA0.089.92% 
DrosophilaEip63E-PG0.066.16% 
EBI UniRef50UniRef50_Q7KM030.066.16%Ecdysone-induced protein 63E, isoform G n=39 Tax=Coelomata RepID=Q7KM03_DROME
NCBI RefSeqXP_624994.20.074.32%PREDICTED: similar to Ecdysone-induced protein 63E CG10579-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3071850350.074.74%Serine/threonine-protein kinase PFTAIRE-1 [Camponotus floridanus]
NCBI nr blastxgi|3071850350.074.74%Serine/threonine-protein kinase PFTAIRE-1 [Camponotus floridanus]
Group
Gene OntologyGO:00055245.2e-92ATP binding
GO:00046745.2e-92protein serine/threonine kinase activity
GO:00064685.2e-92protein phosphorylation
GO:00167726.2e-83transferase activity, transferring phosphorus-containing groups
GO:00046721.6e-67protein kinase activity
GO:00047131e-08protein tyrosine kinase activity
KEGG pathway 
InterPro domain[182-469] IPR0022905.2e-92Serine/threonine-protein kinase domain
[168-497] IPR0110096.2e-83Protein kinase-like domain
[183-469] IPR0174421.6e-67Serine/threonine-protein kinase-like domain
[182-469] IPR0206351e-08Tyrosine-protein kinase, catalytic domain
Orthology groupMCL13161 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201944-TA
ATGAGTCAGCTAACAGGCGTCGGTATTGTTAACTTCGTCAACGTGCTGCCCACGATATCTGAAGGCGTGACAATGAGAGAAAAACGCGGAGGTGCCATGAGCAAAATGCAGAAGCTCAGAAAAAGATTGTCCCTGAGCTTTGGTAGATTGTCAACGAAAGACGAATCAGAAGTTGACATCGAGTGCCGTGGAAGGCAGAACGGCGGCGGTGGCGCTCGCGGTAAACTGCCGTACAACGGTTACTCAGAAGAGTGTCTAGATAGACTCGAACCCAATGGAAATATACCAAACGATAAGGACTCCCATTATGATTGGAGCGGCGGTGGTAATGGTGAGTATCGTGTTCGGAGACAGCTCTCTGTGTCATCTGACTCCAAGCTGTTAGACGAGGGAGCCCGGGAGGACGCGAGGGTAGTCATGAGGCCTAAGCGACCGCCGAGGCCTAAGTCCGAGGCTTTCTTAGGACCACAAGAGAATTCCAATCGACGTACCAAGCGGTTTAGTGCCTTCGGGGGTGACTCTCCTTTTGGGAAGTCTGAGGCGTATATAAAATTGGAGCAACTCGGTGAGGGGTCTTATGCTACAGTCTATAAAGGATATAGCAATTTAACACAACAAGTTGTGGCCTTGAAAGAGATCAGGCTTCAGGAAGAGGAAGGCGCTCCTTTCACAGCGATAAGAGAAGCTTCCTTACTCAAAGAGTTGAAACACGCCAACATAGTCACGTTACACGACATTGTGCACACCAGGGAGACATTAACGTTTGTGTTCGAATTTGTGGACACAGATCTATCTCAGTACATGGAGCGACATCCCGGTGGACTTAACAGGCACAATGTGAGATTATTTATGTATCAATTGTTGAGAGGACTCGCGTACTGCCATCGAAGGAGAGTACTGCACAGAGATGTGAAGCCACAGAATCTCCTGATCAGTTCGAGTGGTGAACTCAAGTTGGCTGACTTCGGTTTGGCTCGAGCCAAGTCCGTACCGAGTCACACGTACTCACATGAAGTAGTCACCCTCTGGTACAGGCCGCCAGACGTATTGCTCGGTAGTACTGAGTACTCCACCTCGCTGGACATGTGGGGTGTGGGATGTATCTTCGTCGAGATGCTCTGCGGCGTGCCAACCTTCCCTGGAGTAAGAGACACTAACGACCAGCTTGACAAAATATTCAAGGTCATCGGCACTCCAACGGAAGAATCATGGTCAGGCGTGACTCGTCTGCCTGGGCTGTCTACACACGTGTCCAGGTGGGGCGCTGTGCCGTCTCGACCGCTGGCTGCTTCATTTCCGCGTCTGCGGGACGCCGGTCGCGACGCTCAGCGGCTGGCAGCGGCCTTACTCCAGCCAGATCCAGCCCGTAGACTGCCAGCACACCGCGCCCTCGCTCATGATTACTTCAATTGTCTGCCAGCACGACTCGCCAGTCTGCCGGACGAGGTGTCTATCTTCACAGTGGAGGGAGTCTGCCTTCATCCTGAGGACTAG

Protein sequence:

>DPOGS201944-PA
MSQLTGVGIVNFVNVLPTISEGVTMREKRGGAMSKMQKLRKRLSLSFGRLSTKDESEVDIECRGRQNGGGGARGKLPYNGYSEECLDRLEPNGNIPNDKDSHYDWSGGGNGEYRVRRQLSVSSDSKLLDEGAREDARVVMRPKRPPRPKSEAFLGPQENSNRRTKRFSAFGGDSPFGKSEAYIKLEQLGEGSYATVYKGYSNLTQQVVALKEIRLQEEEGAPFTAIREASLLKELKHANIVTLHDIVHTRETLTFVFEFVDTDLSQYMERHPGGLNRHNVRLFMYQLLRGLAYCHRRRVLHRDVKPQNLLISSSGELKLADFGLARAKSVPSHTYSHEVVTLWYRPPDVLLGSTEYSTSLDMWGVGCIFVEMLCGVPTFPGVRDTNDQLDKIFKVIGTPTEESWSGVTRLPGLSTHVSRWGAVPSRPLAASFPRLRDAGRDAQRLAAALLQPDPARRLPAHRALAHDYFNCLPARLASLPDEVSIFTVEGVCLHPED-