Monarch geneset OGS2.0

DPOGS207324
TranscriptDPOGS207324-TA4377 bp
ProteinDPOGS207324-PA1458 aa
Genomic positionDPSCF300188 - 389571-403638
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0088620.060.06% 
BombyxBGIBMGA008626-TA3e-9039.56% 
DrosophilaPEK-PA7e-9033.24% 
EBI UniRef50UniRef50_UPI00020646983e-10336.72%UPI0002064698 related cluster n=4 Tax=unknown RepID=UPI0002064698
NCBI RefSeqXP_001604278.13e-10341.26%PREDICTED: similar to eukaryotic translation initiation factor 2-alpha kinase 3 (pancreatic eif2-alpha kinase) [Nasonia vitripennis]
NCBI nr blastpgi|3504179101e-10639.59%PREDICTED: eukaryotic translation initiation factor 2-alpha kinase 3-like [Bombus impatiens]
NCBI nr blastxgi|3504179101e-10640.13%PREDICTED: eukaryotic translation initiation factor 2-alpha kinase 3-like [Bombus impatiens]
Group
Gene OntologyGO:00167722.8e-69transferase activity, transferring phosphorus-containing groups
GO:00055242.6e-37ATP binding
GO:00046722.6e-37protein kinase activity
GO:00064682.6e-37protein phosphorylation
GO:00046747e-34protein serine/threonine kinase activity
GO:00047135.5e-10protein tyrosine kinase activity
KEGG pathwaynvi:1001206659e-103 
 K08860 (EIF2AK)maps-> Protein processing in endoplasmic reticulum
InterPro domain[980-1435] IPR0110092.8e-69Protein kinase-like domain
[1240-1432] IPR0174422.6e-37Serine/threonine-protein kinase-like domain
[994-1439] IPR0022907e-34Serine/threonine-protein kinase domain
[994-1437] IPR0206355.5e-10Tyrosine-protein kinase, catalytic domain
[48-256] IPR0110476.3e-09Quinonprotein alcohol dehydrogenase-like
Orthology groupMCL13357 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207324-TA
ATGTCGGCTTGGTATTGGGGAGTGATTGCCTTAAAAGTGGTTTTTGCATTCACTTGTTTTGTTTCTTCACTACGTGCTGATGATACGGTGCAAAAATTGCCTTTCTGTAATCCCACAACCTCGAAAGACGCACCTAATTTCAACGACTTGCTGATTGTTAGCACTTTAGATGGAAAGATCTCAGCTTTTGCCACCGAAAATGGTATTAAGGCCTGGGATTTGGAGACTCAGCCATTACTGTCTTCTAATTTACATAATGTCGAGCTAACATCGGATGGAAAATGGGTGCGTCTAGTGCCGTCCCTCCGTGGCACTTTGTACAGTCTGAGCGGAGATTCGATTGAGCCCTTGCCATTTAGTGCGGAACAATTATTGTCTTCATCATTTAAATATTCTGATGATATGGTCATTGCTGGTGCTCGAGAGACATTATGGCTTGGCGTTGAAGCTCAATCAGGAAGTGTCATATATGAGTGCAGTTCGAGCGGATGTAATTCTGAACAACAGACAGCTGGAGCCGGCCGGGATATGATAGTGTTGAGGAGATACTCCACCACTGTGAGAGCTTTGGATCCTCGGTCTGGCAGTGAGAAATGGAACTTCAGTGTGGCTGAACACCAGTTGACTCTGAGCCGTAGGGAATGTGCTGATAATAGTAAAGCTCGTGTGGTCGCTGTAGCGGTTGCTCTGCCGGATGGTGAGGTTGTTGTCAAGGATCCGGAAACCAAAATGGCCATTTGGCAACACAAGTTAGAGGCTCCCGTAGTGAACATGTGGCGTCTTCAGGGAGGTTTATTGGAAAACTTGGATGTTTTTTTGGAAGCCTCCCAGGCTCTCGTTGAAGTCAACGCGCCCGCATATCCGTCATTGTATTTGGGAATCCATAACACACAACTCTACATTCAAGAGAGCGCCATCTATGCGCGGAAGTTGGAAACAGCTGTAGTGACGAAGCCGACACCTTGGAAGTTGAAGAAGTCCAGACCGTTGCTAACAGATGGCAGCACAGCTTTAACAGCTGTAGACGAAGATAACTCGCTATCGGTGGTTAGCATACCGGGGAATATTGGGAGTACAGAAAACAACGGCTACTTCCTCTACCTCCAAGACACGTGTGACAAGTCCTTGCAAGTTGACGAGGAGATGATGCCGGACATTGTAGCTCCCAACAGCAGCGGTGATACAATGCACCACCATGTACACGTCCACGTATACTCCTTGTGGTTCTGGTGGAAGGAGGTACTGGTGATAGCTGTTAGTTCAGCATTATTGCTGAACCTGTTAATATGGCCGAGATTTTTCCCGCCAAAACAGTTAGCCCCCGCACCAAGAGAGAAGAGAGAGTTCGTTGTCGTCAGACACACGCATTTCGAACAGAAACCGACCACTGAATATTCGGGACGATACGAAAACGACTTCACCACGTTGAAGTATCTTGGTAAAGGTGGCTTCGGGGTCGTGTTTGAAGCTAGGAATAAGATAGACCATTGTTCGTATGCCGTTAAGAGGATCACGCTGCCGAGACGTGAATCTCAGCGTGAGCGTGTACTCCGTGAGGTCCGGGCCCTCGCTAAACTGGAACACGAGCATATTGTGAGGTATTTCAACGCTTGGGTCGAGGAACCTCCTCCGCACTGGCAGGAGATGAGGGATAAGCAGCTTATGCACGACCTGGGAGGTGTGTCACTAGCGATGTCCGACGACTACACGTCCCCCACCTCGCCGCCCGCGCCGCACATGCTGTCCAAACCGTCCAAGGGCGACGTCATGCTGAACCTTCAAAAGTCCATCGATGATATGGACTACGGAAAAAAGTTGCCGGAACCAAGGAGGCTGAGATCGCAAAGCTGCAACGATTCCTTCACCGTTGAATTCGATGATGGAGCTTCGAAGCCGCATTTGACATCGGAAAGTTCTAGAAGGGGTGACGTCACAAGTACAATCTCCAAGCACGCGGCCGGTGACAACGACGACTCGTTCATAGTGTTCGCTAATAGCAACGCGGAACGTTCTGGACTGTCCGAAGCTAGACCGGATGTGTCCAATAGGATATTGGAGAAAATCAAGGAGTCTGAAGGATATAGAACTAGCAGCTTAGTAAGCCAAAACAACGGCTACTTCCTCTACCTCCAAGACACGTGTGACAAGTCCTTGCAAGTTGACGAGGAGATGATGCCGGACATTGTAGCTCCCAACAGCAGCGGTGATACAATGCACCACCATGTACACGTCCACGTATACTCCTTGTGGTTCTGGTGGAAGGAGGTACTGGTGATAGCTGTTAGTTCAGCATTATTGCTGAACCTGTTAATATGGCCGAGATTTTTCCCGCCAAAACAGTTAGCCCCCGCACCAAGAGAGAAGAGAGAGTTCGTTGTCGTCAGACACACGCATTTCGAGCAGAAACCGACCACTGAATATTCGGGACGATACGAAAACGACTTCACCACGTTGAAGTATCTTGGTAAAGAGAGCGCCATCTATGCGCGGAAGTTGGAAACAGCTGTAGTGACGAAGCCGACACCTTGGAAGTTGAAGAAGTCCAGACCGTTGTTAACAGATGGCAGCACAGCTTTGACAGCTGTAGACGAAGATAACTCGCTATCGGTGGTTAGCATACCGGGGAATATTGGGAATACAGAAAACAACGGCTACTTCCTCTACCTCCAAGACACGTGTGACAAGTCCTTGCAAGTTGACGACGAGATGATGCCGGACATTGTAGCTCCCAACAGCAGCGGTGATACAATGCACCACCATGTACACGTCCATGTATACTCCTTGTGGTTCTGGTGGAAGGAGGTACTGGTGATAGCTGTTAGTTCAGCATTATTGCTGAACCTGTTAATATGGCCGAGATTTTTCCCGCCAAAACAGTTAGCCCCCGCACCAAGAGAGAAGAGAGAGTTCGTTGTCGTCAGACACACGCATTTCGAACAGAAACCGACCACTGAATATTCGGGACGATACGAAAACGACTTCACCACGTTGAAGTATCTTGGTAAAGGTGGCTTCGGGGTCGTGTTTGAAGCTAGGAATAAGATAGACCATTGTTCGTATGCCGTTAAGAGGATCACGCTGCCGAGACGTGAATCTCAGCGTGAGCGTGTACTCCGTGAGGTCCGGGCCCTCGCTAAACTGGAACACGAGCATATTGTGAGGTATTTCAACGCTTGGGTCGAGGAACCTCCTCCGCACTGGCAGGAGATGAGGGATAAGCAGCTTATGCACGACCTGGGAGGTGTGTCACTAGCGATGTCCGACGACTACACGTCCCCCACCTCGCCGCCCGCGCCGCACATGCTGTCCAAACCGTCCAAGGGCGACGTCATGCTGAACCTTCAAAAGTCCATCGACGATATGGACTACGGAAAAAAGTTGCCGGAACCAAGGAGGCTGAGATCGCAAAGCTGCAACGATTCCTTCACCGTTGAATTCGATGATGGAGCTTCGAAGCCGCATTTGACATCGGAAAGTTCTAGAAGGGGTGACGTCACAAGTACAATCTCCAAGCACGCGGCCGGTGACAACGACGACTCGTTCATAGTGTTCGCTAATAGCAACGCGGAACGTTCTGGACTGTCCGAAGCTAGACCGGATGTGTCCAATAGGATATTGGAGAAAATCAAGGAGTCTGAAGGATATAGAACTAGCAGCTTAGTAAGCCGTTCCAGGAAAAAGAAAGGACACACACGTCATTGGTCCCTAGACATGGTGGTCAACAGCGAGCACAGCAAAATGTACCTCTACATACAGATGCAACTCTGCAAACGGGACAGTCTTCACGACTGGCTGAGGAATAACAGCACTTGGGACTCCAGGAGGGACATGGTGAAGCCGCTCTTTAGTCAGATAGTATCAGCGGTGGAGTATGTGCACCGAGCTGGTCTCATCCACAGAGACCTTAAACCCAGGAACATATTCTTCGCTCTGGATGGAAAGGTCAAGGTCGGGGATTTTGGTCTCGTGACCACAATGACCGATAACACCACAGACACACCAACCGAATTGAATGCAAACGCCCTGCACACACATAAAGTTGGCACCCATCTTTATATGTCTCCGGAACAACTCCAAGGCAGATCATACAGCTACAAGGTCGATATATTCTCACTGGGGCTGGTGTTGTTCGAAATGCTACATCCGTTTGGAACTGATACGGAGAGAATCAAATGCCTGATGAATGCGAGATGCGGCAAATATCCGCAGGACTTCGTCACCGACTACCCCAATGAGACGGAAGTGCTGAAGCTGATGCTGAGTGAAGATCCTAACCAGCGTCCGACAGCAAGAGGAGTGCGAGCTCGAGCTCCGTTATACCACTGTGCAGACGAACATGTGACACAGAAAACACATTACAGTGTCAACCTTCCATAG

Protein sequence:

>DPOGS207324-PA
MSAWYWGVIALKVVFAFTCFVSSLRADDTVQKLPFCNPTTSKDAPNFNDLLIVSTLDGKISAFATENGIKAWDLETQPLLSSNLHNVELTSDGKWVRLVPSLRGTLYSLSGDSIEPLPFSAEQLLSSSFKYSDDMVIAGARETLWLGVEAQSGSVIYECSSSGCNSEQQTAGAGRDMIVLRRYSTTVRALDPRSGSEKWNFSVAEHQLTLSRRECADNSKARVVAVAVALPDGEVVVKDPETKMAIWQHKLEAPVVNMWRLQGGLLENLDVFLEASQALVEVNAPAYPSLYLGIHNTQLYIQESAIYARKLETAVVTKPTPWKLKKSRPLLTDGSTALTAVDEDNSLSVVSIPGNIGSTENNGYFLYLQDTCDKSLQVDEEMMPDIVAPNSSGDTMHHHVHVHVYSLWFWWKEVLVIAVSSALLLNLLIWPRFFPPKQLAPAPREKREFVVVRHTHFEQKPTTEYSGRYENDFTTLKYLGKGGFGVVFEARNKIDHCSYAVKRITLPRRESQRERVLREVRALAKLEHEHIVRYFNAWVEEPPPHWQEMRDKQLMHDLGGVSLAMSDDYTSPTSPPAPHMLSKPSKGDVMLNLQKSIDDMDYGKKLPEPRRLRSQSCNDSFTVEFDDGASKPHLTSESSRRGDVTSTISKHAAGDNDDSFIVFANSNAERSGLSEARPDVSNRILEKIKESEGYRTSSLVSQNNGYFLYLQDTCDKSLQVDEEMMPDIVAPNSSGDTMHHHVHVHVYSLWFWWKEVLVIAVSSALLLNLLIWPRFFPPKQLAPAPREKREFVVVRHTHFEQKPTTEYSGRYENDFTTLKYLGKESAIYARKLETAVVTKPTPWKLKKSRPLLTDGSTALTAVDEDNSLSVVSIPGNIGNTENNGYFLYLQDTCDKSLQVDDEMMPDIVAPNSSGDTMHHHVHVHVYSLWFWWKEVLVIAVSSALLLNLLIWPRFFPPKQLAPAPREKREFVVVRHTHFEQKPTTEYSGRYENDFTTLKYLGKGGFGVVFEARNKIDHCSYAVKRITLPRRESQRERVLREVRALAKLEHEHIVRYFNAWVEEPPPHWQEMRDKQLMHDLGGVSLAMSDDYTSPTSPPAPHMLSKPSKGDVMLNLQKSIDDMDYGKKLPEPRRLRSQSCNDSFTVEFDDGASKPHLTSESSRRGDVTSTISKHAAGDNDDSFIVFANSNAERSGLSEARPDVSNRILEKIKESEGYRTSSLVSRSRKKKGHTRHWSLDMVVNSEHSKMYLYIQMQLCKRDSLHDWLRNNSTWDSRRDMVKPLFSQIVSAVEYVHRAGLIHRDLKPRNIFFALDGKVKVGDFGLVTTMTDNTTDTPTELNANALHTHKVGTHLYMSPEQLQGRSYSYKVDIFSLGLVLFEMLHPFGTDTERIKCLMNARCGKYPQDFVTDYPNETEVLKLMLSEDPNQRPTARGVRARAPLYHCADEHVTQKTHYSVNLP-