Monarch geneset OGS2.0

DPOGS201500
TranscriptDPOGS201500-TA3429 bp
ProteinDPOGS201500-PA1142 aa
Genomic positionDPSCF300006 + 886850-897326
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0155040.066.90% 
BombyxBGIBMGA002591-TA0.064.01% 
DrosophilaPask-PA8e-7250.38% 
EBI UniRef50UniRef50_E3X9T62e-14836.28%Putative uncharacterized protein n=4 Tax=Pancrustacea RepID=E3X9T6_ANODA
NCBI RefSeqXP_001865012.12e-14737.55%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3123736028e-14836.28%hypothetical protein AND_17229 [Anopheles darlingi]
NCBI nr blastxgi|3123736023e-14536.55%hypothetical protein AND_17229 [Anopheles darlingi]
Group
Gene OntologyGO:00167722.7e-51transferase activity, transferring phosphorus-containing groups
GO:00055241.9e-37ATP binding
GO:00046721.9e-37protein kinase activity
GO:00064681.9e-37protein phosphorylation
GO:00046742.2e-33protein serine/threonine kinase activity
GO:00047131.3e-05protein tyrosine kinase activity
KEGG pathway 
InterPro domain[806-1005] IPR0110092.7e-51Protein kinase-like domain
[820-974] IPR0174421.9e-37Serine/threonine-protein kinase-like domain
[820-1003] IPR0022902.2e-33Serine/threonine-protein kinase domain
Orthology groupMCL17481 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201500-TA
ATGGAATCTGCTAGAGTACTGAACTTAAATAGAATCGCTGCTACAGATTCATTGGGTTTTACACCCATAAAACCGAGACTACGAAGATTTGTGGATGTTTATTCTCCAGAACATCTGAATTTGACGCCTAAGCCTAAACAGAAGCTTGATTTGGTATCCAATGTTGGTAAATTGTATTCAACGCCCGATAAACCCTTAAGCATTGATAAAGAATGCAGGACAGTCAAAGCTCATCCAAATGTTTGCAATGCCTGGGGGAATTGTTCTTTTGAAGGAAATCAAAGCTTTCCAAGATTAAGTAGAAAACTAAGGCCTTTAAGAATGGATCTTGAGACTCCAACGAAAGCAAAGCCAGCTGTTGATTTAGTTAGAGTTGATGGGCCAAATGGATTGCGTTTCAATAGCTCCTTGGCAACTGGAGATGCAGCATCCACACCTACGCAGATGGACAGATTACAGCAATCTATCAACCATAGTAAAGCAGTCTTCACCATTGAACCAAACACTTCAAAGATTCTCATTGTGAACAACAAGGCTTGCTCTCTACTGGGTTACTCTTCCGGGGAATTATGTGACCTAAGATTCTCAGATCTTCTGAGGAAGAGGAATAGCAAGGCTTTTAGCATTCATGAGGCAGAAGATGGTGATATCTCAGAAGATGGTACAATGATTTTGTTAAGTGGCAAGGTTGTAGAATTGCTGACGAAGGACGGCAGTTCAGTTCAAGTGTCTCTCTGGATACGACAATTAGACAGTGATGGTCCCTGTCTGGTGGTTGCTGAGCCTATAATCTGTAAAAATGTTGTGCTGACTATAGACGGCGATGGTATAATAACATCCTGTGAGGGAGAAGATGGAGCTTTGTTGTTCCAAGCTGAATCATCGGACAAGCTGATAGGCCTCCCGGTCTCATCTTTGATACCTTCCATACATCTTCCACACGACGTTCCCATGTCTAAGAGTGTTTCAAAGCAGAAGGCCACAGGTAGGACCCTTGACGGTGGCTCATTCCCACTGTGTCTTTGGATATCAAAGGCCCATGTGGAAAACACGTGGTCGTCGTTGAAGACTAAAGACAAACCTGCTAATAAAGAGAAGCCTGTATTTATAGTGAATGTCAGGATAACATATAACGTAAGCGGACTTCTTGTTGTCGACGAAAGCGGAGTGATAACAGCATGCAACCAACACTTTGCCATGTTGACCTTCGGCAAAGCACAGTCAGAGGTGATAGGTCATCAAATAGAAGATGTCATACAGAACTTCTGTCGGGAATCCGATCTGGTGAAGATTCAAGATAGAAATAGAAATATGACTCTATCACCTGTCAACAATGATAACAATGATTCGGCTTCAGAAACCGGCGAGGACTCGTGTGGAGCGTTCAACGGAAGTCAGAAATCTGCATGCACATCCCTAAATGTACAACAGTCGATGTTATCCACAACTAGAGAGAAATCCTCTAGCGCGCTGTGCCTCGACAAATCATACAGCATGGTGACACACACACCGACGCCCACACAGGATATGGTGTCAAGCATAAGCACGACGGAGCAGAGGAATGATATATCAGCCCTCCCAGACGTCACTTCCGGCATGTCAGGAATTTCCATAGACGATGAAAACTACTGCCAAAGCAGCATATCCAAGTCGAGGTCCGAAAACATCCTTCGATCTGAACAGGCTATTATAAAGCAGTCGCCCAAAGTCAACGAGAAATCTGACTCGATTTATTACACGTCCCAACACTCCCAGGAGGTCACGCCGACGGGAAACTCATCCAGAACTAGATTAAATGACACGTCACTAAGACTATCGTTCGACTTCACCAAATGTAAAACGATCAAAGTCAAAGAGGACAAAAGCAGTGCGTCCCTAGATTTTTGTGACTCCAACGAGACGAGCGCTGATTTCTTGACCCCCATCAACGAAATGCCGCCACCGGGCTGTGAAATTGAGGATTTACCGAAACATAACGGAAATGACGAGAGCATGGACAGTTTAAGCAATGACAATGATCTGGAAACACAGACGGAATCCGCACCCAGGAGGAAGTTTGACGACGAGCCAGAGACTCCTTGTATGACGAAGCGCCTGGTCCGCGCCCACGTGACTTCGACACCGGCACAATCACGCCGCGACCTGGACGACTGGCGCCGCGCCGACGGCACATACAGCGGCATTGTGCGGCACAAGGACGGCACTGAGCTCAACGTAGTGTACACGGTGTCCAGTATGCAGCTGTCATCTCGTCGTATCGTGAGGTGTGTGTGGCTCGGAGTGAGGGCGCACGAGCCGCGACATACAACACTCGCCTCCAGTCTCGCCTCCACGCTGGCGTCAACTGCTGATAACTCACTGGTCGCGGGTAACAAGTCAGCGAGCAGCAGACATCAATCCGTGTCTCTGATGAGCCAGTGTGGCGAAGAACAAATTGCGGGGGAGTACACCAAGCATTACGTCACTCTTAAACAAATAGGTAAAGGAGCTTATGGGTGTGTCAAAATGGCGTACCGAAGATCAGATAGATTACTAGCTGTTGCAAAATTCATTTTAAAGGAAAAGGTTGGAGCTGCATTCTGGAGCGACGCTCCTGACGGGAGGAGGGTGCCGCTAGAACTAAGCCTGTTGATGACCCTCTCTCATCCTAATATAGTGTCCGTCATTGATGTGTTCGAAAACGACAAATACTTCCAGATGGTAATGGAAAAGCATGGAGCCGGTATGGACCTGTTTGAGTTTATAGAACGGCGACCGAGGATGGATGAACCGCTCGTCAGTTATATATTCAGACAGATAGGCCAGGCGGTGGAGTATCTTCATTCCCTTAACATCCTCCATCGTGACATCAAGGACGAAAACGTCATAATTGACAACAAGTTTCACGTGAAACTCATCGACTTCGGCTCAGCCACGTTCATGAACCTGGAACAGCTGTTACGTCGCATGCTCTGCAAGGAGCCAGGAGCCCGCTGCACAGTGCCACAGCTGATGGCACACCCCTGGATTCGACAACCTGTAAATCTCGCCTCATACAACTTCCAGGAGATCGTGGATTGTGATCGTCATGAAGCAAATCCCGAGATGTACTTCAGCGGCAGTCTAGAATCTCCGAGAAGTAACTCTCCCGTATCCTTAGCCGATCCTCTCGCTAAAGAGCGGTCGATGCGTTCGGAGGCGGAAGCGGCAGGAAGGTCGGAGAAGGTGTCTAAATCAGACGCGCGCCGTCCAGCACACCAGCTGTCCGACAACTACAGCCTGCGATCGTCCGCTGACATACTAGATATATCTTCGAAGCCGGTGTCGGAAGCGGCATTGACGGATATAAGTTCAGACGCGACCGGCCACGCGGCCTGCGACATCGACTACGACTGTGACCAGTACGAGTGCGACAGCTGGGATGAGTGCGAGCAGGACAGCTTCTCATAG

Protein sequence:

>DPOGS201500-PA
MESARVLNLNRIAATDSLGFTPIKPRLRRFVDVYSPEHLNLTPKPKQKLDLVSNVGKLYSTPDKPLSIDKECRTVKAHPNVCNAWGNCSFEGNQSFPRLSRKLRPLRMDLETPTKAKPAVDLVRVDGPNGLRFNSSLATGDAASTPTQMDRLQQSINHSKAVFTIEPNTSKILIVNNKACSLLGYSSGELCDLRFSDLLRKRNSKAFSIHEAEDGDISEDGTMILLSGKVVELLTKDGSSVQVSLWIRQLDSDGPCLVVAEPIICKNVVLTIDGDGIITSCEGEDGALLFQAESSDKLIGLPVSSLIPSIHLPHDVPMSKSVSKQKATGRTLDGGSFPLCLWISKAHVENTWSSLKTKDKPANKEKPVFIVNVRITYNVSGLLVVDESGVITACNQHFAMLTFGKAQSEVIGHQIEDVIQNFCRESDLVKIQDRNRNMTLSPVNNDNNDSASETGEDSCGAFNGSQKSACTSLNVQQSMLSTTREKSSSALCLDKSYSMVTHTPTPTQDMVSSISTTEQRNDISALPDVTSGMSGISIDDENYCQSSISKSRSENILRSEQAIIKQSPKVNEKSDSIYYTSQHSQEVTPTGNSSRTRLNDTSLRLSFDFTKCKTIKVKEDKSSASLDFCDSNETSADFLTPINEMPPPGCEIEDLPKHNGNDESMDSLSNDNDLETQTESAPRRKFDDEPETPCMTKRLVRAHVTSTPAQSRRDLDDWRRADGTYSGIVRHKDGTELNVVYTVSSMQLSSRRIVRCVWLGVRAHEPRHTTLASSLASTLASTADNSLVAGNKSASSRHQSVSLMSQCGEEQIAGEYTKHYVTLKQIGKGAYGCVKMAYRRSDRLLAVAKFILKEKVGAAFWSDAPDGRRVPLELSLLMTLSHPNIVSVIDVFENDKYFQMVMEKHGAGMDLFEFIERRPRMDEPLVSYIFRQIGQAVEYLHSLNILHRDIKDENVIIDNKFHVKLIDFGSATFMNLEQLLRRMLCKEPGARCTVPQLMAHPWIRQPVNLASYNFQEIVDCDRHEANPEMYFSGSLESPRSNSPVSLADPLAKERSMRSEAEAAGRSEKVSKSDARRPAHQLSDNYSLRSSADILDISSKPVSEAALTDISSDATGHAACDIDYDCDQYECDSWDECEQDSFS-