Monarch geneset OGS2.0

DPOGS214006
TranscriptDPOGS214006-TA903 bp
ProteinDPOGS214006-PA300 aa
Genomic positionDPSCF300313 - 41297-42418
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0138131e-8677.66% 
BombyxBGIBMGA004055-TA2e-12168.11% 
DrosophilaCG7028-PA2e-5553.72% 
EBI UniRef50UniRef50_Q16YF12e-5656.38%Prp4 n=3 Tax=Culicidae RepID=Q16YF1_AEDAE
NCBI RefSeqXP_001653292.13e-5756.38%prp4 [Aedes aegypti]
NCBI nr blastpgi|3123844803e-5656.32%hypothetical protein AND_02062 [Anopheles darlingi]
NCBI nr blastxgi|1892418933e-5751.29%PREDICTED: similar to CG7028 CG7028-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167721.9e-42transferase activity, transferring phosphorus-containing groups
GO:00055241.2e-23ATP binding
GO:00046721.2e-23protein kinase activity
GO:00064681.2e-23protein phosphorylation
GO:00046745e-15protein serine/threonine kinase activity
KEGG pathway 
InterPro domain[36-298] IPR0110091.9e-42Protein kinase-like domain
[115-294] IPR0174421.2e-23Serine/threonine-protein kinase-like domain
[50-294] IPR0022905e-15Serine/threonine-protein kinase domain
Orthology groupMCL30217 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214006-TA
ATGTTTTCAGAAGAGGACTTTACACTTGATAATATTTCCGATAAAATTACGCAAGACAGTAAAGAAAATTCAGAATTAAAAGACAACTGGGATGATTCGGAAGGTTACTACAAAATTAATGTTGGTGATATGATATATAACTCAAGATATACAATAAAAAATTTACTAGGACAAGGCGTTTTCTCAAATGTTGTAAAAGCACAAGATAACAACCAGGACAAACTTGAAGTTGCTATTAAAATTTTAAGAAATAATGATTTGATGCACAAAACGGGATTGAAGGAGTTAAAAATGTTGAGGGAGATAAATGATGCCGACCCAAATGATAAATTTCATTTGAACGAAAAGAAGAATGTCTTGAAGCTTTGCGATTTTGGTTCGGCTTCAAAAATCAAAGAACATGAACCCACACCATACTTAGTTTCTAGATTTTATAGAGCACCTGAAATAATTCTAGGCATACCGTACCTGCATAGTGTTGATGTGTGGTCGGCTGCTTGTACAATTTACGAAATGGCAACAGGAAAAATACTTTTCATGGGTGGATCTAATAATAAAATGCTCAAATGTTTTATGGACTTAAAAGGAAGATTTCCGAGTAGGATAATAAGAAGAGCGAAATTTAAGGATCAACATTTCAATTACAACAACAATTTTCTGCTACACAAAACCGATGAGTTCACAGGGAAGGATAAATTTGTTGAAATAAGCAATATTATAACTAATAGAGATTTGTATAAAGAATTAAAGAAAGGTTACAAAAATCCATCGAGTTATGAAGAAGAGAAGATAACACAATTAAAAGAAATGCTGGAGAAAATGCTTACATTAGATTCTAACTTTAGAGTTTCTGCCACCGACTGTTTGAAACATCCATTCATTCAGGATGTACTGAAAAAATAA

Protein sequence:

>DPOGS214006-PA
MFSEEDFTLDNISDKITQDSKENSELKDNWDDSEGYYKINVGDMIYNSRYTIKNLLGQGVFSNVVKAQDNNQDKLEVAIKILRNNDLMHKTGLKELKMLREINDADPNDKFHLNEKKNVLKLCDFGSASKIKEHEPTPYLVSRFYRAPEIILGIPYLHSVDVWSAACTIYEMATGKILFMGGSNNKMLKCFMDLKGRFPSRIIRRAKFKDQHFNYNNNFLLHKTDEFTGKDKFVEISNIITNRDLYKELKKGYKNPSSYEEEKITQLKEMLEKMLTLDSNFRVSATDCLKHPFIQDVLKK-