Monarch geneset OGS2.0

DPOGS209401
TranscriptDPOGS209401-TA1422 bp
ProteinDPOGS209401-PA473 aa
Genomic positionDPSCF300346 - 243053-250408
RNAseq coverage852x (Rank: top 15%)
Annotation
HeliconiusHMEL0221612e-14569.80% 
BombyxBGIBMGA007107-TA2e-14359.63% 
Drosophilaspag-PA4e-2932.04% 
EBI UniRef50UniRef50_Q2F6433e-12257.50%TPR-repeat protein n=1 Tax=Bombyx mori RepID=Q2F643_BOMMO
NCBI RefSeqNP_001040307.15e-12357.50%TPR-repeat protein [Bombyx mori]
NCBI nr blastpgi|1140515681e-12157.50%TPR-repeat protein [Bombyx mori]
NCBI nr blastxgi|1140515684e-11757.25%TPR-repeat protein [Bombyx mori]
Group
Gene OntologyGO:00054881.1e-30binding
GO:00055151.2e-06protein binding
KEGG pathwaysmm:Smp_0908602e-16 
 K04460 (PPP5C, PP5)maps-> MAPK signaling pathway
InterPro domain[114-233] IPR0119901.1e-30Tetratricopeptide-like helical
[121-146] IPR0014401.2e-06Tetratricopeptide TPR-1
[151-184] IPR0197349.2e-06Tetratricopeptide repeat
Orthology groupMCL13026 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209401-TA
ATGGACAAAGCTTTTGAGATTCAGAAAAAAGCTCGGGATAATGTGAAGACATTGCATACATATCTTACCGATTTACAGAACTGGGAAATTGAAATGAAACGTAAAGAGGCCGCTCTTAACGGTGACTTAGAGCAGGAACTCCCACCAGTGAGGAGTAAGGTGAAGAGAGAAAGACCTGTTCAGGTAAAGCAGAAACCAGAGAAAAAAATAGTTGCTTCAGACTATCAAGCATGGGAAAAATTTGATGCAGAGAAAGCATGCGAAGAGGTTGATATGGCTGATATAGGGCCGGTGTCACTTGACAGTAAGAAGAGCAACAAAATTAAGACTGAGAAGCTGAAGGAGGAAGCGCAGTATGAGAAGGAGAGGGGCAATAGTTTTGTTAAACAAGAAAAGTGGGATGAGGCGATAGCATGCTACAACAGGGCCATTGAGTTAGTTCAAGATGATGCTATTTACTACGCCAATAGAGGATTGTGTTATCTTAAAAAAGATAGTTTACACCAAGCGGAATCCGACTGCACAGAGGCCATACGGTTAGATCCTACGTACGTGAAGGCATTTCAACGTAGAGCGTCTGCTAGAGAGAAGCTCGGCTCTCTAAGGGCAGCTTCACACGACCTGAGCGAAGTCATTAGATTGGAACCACACAACATGCTGGCCAGACAACAGTTAGAATGCATCAAAAACAGAATGGGAACTAAGGGGTCGAAATCAAAGTCATCTCCCACAACACCGCCCGGCGAAACGAAAACAGCTCCGGAAAAAAAAAGCAAAATAGTTGAACTGCCCGATAGTAAAACTGAATTGAGTCCTTTAGAGAAATGGAGAGACGGAGTCCACGAGAACATTACAGTCATAAAGCCAGTCAAGAAGCCACCTCATCTGAGGTCTAAGCGAGCATTGAAACACATCACAATACAGGAGATACCCCTCGGTAGAGCCATCGATAAGTCCAACGACACGAACAACAACAAATCTACAAATGTGACTCTGGCAGCGGATATCCAGGAGAAAATGGTGTTCAATGTGAACGAGAAGCACAAGGAGAGTATGGTGGTGCCCACGAACAGTGTGCAGTTCATGTCAGAGTGGAAATACTTGAAGGGTAACGATAGAGCCAGGGGGGATTACTTGAGTATAATACCCCCGGACCTCCTGCCATCCATCTTTGAGAACGCCCTCGAGAGTGATGTGTTGTCTACAGTACTGAGAACCATCCACAACAACGTGGACCGCTTCCCCCACTCAGTGGCCGCTTACCTCAAGAATATATGTCGCGTGAAGCGCTTCTCAGCTTTAGCTATGTTCCTTAGCGCTATAGATAAAGAATTAATTAACAATATGCTCAAGCACTGCAGGGACGTGGAAAATCTCAGTGAGAGTGAAATAACTGATTTGATGAACAAATATGAACTCTAA

Protein sequence:

>DPOGS209401-PA
MDKAFEIQKKARDNVKTLHTYLTDLQNWEIEMKRKEAALNGDLEQELPPVRSKVKRERPVQVKQKPEKKIVASDYQAWEKFDAEKACEEVDMADIGPVSLDSKKSNKIKTEKLKEEAQYEKERGNSFVKQEKWDEAIACYNRAIELVQDDAIYYANRGLCYLKKDSLHQAESDCTEAIRLDPTYVKAFQRRASAREKLGSLRAASHDLSEVIRLEPHNMLARQQLECIKNRMGTKGSKSKSSPTTPPGETKTAPEKKSKIVELPDSKTELSPLEKWRDGVHENITVIKPVKKPPHLRSKRALKHITIQEIPLGRAIDKSNDTNNNKSTNVTLAADIQEKMVFNVNEKHKESMVVPTNSVQFMSEWKYLKGNDRARGDYLSIIPPDLLPSIFENALESDVLSTVLRTIHNNVDRFPHSVAAYLKNICRVKRFSALAMFLSAIDKELINNMLKHCRDVENLSESEITDLMNKYEL-