Monarch geneset OGS2.0

DPOGS201081
TranscriptDPOGS201081-TA3669 bp
ProteinDPOGS201081-PA1222 aa
Genomic positionDPSCF300185 + 79382-86733
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0178850.069.20% 
BombyxBGIBMGA001390-TA0.064.47% 
Drosophila% 
EBI UniRef50UniRef50_E2C2S87e-6432.13%Tumor suppressor p53-binding protein 1 n=2 Tax=Formicidae RepID=E2C2S8_HARSA
NCBI RefSeqXP_321661.45e-5037.91%AGAP001466-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071960382e-6332.13%Tumor suppressor p53-binding protein 1 [Harpegnathos saltator]
NCBI nr blastxgi|3071960382e-6724.76%Tumor suppressor p53-binding protein 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00056221.5e-23intracellular
KEGG pathway 
InterPro domain[949-1117] IPR0013571.5e-23BRCT
Orthology groupMCL20676 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201081-TA
ATGGAGGATTCCAGTGATAACATTATCACTTTTAATTCATCAGCAGGAGATTCAGAGACGAAGACAGAGGTGCCCGTAAGTCCCGACAAACCCAAGGATATATCTCAGAACCTCTTCCCCGAGAGTCCGGTGTATGAACCTGATGATGAAGTAAAATGCATTGAGGATGACAGCAATATGAATGATCAGGTCGAAAAAGATGACAACCCCGTCCACGCTAAACGGAAACTCTCAGAGGAAACATGTGATAACAACGCTAAAATACTTAAACTGGATCTGACGGAAGACGAAGTTACTGTGACGACGCAGACCTTGGACGACGAGGTGTCGGAAATCAATGATGTGGGAAACCAGAATGAGGAACTACCCTCGACATCGCGCGCGCGGACGCCAGTGCGGTCGAGTCCTGGGAGGAACAAGAGAAGCAACAGCGTCGAAGTCACCCCAAAAACCCCAAAATCTCGATCCGCCAGTGTCGAAGTCGTCGCCAAGAACCCAAGAACCCCAAAATCTAGATCCGCCAGTGTCGATGTCGCCCCTAAGACCCCAAAAACCCCAAAATCCAGGTCCGCCAGTGTTGACTTATCTAAAGCTCAGACGCCCCGCAGAGTACCGTTAGATTTATTAGCAGAAAAAGACGATGACGTCATTATACATTCCGATGACAGTTCAAGCCGACTGAGCGTGGAGTTCGTTAGAGAACTACCAGCGCGGAAACTACCGGACACCATCCCAGAAGAAAGCGGCTCCGAACTCAACGATTCGCAGAACTTCCATTTAATTCTGTCGCCGGTCGAGAACTATGACCCGAATGATAACGAGAAATCGAATGCAGACACGAACTACGATACTGAGAAAGTGACCGGCAAAGAGAGGGAAGTTATTGATGCTGGGAAGGTTGCGACTAGAAGCGGCTACAACTACTCCGAGATGAGCAGCGTCACATCTCAGGCACCAAACGACTGTAAGGAACGTGCGACGGACCAGGAGAGCACGGATAGCATAGGGTCGATGGGCCTCGACAGCCCCGAACCAGTTGCTAGCGTCGCTTCCAAGCTCATATCCAAGCTGTCTAACGGAAACTCATCCACACCGACGGACGTCAACGATCCGGCTCAAGAAATAACCCCCGATATGTGCAAGCTGAATGGGAACAAGGGGAAACGGGGGAAAAACGAATCGTTCAGGGTCAGCAACACGACGACGCCGTCCACCATCTCGCCTTTGCAGAACGGACATTCCTTGCCTATAAGCACACCCCTGATACCTGTTTTCGACGTACATGTCAGTCACAACGAGGACTGCGAGTTCCTGTCGCTGTACGTCGTTAGAACCGACAACGACGTGGGCATGGACATGTGCAGGGAGTACCAAAGAATATCGAAGAGGTTCACCATAGATCCTTACTTAGGCGACGTGTCGGTCAGTAACTCCCCGTCCAGCGTCACGAGCGGCGGACTCATGAGTTTGCCGAACAGAACCTCCTTCGCCTCGACTATTAGTTCAACGTCATCATCCAGCACTCGCACCAGTGACGGAGCCTTCGTGGTCCCCCCACCTCCGAGGAAATCCGTATCCAACCCTACAACCACCGTAAAAGGTTACGAGGCACTGATGAAAAAACTGCAAGACATATTTTCGCACATCAGAGACGCGTCCATAGAAGCGAACCGATCTCTCAACGACGACAAAATATCCGTCGGCATTCAGGCTTCCATATCCGAAGCTACCTTCAGCAACGGCAACGCGAGTCCAGAAGAGGTCAGCAAGTGTGACAAAGCGACGCCGAAGAGCTCGCTCAAAAAGACACGAGTGAGAGGACGAAGGCCGATAGCGGGGAAAACTAAGAGAGCCTTGCTGCCCACGCAACACGAAGAAGCGGAGTACATGCAGGGAATGAACTCACCGGAAATGATTCCCAGCAACGGAGACACCGGGAAGATATCACCGAAAGAGGAAAAGCCCGCTGTTGTCGGAACACCGAAGTCGGTTAGCAAGTTAAAACAAAAACGTAGACCTCCCTCCCCGCGGCCGGCGACTCCGGTCGAGAAGGCGATCGCGAAGCCCGAGTACCCCGGCTTTGCACCGGACACGGTAGTCCTGGCCAAATGGGTGGACAAGAGATACTATTCCGGAAAAGTACTAGAGATCACCGAACCCAACAAGTATCTGATCAAGTTCGACGACGGTCAGAGCAAAGTCCTCCTGGACGACTTCATAATATTCGGCGACATGAAGAAGCTGCCGCTGCAAGGACAGTCGGTGTACGCGCTGGTCGACGAGGAGTTGAACTACGAACCGGGACTGGTGCTGGGGGTGGAGGAGAACGGTAGCGGCACGGTCACCTACAGATGCACCACCGACGGGGACACGATAGTAGTGGTGACGGCGAGCGAGTTATATCTCACCGAAGACCAGGCCAGGTCGCTCAAGGAGTCCAGGGCCAGGTCACCAGCAACGCCGACCACGCCCAGGCGGAGACATCACAGAGAGCTAGACCTCGATAATATTATACAGGGTCCTCGCAGTGCAAGAAGTCGAGACAAAGGCAGCTCCAGTGCAAGAAAACGAGTGGCGTCACCCAAAAGTCCCAAAGCATCTACCTCAGGTGTTAAAACGAAGAGCATAGCTCGCAAGCGTCTGGCTAGCGAAAGTAGCGAGTTGAGTGAGAACAGCAACTCGGCGCCGGCCAGGATCGAGGAGGTCGCTGGGGTGGAGCCCGAGGTGCAGCGGACGCCGAGGAAGATAGACGGAGTTAAGGCCGGACCCCTTCAGTTGAAGGGAGCGGCCAAACAGAACATTGGGAAGAAGAATTCTAAGCTGACGAAGTTTGAAAACGATGAAGATACTATCTCAGCGCTGGGGCCCATCCCCACCGACAGCAAGATGTTCGCTGGCTATTGTTTCCTTCTAACATGTACGGAACCACCGAAGAAGAATAGAGTGACGGACAGGAAGGAGAAACAGATGAACCAGGACAGCCGGCATTACTCCTCGGAGGAAGACGGCGAGAGCACAGCCGCTGGGACGGACACGGAGGACCTGGTGTTCTGTGAACGACCCTACAACAAGGAACGACTGCGGGAACAGCTGGAAACAGCTGGAGGAGTTGTTTACAGTCATTTCGACGACGTGCCAAAGACGAAGTACCCGCAATGCTACCTGATATCGCCCCGTCCCTGCCTCACCGCTAAGTACATCTCCTGCCTGGCCGCGGCGATAAAGGCCGTGTCCCACGACTGGGTGATACAATCTTGCATGGTGGGTCACCTGCTGGATGTGGACTCGTTCGTGCTGCCCACCGGCTGGAGCTTAAAGAAGTCATCATTCGTTAATTGGACGACATCATCTGGCAAAAGAAACACGACCTTCAAGGACAAGATAATACTCCTGTGCGGAGATCAAGATACATTTGTTAAGTTCTGGGAGCGCGTCTGCACGTTGGCCGGCGCTACGACAAGAATTGTCAATGAAGATAACTTAAATATGACCGGGGCCATTGCCCTGGTGACCGAGTGGGACTGTCCTCATGAAGTACAGAATAAAGCGAACCAGGATAACATACCGCTGGTGTCGACGACCTGGGTGGTCCAGTGCCTGATTGAGGGCAAGGTCGTCGCCCCCACCGCCTTGGACAAGTTCTCATTTATGTACGCGGAGCCCGAATGA

Protein sequence:

>DPOGS201081-PA
MEDSSDNIITFNSSAGDSETKTEVPVSPDKPKDISQNLFPESPVYEPDDEVKCIEDDSNMNDQVEKDDNPVHAKRKLSEETCDNNAKILKLDLTEDEVTVTTQTLDDEVSEINDVGNQNEELPSTSRARTPVRSSPGRNKRSNSVEVTPKTPKSRSASVEVVAKNPRTPKSRSASVDVAPKTPKTPKSRSASVDLSKAQTPRRVPLDLLAEKDDDVIIHSDDSSSRLSVEFVRELPARKLPDTIPEESGSELNDSQNFHLILSPVENYDPNDNEKSNADTNYDTEKVTGKEREVIDAGKVATRSGYNYSEMSSVTSQAPNDCKERATDQESTDSIGSMGLDSPEPVASVASKLISKLSNGNSSTPTDVNDPAQEITPDMCKLNGNKGKRGKNESFRVSNTTTPSTISPLQNGHSLPISTPLIPVFDVHVSHNEDCEFLSLYVVRTDNDVGMDMCREYQRISKRFTIDPYLGDVSVSNSPSSVTSGGLMSLPNRTSFASTISSTSSSSTRTSDGAFVVPPPPRKSVSNPTTTVKGYEALMKKLQDIFSHIRDASIEANRSLNDDKISVGIQASISEATFSNGNASPEEVSKCDKATPKSSLKKTRVRGRRPIAGKTKRALLPTQHEEAEYMQGMNSPEMIPSNGDTGKISPKEEKPAVVGTPKSVSKLKQKRRPPSPRPATPVEKAIAKPEYPGFAPDTVVLAKWVDKRYYSGKVLEITEPNKYLIKFDDGQSKVLLDDFIIFGDMKKLPLQGQSVYALVDEELNYEPGLVLGVEENGSGTVTYRCTTDGDTIVVVTASELYLTEDQARSLKESRARSPATPTTPRRRHHRELDLDNIIQGPRSARSRDKGSSSARKRVASPKSPKASTSGVKTKSIARKRLASESSELSENSNSAPARIEEVAGVEPEVQRTPRKIDGVKAGPLQLKGAAKQNIGKKNSKLTKFENDEDTISALGPIPTDSKMFAGYCFLLTCTEPPKKNRVTDRKEKQMNQDSRHYSSEEDGESTAAGTDTEDLVFCERPYNKERLREQLETAGGVVYSHFDDVPKTKYPQCYLISPRPCLTAKYISCLAAAIKAVSHDWVIQSCMVGHLLDVDSFVLPTGWSLKKSSFVNWTTSSGKRNTTFKDKIILLCGDQDTFVKFWERVCTLAGATTRIVNEDNLNMTGAIALVTEWDCPHEVQNKANQDNIPLVSTTWVVQCLIEGKVVAPTALDKFSFMYAEPE-