Monarch geneset OGS2.0

DPOGS201248
TranscriptDPOGS201248-TA3216 bp
ProteinDPOGS201248-PA1071 aa
Genomic positionDPSCF300037 + 123564-131917
RNAseq coverage917x (Rank: top 14%)
Annotation
HeliconiusHMEL0032063e-12483.96% 
BombyxBGIBMGA012470-TA0.064.56% 
Drosophila% 
EBI UniRef50UniRef50_UPI00021A83829e-17738.49%UPI00021A8382 related cluster n=3 Tax=unknown RepID=UPI00021A8382
NCBI RefSeqXP_001605185.14e-16141.79%PREDICTED: similar to Ccar1 protein [Nasonia vitripennis]
NCBI nr blastpgi|3504058758e-17738.45%PREDICTED: cell division cycle and apoptosis regulator protein 1-like [Bombus impatiens]
NCBI nr blastxgi|3320247670.040.16%Cell division cycle and apoptosis regulator protein 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00036762e-08nucleic acid binding
KEGG pathway 
InterPro domain[552-594] IPR0030342e-08DNA-binding SAP
Orthology groupMCL15633 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201248-TA
ATGCAGACTGGTGGTGGAGCTAAAAATCCGCCATGGGCACGATCAAATGTGAATACAAACATGACCGGCATGAATCCACAAATAATGGATCCGAATGCCGCTATGATGTCACAGCAAGGTATGATGCCATTCCAACAAACTCAAGCTGTATTTAATCCCAGTATGATGCAGGCACAAGGAGGCATGGCTATGCCAATGCCCGGACAAATGCCAATGAATCAAATGGCACAAGGTCAGATGTATCCAGGTAATGTTGTTGCCTACCCCACGCCACGAGCTATGAATCCTAACATGTATCAGAACAGTACACCGAACCAGAATCAGCAATCAAATGAACAGCGGGTCTTCACGGGCACTGTAACAAAGACTCATAACGATTTCGGCTTTGTTGATCACGATGTTTTCTATCAAACATCGGTTTGTGCAAAAGGCGCTATACCAAAAGTAAACGATAGAGTACTCGTCGAAGCCACCTACAATCCTAATATGCCTTTTAAATGGAATGCAACTCGAGTACAAGTGCTCCCAAAGGGGGGACCTAACCAAAAAATGAACCAACCTAAGTCAAACAATTACAACGCAGTTCCACCTCCTTCCAACAAAAAGTCAAACATCATATCAAGACGAGACGATCGTCGGGATGATCGTAGTTCGCGAAGAGATGACCGGAAAAGATCCCGGACAAGGAGTCGATCTCGTTCACGTGATCGTCGTGATAGTCGTAATAGGAGGGACTCGCCTCCTCGCAAGAGACCTGTGATACACCAACCATCTCCTGTCAAATATGTAGTCCGAGTACCGAGAATGCCCCTCGATATATCAAACCTCGATGTCCCAACCTTATCACAGAGGTACAGCAATCTCTATATACCGTCAGATTTCTTCAATGCATATGTCAAATGGGGGGAGACGTTCCCGCCACAGTCACCGTTTTCTTTAAACAATCCATGTGCATACCACATAATGAGCAAAGATGTTCCAAATCCCAACCCAAATGAGGCTGTCTTGGAGCCCCCCGATGCTGATTACAGATTTTCAGCTAAGGTGATGCTTATAAGCATGCCATCTCTAGAGACACTCTATCAAAAGTGTGGACTCACAAAAGTAGACGAGAAAGACAAACGGACGAGCTCAAAAACGCCACTTCATCCAACTCGCCTTATAAAGTTCCTCGTGGGGCAGAAGGGTAAAGGGGGGGAGAACTTCGCTATAGGCGGGCCGTGGAGCCCCTCGCTAGACGGAGAACATCCGGAGACAGACCCCGGCGTGCTGGTGAAGACGGCTATCAGAACATGCAAGGCTCTGACCGGCGTAGACTTGTCGAACTGCACCCAGTGGTACCGCGTGGTGGAGTTTTACTACTGGCGGGAGGGCGGCGGCAGGTCGCGACTCGAGTGTGTGGTGTTGTTCCTGCCGGACGTGTGGTCGGCTCGGCCGTCGCGCGTCGAGTGGACCACGGTGCAGGACCAGTATAAGGCGGCCCGCGACGCGGCGCTGCGCCGGCTCCTCGGCGGGGAGTCCCCGCGCCGCTCCGACGACTCGCCGGACCGATCACCAATTGAAAATTTGGATGCGAACGCTAGCACTATAACAATAGACGAGAATGATGACGATGACGACTGCAAGCCCGAAGCCACGCATTACTCCAACATTGATCTGAGGACCATAAAGGTTGACCAGCTGCGACAGGAATTACGGGCGCGGAATGTCAGCTGTAAAGGGCTCCGATCACAGTTAGTTTCACGACTATCAAAACTAATAAAGGCTGAAGAAGAGAAAGATACTAAGAACGAGGATGTCATGGAAGTGGTGGACGATGAACAGGAGGACAAGAAAGACACTACAGACACGGTCGAGATTACAGATGACACGACTAATGATAAAGAGAAGCCAGTTGAAGATAAAATAGAGAAAAACGATGCCAATGACTCAAAACCAAATGATAAAAGTAAAGACGGCGAGAGCAAAGAGAGTGACGGCGTGAGCGAGGAGAGAAAGGACAGACCGAAAACGGAGAAGGAGATTGAAGAGGAGAAGAAAAGATTGGAGCGCGAGCGGCAGTCGCTGATGACGCGCTACGAGCTGCCGGCGTCTCCGCATGTGGTGGTGCACGCGTCGGGCTCGGCGCGCGCGGGCCGCTTCGCGTGCAGTGTGGCCTCACTGTCACTGTTGTTGGACTACAGAGTCACGGACAACAAGGAACACAGCTTTGAGTTGTTCGTGTTCGCGGAGCTCTTTAATGAGATGTTAATGCGCGATTTCGGTTTCTACGTGTACAAAACTCTGTACACGTTACCGGAGAAGGCTGAGGAAGCTAAAGACAAAGACAGAGATAAAGATAAAAGTGTAGAAAAGACTGACAAGAAAGAAGAAAAGAAGACGGAGCCGGAGAAGAAAGACGACAAGAAAGAAGATAAGAAAGATGACAAACGTGATGCACGACGCAGTCACAAGAAGGAATGTATGAGTGATGAAGAGCGCGGTTGGTCTCCTCGCTACCGCGGCGGCCGGGGGGTGGAGCTTCCCCCCGACCCGTACCTACTCCTGTCCCTGGCTTACTTCGACACCGCCCGCGCCGGCGTCATCTCCAAGAAAGACCTCCAGAGCCTGTTTGTGAGCCTGGGACTACAGCTGTCACGATCACAGATCAGGACCGTATTAGATAAAGTCTGCATCAGAGATAACTTCAGCTACAAAACTCTCATCCAAGCCATCAAAGACCTGGCCTCCGGAGCGCCTGAAGCCATACAGGACTTGCCTCTGGACAGTACTATTGACAGCAACGAAGTTCCACCACACATCGCAGAACTCGAGGCGACCATAGCGGCCGGCAACAGGGAACTGCTGCCCATGTTCAACAGAGACGGCGGCGCGGGCGGCGGCGGCACCAGGGACGTGTCCGACAACGGTATGGTGCTGTACAAGGGTCGCGTGGTGGACGCGAGTTCTAAAATCTTCCATTCGGCGTTGAAGAGTATACAGACCAAGGTGGAGGCGGTGGTTAACATCAAGTACGAGGACGATGACGTCGTGGAGGTCGTGAACGGGCCTTCCAAAGAGGAGAATGTTACAAAGGAAGAAAAGTCTGAACCTGGAACAGAGGAAAGGAAGGAAAACAAAAGTGATGTTAAAAACGAGGAACTCATTAAGATAGATGACGCGAGAGACGCCAAGGACAGCGACGCGGCCGACAACATGGACATAGACAGGGAGTAG

Protein sequence:

>DPOGS201248-PA
MQTGGGAKNPPWARSNVNTNMTGMNPQIMDPNAAMMSQQGMMPFQQTQAVFNPSMMQAQGGMAMPMPGQMPMNQMAQGQMYPGNVVAYPTPRAMNPNMYQNSTPNQNQQSNEQRVFTGTVTKTHNDFGFVDHDVFYQTSVCAKGAIPKVNDRVLVEATYNPNMPFKWNATRVQVLPKGGPNQKMNQPKSNNYNAVPPPSNKKSNIISRRDDRRDDRSSRRDDRKRSRTRSRSRSRDRRDSRNRRDSPPRKRPVIHQPSPVKYVVRVPRMPLDISNLDVPTLSQRYSNLYIPSDFFNAYVKWGETFPPQSPFSLNNPCAYHIMSKDVPNPNPNEAVLEPPDADYRFSAKVMLISMPSLETLYQKCGLTKVDEKDKRTSSKTPLHPTRLIKFLVGQKGKGGENFAIGGPWSPSLDGEHPETDPGVLVKTAIRTCKALTGVDLSNCTQWYRVVEFYYWREGGGRSRLECVVLFLPDVWSARPSRVEWTTVQDQYKAARDAALRRLLGGESPRRSDDSPDRSPIENLDANASTITIDENDDDDDCKPEATHYSNIDLRTIKVDQLRQELRARNVSCKGLRSQLVSRLSKLIKAEEEKDTKNEDVMEVVDDEQEDKKDTTDTVEITDDTTNDKEKPVEDKIEKNDANDSKPNDKSKDGESKESDGVSEERKDRPKTEKEIEEEKKRLERERQSLMTRYELPASPHVVVHASGSARAGRFACSVASLSLLLDYRVTDNKEHSFELFVFAELFNEMLMRDFGFYVYKTLYTLPEKAEEAKDKDRDKDKSVEKTDKKEEKKTEPEKKDDKKEDKKDDKRDARRSHKKECMSDEERGWSPRYRGGRGVELPPDPYLLLSLAYFDTARAGVISKKDLQSLFVSLGLQLSRSQIRTVLDKVCIRDNFSYKTLIQAIKDLASGAPEAIQDLPLDSTIDSNEVPPHIAELEATIAAGNRELLPMFNRDGGAGGGGTRDVSDNGMVLYKGRVVDASSKIFHSALKSIQTKVEAVVNIKYEDDDVVEVVNGPSKEENVTKEEKSEPGTEERKENKSDVKNEELIKIDDARDAKDSDAADNMDIDRE-