Monarch geneset OGS2.0

DPOGS206023
TranscriptDPOGS206023-TA4008 bp
ProteinDPOGS206023-PA1335 aa
Genomic positionDPSCF300028 - 1780369-1790738
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0109030.090.73% 
BombyxBGIBMGA000550-TA0.080.31% 
DrosophilaMED23-PB0.053.89% 
EBI UniRef50UniRef50_Q16HH90.061.09%Mediator of RNA polymerase II transcription subunit 23 n=17 Tax=Coelomata RepID=MED23_AEDAE
NCBI RefSeqXP_395793.20.066.54%PREDICTED: similar to CRSP complex subunit 3 (Cofactor required for Sp1 transcriptional activation subunit 3) (Transcriptional coactivator CRSP130) (Vitamin D3 receptor-interacting protein complex 130 kDa component) (DRIP130) (Activator-recruited cofactor 130... isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838532630.067.01%PREDICTED: mediator of RNA polymerase II transcription subunit 23-like [Megachile rotundata]
NCBI nr blastxgi|3071739920.067.01%Mediator of RNA polymerase II transcription subunit 23 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[1-1299] IPR0216290Mediator complex, subunit Med23
Orthology groupMCL12208 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206023-TA
ATGACGGATTCTCAAGTAGCAAACATTGTAAACGAAATTTTGCGGGTAGAGGCTGTGGAGGAAGCATTCAGTTGTTTTCTGGTTTACAAACCAGAACAGGAAAGTGAGAGGTTATCCATATATCAGAAAAAGCTATGTTCCATCATGAGCAGTCCCAGTGCGGAGGTGCAAGAATCAGCTATCCGTCAGTATCTTACTTTGACTGCAGTCCTCACAAACAGATATAAAATGAAACAGCTGCTGGGTTTACTTGAGAATTTAGTAAACACTAATATACTTCAAGCAAGAATGCTCTGCGATTGTATATTAACAAGCGAGAAACTGGTTTACAAAAATAGTGATTATTGGATTGAATGCTTCAATCTAGTTAGACGAGTTATTGGTGGTGTTGACTACAAGGGTGTTAGAGAAATTATGAAAGGTTGCAGAGAAAAAGCACAAACTTTACCAGTTAGACTCAACTCCAGCACAATGCCTCAAATGAGAGCATTGTGTAATGTCATTGAGTATATTTTTGATCGTAATGCATCACTCTTGCCTGCATATTTCATAGTCACTGAAATTCAAAAGGACTATCCAGATAATAATCATTGGCCACACTGGCAGCTGGCTAAACTACTTACCACATTTGTTGAAAGTTTCAAGCCTTGCGCTCAAATGGTATCAATTATTGGTCATTCACAAATGTTGCCCGTGGTAGAGTTTTCTGGATATGCCGACCATTTGGTAAACCCATGGCGGTTAGATCCAACAACACTTAAATTCTCTCTCAAAGGCAATTTGCCTTATGATGATGAATTACTTAAGCCTCAAATATCATTACTAAGGCATGTTCTTCAGCAGCCCTACTCCAGGGATATGATGTGTTCTATGCTAGGCTTGCAAAAACAGCACAAACAAAGATGTATAGCATTAGAAGACCAGCTTGTGGAATTAATGATTTTACCCATGGAAAGGAGTGAGCAAGAGAATGAGGATGATGAAATGCCATCTACGCATTGGTGCTGGTTGCATTTATCCTCACAAGTGATATATTTAATCCTGATTGGGTTCGCATCATTCCCTAGCATAGTAATGGGTCTACATAATAAACTTGTTGGGTGTGATCTGAAGAAAGGACGGGATCACCTTATGTGGGTTCTTTTACAATTTATATCTGGAAGCATACAACGGAATCCGCTGTCTAACTTTTTACCCATAATTAAATTGCATGAACTGCTTTATCCAGAGAAGGAAGCATTGCCAATGCCTGATTATACAAAAGCTCATTGCACTCATCAGATGGCTGTTGTGTGTATTTGGATGCATTTGTTAAAAAAAGCTGAATCTGAACACAAAACTATGATAATGCCACAAAACTTGAAGGTACAATATGAATTCCTGCAACACCTGATGACATCAAACAACACACCAACTTTAATGGGTGCAGACTATCGCATTGCTCTGCTATGTAATGCATATTCAACTAACCAAGAGTATTATGCCCGACCTATGGGGATCATCATAGAAACACTATTTGGCAATCAAAAGCCCATGCCAAATGGAAACCCTGCGGCTCCTCTGCCAACAGTACCTTTATCGATGTGTATATTGGATAGTCTTACATTGCATTGCAAAATGTCATTAATACATTCCATAGTGACACATGTGGCTAAGCTGGCTCAAAATAAAACCACAATACCTGGCAGTAATATGATGGCACCAGCATTAGTTGAGACATATAGCCGTCTTCTTGTTTACACTGAAATTGAGTCACTTGGAATTAAAGGATTTATTAATCAACTTCTGCCGAATGTATTCAAATCCCACGCGTGGGGTATACTTCACAACTTGCTGGACATGTTCTCATACAGGATACATCACATTCAACCTCACTATAGAGTTACCTTACTGTCTAATATCAACTCACTCGCAGCTTACCCTCAGGCTAACCAGACTCAGTTGCAATTATGTTTCGAGAGTACAGCGCTCCGTCTAATAACGAGTCTGGGCAGTTCGGGCGTTCAGCTGCAGATGTCGCGTGTGGTTTCCGAGCCCAAGTCCTGTGTAGTTGTCAGCAGCGAGAGTGAAGAACTAAACAGAGTGCTGGTCCTCACTCTCGCCAGAGGAATCTATATGACAGGAACAGGTAATGATGGAGCAGCAGTGAAGGAGATACTAACGACCATAATGACGAACACTCCCCATATGTGGTCTCAACACACGCTGCAATGCTTCCCTCCTGTACTAGTCGAATTCTTCTCACAGAATCCAGCCCCAAAAGAAAACAAGCAACTTTTAAAAAAATCGGTAGAAGAAGAATACAGGAAGTGGACTTCAATGGCGAACGACAATGATATTATATCACATTTCTCTGTTCCTGGAACACCGCTGTTCCTATGCTTGCTATGGAAAATGATATTTGAAACGAACAGAATTAATCCCATTGCTTTCAAGATCCTTGAACGTATTGGTGCTAGAGCTTTATCGGCTCACCTACGCAAGTTCTGCGACTATTTAGTATTTGAGGTAACAAATCCAGCTGGGGGTCCCCACATCAACAAGTGTGTGGACGCAATCAACGATATCATATGGAAATACAACATCGTCACCATTGACAGACTTGTTCTCTGCCTTGTACTGCGCCCAAACCCGGACGGCAACGAGAGTCAAGTGTGCTTATATATCATTCAATTACTACTCCTAAAAGGATCGGAACTTAGAAACAGAGCACAGGATTTCGTAAAAGAAAACTCTCCAGAACATTGGAAGCAAAACAACTGGTATGAACGCCACTCAGCCTTCCACCGGAAGTATCCAGAGAAGTTCGCACCTGAAGAAACGAGCGGAGCGTACGGCGGCCCTATTCCTGTTTACCTCAGCAACGTGTGTCTCAGGTTCATACCTGTGTTGGACATCGTAGTTCACAGACATCTTGAGATACCGCAAGTCAGTAAGAACCTGGAACAACTGCTGGAACATCTCGGTTATTTGTACAAATTTCACGACCGGCCAGTAACATTTCTATACAACACGCTTCACTACTACGAGAGTAAGCTACGAGATAAGCCATTACTGAAACGCAAGTTGGTGACAGCTGTATTAGGCTCCCTTAAAGAAGTCCGGGCCCCAGGTTGGGCTACAACAGAAGCCTTCCAGACTTTCCTTGGCAGCGAAACTGAAGCGTGGAGTCCAGACCTTAACTACTATCTAAGTCTGATCAACCGTATGGTTGATACTATGATGGGCAGTTCTCATTTCCCGAACACCGATTGGCGCTTCAATGAATATCCGAATCCTTCAGCGCATTGTCTATACGTGACATGTGTTGAACTGATGTCCCTGCCGGTGCCCCCGAATACGGTCGGAAACAACCTTCTCGATGTGGTCACAAAAGGTTTCGTTGTGATCCCTGCAAACAAAATACAACTCTGGATTAATGCAATAGGGATCATAATGGCAGCGCTTCCCGATCCTTACTGGACCGTGGTCCATGACAGGCTATTAGAACTCATAACTGGGACAGAGATGGTAGAATGGACGTACCAGCATACGCCGTTCCAATTATTCAATTTGACCAAGACTAATGAATGTATGTTAGAGAATAAATACAGTCTGACGTTGGCATTAGCGCATGCGGTATGGTACCACGCTGGCCCCGGACAGATTATGCAAGTTCCAACGTTCGTCAAAGAGAAGATGTCCCCGGAAATTAGGACGGAAGTTCAGCTGATATTCCTGTGTCATCTGATGGGTCCTTACTTACAGCGTTTTAACACTGATCTGTCAAGAGCTGTCATGGATATAACTATAGTGTTGTATGAGTTACTGGCTCATATAGACAAGTCACAGACCCACTTGCAGTACATTGATCCTATATGTGATCTGCTATATCATATAAAATACATGTTCGTTGGCGATACAATGAAGAATGAAGTGGAAAACGTGATACGAAAGCTACGACCGGCTTTACAAATGAGACTACGTTTCATTACTCACTTAAACGTAGAACAAATAAATACTGCCTAG

Protein sequence:

>DPOGS206023-PA
MTDSQVANIVNEILRVEAVEEAFSCFLVYKPEQESERLSIYQKKLCSIMSSPSAEVQESAIRQYLTLTAVLTNRYKMKQLLGLLENLVNTNILQARMLCDCILTSEKLVYKNSDYWIECFNLVRRVIGGVDYKGVREIMKGCREKAQTLPVRLNSSTMPQMRALCNVIEYIFDRNASLLPAYFIVTEIQKDYPDNNHWPHWQLAKLLTTFVESFKPCAQMVSIIGHSQMLPVVEFSGYADHLVNPWRLDPTTLKFSLKGNLPYDDELLKPQISLLRHVLQQPYSRDMMCSMLGLQKQHKQRCIALEDQLVELMILPMERSEQENEDDEMPSTHWCWLHLSSQVIYLILIGFASFPSIVMGLHNKLVGCDLKKGRDHLMWVLLQFISGSIQRNPLSNFLPIIKLHELLYPEKEALPMPDYTKAHCTHQMAVVCIWMHLLKKAESEHKTMIMPQNLKVQYEFLQHLMTSNNTPTLMGADYRIALLCNAYSTNQEYYARPMGIIIETLFGNQKPMPNGNPAAPLPTVPLSMCILDSLTLHCKMSLIHSIVTHVAKLAQNKTTIPGSNMMAPALVETYSRLLVYTEIESLGIKGFINQLLPNVFKSHAWGILHNLLDMFSYRIHHIQPHYRVTLLSNINSLAAYPQANQTQLQLCFESTALRLITSLGSSGVQLQMSRVVSEPKSCVVVSSESEELNRVLVLTLARGIYMTGTGNDGAAVKEILTTIMTNTPHMWSQHTLQCFPPVLVEFFSQNPAPKENKQLLKKSVEEEYRKWTSMANDNDIISHFSVPGTPLFLCLLWKMIFETNRINPIAFKILERIGARALSAHLRKFCDYLVFEVTNPAGGPHINKCVDAINDIIWKYNIVTIDRLVLCLVLRPNPDGNESQVCLYIIQLLLLKGSELRNRAQDFVKENSPEHWKQNNWYERHSAFHRKYPEKFAPEETSGAYGGPIPVYLSNVCLRFIPVLDIVVHRHLEIPQVSKNLEQLLEHLGYLYKFHDRPVTFLYNTLHYYESKLRDKPLLKRKLVTAVLGSLKEVRAPGWATTEAFQTFLGSETEAWSPDLNYYLSLINRMVDTMMGSSHFPNTDWRFNEYPNPSAHCLYVTCVELMSLPVPPNTVGNNLLDVVTKGFVVIPANKIQLWINAIGIIMAALPDPYWTVVHDRLLELITGTEMVEWTYQHTPFQLFNLTKTNECMLENKYSLTLALAHAVWYHAGPGQIMQVPTFVKEKMSPEIRTEVQLIFLCHLMGPYLQRFNTDLSRAVMDITIVLYELLAHIDKSQTHLQYIDPICDLLYHIKYMFVGDTMKNEVENVIRKLRPALQMRLRFITHLNVEQINTA-