Monarch geneset OGS2.0

DPOGS209621
TranscriptDPOGS209621-TA5130 bp
ProteinDPOGS209621-PA1709 aa
Genomic positionDPSCF300015 + 618909-633018
RNAseq coverage761x (Rank: top 17%)
Annotation
HeliconiusHMEL0170120.078.63% 
BombyxBGIBMGA006686-TA0.061.78% 
Drosophilaskd-PE4e-12545.83% 
EBI UniRef50UniRef50_D6WYH50.046.10%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WYH5_TRICA
NCBI RefSeqXP_973676.20.045.80%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2700114270.046.10%hypothetical protein TcasGA2_TC005449 [Tribolium castaneum]
NCBI nr blastxgi|1892404570.041.47%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00063571.8e-80regulation of transcription from RNA polymerase II promoter
GO:00165921.8e-80mediator complex
GO:00011041.8e-80RNA polymerase II transcription cofactor activity
KEGG pathway 
InterPro domain[1176-1700] IPR0094011.8e-80Mediator complex, subunit Med13
[70-268] IPR0216431.6e-42Mediator complex, subunit Med13, N-terminal, metazoa/fungi
Orthology groupMCL12062 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209621-TA
ATGTTAGTAGCGCTGACGTTCGCATGCGTGTCGTGTCGCGCGACAAACACGGGCACGTGCACCTCGACCGGAGAGTTACGTCACGGGGTGCTCCGCCTCCCGCGGACCCTGGGAGCCCCGCGGGACAGGGACGGGGCGAGCCACGGCGGGGCGGAGACGGCGCCCGCCGCGTATTGTTTTCAAAAAGAAACGCCGGTGATAGGACGAGAAGGCGATCAAGGTTCCTGGGAGAGCGGGCTGTCGTACGAGTGTCGCTCGCTGCTGTTCAAGGCGTTGCATAATTTGATCGAAAGATGTCTTCTGTCTCGCGACTTCGTCCGTCTCGGCAAGTGGTTCGTGCAGCCGTACGACGGAGACGAAGAAGATGTCGGCAAAAGCAGTCCTTGGCACCTGTCGTTCTCGTTCGCGTTCTTCCTTCACGGCGAGAGTACGGTCTGCGCCTCAGTAGACGTGCGACAACACCCGCCCGTCCGCACCCTCACCGCCAAGCGCCTCGCTCGTCTGCATACGCCGGCCGCCGACCGGCGGGGGGACAACAGGGTGATCCTGGCACCCTTCGGACTGGAGGGGCGCGTCACGGGCCGAGAGTGGGGCGAGAACGACCCCGGCACGGCGCGTCTGCTGGACGCCTGGCGGCAGCTGTATCCGCTGGAGCAGGGCTGCGGAGCCGTAGAGGTGGAGTGCGGTGGAGTACGCATGCCCTATCCCGCGCCCTACGCCCTACTCACCGAGATGGAGACCTGTCGGGACGCCGCGCCGCCCGCCGCAGAGCTGGCCGCCCGGGCACTGGCCGCCCTCGCCACACCACACCCGCACCAGACGGCGGAGGTGTGCTACTCGGAGACGTCCGGCGACGCTAACGACGACTTCCTAGATCCCACCAGAAAGACACCCTGTACCTGCGCCAATGTGTTCCATCGGGGTTGGGCGCGGTCGGTGACGCCAGGCCTGGCAGGCCGGAGTGGCTGCGCCGCGACCCCGTCGGTGACCGACCTCTCCCCGCCTTGCAGAGCTAGCGGATGCGGCGCGGGGTGCGCGGGCGGAGGGGGCGCCTGCTCCCCCGGCAGCTGCGGGGCCTCACCCGCCGCCGCTACCTCGCCCGCCGCCCTTACACACGCGCCACACGGACAACTCACCGCCCTGACGACGCACCAGCAGGTCAGCGTGCCGACGACGTTACACACGCCGGCCCCGACGCCCGACCCCCTCGCGCCGCCCACACCCGCCCCACCCTCCGCGCCGCCTCACAAGCTGTACGCGCAGGGAGAGTCCCCGGCGGGAGCGGGCGGGCCGTGCGCGGGCCAGCCGCCGCAAGAGGCGCTCCGACGACCGACCCTGCCGCCGCCCGAGAAGAGGCTCGACCCCCTGGAGGACGAGCACTCGCTCCACATGCTGTACGACTACACCACCGTGCACGCCTGGCTGGAGCATCCTGTGAAGCGCTTCAAGAGCGGCGGCGAAGCCCCCCGCGAGTCCTCCCCCGCCGGCGACCTGTACGCCGGACACGACCACACCCGCCTGGCCGCCACGCCCGCACACGTCACCCTCAAGATAGAGAAACAGGATCAGGAGGATGTGAAGGAGTACAAGAATTTGTTCACGTCGGACGGTCTGTGTCCGACGCTAAAAGATCTCGACCAAATATTCGACAATTCGGACGACGCGGCCAGCGGCGATGAGACGCTGCAGGTCCAGACGCCGCCCGACTCCAACAAGTCCAGCGAGGAGGCGCGCGGTGTCAGCGGTCGCTGCGTGCGGGCCGAGGAGCTGAGCAAGATGTTCCCCACGCCACCCAGCATGGAGGCCCACGCCCAGCCCTCGCCGGGGTTCTCACCGCCCGACGACCACACACACCTTCACCCACACGCCCTGCGGCCACACGGCTCCCCCCCGCCCGATCCCGTTATAGAGGACTGGTCGTACGTGTTCAAGCCGCCCACGATATGCAAGTACGTGGGCTCGTCCAAGTACGCGCCGCTCACCGCGCTGCCGAGCCAGCTGCTGCCTCCCGTGGCGCTGCCGTCCACCGCCGTGTACCGTCCGCGGTGGCAGCGCGGCGAAGAGCGAGACGACCACGCGCAGACGCATCGCTCGGAGTCCGTCAGCGCCACCTCCGGATCGTCCACCAACACCACCACCACCACGACGAGCGGCGTCAAGCGATCGTCTTCCAACACGGAGCGAGTGACGACCAGCAGCAGTAGTGGCGCGGCCGAGTCCCGGCCGTCCCCGCGCCGGCTGCCCGGAGCCCCGGCCTGCCCGCCGCCCGTGTCCCCCCGCGCCCCGGCCCCCGCCTGTCCTCTGCTTCTCAACGTCCTCCTCGCCGATACCGTGCTCAACGTCTTCCGAGACCACAACTTCGACAGCTGCACGCTCTGCGTGTGTAACGCCGGAGCGAGGACCGTCGGTAACATCCGCGGAGCGGACGCGGCCACGTATCTCCCCGGTGGCGACTGGGGCGGCGGACCGGACGACGAGCCCTCGCGCTGCTCCTGCGGGTTCAGCGCTATCGTCAATCGCCGGCTGGCGCACAGAGCGGGACTTTTTTATGAGGACGAGATGGAGATCACGGGCATAGCGGAGGAGCCCGGCGGCGGCAGCGGCGGGGGGCTGGCGGACGTGGCAGGGGTGGTGGTGGCGGCCTGCGCCGCGCCCGCGGGGGGAGCTGCCTCCGCCCTGGCTAGGGCGGCGCGGGGGGCCGCCGCGCCACCCCTGGCCGACCACCTGCGACTCAACCTGCTAGAGTACTCGGACGGCGGGGCGGCGGCGGCGCGCGCGCTGCGGGCGGCGGCCGGGGGCTCCGGCTCGTCCCTGCCGACGCCCAACATCGCCACTGGTGCTAACAGCGCCGTTCATCGCTGGCCTTTCATCGGGGCTCGCGCTCCGAGGTCCTCCAGAGACGTTGTCAGATTGATGCGTCGCCTGCGACCGCTCCTGCAGGACGCCATCCAGAAGCGTTGCTGCGGTGCCCGCATGTGGGACGGAGTGTCGGGTCCCCTGACGTGGCGGCAGTTCCACCGCCTGGCCGGGCGAGGCAACGAGGACCTGTGCGAGCCGCAGCCCGTGCCGCCGCTGCTGGTGGGCCACGACCGAGACTGGATCTCGCTCTCGCCCTATGCCCTGCGGCACTGGGAGCGACTGTCGCTGGAGCCCTACTCGTACGCCAGGGACGTGGCCTACGTGGTGCTGGCGGCGGACGGGGAGGCGCTGTCCGAGCCTCTCAAGACTTTCTTTCGGGAAGTGTCGGCGTCCTACGAGGCGTGTCGACTCGGCAGGCATCAACCGATCACCAGGATAGCGAGGGACGGGATTGTCAGGACCGCGCCCGCCGAAGGGGACCGCGACGCGAATTACGAAGAATGGACCGGCGAGTTACCACCAGGAAGATTAGGAGAATATATGAGATCGTACGCTGAAACGTTACGGAGCAATCTCGTTCCGCAACTTGCCGCGCTAAACGTTGACCGGACATTGTTCGAGAGAACGGCCGCCTGGGAGGAGGACGAGGCCGGGCCTCCTGCACTCGTCATATACGTTGTCGAGCCGCCCGCCGCACACGCGCACAGGCCACGAGCACTGGCCGCCCTGCTGCGACTCGCATCGCAGGTCGCCAACCACCTTCACCATCACAACCCACTCGTCAAGATAATATCTCTGGACGGCGTGTGTGAGTGGTGGGGCGCGCGCGGAGGCTGGGCGGCGGAGGCGCGCGCGCTGGCGTTCAGCGTGTTCGGTGGAGCGAGGCGACTCATGCAGCACTCGCCCGCCGGGAAGAGTCTCACCGGCTTCGGCACAGCCGCCACGGCTAACCTATTCATCAACAACAAGGATGAAAAGAACCGTGCGCCATACAAGCTGTTCTCGCCGGCGTGGGTGTTGTCCCCGCCGCGGGCGTTGAAGGAGGTGGCGGAGACGTGGGGCACGGCATGCGGCGACCAGTCCAGCGTGCTGTTCCTGGCGTACTGTCTGTCGCACGACCAGCGCTGGCTGCTGGCCGCCGCCACGGACGGCCGGGGCGAGCTTCTCGACACCGCCGCCATCAACATACACGTGCCTGCAAGATCCAGAAGGAAGAGAGGCGGACCGGCGAGGAGGCTGGGCCTGACCAAACTTATGGATTTCACGTTGGGCGTAATGTCACAGTCCGCTCAGCCCTGGAGACTGGTCGTCGGACGCGTGGGGAGGATCGGCCACGGAGAACTCAAAGGCTGGAGCTGGTTGCTGTCGCGGCCAAACTTGAACCGGGCGTCCAACATGCTCCGTGAGATGTGCGGCAGTTGTTCCCTGCTGTACCCGACCGGCGCGCCCTGCATACTGTCAGCCTGCTTGGTGTCCACGGAGCCCGACTCCTGTCTCCGGCTCATGGCCGACCGCTTCACGCCGGACGAGCGCTTCTCTCAGGCCTCCATACAGTCACACCTCCACACGCCGAGGGACGTCACCGCCACGCATATCCTCGTCTTCCCGACCTCCGCGACCACACAGTCGAATCAAATGCCGTTCGAGCCTCCGAATGCAAACGGCGAGGAGAACGACATGCTGCTGGGACTGGAGATGGTGGACGAGGACATGGGCGAGGACAACATGACGGACCTGCTCATAGGAGACATGTTCATGTGGCCGGCCGGCAGTCCGCGCGCCTCGCCCCGCAGGCACGACGACGAGGCCAGCCGCGAGGGCAGTCCGGCCAGGAACGCGCATATGAACAACACCGGCGCCTACACCAGGGACGACGACCACCATCCGGAGCAAGTGGGGACGGTTCTGCAACAACCGCTGGCTCTCGGCTACCTGGTGTCGACGGCCCCGCTGGGCGCCATGCCGGGCTGGTGGTGGGCGAGCTGCTGGCACCTCCGCGACGCGTGTCCCGCCTTCCTCAAGAACGCTCTCCACCTGCAGTGCGCCGTCACCGCTCACGACGACTACACCTCGCTCACCCAACGCCGCGACCACAACGCGCATCCGCTGGACTCCACCACCACCACGGACGTGCTGAGGTACGTCCTGGAGGGTTACAACGCGCTGTCGTGGCTGGCGCTGGACGGCTCCACCCACGACCGCCTCTCGTGCCTGCCGCTCCACGTGCAGGTCCTCATGCGGCTGTACCACACGGCCGCGGCGCTCGGCTAG

Protein sequence:

>DPOGS209621-PA
MLVALTFACVSCRATNTGTCTSTGELRHGVLRLPRTLGAPRDRDGASHGGAETAPAAYCFQKETPVIGREGDQGSWESGLSYECRSLLFKALHNLIERCLLSRDFVRLGKWFVQPYDGDEEDVGKSSPWHLSFSFAFFLHGESTVCASVDVRQHPPVRTLTAKRLARLHTPAADRRGDNRVILAPFGLEGRVTGREWGENDPGTARLLDAWRQLYPLEQGCGAVEVECGGVRMPYPAPYALLTEMETCRDAAPPAAELAARALAALATPHPHQTAEVCYSETSGDANDDFLDPTRKTPCTCANVFHRGWARSVTPGLAGRSGCAATPSVTDLSPPCRASGCGAGCAGGGGACSPGSCGASPAAATSPAALTHAPHGQLTALTTHQQVSVPTTLHTPAPTPDPLAPPTPAPPSAPPHKLYAQGESPAGAGGPCAGQPPQEALRRPTLPPPEKRLDPLEDEHSLHMLYDYTTVHAWLEHPVKRFKSGGEAPRESSPAGDLYAGHDHTRLAATPAHVTLKIEKQDQEDVKEYKNLFTSDGLCPTLKDLDQIFDNSDDAASGDETLQVQTPPDSNKSSEEARGVSGRCVRAEELSKMFPTPPSMEAHAQPSPGFSPPDDHTHLHPHALRPHGSPPPDPVIEDWSYVFKPPTICKYVGSSKYAPLTALPSQLLPPVALPSTAVYRPRWQRGEERDDHAQTHRSESVSATSGSSTNTTTTTTSGVKRSSSNTERVTTSSSSGAAESRPSPRRLPGAPACPPPVSPRAPAPACPLLLNVLLADTVLNVFRDHNFDSCTLCVCNAGARTVGNIRGADAATYLPGGDWGGGPDDEPSRCSCGFSAIVNRRLAHRAGLFYEDEMEITGIAEEPGGGSGGGLADVAGVVVAACAAPAGGAASALARAARGAAAPPLADHLRLNLLEYSDGGAAAARALRAAAGGSGSSLPTPNIATGANSAVHRWPFIGARAPRSSRDVVRLMRRLRPLLQDAIQKRCCGARMWDGVSGPLTWRQFHRLAGRGNEDLCEPQPVPPLLVGHDRDWISLSPYALRHWERLSLEPYSYARDVAYVVLAADGEALSEPLKTFFREVSASYEACRLGRHQPITRIARDGIVRTAPAEGDRDANYEEWTGELPPGRLGEYMRSYAETLRSNLVPQLAALNVDRTLFERTAAWEEDEAGPPALVIYVVEPPAAHAHRPRALAALLRLASQVANHLHHHNPLVKIISLDGVCEWWGARGGWAAEARALAFSVFGGARRLMQHSPAGKSLTGFGTAATANLFINNKDEKNRAPYKLFSPAWVLSPPRALKEVAETWGTACGDQSSVLFLAYCLSHDQRWLLAAATDGRGELLDTAAINIHVPARSRRKRGGPARRLGLTKLMDFTLGVMSQSAQPWRLVVGRVGRIGHGELKGWSWLLSRPNLNRASNMLREMCGSCSLLYPTGAPCILSACLVSTEPDSCLRLMADRFTPDERFSQASIQSHLHTPRDVTATHILVFPTSATTQSNQMPFEPPNANGEENDMLLGLEMVDEDMGEDNMTDLLIGDMFMWPAGSPRASPRRHDDEASREGSPARNAHMNNTGAYTRDDDHHPEQVGTVLQQPLALGYLVSTAPLGAMPGWWWASCWHLRDACPAFLKNALHLQCAVTAHDDYTSLTQRRDHNAHPLDSTTTTDVLRYVLEGYNALSWLALDGSTHDRLSCLPLHVQVLMRLYHTAAALG-