Monarch geneset OGS2.0

DPOGS211788
TranscriptDPOGS211788-TA4371 bp
ProteinDPOGS211788-PA1456 aa
Genomic positionDPSCF300107 + 340023-345193
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0079450.054.81% 
BombyxBGIBMGA004095-TA7e-11744.13% 
DrosophilaCG7099-PA2e-2226.55% 
EBI UniRef50UniRef50_Q7QCJ49e-4122.04%AGAP002656-PA n=4 Tax=Anopheles gambiae RepID=Q7QCJ4_ANOGA
NCBI RefSeqXP_312268.41e-4021.91%AGAP002656-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479683033e-4022.04%AGAP002656-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3838494061e-3528.04%PREDICTED: general transcription factor 3C polypeptide 1-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[123-197] IPR0073091.6e-11B-block binding subunit of TFIIIC
Orthology groupMCL24984 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211788-TA
ATGAAAATAAGATTCTGGAATTACATCCTCGCATCAGAATCAATTTCAGTTTATGAGTTAAAAACCTCACCTCCCTTCATTGAAATATTAGATCGTTTTACGATTATTGATGAAAACACTGGGAATCTTTTGGATCCTCCAGATTACCTAGATGGACCATATGAATTTCAAAGTATTGATAATGAATATGGATCTTGCATTAATTTTAACACCAGAAAACAATTGCCCAAGGATGCATTAAAGATGATTAGCTATGAAGAAACAGTTTCGAAATATGGAGACAAATTAATCCTTGTTGCCTCTTTAGAAGAACGCTGGAAAGCTTTAGCACCCCATCTACCAATAATGTATTTGAACCAACTAACTCCAGTACACTTCTGTTTGTTGGAACTTATTGGAAAATCAAGACATAATGGCCAAATGACTGTTGGGAAAACTAATTTAAGCAAAGTAGTGAAAGATGCAAAACTGTTGTTTTATAATAGAACTGTCTTACAAAAATTTGATCTCATAAAAGTGCAATACTCTACACAAGTTGTGGGCGGCAGAGCTTTAAAGAGTATATTACTAAGATTGAATAGGTTCTACCAACCTACATTGTGTACTCAGCCTAAGATTGGCAAACTTCATAATATCATTAACTATCTGTTAGAAGCACCAGATTATTCTGAACAGACTGACGTTATGATCAAAAAAGGTTTATTAACTCCACCACAAAGCAAGAGATTACAAAAAACTATTAACATATTTAACTTTGAAGAAAAAGACATAATAATAAACCGTGGTGAAAATTCTAAAATGGAACCTTCAAGACACACTGTGAAAAGGAAATGTATATCTCTCAACTGTCAGAGTGATGAGTCCCAATCAGATGAGGAAAGTAATGAAAATGATAGCAATTTAAAATGCCAGTATAAAGTGGGAGTTAATTTGGGGAGGCAGGCATATGAATGTTTCTTGGCAGCAGGTCTTAAAGGTCTGACCCAAATTGATATAGCACAGCTTTTGGGTATAGAGTTTTATACAAGCCGGACCATATGTAGAGTTTTCAAAGCAAGAAAAATTGTGAGAGAGTTCCTCGAAGATAAGGGAAGACAGAGAACCGCTAGATATATCGCAATAGCTGCTACTAAATATATTGACAAACAGTATGCCAAAGAAAAAAGCAAATTCTTGGAACACTGTAATAAAAGAAGTAAAGATAAAAATAAAAGCAAAGGTGATTTAAGTTCTGACTCCGAAAGCGAAGTACCATTGAAAAAGATAAAGAAAGCCAAAGAAAACACTCCAGTGAACGAAACTAAGCAGGATATTACAGAAGTTAAAATCATGGAAGGATGCGAAAACGTGACAGAGTCTTTATTAAACCAAAAAAAGAATCCTACTCTAAGGCAGTTGAAATTCGCAAACGGTATTATAAAAGTCGTGCAAGATAGGAAATATGTGATGGGGTTCCAGGTGCTAAGCAATTTGGTTGCCAAAGAAACGGGCGAACCGCCGATGGATACGAAGGCGCTCAAAATATTTACCCAAAAGCTAGTAGCTGATGGGCAGCTCAAGATATTAAAGATAAAAAGGCCAAGCGTCAACGGCTTAACCTGCAATATATTTATTTGCGCGCCACACATTAAAGCCTCCAACCCCATCATACGAGCAAAATACAAGGAGATCTGCGCCAGATCGGAAATTAGCAAGAAAGCTAAACTGCACAAATCGACACCTAGGAATATAAGGCCGGTGACACACTTCACATATCCTAGGTACATGAAAATTCAGAAACTCCACGAGTTCTTGACACAGCTGGTTTACTTCAATGAGAGTAGCTGTAAAGATAGTATTAATGGTATCTCCTCACTGATGAGTATTATACCGGAAATGACAGTTGAACTGGCTTTGGGTCATATAAGCCATATGGACGAAACTGATATAAATTATATCAAAACCGCAGCCTTTTCACTGGACGATAAGGTCAAAGACGCTCCGATACGACTGAATCGAATAATATTACAATCGAGGAGCTTAATAAATTCCTTGAAGGTTGTCTTAAAAGTTCTAGCGCTATTAGGTCTTATTCAATTAGTCCATCAGACTACAGTGCCTATCAACAGAGATGCTACTGACGTTTTCTATGTTAATCGTCACGCGAAGATTATAAACACCATGGGAAAATGGCCGCAACCGAATATCGATAAAACTGTTTTAGAAAAATCATATCATTTCAAAAAGTTCGAAGACGTTCAGGAATATTGGAGCGATGTACACGATATAAGCATCAACACTACGATAAATGTACCGAGGAGGCAGCAATATAAATTATGTATACCAGTGAGAAACGAACAGAACCTCACTAATTACGACACAGGGGAAATATTTGGTGATGGTTTGGGACCTTGTGGTTTCGACTCTAGTATATACATGGATATAGCCAGGCTATGGCGATCATTTATGATAAAAAATCACACTAAGAAAAAATTGCGCAAACCAAAAGTCAAAAAGGTTAAGAAGAAATCATCCGCTCCTGCGGAAAGTAAAACTAAAATAAAAAGTATAAAGAACGAAAATAATGCAATTAAAATAAAAGATTTCATGAATATACGGAAAAGACAGCTCGATTCGCATATAAAGTGGTCCAACTTCGAAGACAGGATATTAATGATGTGTAAAGCGGCTGTCACTATAATGAGTCCAGTGTCACAACCAGGATGCTTAAGGGTTAGGAATATTGTCGCCAAAGACTTGCTTTCAATATTCGATCCGAGAAAAACAGCAGCCATTTGTCACAAAAGAGCCATCATCCTAGAAACGAATTCAACTTTGGCACACGAGAAAGACTGTATCCTAAACGAATTAAGATGTCATAGAAACCTGATACAAAAGTACGAAGGTCTCTTAAGAGTTTTAAGACTTAGAAATTACGCGAATCTTTCAAAATATGTAAACGAATCAAGATTGCCGCTGTTGCAATTGATCTGGATTATAGCTCAGATCGCGGAGACTAAGCCTTTTAATAGGAGAATGCCGTGCATGGCAATCGACCTCAAAGATTTTAATAAGAAATACAACATATCACCGTCGTCGGCGAACAGAGCTTACAACTTATATAGAACGCCGACGTCTTCGAGACCCGAATTCGCTATATTAAAAGAGGCCATCATTATGACCATAATGCTGTCATTGGACGAGCCAGTAAATAAGAAAATGGGGAAAAAGATATTTTCAATTTTTAATGTTTACGACGAGCCCATGCTTAGAAGTGCCATACAACAGCTGAGAAAAAGTGGAGCGATCGCTGCCAGAGACAAAATGATCAACAGCCAATTGAAGAAAGAACATTGCGAAGACATAGTCCAAACTATGTATAAAATAGCCATGATGTACAGAAGAAAATGGATGAGCCATCTGGAGCCAGAGTTTTTGGACAGTCTTTCAGAGTTACTTAGCAGCAAGATACCGCACAACGGTTTGAAAGGATCCTGCGATATCAATTGTCTGTTATGCGAAACGTACAGCCATGATGTTATAGATATCGTCTCTGACACGATACCCGTAGTGACGGGTTCCGCGGGATCTATATTGCAGGAGGAACAGTTAAATGTCATCGACATTGAAACTAAATTCAAATTAAAGTCCGGCCTCCTTGGATGGAAAAACAAATCTAACGTAGAAACATTTTCTGAACTATATCATAATATCGGCTATGATGGGATATTTGAAAACTTAATAAGGAAATCTATTGTGGATTACTCAAAGACTGAGGAAATTGATGAGATGGACAATATAATACAATTTTTGGAAGAAAAGGAATCGAAGGGATGCACATTTAAAGAATTGGATGAGAAAATACCTGATGACCAGAACACATTAGTTAAAAGATTATTAGATCTTGAGAGCAAAGAAATCGTAAAGAGGGTTGGGCTTTACGAAAACGTGATTGTGCTGAAGAAATACGCGAAACCCTACTTATTGTTTGTACCTAAAAACCTTTACATGATACCAACGCCCTGGCTGACATTAAAGGCCGAGATTCAGAAGGACGTATTCTTCAAATGGGCAGGCGTTATAATGAACAAAGTATTCGAGTGCCCCGGCTGCTCAGTAGCATTCATCAGCAATAGTATAGAATACATAACATATAGAAGCGTCCAAGAAGTATGTATGTTCCTAGAACAGACGAAATGCGTGGAACTGGTCGGTGTGGAGAAGAAGGACACAGATCTGTTCACAGAGGAAGATTACAACTGGGTACCGGAGTTATACGAGTTTAATCCTTACGAATCACCAGATAACATTCTAGTAGTCCCCGTTAAGGATTGTTTAAGCAAATATGCTTATATAAGAAGCAAGATATTAAACGACATTGAACTGTCATAA

Protein sequence:

>DPOGS211788-PA
MKIRFWNYILASESISVYELKTSPPFIEILDRFTIIDENTGNLLDPPDYLDGPYEFQSIDNEYGSCINFNTRKQLPKDALKMISYEETVSKYGDKLILVASLEERWKALAPHLPIMYLNQLTPVHFCLLELIGKSRHNGQMTVGKTNLSKVVKDAKLLFYNRTVLQKFDLIKVQYSTQVVGGRALKSILLRLNRFYQPTLCTQPKIGKLHNIINYLLEAPDYSEQTDVMIKKGLLTPPQSKRLQKTINIFNFEEKDIIINRGENSKMEPSRHTVKRKCISLNCQSDESQSDEESNENDSNLKCQYKVGVNLGRQAYECFLAAGLKGLTQIDIAQLLGIEFYTSRTICRVFKARKIVREFLEDKGRQRTARYIAIAATKYIDKQYAKEKSKFLEHCNKRSKDKNKSKGDLSSDSESEVPLKKIKKAKENTPVNETKQDITEVKIMEGCENVTESLLNQKKNPTLRQLKFANGIIKVVQDRKYVMGFQVLSNLVAKETGEPPMDTKALKIFTQKLVADGQLKILKIKRPSVNGLTCNIFICAPHIKASNPIIRAKYKEICARSEISKKAKLHKSTPRNIRPVTHFTYPRYMKIQKLHEFLTQLVYFNESSCKDSINGISSLMSIIPEMTVELALGHISHMDETDINYIKTAAFSLDDKVKDAPIRLNRIILQSRSLINSLKVVLKVLALLGLIQLVHQTTVPINRDATDVFYVNRHAKIINTMGKWPQPNIDKTVLEKSYHFKKFEDVQEYWSDVHDISINTTINVPRRQQYKLCIPVRNEQNLTNYDTGEIFGDGLGPCGFDSSIYMDIARLWRSFMIKNHTKKKLRKPKVKKVKKKSSAPAESKTKIKSIKNENNAIKIKDFMNIRKRQLDSHIKWSNFEDRILMMCKAAVTIMSPVSQPGCLRVRNIVAKDLLSIFDPRKTAAICHKRAIILETNSTLAHEKDCILNELRCHRNLIQKYEGLLRVLRLRNYANLSKYVNESRLPLLQLIWIIAQIAETKPFNRRMPCMAIDLKDFNKKYNISPSSANRAYNLYRTPTSSRPEFAILKEAIIMTIMLSLDEPVNKKMGKKIFSIFNVYDEPMLRSAIQQLRKSGAIAARDKMINSQLKKEHCEDIVQTMYKIAMMYRRKWMSHLEPEFLDSLSELLSSKIPHNGLKGSCDINCLLCETYSHDVIDIVSDTIPVVTGSAGSILQEEQLNVIDIETKFKLKSGLLGWKNKSNVETFSELYHNIGYDGIFENLIRKSIVDYSKTEEIDEMDNIIQFLEEKESKGCTFKELDEKIPDDQNTLVKRLLDLESKEIVKRVGLYENVIVLKKYAKPYLLFVPKNLYMIPTPWLTLKAEIQKDVFFKWAGVIMNKVFECPGCSVAFISNSIEYITYRSVQEVCMFLEQTKCVELVGVEKKDTDLFTEEDYNWVPELYEFNPYESPDNILVVPVKDCLSKYAYIRSKILNDIELS-