Monarch geneset OGS2.0

DPOGS202410
TranscriptDPOGS202410-TA2748 bp
ProteinDPOGS202410-PA915 aa
Genomic positionDPSCF300233 - 16235-29697
RNAseq coverage69x (Rank: top 66%)
Annotation
HeliconiusHMEL0036897e-5893.01% 
BombyxBGIBMGA003305-TA3e-14274.89% 
DrosophilaTfIIFalpha-PA2e-10670.06% 
EBI UniRef50UniRef50_B4N9832e-13553.21%GK12178 n=5 Tax=Eumetazoa RepID=B4N983_DROWI
NCBI RefSeqXP_002070644.13e-13653.21%GK12178 [Drosophila willistoni]
NCBI nr blastpgi|1954461366e-13553.21%GK12178 [Drosophila willistoni]
NCBI nr blastxgi|3479685382e-17160.47%AGAP002779-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00056349.5e-150nucleus
GO:00036779.5e-150DNA binding
GO:00458939.5e-150positive regulation of transcription, DNA-dependent
GO:00038249.4e-54catalytic activity
GO:00063679.4e-54transcription initiation from RNA polymerase II promoter
KEGG pathwaydwi:Dwil_GK121789e-136 
 K03138 (TFIIF1)maps-> Basal transcription factors
InterPro domain[5-496] IPR0088519.5e-150Transcription initiation factor IIF, alpha subunit
[8-170] IPR0110399.4e-54Transcription Factor IIF, Rap30/Rap74, interaction
[429-497] IPR0119912.4e-31Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL11506 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202410-TA
ATGACGACACCAGGAACTTCACAACCGGCCACGGTACAAGAATTCAAAATCAGGGTGCCAAAGAACGTGAAGAAGAAATACCACGTGATGAGATTCAACGCGACCCTCAACGTTGACTTCGCGAAGTGGACTCACGTGAAGATGGAGAGAGAGAACAACATTAAGGAGTTCAAGGGAACGGAAGAGGAAATGCCAAAGTTCGGCGCTGGTTCAGAATACGGCAGGGATGTGAGGGAAGAGGCTCGACGGAAGAAATTTGGTATCATCTCGCGGAAATACAAACCTGAAGATCAACCCTGGATACTGAAAGTAGGCGGGAAAACTGGCAAGAAGTTCAAAGGTATCCGCGAGGGCGGTGTCTCTGAGAACGCAGCCTACTACGTCTTCACGCACGCCGCTGACGGAGCTATCGACGCCTACCCTCTACAAGAATGGTACAATTTCCAACCGATCCAGCGCTACAAGGCGCTTTCCGCCGAAGAAGCGGAACAGGAATTTGGAAGACGTAACAAGGTGATCAATTACTTCTCACTGATGTTCCGTAAACGTATGAGAGGGGACGACGCGGCCGACGAAGACGATCCCGATGACAAGAAAACCAAGGGGGCGAAAGCTAAGAAGGATCTGAAGATATCTGAAATGGACGAGTGGATAGATTCCGACGACGAGTCCTCGGATTCAGAAGGAGACAAAGACAAAGAGAAGGAGGACAGCGACTCCGGCACCAAAAAGAAGAATAAGAAGAAAGCGGTGCCAAAGAAGAAGAAGAAGGTCAATGATGAGGCGTTTGAAGAGAGTGATGATGGAGACGAGGAGGGCAGGGAGAGAGATTATATATCAGACTCATCGGAGAGTGAGTCCGACCATGAGACGAAAGCCAACAAGGAGCTGAAGGGAGTCGCCGAGGAAGACGCTCTGAGGAAACTGCTGACATCGGACGAGGGTACGGACTCGGAGCAGGAACAGAAACAAGAGTCGGAGGGAGAAGACGAGCCCACCAAGGAGGGGGAGGAGAGAGCGAGCAAACTCACCAAGAAGAAGAAGAAGGAAGACGCCAAGAGAGACACCAGCAGCGACTTCAGCTCAGACTCCGACACCGACCCCGAGAACAGCAGCAAGAAACAGAAGAAAGGAAAGAACAATGACGCGAAAAACAACAACGCGGGTGGTAGCGCGAGCACGTCTCGCTCGTGCACTCCCACACCTTCAAACGCGATGTCCGCCGTGGCCGCAGCCAACGCCGCCAACAACCAGCCCGCCAAGAGAGCGAAGCTGGACCCCTCGTATACGGAGTGCGGCGTCACGGAGGAGGCCGTGCGCCGTTACTTGACTAGGAAGCCGATGACCACCACGGAGCTGCTGACCAAGTTCAAGTCCAAGAGGAGCGGCGTGTCCTCCGAGAGGCTCGTGGAGACCATGACGCAGATCCTCAAGAGGATCAACCCCGTCAAACAGAACATCAACGGCAAGATGTATCTTAGCATCAAACAGACGTTCAAAGGTATCCGCGAGGGCGGTGTCTCTGAGAACGCAGCCTACTACGTCTTCACGCACGCCGCTGACGGAGCTATCGACGCCTACCCTCTACAAGAATGGTACAATTTCCAACCGATCCAGCGCTACAAGGCGCTTTCCGCCGAAGAAGCGGAACAGGAATTTGGAAGACGTAACAAGGTGATCAATTACTTCTCACTGATGTTCCGTAAACGTATGAGAGGGGACGACGCGGCCGACGAAGACGATCCCGATGACAAGAAAACCAAGGGGGCGAAAGCTAAGAAGGATCTGAAGATATCTGAAATGGACGAGTGGATAGATTCCGACGACGAGTCCTCGGATTCAGAAGGAGACAAAGACAAAGAGAAGGAGGACAGCGACTCCGGCACCAAAAAGAAGAATAAGAAGAAAGCGGTGCCAAAGAAGAAGAAGAAGGTCAATGATGAGGCGTTTGAAGAGAGTGATGATGGAGACGAGGAGGGCAGGGAGAGAGATTATATATCAGACTCGTCGGAGAGTGAGTCCGACCATGAGACGAAAGCCAACAAGGAGCTGAAGGGAGTCGCCGAGGAAGACGCTCTGAGGAAACTGCTGACATCGGACGAGGGTACGGACTCGGAGCAGGAACAGAAACAAGAGTCGGAGGGAGAAGACGAGCCCACCAAGGAGGGGGAGGAGAGAGCGAGCAAACTCACCAAGAAGAAGAAGAAGGAAGACGCCAAGAGAGGTAACACTAACACAGCAGACATACCCGCACAAATCAAAGACGTCACCTGGGACATGAAGATGGATTCTTACAGGCATCTTGTGGTTACAGACACCAGCAGCGACTTCAGCTCAGACTCCGACACCGACCCCGAGAACAGCAGCAAGAAACAGAAGAAAGGAAAGAACAATGACGCGAAAAACAACAACGCGGGTGGTAGCGCGAGCACGTCTCGCTCGTGCACTCCCACACCTTCAAACGCGATGTCCGCCGTGGCCGCAGCCAACGCCGCCAACAACCAGCCCGCCAAGAGAGCGAAGCTGGACCCCTCGTATACGGAGTGCGGCGTCACGGAGGAGGCCGTGCGCCGTTACTTGACTAGGAAGCCGATGACCACCACGGAGCTGCTGACCAAGTTCAAGTCCAAGAGGAGCGGCGTGTCCTCCGAGAGGCTCGTGGAGACCATGACGCAGATCCTCAAGAGGATCAACCCCGTCAAACAGAACATCAACGGCAAGATGTATCTTAGCATCAAACAGACGTGA

Protein sequence:

>DPOGS202410-PA
MTTPGTSQPATVQEFKIRVPKNVKKKYHVMRFNATLNVDFAKWTHVKMERENNIKEFKGTEEEMPKFGAGSEYGRDVREEARRKKFGIISRKYKPEDQPWILKVGGKTGKKFKGIREGGVSENAAYYVFTHAADGAIDAYPLQEWYNFQPIQRYKALSAEEAEQEFGRRNKVINYFSLMFRKRMRGDDAADEDDPDDKKTKGAKAKKDLKISEMDEWIDSDDESSDSEGDKDKEKEDSDSGTKKKNKKKAVPKKKKKVNDEAFEESDDGDEEGRERDYISDSSESESDHETKANKELKGVAEEDALRKLLTSDEGTDSEQEQKQESEGEDEPTKEGEERASKLTKKKKKEDAKRDTSSDFSSDSDTDPENSSKKQKKGKNNDAKNNNAGGSASTSRSCTPTPSNAMSAVAAANAANNQPAKRAKLDPSYTECGVTEEAVRRYLTRKPMTTTELLTKFKSKRSGVSSERLVETMTQILKRINPVKQNINGKMYLSIKQTFKGIREGGVSENAAYYVFTHAADGAIDAYPLQEWYNFQPIQRYKALSAEEAEQEFGRRNKVINYFSLMFRKRMRGDDAADEDDPDDKKTKGAKAKKDLKISEMDEWIDSDDESSDSEGDKDKEKEDSDSGTKKKNKKKAVPKKKKKVNDEAFEESDDGDEEGRERDYISDSSESESDHETKANKELKGVAEEDALRKLLTSDEGTDSEQEQKQESEGEDEPTKEGEERASKLTKKKKKEDAKRGNTNTADIPAQIKDVTWDMKMDSYRHLVVTDTSSDFSSDSDTDPENSSKKQKKGKNNDAKNNNAGGSASTSRSCTPTPSNAMSAVAAANAANNQPAKRAKLDPSYTECGVTEEAVRRYLTRKPMTTTELLTKFKSKRSGVSSERLVETMTQILKRINPVKQNINGKMYLSIKQT-