Monarch geneset OGS2.0

DPOGS211177
TranscriptDPOGS211177-TA816 bp
ProteinDPOGS211177-PA271 aa
Genomic positionDPSCF300007 + 408065-409598
RNAseq coverage323x (Rank: top 35%)
Annotation
HeliconiusHMEL0124166e-13490.44% 
BombyxBGIBMGA003166-TA3e-12984.94% 
DrosophilaTfIIFbeta-PA4e-9866.04% 
EBI UniRef50UniRef50_P419005e-9666.04%General transcription factor IIF subunit 2 n=34 Tax=Coelomata RepID=T2FB_DROME
NCBI RefSeqXP_623868.15e-10974.81%PREDICTED: similar to Transcription initiation factor IIF subunit beta (ATP-dependent helicase TfIIF-beta) (TFIIF-beta) isoform 1 [Apis mellifera]
NCBI nr blastpgi|3323761031e-10870.72%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3800207875e-11272.96%PREDICTED: general transcription factor IIF subunit 2-like [Apis florea]
Group
Gene OntologyGO:00055243.6e-101ATP binding
GO:00056743.6e-101transcription factor TFIIF complex
GO:00063673.6e-101transcription initiation from RNA polymerase II promoter
GO:00038244e-34catalytic activity
KEGG pathwayame:5514701e-108 
 K03139 (TFIIF2)maps-> Basal transcription factors
InterPro domain[13-271] IPR0031963.6e-101Transcription initiation factor IIF, beta subunit
[7-131] IPR0110394e-34Transcription Factor IIF, Rap30/Rap74, interaction
[187-254] IPR0119918.5e-29Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL14429 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211177-TA
ATGAGTAATTCCAATACAGTGCCCCATGTCGACCGTGAATTAGATTTGTCAAATACGGGAAGAGGTGTCTGGCTGGTAAAGGTGCCGAAATATATTGCCAACAAATGGGACAAAGCTTCAGGTGATATAGAAGTAGGTAAATTAAAGATATCTCGAGTTCCTGGTCAACGTGCTCAAGTCCAGCTGTCGCTGTCGGAAGCTGTTTTATGTTTAAACGAACCCGGAGAACAAGCTTTACCTAAGGAGCATCGCCTCGATGTATCTAATGTAACACGACAATCTTTAGGAGTTTTTTCTCATGCTGTTCCATCAAATACGGATACAGTGGTTCCTGAATCTGAAAAACTTTATATGGAAGGTAGAATAGTGCAAAAATTAGAATGTAAGCCATATGCAGACCCAACTTATTACAAGCTTAAGTCTGAATCAATAAGGAAGGCTTTGATGCCACAAAGACAAGTACAACAACTGAAAGGAATTGTGCAAAATTTCAAACCTGTGTCTGACCACAAACATAATATTGACTATCAAGTAAAGAAGAAAGCGGAAGGTAAAAAGGCTCGTGATGACAAGGAATCGGTGCTCAATGTTTTGTTTGCAGCATTTGAGAAACACCAGTATTATAATATTAAGGATTTGCAAAATATAACAAGACAACCTATAGTATATTTGAAGGAAATATTAAAGGAGGTCTGCAATTATAATTTAAAGAATCCCCATAAGAATATGTGGGAATTGAAACCAGAGTACAGGCACTACAAACAGGAGGCCCCTGTTGAGACCAAAGAGGACCCTCAAAGTTCGGACAGCGACTAG

Protein sequence:

>DPOGS211177-PA
MSNSNTVPHVDRELDLSNTGRGVWLVKVPKYIANKWDKASGDIEVGKLKISRVPGQRAQVQLSLSEAVLCLNEPGEQALPKEHRLDVSNVTRQSLGVFSHAVPSNTDTVVPESEKLYMEGRIVQKLECKPYADPTYYKLKSESIRKALMPQRQVQQLKGIVQNFKPVSDHKHNIDYQVKKKAEGKKARDDKESVLNVLFAAFEKHQYYNIKDLQNITRQPIVYLKEILKEVCNYNLKNPHKNMWELKPEYRHYKQEAPVETKEDPQSSDSD-