Monarch geneset OGS2.0

DPOGS214644
TranscriptDPOGS214644-TA1464 bp
ProteinDPOGS214644-PA487 aa
Genomic positionDPSCF300050 + 712745-717562
RNAseq coverage337x (Rank: top 34%)
Annotation
HeliconiusHMEL0219303e-8279.49% 
BombyxBGIBMGA005062-TA2e-13563.16% 
DrosophilaBrf-PB8e-8050.70% 
EBI UniRef50UniRef50_B0WL023e-9245.47%Transcription factor IIIB 90 kDa subunit n=1 Tax=Culex quinquefasciatus RepID=B0WL02_CULQU
NCBI RefSeqNP_001037055.11e-14764.45%TFIIB-related factor [Bombyx mori]
NCBI nr blastpgi|1603338892e-14664.45%TFIIB-related factor [Bombyx mori]
NCBI nr blastxgi|1603338894e-16060.28%TFIIB-related factor [Bombyx mori]
Group
Gene OntologyGO:00063553.7e-110regulation of transcription, DNA-dependent
GO:00082703.7e-110zinc ion binding
GO:00063523.7e-110transcription initiation, DNA-dependent
GO:00056345.8e-23nucleus
GO:00458935.8e-23positive regulation of transcription, DNA-dependent
GO:00064131.4e-14translational initiation
GO:00037431.4e-14translation initiation factor activity
KEGG pathwaypis:Pisl_16672e-07 
 K03124 (TFIIB)maps-> Basal transcription factors
InterPro domain[7-488] IPR0008123.7e-110Transcription factor TFIIB
[5-116] IPR0137631.1e-26Cyclin-like
[263-355] IPR0116655.8e-23Brf1-like TBP-binding
[12-85] IPR0131501.4e-14Transcription factor TFIIB, cyclin-related
Orthology groupMCL10411 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214644-TA
ATGTGTCGTGATTTTTATGATCCATGTCTGTACATAATACGTTTCGCATCCCAGCTGCAGTTCGAGGAGAAGCAGCATGAGGTGAGCATGACGGCCCTGAGACTGGTGCAGAGGATGAAGAGGGACTCCATACACTCAGGGAGGCGACCCTCGGGCGTCTGTGGGGCTGCACTCCTTATAGCTGCTCGACTGCACGATTTCTCACGAACGCCGACCGATGTGGTCCGTATCGTGAAGATTCATGAAACAACGCTTCGGAAAAGGCTCTTGGAGTTTGGGGAGACGCCGTCCAGCGCCCTCACCTTGGACGAGTTTATGACAGTGGACCTCGAGGAGGAACAGGACCCACCAGCCTTCAAGATAGCCAGAAGGAGGGACAAAGATAGACTGCAGAAGTTAATGGAAGAGGAAGATGGCGAAAGAGAACTCACAGAGCTGCAAAGAGAAATAGACGCTCAGTTAGATAAAGACAAGAAAAGACGAAAGCCCACAAAGATGGTCGCGCCTATACTGCCTGCATCCGTTGACGACGATACGTCAGAAGACGCTGAGGCGAGTAGATTCGCAACAGAGGATACCTTATCACTTATAGGTGATATAGCGAGAGACGTGCAGCCGACACCCGACGGTGCCAATAACGACAAACTCAAGCTGGAAAAAGGACTTGGTCCAGAGATAGCCTTTATGGGTCTGAATCCACCCGAAGAGAGGGACAAGTCGAAGCCGGAGTCCAAGCAGTTTACCAAGGACCTGCAGGCGGATGAACTACATCTGGATGATGATGATGAACAGTACCTCGACTCGCTGATCATGACGGACGAAGAAGCCAGACATAAGACTCTGCTGTGGCATAACATTAACGCTGGATATTTGAAGGAACAAAAAATTAAAGAAGAAATACGCGCTAAAGAACGGGAAGAGGGTAAAGACAAAAAGAAGAAAACCCGTGGATCATACAAGAAGAAGGTGGCCATTACTGGAGCGACCGCTGGCGAGGCTGTGGGGAAGATGCTGGCGGAGAAGAAGATGAGCTCCAAGATTAATTACGACATATTGAAGAGCCTGGATCATCCTGGGTCGCCCAGTGTACCGCCGGTGAATGTTGTGAAGGCTGTAGAAGAAACTAGTACAATCCAACCGCCAATACTCGAGACGGTGCCGCCGTCGCCGGTGCCGAAGAAGCGGAAACGGAAGGAGAAGACAGCTCCCTCCCTGGCTCCTGCTACAGTTTCTGCGAGGCCGTCACAGGGAGAACCCTCCCCGGCAGGACCCCTCCAGTCCCCCACAGAACCCCTGCCCTCTCCTGCGGACCCCTTGCCCTCCCCTGCACCGGAAACGCCAGCACAGAATGCTGATGATTATGAAGATGACTTTGAGGATCCGGTCGACAGCCGAGAGATGTCGCTAGCAGCGCTCTTACAGAACGGAAACGACGACGAATATTACGACTACGAAGAATATTAA

Protein sequence:

>DPOGS214644-PA
MCRDFYDPCLYIIRFASQLQFEEKQHEVSMTALRLVQRMKRDSIHSGRRPSGVCGAALLIAARLHDFSRTPTDVVRIVKIHETTLRKRLLEFGETPSSALTLDEFMTVDLEEEQDPPAFKIARRRDKDRLQKLMEEEDGERELTELQREIDAQLDKDKKRRKPTKMVAPILPASVDDDTSEDAEASRFATEDTLSLIGDIARDVQPTPDGANNDKLKLEKGLGPEIAFMGLNPPEERDKSKPESKQFTKDLQADELHLDDDDEQYLDSLIMTDEEARHKTLLWHNINAGYLKEQKIKEEIRAKEREEGKDKKKKTRGSYKKKVAITGATAGEAVGKMLAEKKMSSKINYDILKSLDHPGSPSVPPVNVVKAVEETSTIQPPILETVPPSPVPKKRKRKEKTAPSLAPATVSARPSQGEPSPAGPLQSPTEPLPSPADPLPSPAPETPAQNADDYEDDFEDPVDSREMSLAALLQNGNDDEYYDYEEY-