Monarch geneset OGS2.0

DPOGS207630
TranscriptDPOGS207630-TA948 bp
ProteinDPOGS207630-PA315 aa
Genomic positionDPSCF300199 - 42952-45434
RNAseq coverage693x (Rank: top 18%)
Annotation
HeliconiusHMEL0098273e-14782.86% 
BombyxBGIBMGA006014-TA0.099.05% 
DrosophilaTfIIB-PA3e-16789.21% 
EBI UniRef50UniRef50_P290524e-16589.21%Transcription initiation factor IIB n=23 Tax=Eukaryota RepID=TF2B_DROME
NCBI RefSeqXP_395432.19e-16991.11%PREDICTED: similar to Transcription initiation factor IIB (General transcription factor TFIIB) isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838623152e-16891.75%PREDICTED: transcription initiation factor IIB-like [Megachile rotundata]
NCBI nr blastxgi|3838623159e-16291.75%PREDICTED: transcription initiation factor IIB-like [Megachile rotundata]
Group
Gene OntologyGO:00063554.2e-188regulation of transcription, DNA-dependent
GO:00082704.2e-188zinc ion binding
GO:00063524.2e-188transcription initiation, DNA-dependent
GO:00064135.5e-27translational initiation
GO:00037435.5e-27translation initiation factor activity
KEGG pathwayame:4119652e-168 
 K03124 (TFIIB)maps-> Basal transcription factors
InterPro domain[5-315] IPR0008124.2e-188Transcription factor TFIIB
[115-215] IPR0137632.6e-38Cyclin-like
[120-190] IPR0131505.5e-27Transcription factor TFIIB, cyclin-related
[13-54] IPR0131378.6e-13Zinc finger, TFIIB-type
Orthology groupMCL13602 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207630-TA
ATGGCGAGCACGTCGAGAATCGAAGCTAATAAGGTGGTGTGTTATGCTCATCCCGATGCCCCGCTTATTGAGGATTATCGAGCGGGTGACATGATATGTTCAGAATGCGGCCTCGTTGTTGGCGACAGAGTTATAGATGTAGGTTCAGAATGGCGTACCTTCAGCAATGAGAAATCTGGAGTCGATCCATCTCGTGTGGGAGGTCCGGAAAATCCTTTGCTATCAGGAGGGGATCTCTCCACTATCATTGGTCCCGGAAGAGGCGATGCTTCCTTTGACAGTTTTGGGGTATCCAAATATCAAAACCGAAGAAATATAAGCAGCACAGACAGAGCGCTTATAAATGCGTTTAGAGAAATAAACACTATGGCTGATAGAATTAATCTGCCTAAAACTATAGTGGACAGAGCCAATAATTTGTTTAAACAGGTACACGATGGAAAAAATTTAAAAGGCAGAGCAAACGATGCAATAGCCTCAGCATGTCTATATATAGCTTGCCGGCAGGAAGGAGTGCCGAGAACATTCAAAGAAATCTGTGCGGTTAGTAAAATTAGTAAGAAAGAAATTGGAAGATGTTTCAAACTGATCCTTAAGGCTTTGGAAACGTCAGTGGACTTGATAACAACAGCAGATTTCATGTCTCGGTTTTGTGCCAATCTGGGTTTACCAAACTCAGTGCAACGAGCTGCAACACATATTGCAAGGAAAGCCGGGGAATTGGACATTGTGTCTGGAAGAAGTCCCATATCTGTGGCAGCCGCTGCCATATATATGGCTTCACAGGCGTCCGAGGATAAGCGTAGTCAGAAAGAGATTGGTGATATAGCCGGTGTAGCGGATGTAACGATCAGACAGTCATACAAATTGATGTATCCATGTGCCGCAAAATTGTTCCCCGAAGACTTCAAGTTCGCTACACCCATTGAATTCCTACCGCAAATGTAA

Protein sequence:

>DPOGS207630-PA
MASTSRIEANKVVCYAHPDAPLIEDYRAGDMICSECGLVVGDRVIDVGSEWRTFSNEKSGVDPSRVGGPENPLLSGGDLSTIIGPGRGDASFDSFGVSKYQNRRNISSTDRALINAFREINTMADRINLPKTIVDRANNLFKQVHDGKNLKGRANDAIASACLYIACRQEGVPRTFKEICAVSKISKKEIGRCFKLILKALETSVDLITTADFMSRFCANLGLPNSVQRAATHIARKAGELDIVSGRSPISVAAAAIYMASQASEDKRSQKEIGDIAGVADVTIRQSYKLMYPCAAKLFPEDFKFATPIEFLPQM-