Monarch geneset OGS2.0

DPOGS205111
TranscriptDPOGS205111-TA876 bp
ProteinDPOGS205111-PA291 aa
Genomic positionDPSCF300172 - 31933-34327
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0146963e-15290.14% 
BombyxBGIBMGA005878-TA2e-13588.01% 
DrosophilaTfIIEbeta-PA3e-11169.07% 
EBI UniRef50UniRef50_O968814e-10969.07%IP01109p n=32 Tax=Arthropoda RepID=O96881_DROME
NCBI RefSeqXP_001864405.14e-12472.03%transcription initiation factor IIE subunit beta [Culex quinquefasciatus]
NCBI nr blastpgi|1700572598e-12372.03%transcription initiation factor IIE subunit beta [Culex quinquefasciatus]
NCBI nr blastxgi|1187863631e-12274.83%AGAP005382-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00056732.2e-139transcription factor TFIIE complex
GO:00063672.2e-139transcription initiation from RNA polymerase II promoter
KEGG pathwaycqu:CpipJ_CPIJ0134751e-123 
 K03137 (TFIIE2)maps-> Basal transcription factors
InterPro domain[1-283] IPR0166562.2e-139Transcription initiation factor TFIIE, beta subunit
[58-139] IPR0119915.3e-39Winged helix-turn-helix transcription repressor DNA-binding
[64-138] IPR0031662.6e-29Transcription factor TFIIE beta subunit, DNA-binding domain
Orthology groupMCL13587 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205111-TA
ATGGATCCAGCACTTCTGAGAGAGCGGGAGGCGTTCAGGAAAAAGGCACTTGCAACGCCGAGTGTTGAGAAAAAGAAGAGAGATGATTCCTTTAAAGATGATAGCAAGAAGAAATCGAAATCATCAAGCTCAGCAAATGCTGCCCCAAAAATTGATGCCACCAATTACAAGACGATGGCTGGCAGTTCCTCCTATCGTTTTGGTGTGTTGGCTCGCATCGTCAGACACATGAAGTCCAGACACCAAGAAGGGGATGACCATCCGCTGTCGATTGATGAAATTTTAGATGAAACAAATCAGTTAGATGTTGGAAGTAAAATTAAACAGTGGCTTCAGACTGAAGCGTTACAGAATAATCCTAAAATAGAGCACACATTTGACGGAAAGTTTATATTTAAACCAGTTTATAAAATTAAGGACAAGAAATCATTACTGAGATTACTGAAGCAACATGATTTAAAAGGTCTAGGGGGGATTTTTCTAGAGGACGTCCAAGAATCATTGCCGCACTGCGACAGAGCATTGAAAAGTTTAGCGCAAGAAATATTATACATAACAAGACCCTCAGATAAAAAGAAAATTTTGTTTTATAATGACAAAACCGCCACTTTAGATGTTGACGAAGAGTTTGTGAAATTATGGCGAGCCACGGCCGTAGATGCCATGGACGACGCCAAGATAGAGGAATACTTGGAGAAGCAAGGCATCAAATCAATGCAGGACCACGGCCCAAGGAAACCTGTGGTGCCCAAACGGAAGAAGGTCACCCAGAAGAGGAGGCAGTTCAAGAAACCCAGGGACAATGAACATCTGGCTGACGTACTCGAAACGTACGAAGATAATACGCTGACACAAAAAGGTGTCAGTATTAAATGA

Protein sequence:

>DPOGS205111-PA
MDPALLREREAFRKKALATPSVEKKKRDDSFKDDSKKKSKSSSSANAAPKIDATNYKTMAGSSSYRFGVLARIVRHMKSRHQEGDDHPLSIDEILDETNQLDVGSKIKQWLQTEALQNNPKIEHTFDGKFIFKPVYKIKDKKSLLRLLKQHDLKGLGGIFLEDVQESLPHCDRALKSLAQEILYITRPSDKKKILFYNDKTATLDVDEEFVKLWRATAVDAMDDAKIEEYLEKQGIKSMQDHGPRKPVVPKRKKVTQKRRQFKKPRDNEHLADVLETYEDNTLTQKGVSIK-