Monarch geneset OGS2.0

DPOGS211027
TranscriptDPOGS211027-TA2412 bp
ProteinDPOGS211027-PA803 aa
Genomic positionDPSCF300004 + 1649491-1654506
RNAseq coverage390x (Rank: top 31%)
Annotation
HeliconiusHMEL0071466e-10352.87% 
BombyxBGIBMGA006506-TA2e-15577.67% 
DrosophilaEloA-PA1e-7742.12% 
EBI UniRef50UniRef50_E1ZWV81e-8844.90%Transcription elongation factor B polypeptide 3 n=2 Tax=Formicidae RepID=E1ZWV8_CAMFO
NCBI RefSeqXP_396851.32e-8640.99%PREDICTED: similar to Elongin A CG6755-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800120748e-8943.67%PREDICTED: LOW QUALITY PROTEIN: transcription elongation factor B polypeptide 3-like [Apis florea]
NCBI nr blastxgi|3504184662e-13437.67%PREDICTED: transcription elongation factor B polypeptide 3-like [Bombus impatiens]
Group
Gene OntologyGO:00056345.8e-28nucleus
GO:00063555.8e-28regulation of transcription, DNA-dependent
GO:00160215.8e-28integral to membrane
GO:00036772.3e-16DNA binding
GO:00063512.3e-16transcription, DNA-dependent
KEGG pathway 
InterPro domain[624-724] IPR0106845.8e-28RNA polymerase II transcription factor SIII, subunit A
[1-93] IPR0179232.3e-16Transcription factor IIS, N-terminal
[4-78] IPR0036173.3e-07Transcription elongation factor, TFIIS/CRSP70, N-terminal, sub-type
Orthology groupMCL15828 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211027-TA
ATGGCGTCTGTTTTAGATTTAGTTAAACATTACCAACGATCTATAGAAAAATATCCCAATGACGAACAGAAAATATTAAAGAGTATAGATAAGTTATACCACTTGAAAGTAACTGTACAGCATTTGCAAGATACTGGTGTTGGCCGCACAGTCAATGCTCTGCGAAAGGAACCAGGAGAAATTGGACAAGCTGCTAGAGCTCTTGTGTTAAAATGGAAGGTTATGGTGGCTGCGGAAGAAAGTGATCATGAAGATCATAATGACGACACCCAAAACTACAGTAGTCATGATAATGGCAGGGATTATGACAGTAACCCAAGCAAATCTACAAGTAAACATGATACATCTGAGAAATCAAATAGAAGATCAAAGACTGAAGAGAAGTACCATAAGCAAACAAATGGGGATTATAGTGGAAATAAGAGAAAATATCAAAGTAGTGAGGAGGAAGACCACGACAATACAAAAAAATCTAAATACTCACAAGATAACGGCTATAATATAAAATCAGAATCAAGAAAGAAAATAGAGTCATCCGAAAGTGAAAACAGTGAGGATGAATCATCACAAAGTGATAGTGGCAGTGAAGATACAAAGTCAGAAGATGAAGAAGAAGAGATCCCAGATACTGAAGCAAAGGTAGTGAAAAATCAACACAAAGAGTCATATAAACCATCCCATTTGAGACAGTCATCTAGTTCACATCAAAGCAAACATAAGCATGAATCAAGACATGATAGAGAAGATTCAAAGCAAAGCAAGGAACATAATGACAGTTCTGATAAAAGACATTCATCTGACAAACCCAAAGACAAGTCCCACAGTTCTCACTCCCACAAAAGTTCTAAAGGGCATGAAAAAAGTTTAGACAAAAAAGACAAAGAAAATAGACATTCAGACAAAGAGAAGCAAATAAAAGAAGACTCAAGTAAACACAGTACTAAACATAGATCTGGTAGCAGTTCAGATAAAACCAAGTCAAGCGGAACTGACAAACACAAGTCTAGTGAGAGTTCACACAAACATAAGTCCAGCAAGAGTGAAGATAAACACAGATCAAGTAGCAGTTCAGAGAAAAATAAAAAACTTGTACCCGAATATCATAAAAGTCACTCAGAGAGAACATCAAGTAGTACAGACAAACAATCCTCACAGATATCTGATAAACACAAAAAATCTTCTCATAAAGATGTTTCTAATGATAAACATAAATCTAAAGAACATGATGATAAAAAAGATGAACAAGTCAAAGAAAAGTCAAAAGAAAAACATAGTTCCTCAAAAGACAAAGAAAGTCACAAAAGTGAGAAGAAACATTCTTCTAAAGAATCAGGTGACAGCAAACGAAAATCTGATAGCAATCATAACAGCGATAGTTCCAAGAAAAGTAAACATAAGTCTAGTTCTAGTAAACAATCTAACAAGTCAAGGGAGGACAAGGAGAAACAACCAAAAAGAACAGAAGATAGCGATGATGGTATAGATTGTGGCTCAGGTGCCAGTTTTGCCGAAGCACTTGGCATGATAAGTCCGTCAAAGCCAAAGAAAAAATCTATATTTTCTAAAGATAATATGCAATCTCCACGCTCTCCTAGTGACAATCTCAACCCTCCTAATTTGCTAGCACCTAGTGCTAAATTGGCGCCATTGCCCTCTTTAGAAATATCTGCCTTACCAGAGATATCACCCAATTATCGGCCACGCCCACCTCCGAAATTCCTACCACACTTCAGCGATGAAGACGCTATGAGTAGCGCAATATCGTCGAAAAATCAGAGAACAAAAGTCTATTCTGGCAACAAAGTTATAGGAAAAATCACAACATTATATGAAATGTGTGTCCATGTCCTGCAAGAACATATTGATGCCCTTGAATACACTGGTGGGGTTCCATATGAAATATTAAAACCAGTTGTGGATAAAGCAACTCCACAGCAGTTATTTGTTTTGGAACATTACAACCCATACCTCATGGACGACACTGATCATTTGTGGCAGAAATTCTGTGAGAAAAGTTTTAGGAACAAGAAACGACAGGAAATGGAGACTTGGAGGGAAATGTATATTCGATGCCAAGAAGAACAAGAAATTAAGCTTAAATCACTCACTGCCAACATCAAAATGACTCAAGAGGCAAAGAAGGCGCCCATAAAGCAAACTAAAATGGCCTATGTTGATACTGTAGTGAAACCACCTCGTAATGTTGCAAAGAAACAGGCACAACACGGTACAGCATTTGCTGCTACTGCCAGCCCTGCTGCTAGGGTTGCCTCTCTTTCTGCAGCACCTAATGTATTAAAAGGTGGCAGGGCTGCCCCAGCCCCGGTTATAACAAACTCATCGAACTTCAAGCCCAAGAAAGCACCGCTTATGCAAAAAGCACTGCAATTTATGCGCGGAAGAAAACGATGA

Protein sequence:

>DPOGS211027-PA
MASVLDLVKHYQRSIEKYPNDEQKILKSIDKLYHLKVTVQHLQDTGVGRTVNALRKEPGEIGQAARALVLKWKVMVAAEESDHEDHNDDTQNYSSHDNGRDYDSNPSKSTSKHDTSEKSNRRSKTEEKYHKQTNGDYSGNKRKYQSSEEEDHDNTKKSKYSQDNGYNIKSESRKKIESSESENSEDESSQSDSGSEDTKSEDEEEEIPDTEAKVVKNQHKESYKPSHLRQSSSSHQSKHKHESRHDREDSKQSKEHNDSSDKRHSSDKPKDKSHSSHSHKSSKGHEKSLDKKDKENRHSDKEKQIKEDSSKHSTKHRSGSSSDKTKSSGTDKHKSSESSHKHKSSKSEDKHRSSSSSEKNKKLVPEYHKSHSERTSSSTDKQSSQISDKHKKSSHKDVSNDKHKSKEHDDKKDEQVKEKSKEKHSSSKDKESHKSEKKHSSKESGDSKRKSDSNHNSDSSKKSKHKSSSSKQSNKSREDKEKQPKRTEDSDDGIDCGSGASFAEALGMISPSKPKKKSIFSKDNMQSPRSPSDNLNPPNLLAPSAKLAPLPSLEISALPEISPNYRPRPPPKFLPHFSDEDAMSSAISSKNQRTKVYSGNKVIGKITTLYEMCVHVLQEHIDALEYTGGVPYEILKPVVDKATPQQLFVLEHYNPYLMDDTDHLWQKFCEKSFRNKKRQEMETWREMYIRCQEEQEIKLKSLTANIKMTQEAKKAPIKQTKMAYVDTVVKPPRNVAKKQAQHGTAFAATASPAARVASLSAAPNVLKGGRAAPAPVITNSSNFKPKKAPLMQKALQFMRGRKR-