Monarch geneset OGS2.0

DPOGS207841
TranscriptDPOGS207841-TA5217 bp
ProteinDPOGS207841-PA1738 aa
Genomic positionDPSCF300042 + 1137031-1151376
RNAseq coverage613x (Rank: top 21%)
Annotation
HeliconiusHMEL0153120.083.16% 
BombyxBGIBMGA005523-TA0.082.84% 
DrosophilaSpt6-PA0.058.08% 
EBI UniRef50UniRef50_E3WRH90.059.47%Putative uncharacterized protein n=3 Tax=Pancrustacea RepID=E3WRH9_ANODA
NCBI RefSeqXP_967189.20.061.73%PREDICTED: similar to Spt6 CG12225-PA [Tribolium castaneum]
NCBI nr blastpgi|3504126180.062.49%PREDICTED: transcription elongation factor SPT6-like [Bombus impatiens]
NCBI nr blastxgi|3504126180.051.80%PREDICTED: transcription elongation factor SPT6-like [Bombus impatiens]
Group
Gene OntologyGO:00327840regulation of transcription elongation, DNA-dependent
GO:00063570regulation of transcription from RNA polymerase II promoter
GO:00037231.2e-11RNA binding
GO:00055152.3e-09protein binding
KEGG pathway 
InterPro domain[1-1725] IPR0170720Transcription elongation factor Spt6
[1128-1280] IPR0230977.1e-41Tex RuvX-like domain
[1065-1071] IPR0233231.1e-12Tex-like domain
[1371-1424] IPR0030291.2e-11Ribosomal protein S1, RNA-binding domain
[1371-1432] IPR0160271.8e-11Nucleic acid-binding, OB-fold-like
[1370-1428] IPR0123401.2e-10Nucleic acid-binding, OB-fold
[1476-1565] IPR0009802.3e-09SH2 motif
Orthology groupMCL10793 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207841-TA
ATGGCAGATTTTTTGGATTCAGAAGCTGAGAGCGAGGTAGAATCTGAAGAGGATGAGAAACCAACTGAGCGTAAAAAGCCAAAGCGTAAAGCTGCCGTTCAAAGTGATGATGAAGACGAAGAGGAAGAGGATGATGAAGAAAGGCTACGCGAGGAGTTGAAGGATCTTATAGATGATGCACCCATAGAAGAATCTGGAAGTGATGGTGAGGATTCAGATGCCAGTGTTGGCCCAAAGAAAAGAAAAAAAAGTGATGATGAGCTTGATGATAGATTGGAAGACGAAGATTATGACCTCATCGAGGAAAATCTTGGTGTTAAAGTTGCTAGAAATAAATTTAAGAGATTGAGACGTTTGGAGGATGATGACAGTGACAATGAAGGGGCTGATGACCCTGATTTGGAGAGGGAGGTTATAGCTGAGAAACTCTTTGTTGGTGGCTCCGATGAGGAGGATGAAAATCGTTCAGAATCAGCTGCACCGAGGGAAATAGAATATGACGATGAGAATGAAGATATGGAGTCGGATGCTGATGACTTCATTGTTGACGACGACGGGAGACCCATAGCTGAAAGAAAGAAAAAACGCAAACCTATTTTTACTGATGCGTCATTACAAGAGGGTCAGGATATATTTGGTGTAGATTTTGATTATGATGAGTTTGAGAAGTATGGTGAAGAAGACTATGAAGATGAAGATGATGAGGATTTGGATGAATATATTGAGGACGAGGAAGAAGAAGGCGGAGAGAGAAGAAGGACTAAAAAGGGTAAAGCGAAACGTCCAAGTAAAAAGTCAATATTTGAAATATATGAGCCTAGTGAGCTGAAGCGGAGTCACTTCACCGATTTAGATAATGAGATAAGAAAAACTGATGTGCCGGAGCGTATGCAAATCCGTGAGGTGCCTATAACACCAGTCGAAGAAGGCAGTACGGAGTTGGAAGATGAAGCTGAATGGATTTATAAACAGGCGTTCCTTAAAGCGCCCGTGTCCAAAGCTGATTCACAGGAAGCGAGGGAACGAACTCGGCGTACCGGAGGTACAGTGATAAAGATTCGTCAGGCCTTAGATTTTATGAGAAACCAGACATTGGAAGTTCCTTTTATAGCGTTTTATCGTAAGGAGTATGTGCAGCCCGAATTAAGTATTAATGATCTGTGGAAGGTTTATAAGTATGATGCCAAGTGGTGTCAATTGAAACAGCGCAAGGAGAATTTATTGCGCCTTATAGAAAACATGAGGGAGTTTCAGCTTGATAAAGTTATGGCTGATCCGGACGCCCCCATACCCGACACCATGAGACTTATTAAGGATGAAGATATTGATAGACTGAAGAACATACAGACGCCGGAGGAGTTGCGTGATGTCCACACACATTTCCTCCTGTACTATTCACACGACTTACCTGAGATGCAGAAGGTACAAAGAAGTAAGGAGAGGAAGAAGGAGCTAGAAGAGAAAAATACCGGAGGTACAGTGATAAAGATTCGTCAGGCCTTAGATTTTATGAGAAACCAGACATTGGAAGTTCCTTTTATAGCGTTTTATCGTAAGGAGTATGTGCAGCCCGAATTAAGTATTAATGATCTGTGGAAGGTTTATAAGTATGATGCCAAGTGGTGTCAATTGAAACAGCGCAAGGAGAATTTATTGCGTCTTATAGAAAACATGAGGGAGTTTCAGCTTGATAAAGTTATGGCTGATCCGGACGCCCCCATACCCGACACCATGAGACTTATTAAGGATGAAGATATTGATAGACTGAAGAACATACAGACGCCGGAGGAGTTGCGTGATGTCCACACACATTTCCTCCTGTACTATTCACACGACTTACCCGAGATGCAGAAGGTACAAAGAAGTAAGGAGAGGAAGAAGGAGCTAGAAGAGAAAAAGAAGTTAGCACGTGAAGAAGCAGAGAAGAACGGCGAGGATCCAGAAGAGGCGGCGGCTGCTGTGGTCGCCGCTGAACCTGATGATGAAGAGGCCACAGAGGTTAAATATGCTGTCCGCTCTGGACCATACGAACTCTGCAGGAAGGCTGGCATCGAGCCTCTGGTGAAGAAATTCGGTCTCTCCCCGGAGCAGTTCGCTGAGAACGTCCGAGACAACTACCAGCGTCACGAGGTGGAACAACAACCTGTACCGCCGCTAGAGGCCGCCGCCGAATATGTGTCTCCGGGCGGGTCCTGCAGTGAGGTGGTCCGTCGCGCCGTGTATATGTGTGGTGTGCAGTTGGCTCGGGAACCTCTACTACGTGCGACGCTTAGGGATGCTTTGAGAGAGAGGGCCACCGTCACAACCAAACCTACCCCGCGCGGTCTGAAGGAAATCGATGAGGGACATCCCTGCTACAGCATGAAATATTTGAAAAAGAAACCCGTTCGTGACCTCACCGGGGACCAGTTCCTGAGGTTGACGATAGCGGCGGACGACAAGCTGCTAGAGTTGACGATCAGCGAACAGATCGAGGGAAACACGAGCCCCAGCTACCTAGAGGAACTGAAACAGTTGTATCAAAAAGACGAGTTTGCGGCCACAGTTCAGGCGTGGAACGAGCTACGGGCGCAGGCCGTGACACTGGCGCTCACTAAGATCGTTATACCAGAACTGAGACGAGAACTACACGCGGTGCTGTTGCAAGAAGCTAAGGATTATGTGCTCAAGTGTTGTCGACGTCGTCTGTATGATTGGCTCAAGGTGGCTCCTTATGAGTCCAGGGTGTCTGATGAGGACGACGAAGAATGGGACTCTTCAAATGGTCTCCGTGTCATGTCTATCGCTTACGTGCCGGAGCGCACTCAGTGCGCGTTCGCCACGGTGGTCGCGGCGGCCGGGGAGGTGGTCGACCATCTGAGACTGCCGCATCTCCTGCTGAGGAGGAACGCCTGGGACGCCGCGGAGAGACGGAACAAGGAGGCTGACATGACGTCCCTCAGGCGGTTTATCCAGCGTAAGAAGCCCCATGTGATCGTGATCGGCGGCGAGTCTCGTGAGGCGTTGTCGGTGAAGGCGGACGTCGCGGAGTGCGTGGCGCAGCTGGTGGATGATGAACAGTTCCCGAGGATACCAGTCGAGATAGCCGACAACACCATCAGCAAAATATACAGCAATAGTATAAGAGGACGGAACGACTTCAGAGAATATCCGGAAACGTTGAGGCAAGCGATTTGTCAGGCGCGTCTCTTACAAGATCCCCTGATGGAGATATCGCAGCTCTGTGGCCCCGACGAAGAAATACTTTGTCTACGATACCATCCGCTACAAGACCAGTTGCCTAAAGAGGACCTGTTAGAAGGAATCGAGTTGGAGTTCGTTAACAGGATTAATGAAGTCGGCGTAGACGTTAATGAAGCGGTTCTGACCGGGCGAGGGACTGAGTTGCTACAATTCGTTTGTGGTTTAGGGCCGAGGAAGGCGCAGGCTCTCATAAAGTTATTCAAACAAACTAACCAGAAGCTAGAGAACAGGACCCAATTGGTCACGGTCTGTCACATGGGGCCGAAGGTTTTCATCAATTGCTCCGGTTTTATAAGGATCGATACCAGCAGTCTCGGTGACAGCACGGAGGCGTATATAGAAGTATTGGACGGTTCCAGAGTTCATCCGGAAACGTATGAATGGGCGAGAAAAATGGCCGTAGACGCTCTGGAGTACGAGGACGAAGACGCGAACCCAGCCGGAGCCCTGGAGGAGATTCTGGAAGCGCCCGAAAGGTTGAAAGACTTGGATTTAGACGCGTTCGCAGAGGAATTAGAAAGGCAGGGTTTCGGTAATAAGAGCATAACGCTCTACGACATCAGAGCCGAATTGAACTCTAGATACAAAGATCTCAGAGTCGCATATCGATCGCCGACACCCGAGGAATTATTCGATATGTTGACCAAAGAGACCCCGGAAACTCTGTATGTTGGTAAAATGGTGCTGGCAACCGTCATAGGTATCTCGCATAGGAAACCTCAAAGGGAAATGTTAGATCAGGCGAATCCCGTCAGAAATGACGAAACCGGATTGTGGGAATGTCCCTTCTGCCACAAGAACGACTTTCCCGAACTCTCTGAGGTATGGAATCATTTCGACGCAGGCGCGTGTCCGGGTCAAGCGACGGGGGTTAGGGTCAGATTAGACAACGGCCTGTCGGGCTACATACATATAAAGAACCTCTCAGACAGACACGTCACTGACCCCACAGAGAGGGTCAGAATAGGCCAGACCATACATTGTAGGATATTAAAGATTGATGTTGAAAGATTTTCGGTAGATTGTTCCTCTAAATCGTCTGACTTGTTGGATAAGAATAATGAGTGGAGACCACCAAAAGATCCTTACTACGATCAAGAATCTGAGGACAAAGATGTCCGAAAAGACACAGAGACTAAGCAGACTAAAGAGCGAATGCAGTATGTGAAACGAGTTATAGTACATCCAGCATTCCACAATATAAGTTTCGCTGAAGCCGAAAAGCTCATGGAGAACATGGCACAAGGGGAAGTCATAGTTAGGCCTAGCAGTAAGGGATCTGATCACCTGACAGTCACCTGGAAGGTGGCAGACGGTATATGCCAGCACATTGACGTGCGGGAAGAGGGCAAAGAGAATGCCTTTTCTTTAGGTCGTAGTCTTTGGATACAGGGATCAGAGTTTGAAGACTTGGATGAAATTATCGCGAGATACGTTACACCTATGGCGGGTCATGCGAGAGACCTCATTGCTTACAAATACTACAAGAACCTTGGCGGCATGAGAGACAAGGCTGAAGAGATTCTCAAAGACGAGAAGTCGAAGAATCCCAATAAGATTCACTACCCTGACTTCTTGACGCCAGCCATCCCTCGCCATAAACCTCAACCTTCCGTGTACACCGAGCCATCAGACTGGCAGAAGGCGGCGGAGGACTGGGTGCGGCACCGAGCCCGGGACGAGAGTACCTCGACTCCTCGACACGCAACGCGTACACCCCGCCACGACGCTCGCGCCACCCCCCGCCACGACTCTCGCGGCACCCCTCGCCACGACTCACGCGGCACACCGCGGCACGACGCCCGCTCGTCGGTCCACAGCACGCCACACAGCACGCCTCACGCCACGCCACACGGCCACACGTCCTCCGGCCGGTCGTCACGCTCGCACTCCGCACGTTCCACGCCGCACACGAACACGTCGCCGCGCTCTATGTCATTAGGGGACGCGACACCGCTCTACGACGAGAATTAG

Protein sequence:

>DPOGS207841-PA
MADFLDSEAESEVESEEDEKPTERKKPKRKAAVQSDDEDEEEEDDEERLREELKDLIDDAPIEESGSDGEDSDASVGPKKRKKSDDELDDRLEDEDYDLIEENLGVKVARNKFKRLRRLEDDDSDNEGADDPDLEREVIAEKLFVGGSDEEDENRSESAAPREIEYDDENEDMESDADDFIVDDDGRPIAERKKKRKPIFTDASLQEGQDIFGVDFDYDEFEKYGEEDYEDEDDEDLDEYIEDEEEEGGERRRTKKGKAKRPSKKSIFEIYEPSELKRSHFTDLDNEIRKTDVPERMQIREVPITPVEEGSTELEDEAEWIYKQAFLKAPVSKADSQEARERTRRTGGTVIKIRQALDFMRNQTLEVPFIAFYRKEYVQPELSINDLWKVYKYDAKWCQLKQRKENLLRLIENMREFQLDKVMADPDAPIPDTMRLIKDEDIDRLKNIQTPEELRDVHTHFLLYYSHDLPEMQKVQRSKERKKELEEKNTGGTVIKIRQALDFMRNQTLEVPFIAFYRKEYVQPELSINDLWKVYKYDAKWCQLKQRKENLLRLIENMREFQLDKVMADPDAPIPDTMRLIKDEDIDRLKNIQTPEELRDVHTHFLLYYSHDLPEMQKVQRSKERKKELEEKKKLAREEAEKNGEDPEEAAAAVVAAEPDDEEATEVKYAVRSGPYELCRKAGIEPLVKKFGLSPEQFAENVRDNYQRHEVEQQPVPPLEAAAEYVSPGGSCSEVVRRAVYMCGVQLAREPLLRATLRDALRERATVTTKPTPRGLKEIDEGHPCYSMKYLKKKPVRDLTGDQFLRLTIAADDKLLELTISEQIEGNTSPSYLEELKQLYQKDEFAATVQAWNELRAQAVTLALTKIVIPELRRELHAVLLQEAKDYVLKCCRRRLYDWLKVAPYESRVSDEDDEEWDSSNGLRVMSIAYVPERTQCAFATVVAAAGEVVDHLRLPHLLLRRNAWDAAERRNKEADMTSLRRFIQRKKPHVIVIGGESREALSVKADVAECVAQLVDDEQFPRIPVEIADNTISKIYSNSIRGRNDFREYPETLRQAICQARLLQDPLMEISQLCGPDEEILCLRYHPLQDQLPKEDLLEGIELEFVNRINEVGVDVNEAVLTGRGTELLQFVCGLGPRKAQALIKLFKQTNQKLENRTQLVTVCHMGPKVFINCSGFIRIDTSSLGDSTEAYIEVLDGSRVHPETYEWARKMAVDALEYEDEDANPAGALEEILEAPERLKDLDLDAFAEELERQGFGNKSITLYDIRAELNSRYKDLRVAYRSPTPEELFDMLTKETPETLYVGKMVLATVIGISHRKPQREMLDQANPVRNDETGLWECPFCHKNDFPELSEVWNHFDAGACPGQATGVRVRLDNGLSGYIHIKNLSDRHVTDPTERVRIGQTIHCRILKIDVERFSVDCSSKSSDLLDKNNEWRPPKDPYYDQESEDKDVRKDTETKQTKERMQYVKRVIVHPAFHNISFAEAEKLMENMAQGEVIVRPSSKGSDHLTVTWKVADGICQHIDVREEGKENAFSLGRSLWIQGSEFEDLDEIIARYVTPMAGHARDLIAYKYYKNLGGMRDKAEEILKDEKSKNPNKIHYPDFLTPAIPRHKPQPSVYTEPSDWQKAAEDWVRHRARDESTSTPRHATRTPRHDARATPRHDSRGTPRHDSRGTPRHDARSSVHSTPHSTPHATPHGHTSSGRSSRSHSARSTPHTNTSPRSMSLGDATPLYDEN-