Monarch geneset OGS2.0

DPOGS209572
TranscriptDPOGS209572-TA978 bp
ProteinDPOGS209572-PA325 aa
Genomic positionDPSCF300015 - 1136494-1137890
RNAseq coverage343x (Rank: top 34%)
Annotation
HeliconiusHMEL0170535e-16078.77% 
BombyxBGIBMGA006632-TA2e-9553.27% 
Drosophilathoc6-PA1e-5636.76% 
EBI UniRef50UniRef50_E2BKD82e-8748.64%THO complex subunit 6-like protein n=8 Tax=Formicidae RepID=E2BKD8_HARSA
NCBI RefSeqXP_001121632.12e-9049.39%PREDICTED: similar to thoc6 CG5632-PA [Apis mellifera]
NCBI nr blastpgi|3838522663e-9050.61%PREDICTED: THO complex subunit 6 homolog [Megachile rotundata]
NCBI nr blastxgi|3838522662e-9150.61%PREDICTED: THO complex subunit 6 homolog [Megachile rotundata]
Group
Gene OntologyGO:00055155.2e-29protein binding
KEGG pathway 
InterPro domain[8-317] IPR0110465.2e-29WD40 repeat-like-containing domain
[8-290] IPR0159433e-25WD40/YVTN repeat-like-containing domain
[145-182] IPR0016801.5e-06WD40 repeat
Orthology groupMCL11776 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209572-TA
ATGCTTGATAAGATATTTTATAACACGGTTTTATGCCAAACATATTCTCCGTGTGGTAAATATTTGGTAGCCGGTAATATTTATGGACAACTAGCAGTGTTTGATCTCGATAACATATTTAATCCTGTGATAGAACTACTTACACCAGATTATAACAAGCCAAAACACATTCATACTTTAGAGTCAGAGAATCAAGTATGTAGTTTAGTGAGCACAGAGAATTTTTTAATTGTTGGCTCAGTAAATGAAATATTGGGATGGAACTGGAAATCAGTAATTCATCCAAAATTAGGTAAACCTGCATGGACAATAAGAATACAGCCAAAGTCGTTCATTGAGAAATGTGATATTAATTATCTGTGGTATTGTGAAGAAGAAGGAAAACTATATGTAGGATGTGGTGATAATAATATATATATATACAATTTAGAAGATGGTAAGCTTGTGTCTACCTTGGAAGGCCACTCTGATTATATACACTGTTTACATGGCAATGGACATCAACTTATTTCTGCTGGTGAAGATGGCAAGGTCCTTCTCTGGGACACAAGAATGAAAAAAAGTCATAACAAAATCGAACCATACAATAACAGTAAAGTTGCGAGACCAGATATTGGTAAATGGATGGGAGCCGCTGCTTTGGGAGATGATTGGATTGTATGTGGAGGTGGTCCCAGATTGGCTCTTTGGCATCTACGCTCCTTGGATGTTGTGACGGTGTTTGATATTCCTGATCATGGAATTCATGTGTCCTTCTTTCATGATGACTGCGTGTTTGCCGGCGGGGCTGCCAAGCACTTGTATCAGCTCAGTTACTCGGGAGATATAAGAGTTGAGCTGCCAGTGTCATCAACCACAGTATACTCCGCGGTGCTGAGAACAAGCCCACATAAAGTCCTAACAATAGCTGGTTCCAGCCCCGAAATTGACTTGTGTACCACATTCAACTATAGAGACCAAGTCTTGCATTTCAGATGA

Protein sequence:

>DPOGS209572-PA
MLDKIFYNTVLCQTYSPCGKYLVAGNIYGQLAVFDLDNIFNPVIELLTPDYNKPKHIHTLESENQVCSLVSTENFLIVGSVNEILGWNWKSVIHPKLGKPAWTIRIQPKSFIEKCDINYLWYCEEEGKLYVGCGDNNIYIYNLEDGKLVSTLEGHSDYIHCLHGNGHQLISAGEDGKVLLWDTRMKKSHNKIEPYNNSKVARPDIGKWMGAAALGDDWIVCGGGPRLALWHLRSLDVVTVFDIPDHGIHVSFFHDDCVFAGGAAKHLYQLSYSGDIRVELPVSSTTVYSAVLRTSPHKVLTIAGSSPEIDLCTTFNYRDQVLHFR-