Monarch geneset OGS2.0

DPOGS207766
TranscriptDPOGS207766-TA2070 bp
ProteinDPOGS207766-PA689 aa
Genomic positionDPSCF300042 - 300573-303466
RNAseq coverage354x (Rank: top 33%)
Annotation
HeliconiusHMEL0175570.077.02% 
BombyxBGIBMGA005317-TA0.072.87% 
Drosophilathoc5-PA4e-5929.76% 
EBI UniRef50UniRef50_F4WXM16e-14743.10%THO complex subunit 5-like protein n=7 Tax=Formicidae RepID=F4WXM1_ACREC
NCBI RefSeqXP_001661802.11e-14341.15%fms interacting protein [Aedes aegypti]
NCBI nr blastpgi|3320206852e-14643.10%THO complex subunit 5-like protein [Acromyrmex echinatior]
NCBI nr blastxgi|3320206856e-14343.23%THO complex subunit 5-like protein [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[96-467] IPR0191635.4e-102THO complex, subunit 5
Orthology groupMCL13534 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207766-TA
ATGGGTAAGGACGATACCTCAACGAAAAAACGACGTAAACTGACTACTACTTCATCGAGTGATAATAACACTAAGCAGACCCCGGTCGATATTTATAAGAAAGTCGTCGAATTCGAAGAAGCTGAGGCGCAGTTACGTTCAGCCGATAAGGATGCAGCGTTGTTTAAAAAGATATGTCAAGATGTTCGCCAATTATTTGCCGAAATAGCAGAATTAAAAGAAAAAGGCACTGATGAGGCAAAAGAAAAAATCAATGTAAAAAGAGTAGAGGCATCATTGCATTTAGTAGCATTGAAAAAGTTAAACAGATTGGAAAAGGTTCGTACAAGAGCTGGAAGAGAGGCTCTGCACAAAGAAAAGCAGAGAGTTGATTCAACACATCTCCTCTTGCAAAATCTTCTCTATGAAGCTGATCATCTTAATAAAGAAGTGACAAAATGCTTACAATTTAAATCAAAAGATGAAGAAATAGAATTAATACCACTAGAAGAATTTTACAAGGAAGCACCAAGTGAAATCTCTCGGCCGGAAGTAACAAAAGCAGATGAGCATCAACTTCAATTGGCAAGGCTTGAGTGGGAATTACGTCAGCGAAGGGAACTTGCTGGGGCCTGTAGTGAGTTGGTGGCTTCAAAGGAATGTGTGGCAGCAGCTATAGCTGCAGCACGGTCGAGGCTGAATGCACTCTCACCGCATTTGAAAGATGTTCTGAAGGCTACAAAACCACTTCAAGAATGTCTAGCTCTTAGATTGGATGAAAAGAGAGATGAGACGAGAGCAGCATCACTTCTCCCACCTCCTTTATTTTTGCTCTACGCCAATGCCAGTGCATATTCTGATGCTCTTGGTGCTAGCAATGTTGTTGTTGGAATATCTGGAGATGAAGATGAAGCGAAAAGATTGGATCAGTTAAGCAATGTTGAAAGTGAACTTGTAGTATCAAACGATTCAGACTCTGACCAGGAAAATAACTATGAAGAACCAAGAGATAAGAAAAAGAGACACCACCGAGGTACAAAAATATCAAGAGAAGAGAAAGCCGAGGCCAAAAAGAAAGAAGTACTTAAAAGACATCCTCTTAATGTTAAAGTTACTGTGAAAATACCAGACGGGACTGCATTGAATCTTATATTCTCATACATGGTTCATTTAAAAATTATTGTAGTCAAAAACACTCTGGACCTGTTTAAACCTATAACAGGAGTTTCAGCTGCCGATGTATTGAATGGAGACTGTATACTTAACGAACTTTACATTGGTGACAACGGCAATGACTCTCCACATCCAGCCACCACCTATTTACTTAATGCAGCTGGCATTGTGGAAGATTTTCACTATTTTATTCCTGAAGTTGGTAGACCTTACATATGGGCTCAGAGAATGTGCGGATTGGATTTCATGGCAGTGACGGGTGAAGAAAAAAAGTCCAATATTATTCAGCCGAGTCAAAGTCTCAGTGTTGTCAGTGTTGAAAATTTTATTTTTACTCTGAAGAAAAGATTGAAATCGAGAGTGGAACTTATGAAAGAATTGCAAGATTTGGAAAGTGGTAAAATTATACCGGAAAAAGGCGTGGGATGTCCCTTGAGACTATCAGGTTCGTTGACCCAGTGGCAGTCAGTGGGATGGAATGAATATAGCCAATCGACTTCAACATCATTCCTGATATCGGAAGGCCAAGTGAATCCAGAGAATATGTTGTACCGCGCTATAATCACAAGACAATCAGCTAAGCTCGTAGCATTGGTTGCCGTGAGTAGTGATTATCCGAAAAAGGCACCGCTGTTTTCATTGACATTACATTGGAACGGTACACACACCGCAGGAACAAACGATGACATAAGAGACATCGAGAGAATCATCAATACGAACTGGACTAATGATGGCAATAAGTCTACTCTCACCGCACAGATGACGAAGTTACTCACTTGTCTAGATATTCTCCTGGAGACCACAGGGTCATCAGAATTTCCTCCCGACAAAGTAATGTTCCAGTCAGTGAGAGGAAGAAATCGGATGAAACCTTACCGTTTTATAAAACAAGGTACAGGCGTGTTTGTACAATATTGA

Protein sequence:

>DPOGS207766-PA
MGKDDTSTKKRRKLTTTSSSDNNTKQTPVDIYKKVVEFEEAEAQLRSADKDAALFKKICQDVRQLFAEIAELKEKGTDEAKEKINVKRVEASLHLVALKKLNRLEKVRTRAGREALHKEKQRVDSTHLLLQNLLYEADHLNKEVTKCLQFKSKDEEIELIPLEEFYKEAPSEISRPEVTKADEHQLQLARLEWELRQRRELAGACSELVASKECVAAAIAAARSRLNALSPHLKDVLKATKPLQECLALRLDEKRDETRAASLLPPPLFLLYANASAYSDALGASNVVVGISGDEDEAKRLDQLSNVESELVVSNDSDSDQENNYEEPRDKKKRHHRGTKISREEKAEAKKKEVLKRHPLNVKVTVKIPDGTALNLIFSYMVHLKIIVVKNTLDLFKPITGVSAADVLNGDCILNELYIGDNGNDSPHPATTYLLNAAGIVEDFHYFIPEVGRPYIWAQRMCGLDFMAVTGEEKKSNIIQPSQSLSVVSVENFIFTLKKRLKSRVELMKELQDLESGKIIPEKGVGCPLRLSGSLTQWQSVGWNEYSQSTSTSFLISEGQVNPENMLYRAIITRQSAKLVALVAVSSDYPKKAPLFSLTLHWNGTHTAGTNDDIRDIERIINTNWTNDGNKSTLTAQMTKLLTCLDILLETTGSSEFPPDKVMFQSVRGRNRMKPYRFIKQGTGVFVQY-