Monarch geneset OGS2.0

DPOGS205605
TranscriptDPOGS205605-TA1677 bp
ProteinDPOGS205605-PA558 aa
Genomic positionDPSCF300167 + 138123-154771
RNAseq coverage2277x (Rank: top 5%)
Annotation
HeliconiusHMEL0174590.085.57% 
BombyxBGIBMGA007210-TA8e-14790.41% 
DrosophilaCG4538-PA1e-16962.00% 
EBI UniRef50UniRef50_Q16HN40.068.49%ATP-dependent clp protease atp-binding subunit clpx n=6 Tax=Metazoa RepID=Q16HN4_AEDAE
NCBI RefSeqXP_001605256.10.064.51%PREDICTED: similar to ATP-dependent clp protease atp-binding subunit clpx [Nasonia vitripennis]
NCBI nr blastpgi|3407292470.064.90%PREDICTED: ATP-dependent Clp protease ATP-binding subunit clpX-like, mitochondrial-like [Bombus terrestris]
NCBI nr blastxgi|3454881790.064.19%PREDICTED: ATP-dependent Clp protease ATP-binding subunit clpX-like, mitochondrial-like [Nasonia vitripennis]
Group
Gene OntologyGO:00064571.9e-254protein folding
GO:00055241.9e-254ATP binding
GO:00510821.9e-254unfolded protein binding
GO:00001665.1e-11nucleotide binding
GO:00171115.1e-11nucleoside-triphosphatase activity
KEGG pathwaynvi:1001216470.0 
 K03544 (clpX, CLPX)maps-> Cell cycle - Caulobacter
InterPro domain[12-547] IPR0044871.9e-254Clp protease, ATP-binding subunit ClpX
[224-439] IPR0130935.1e-37ATPase, AAA-2
[446-517] IPR0194896e-14Clp ATPase, C-terminal
[223-374] IPR0035935.1e-11ATPase, AAA+ type, core
Orthology groupMCL11969 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205605-TA
ATGAGTAGCGTTCGACTTAGTTTCGTATCAGTGGGACGGATTGCGGTAAGGAGAAATTCTCAGTTCACATCGGTAGCGTCAGGTGTTCAGACGTGTCGCGCCATCCGTCGCTCCTGTGCCCGCAGCATCAGCACCGGCGTCTCCCTCAGGAAGGGCACCGGGGAACTGCCGCCATCACACAACAAGGATGGGAATACATCCGGAACATCAGGTAAGAAAGGCGGCAGCGGAGTCCTAACATGTCCTAAGTGCGGGGACCCTTGTACCCACGTTGAGACGTTCGTCAGCTCGACACGTTTCGTGAAGTGCGACAAGTGCCACCACTTTTTCGTGGTCCTCAGCGAGGTGGACACCAAGAAGAGCATCAAGGACAACACTGAGAACAAGTCCGGCTTCTACAGGAAACCTCCTCCTCCACCAAAGAAGATCTTCGAATATCTGAACAAGCACGTGGTGGGTCAGGAATACGCCAAGAAGGTGTTGTCAGTTGCCGTCTACAACCACTACAAGCGTATATACAACAACGTGACGAGCGGCGCGCCCGCCGACGGACAACATCCGCTACATCACACGCACAGAGATCTGCTCCACGTGAACCAGGGCCAGAGCCCCGGCGCGGGGGGAGGGGCGGAGGTGCTGGAGAGGAACAACCACGAGCTCAGGCTCGACAAGAGCAATGTGCTGCTGCTGGGACCCACCGGCAGCGGTAAGACCCTCCTCGCTCAAACCATCGCCCAGTGCCTGGACGTGCCGTTCGCCATCTGTGACTGCACCACCCTCACCCAGGCCGGGTACGTGGGCGAGGACATCGAGAGCGTCATCGCTAAACTGCTACAGGACGCCAACTTCAATGTTGAACGAGCACAGACCGGTATAGTGTTTTTGGACGAAGTCGACAAAATAGGAGCCGTGCCCGGGATACACCAGCTGAGGGATGTTGGAGGCGAGGGAGTTCAGCAGGGTATGCTGAAGATGTTGGAGGGCGCGCTGGTGTCCGTCCCCGAGAGGAACTCGCGCAAGCTGAGAGGAGACGCCGTGCAGGTCGACACCACCAACATACTGTTCGTGGCCAGCGGCGCTTATAACGGACTGGACAGACTGATCCAGCGCCGCAACAACGAGAAGTACCTCGGCTTCGGTGCCTGGGACCCTCGCTCGGGGCGCCGCGCAGCCCTGGCCGCCGCCGCCGCCGACGCCTCGCCCCTGGACAGCGCCACGGACGAGGCGGGCGAGAGGGACCACTGGCTGCGAGCCGTGCAGGCCCGGGACCTTATCGACTTCGGCATGATACCGGAGTTCGTCGGCCGCTTCCCGGTGCTGGTGCCCTTCCACAGTCTGAACCAAGATCTGCTGGTTAGGATACTCACGGAACCCAAGAACGCCATTGTGGCTCAGTACAAGTTGTTGTTCGCGATGGACAAGTGCGAGCTGTCGTTCAGCGACGAAGCCTTACGAGCGGTGGCCGCACTCGCCATGGAGAGGAAGACGGGCGCCAGGGGATTGCGGGCTATCATGGAGAATCTGTTACTGGAGGTGATGTTCGAGATCCCCGGCTCAGACATAACCTGCGTTCACATACACGAGGGCTGCGTGCAACGAGCGGAGCCGCCCACCGTGAGGCGGAGGGAGAGGGAGAGGCAACCCTGGAGCTCGGTGCGACTCTCTAATTGTACGTAG

Protein sequence:

>DPOGS205605-PA
MSSVRLSFVSVGRIAVRRNSQFTSVASGVQTCRAIRRSCARSISTGVSLRKGTGELPPSHNKDGNTSGTSGKKGGSGVLTCPKCGDPCTHVETFVSSTRFVKCDKCHHFFVVLSEVDTKKSIKDNTENKSGFYRKPPPPPKKIFEYLNKHVVGQEYAKKVLSVAVYNHYKRIYNNVTSGAPADGQHPLHHTHRDLLHVNQGQSPGAGGGAEVLERNNHELRLDKSNVLLLGPTGSGKTLLAQTIAQCLDVPFAICDCTTLTQAGYVGEDIESVIAKLLQDANFNVERAQTGIVFLDEVDKIGAVPGIHQLRDVGGEGVQQGMLKMLEGALVSVPERNSRKLRGDAVQVDTTNILFVASGAYNGLDRLIQRRNNEKYLGFGAWDPRSGRRAALAAAAADASPLDSATDEAGERDHWLRAVQARDLIDFGMIPEFVGRFPVLVPFHSLNQDLLVRILTEPKNAIVAQYKLLFAMDKCELSFSDEALRAVAALAMERKTGARGLRAIMENLLLEVMFEIPGSDITCVHIHEGCVQRAEPPTVRRRERERQPWSSVRLSNCT-