Monarch geneset OGS2.0

DPOGS215092
TranscriptDPOGS215092-TA2673 bp
ProteinDPOGS215092-PA890 aa
Genomic positionDPSCF300187 + 263008-272113
RNAseq coverage601x (Rank: top 21%)
Annotation
HeliconiusHMEL0105430.073.71% 
BombyxBGIBMGA007194-TA0.069.68% 
DrosophilaCG8798-PC0.055.60% 
EBI UniRef50UniRef50_Q7KUT20.055.60%Lon protease homolog, mitochondrial n=8 Tax=cellular organisms RepID=LONM_DROME
NCBI RefSeqXP_973021.10.064.00%PREDICTED: similar to AGAP010451-PA [Tribolium castaneum]
NCBI nr blastpgi|910772060.064.00%PREDICTED: similar to AGAP010451-PA [Tribolium castaneum]
NCBI nr blastxgi|910772060.057.97%PREDICTED: similar to AGAP010451-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065082.1e-207proteolysis
GO:00055242.1e-207ATP binding
GO:00041762.1e-207ATP-dependent peptidase activity
GO:00042522e-71serine-type endopeptidase activity
KEGG pathway 
InterPro domain[477-886] IPR0048152.1e-207Peptidase S16, ATP-dependent protease La
[680-886] IPR0082692e-71Peptidase S16, Lon C-terminal
[704-890] IPR0205685.9e-53Ribosomal protein S5 domain 2-type fold
[112-378] IPR0031113.8e-44Peptidase S16, lon N-terminal
[110-222] IPR0159471.6e-14PUA-like domain
[477-599] IPR0039591.9e-12ATPase, AAA-type, core
Orthology groupMCL11213 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215092-TA
ATGCACATAGCGAGTGTATTAGTGCGTAATACCGCACTTCTTAATCCCTCGATTAGGCCTTCATCGCAAACTGTTCGTAATGTAACCAAAATTGCATCATATTGCAAGCCGGTAGGAAATCGTTTTTTTAATGGACACAATTTGTACGGAACTCGCAACGCTCGGATATGCTCGTATAACCAAGAATATGCAGCCGTGAAAAAGGTACAGAACATTAGACATTATTCGAAGAAGCTTAATCCGGAAGAAGAGGAATCAGCTGATATTAAAGAGGACCCGCCATTGTTCTCAAGCCAGCTACCAGCAACTGTGGCTGTGCCTGAAGTGTGGCCGCAAGTGCCCGTTATTGCCATTAATAGGAACCCCGTTTTTCCAAGATTTATTAAATTAATTGAGATATCAAACCCAGCTTTAATAGATCTAATAAGGCGTAAAGTGAAACTGAATCAGCCGTATGTTGGTATATTTTTGCGTAAGAAAGAAGACGAGAAATCAGATGTTGTGTCGAGTTTGGACGATCTTCATGATGTGGGGGTGTTCGCTCAGATCCACGAGATGCAGGATATGGATTACAAGCTACGTCTAGTCGTTATGGCACACAGAAGAATAAAAATCACCGGCCAGTTTATAGAAGACGAGATCGAAACTGGCCCAGCCGAAATGAAGCTAAAGTTTCCCGTATTTAACGTGGAATTTAACGTTACCCGCGAAGAATCAGACGCTGAGCGACGTAGGAGGAAATATCGTAACACGAGACGGCAACGTAACGACTCGGACGCGGAACACGAGAAGGAGGTGCAGGAACCAAAGGAAGCTAAGAAACCTCCGCCGGACCAGCTTATGATGGTCAAAGTGGAGAATATGATGCATGACAAGTTCCAGCAGAACGAGGAGGTGAAAGCGTTGACGCAGGAGATCATCAAGACTATCAGGGATATCATCAATATGAACCCCCTGTATAGAGAATCTCTGCATCACATGCTAGCTCAAGGTCAGCGTGTTGTGGACGATCCCGTGTACCTCGCGGATTTAGGCGCCGCCTTAACCGCAGCTGAGCCCAAGGACCTACAGCCGGTTCTTGAGGAGATGGATATTCCGAAACGACTGTTACTATCATTATCACTGCTGAAGAAGGAATATGAACTGTCCAAATTGCAGCAGAAAATCGGTAAGGAAGTTGAAGAAAAGGTGAAACAGCAGCACAGGAAATACATTCTGCATGAACAACTCAAGGTTATAAAAAAAGAATTAGGTCTTGAGAAGGATGACAAAGACGCCATTGGTGAGAAATTCCGCGAGAGACTGGCTGATAAAGTGGTACCACCCTCTGTTCAGACGGTCATTGACGAGGAGCTCAACAAACTGAACTTCCTAGAGAGTCATAGCTCAGAGTTCAAGTTAGTATGGTCGATAACGTTCAATAAAACCCGTTCCATAGCCAGAGCGTTGAACCGTAAGTATTTTAGGTTCTCAGTGGGCGGTATGACGGATGTGGCGGAGATAAAGGGACACAGACGTACATACGTGGGCGCTATGCCCGGGAAGCTGGTGCAGTGCTTGAAGAAGACGAACACAGAGAACCCATTGGTCCTTATAGATGAAGTGGATAAGATCGGGAAAGGTGTCCACGGTGATCCGTCATCAGCTCTTCTGGAACTGCTGGATCCAGAACAGAACGCGAATTTCCTGGACCACTACTTGGATGTTCCGGTGGACCTGTCTCGAGTGCTCTTCATCTGCACAGCGAACGTACTCGACCTTATACCGGAACCTCTGAGGGACAGGATGGAACTTATAGAAATGTCAGGATATGTGGCAGAAGAGAAGCTAGCCATAGCCCAGCAGTACTTGATACCGACAGCCCTCAAGAACTGTGGTCTCACAGACGAAAAAATCAATATAACACCGGAGGCATTACACACACTCATAAGGTCATACTGCAGGGAGAGCGGAGTCAGGAATCTACAGAAACATATTGAGAAGATTGCACGTAAGGTAGCCTACAAGCTTGTAAAGAAAGAGACGTCTTCCTTATCTGTGACGGACGCTAATTTATCGGAACTGGTTGGGAAGCCGACCTTCAAACACGACCGCATGTATGACGTCACACCACCCGGAGTGGTGATGGGCCTAGCGTGGACCGCCATGGGTGGTAGTACGTTATACATAGAAACAGCTGTACGGAACACTATGAAGGGTGAGAAGCAATCCGGCTCGCTGGAGCTGACCGGGCACCTGGGTGACGTCATGAAGGAGTCGGCCCGGATCGCGCTCACCGTGGCCCGCAACTACCTCAAGGAGTCCCAGCCGGACAACGACTTCCTTAACACCAGTCACCTCCACCTCCACGTGCCCGAGGGCGCGACTCCCAAGGACGGTCCATCAGCGGGCGTGACCATCGCCACCGCTCTCCTGAGCCTAGCGCTCCAACGACCAGCCAACACCCTCGCTATGACCGGGGAGCTCACCCTCACTGGACGAGTGCTGCCCGTTGGAGGGATCAAGGAGAAGATTATAGCGGCTAAGCGTGTCGGAGTGACTTGCGTGATTCTCCCCGAGGACAACAGGCGCGACTTCGACGACCTGCCCTCCTTCATCAGGGACGGTATCGACGTGCACTTCGTCAATGTGTATGATGACGTGTTCAAGATAGTCTTCGACGGAAAGGTTTAA

Protein sequence:

>DPOGS215092-PA
MHIASVLVRNTALLNPSIRPSSQTVRNVTKIASYCKPVGNRFFNGHNLYGTRNARICSYNQEYAAVKKVQNIRHYSKKLNPEEEESADIKEDPPLFSSQLPATVAVPEVWPQVPVIAINRNPVFPRFIKLIEISNPALIDLIRRKVKLNQPYVGIFLRKKEDEKSDVVSSLDDLHDVGVFAQIHEMQDMDYKLRLVVMAHRRIKITGQFIEDEIETGPAEMKLKFPVFNVEFNVTREESDAERRRRKYRNTRRQRNDSDAEHEKEVQEPKEAKKPPPDQLMMVKVENMMHDKFQQNEEVKALTQEIIKTIRDIINMNPLYRESLHHMLAQGQRVVDDPVYLADLGAALTAAEPKDLQPVLEEMDIPKRLLLSLSLLKKEYELSKLQQKIGKEVEEKVKQQHRKYILHEQLKVIKKELGLEKDDKDAIGEKFRERLADKVVPPSVQTVIDEELNKLNFLESHSSEFKLVWSITFNKTRSIARALNRKYFRFSVGGMTDVAEIKGHRRTYVGAMPGKLVQCLKKTNTENPLVLIDEVDKIGKGVHGDPSSALLELLDPEQNANFLDHYLDVPVDLSRVLFICTANVLDLIPEPLRDRMELIEMSGYVAEEKLAIAQQYLIPTALKNCGLTDEKINITPEALHTLIRSYCRESGVRNLQKHIEKIARKVAYKLVKKETSSLSVTDANLSELVGKPTFKHDRMYDVTPPGVVMGLAWTAMGGSTLYIETAVRNTMKGEKQSGSLELTGHLGDVMKESARIALTVARNYLKESQPDNDFLNTSHLHLHVPEGATPKDGPSAGVTIATALLSLALQRPANTLAMTGELTLTGRVLPVGGIKEKIIAAKRVGVTCVILPEDNRRDFDDLPSFIRDGIDVHFVNVYDDVFKIVFDGKV-