Monarch geneset OGS2.0

DPOGS214561
TranscriptDPOGS214561-TA2262 bp
ProteinDPOGS214561-PA753 aa
Genomic positionDPSCF300266 + 137194-145487
RNAseq coverage1331x (Rank: top 10%)
Annotation
HeliconiusHMEL0161130.080.86% 
BombyxBGIBMGA010088-TA3e-9441.19% 
DrosophilaCG3499-PC0.069.91% 
EBI UniRef50UniRef50_F3YDF10.069.91%MIP17311p n=16 Tax=Endopterygota RepID=F3YDF1_DROME
NCBI RefSeqXP_970259.10.060.53%PREDICTED: similar to GA17483-PA [Tribolium castaneum]
NCBI nr blastpgi|910861650.060.53%PREDICTED: similar to GA17483-PA [Tribolium castaneum]
NCBI nr blastxgi|910861650.060.27%PREDICTED: similar to GA17483-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160203.2e-154membrane
GO:00042223.2e-154metalloendopeptidase activity
GO:00301633.2e-154protein catabolic process
GO:00055241.3e-41ATP binding
GO:00065084e-40proteolysis
GO:00001661.7e-21nucleotide binding
GO:00171111.7e-21nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[255-722] IPR0059363.2e-154Peptidase M41, FtsH
[337-467] IPR0039591.3e-41ATPase, AAA-type, core
[529-636] IPR0006424e-40Peptidase M41
[333-470] IPR0035931.7e-21ATPase, AAA+ type, core
Orthology groupMCL12854 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214561-TA
ATGTTCTCATTGAACTCCATAAACACCCAGAACCAGATTCTAATAAGTTTTAGTCAACTATCGTCTCGATATTCCAACGCATTTAGAAATAGAAGGAGTAGCAATGTTAAGAACAAAGATTTAATAAGAAAAGAGGCCACAACAGCTCCTTACAACGCATCAATATGTCCAGAAAGTTTTGATGAGGCCCTGCGTCATTTTGATAAGAATGTATTAACGGAGATGCGTAACATTGATCTGAGATATGTGACATCTTTAGCCGGATCCTCTAGTAATATTATGAGAAATGTAGCCGGCTTAGCATCGTCCAAAAAAGTATCATTCGTATCTACGGATTCGTTTGAGAAAAACAAAAATGGTTGGGTCGCTACACCTACGATCACCGTAGACCTACGCCACAAAAATACTAAACTTAGTTTGACCGACAACTTTGTTGGTCTGCTGTCGAAGAGCATAAGGGATAATTGCTTTAATTATAATGTACAAGTAAGAGGTTTTAAGACAGAAAGAGGTATACATGCTGATCTGAAGAGAAACCCCAATTTATTTAATAGACTACGTCTTCATGTATCCGACACGTCTGATAACAAATCCAACCTCGGTCCAGATGTTGCTCCACGATTGGAGAAGTTGCTCAGTGAAGACTCGACTAATCTTACACACCAACAGAAGGATAAGATTAAAATTGCCTTTGCCGAGGGATATTTGGCAGGATCTCACCCTGACAATGCCCGAGGGACGAAGGCCTCTAAGTACTTAAAGCTAGTACAACAACTCCTCACTATAGTACTATTCCTGGCTATATTTGTCAGTCTGATGGCATCTGTTAGCGGCACTGTTTTCCGGATCCAGTTGGGTAACCAGGTGGAGGTGGATCCTGAAGATATAACGGTCACCTTCGATGACGTCAAGGGTGCTGACGAAGCCAAGCAGGAGCTCAAGGATGTGGTGGAGTTCCTGAAATCCCCGGAGAAGTTTTCATCTTTGGGGGGCAAATTGCCTAAGGGTGTGTTACTGGTGGGTCCCCCTGGCACCGGGAAGACGTTATTGGCTCGAGCTGTGGCTGGTGAGGCGAGGGTGCCGTTCTTCCACGCAGCCGGACCAGAGTTCGATGAGATCCTCGTGGGACAGGGCGCTAGGCGCGTCAGGGATCTATTTAAGGCGGCCAAGGAGCGAGCCCCCTGCGTCATATTCATTGACGAGATAGATTCAGTGGGTGCCAAACGTACCAACAGCGTGCTGCATCCGTACGCCAACCAGACAATAAACCAACTCTTATCAGAGATGGATGGATTCCATCAGAACGAGGGTGTGATAGTGCTGGGCGCCACCAACAGGAGAGACGACCTGGACCAGGCCCTGCTGAGACCTGGAAGATTCGATGTCGAAGTGTCCGTACCAACACCAGACTACGGTGGTCGTCTGGAGATACTGCGGATGTACGTGTCGAGGGTCGCGGCTGCCCCGGGGCTGGACGTGGAATCCCTAGCCAGGGGTACCACGGGATTCACGGGCGCTGACCTCGAGAGTATGGTCAACCAGGCAGCACTCAAGGCAGCCATCGAGGGCGCCAAAACTGTGAGCATGTACCACCTGGAGGAGGCGAGGGACAAGGTGCTCATGGGCCCGGCGAGAAGAGCGAGGCTACCCGACGACGAGGCCAACGCTATCACCGCCTGCCACGAGGGCGGGCATGCGGTGGTAGCGTACTACACTAAGGATTCTCACCCACTTCACAAGGTCACCATAATACCACGCGGACCTTCCCTGGGACACACAGCCTACATACCGGCCAAAGAAAGATATCACGTGACGAAACAACAACTCCTGGCTATGATGGACACCATGATGGGCGGGCGGGCGGCCGAGGAACTGGTCTACGGACCGGATAAGATTACATCGGGTTTAGGAGGTGCAAGTGATCAAGGCCGCCATCACACCGAATCTACAGGTGGCATAGCCAGCGGTTGGCGTACTTCCGCCTTCTTCCACGGTTTCACCTGGAGAATGGAAAATGCTGTGGATGCTGAAATAAAGAAAATTCTATCAGAGAGCTACGAAAGAGCTAAGGCCATACTGAGGACGCACGCTAAAGAACATAAGGCTCTGTCGGAAGCCTTATTAAAATACGAGACTCTGGACGCTGACGACATCAAAGCGATCATGTCAGGAGACAAGGTGAAGATGGACCGAGGTAGAAGCAGCAACACTAATAAGGAGCCCTCGCCCGCCACGCTGCTGCCGCACACGATGCCCGCTTAG

Protein sequence:

>DPOGS214561-PA
MFSLNSINTQNQILISFSQLSSRYSNAFRNRRSSNVKNKDLIRKEATTAPYNASICPESFDEALRHFDKNVLTEMRNIDLRYVTSLAGSSSNIMRNVAGLASSKKVSFVSTDSFEKNKNGWVATPTITVDLRHKNTKLSLTDNFVGLLSKSIRDNCFNYNVQVRGFKTERGIHADLKRNPNLFNRLRLHVSDTSDNKSNLGPDVAPRLEKLLSEDSTNLTHQQKDKIKIAFAEGYLAGSHPDNARGTKASKYLKLVQQLLTIVLFLAIFVSLMASVSGTVFRIQLGNQVEVDPEDITVTFDDVKGADEAKQELKDVVEFLKSPEKFSSLGGKLPKGVLLVGPPGTGKTLLARAVAGEARVPFFHAAGPEFDEILVGQGARRVRDLFKAAKERAPCVIFIDEIDSVGAKRTNSVLHPYANQTINQLLSEMDGFHQNEGVIVLGATNRRDDLDQALLRPGRFDVEVSVPTPDYGGRLEILRMYVSRVAAAPGLDVESLARGTTGFTGADLESMVNQAALKAAIEGAKTVSMYHLEEARDKVLMGPARRARLPDDEANAITACHEGGHAVVAYYTKDSHPLHKVTIIPRGPSLGHTAYIPAKERYHVTKQQLLAMMDTMMGGRAAEELVYGPDKITSGLGGASDQGRHHTESTGGIASGWRTSAFFHGFTWRMENAVDAEIKKILSESYERAKAILRTHAKEHKALSEALLKYETLDADDIKAIMSGDKVKMDRGRSSNTNKEPSPATLLPHTMPA-