Monarch geneset OGS2.0

DPOGS208369
TranscriptDPOGS208369-TA2358 bp
ProteinDPOGS208369-PA785 aa
Genomic positionDPSCF300146 - 145924-154602
RNAseq coverage618x (Rank: top 21%)
Annotation
HeliconiusHMEL0072360.078.26% 
BombyxBGIBMGA012223-TA0.090.49% 
DrosophilaCG2658-PA0.061.11% 
EBI UniRef50UniRef50_O768670.061.11%EG:100G10.7 protein n=18 Tax=Eumetazoa RepID=O76867_DROME
NCBI RefSeqXP_001813433.10.066.62%PREDICTED: similar to paraplegin [Tribolium castaneum]
NCBI nr blastpgi|1892354340.066.62%PREDICTED: similar to paraplegin [Tribolium castaneum]
NCBI nr blastxgi|1892354340.066.91%PREDICTED: similar to paraplegin [Tribolium castaneum]
Group
Gene OntologyGO:00160207.6e-176membrane
GO:00042227.6e-176metalloendopeptidase activity
GO:00301637.6e-176protein catabolic process
GO:00065088.1e-70proteolysis
GO:00055248.1e-70ATP binding
GO:00001662.4e-19nucleotide binding
GO:00171112.4e-19nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[249-738] IPR0059367.6e-176Peptidase M41, FtsH
[537-737] IPR0006428.1e-70Peptidase M41
[339-473] IPR0039591.2e-39ATPase, AAA-type, core
[334-476] IPR0035932.4e-19ATPase, AAA+ type, core
Orthology groupMCL12117 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208369-TA
ATGTTATTAATCAAGCGACTACCAGCGCTTTCAGCATCAAAAATTAATTTAAAAGTACCGTATCTTGAAAAAAATGTGATCCTCGAGAGGCAAATTAGTAAACACTGCCACTATAATTTGTTAAAGTTATGTAAATCTGCATCAGTACTACATAATTTAACCCACAAATCTAAAAATCCACAACTTCGGCAGTTCCATGCAGAATATAAGGCAGCCTTTGCTTTGCTGCAGAGATCAAATTTTTTAGGATTTGGTTCCTTTTTAAATACAAATTCAAGAAGGTTGCATACAAATTCCCCAAATCAAAACAGGAAAAATGAAGACAATGACGATAAGGAGAAAAAGAAAGAAAACGATAAAGGGACAATGCCATCATTATTACTGAAGGCGGCATTTTGGATGCTCACAACATTTACCTTAATAATGCTCAGCTCATTTCTTGTACCAGGAGACAACACACAAAATGAGTTAATCCGTTACGTGTCGTGGAACGAGTTCGTATATTCGATGTTATCAAAGGGGGAAGTTGAGGAATTGATAGTCCGACCAGATTTAGAGGTGGTCACCATTATATTACACGAAGGTGCTGTCATAAAAGGCAAAAGATCAAACCATCGAGTTTTCCACATGAATGTTGGTGACATTCATCGGTTTGAAGAGAAACTGAGAGAGACAGAACTCGGCCTAGGGGTTAAGGAAGGAGTTCGTGTAATATATGATAGGAATGGAAGTGTCGCTGGGAAAATTATTACAAGCCTTCTGATAGCCGCTATTATTATGTCCTTCCTCTATTCAACTAAGTCAATGAGGATGAATATAAACTTGGGTGGATTTAGTCAGTTGAGGCGTGCCAAATTCACTCTAGTTGATTCTATGAGCGGTCAAGGGAAGGGTGTTAAGTTTGAGGACGTGGCCGGCCTGAAGGAAGCCAAGATAGAGGTTATGGAGTTTGTTGACTACTTGAAGCGACCAGAGCATTACAGAAGCTTAGGTGCTAAGGTGCCCAAGGGAGCTCTGTTACTTGGTCCACCGGGTTGTGGTAAGACGTTACTTGCTAAGGCTGTGGCTACGGAAGCTAATGTACCATTTCTCTCTATGAACGGATCAGAGTTCATCGAAATGATCGGAGGCTTGGGAGCGGCGAGAGTCAGGGATCTGTTCAAAGAGGCGAGCTCGAGAGCACCCTGTATAATATATATCGACGAAATGGATGCCGTTGGTCGTGCGAGGTCATCCGGCACTTCCTCCTGGGGTCCCGGCGGTGGGGAGGGGGAACAGACCCTCAATCAGTTGCTGGTAGAAATGGACGGCATGAAAAGCAGGGAGGGGGTCGTTGTATTAGCCAGCACCAACAGAGCTGATGTACTAGATAAGGCGCTACTCCGTCCGGGACGGTTCGACAGACACATCCTCATAGATTTACCGACTTTGTTAGAACGAGAAGAAATCTTCGAGAGGCATTTGAAGAACATAGTACTTGAGAAGTTGCCACCTTATTATGTTAAACGTCTTGCGTATTTAACGCCTGGATTCAGTGGCGCTGATATAGCTAACGTTTGTAACGAGGCGGCCTTACACGCTGCTAGATTCAAGCAAAGTATAGTGAAGGCTTCGGATCTGGAATACGCCGTCGAGAGGGTCGTCGGTGGTACGGAAAAACGAAGTCACGCTATTTCACCGGCTGAGAAGCGTGTCATAGCTTACCATGAGGCGGGACACGCTCTGGTCGGCTGGCTGCTAGAACATACGGACGCCTTGCTCAAGGTCACGATCGTGCCGCGTACCAATAAAGCATTGGGCTTCGCTCAATACACGACATCAGATCAAAAACTGTACTCCAAGGAAGAGTTGTTCGATCGCATGTGTATGGCGTTGGGCGGTCGGGCGGCCGAGGCGATAACATTCAACTCTGTAACCAGCGGAGCCCAGAACGACCTTGAGAAGGTCACCAAAATAGCATACGCACAGGTCCGTGTTTTCGGCATGTCGCCGAGCGTAGGGTTGGTTTCTTTCCCCGATGTCAAAGAGCACCAGAGGAGTCCATTCAGCAAGGCTCTGAAGAACCTCATAGATATGGAGGCGAGACAGCTGATCGCTAAAGCCTACTACAGGACCGAGGAGCTCTTGAAACGGAATGAGAACAAACTGAAATTACTCGCCGAGGAACTTATAAAGAAAGAAACGCTCAACTACAAGGACGTTGAGGCTATTCTAGGCAAGCCTCCGTTCGCCAAGAAATTCATAGATCCGATAGAGTTCGAACAGAATCTAAGGAACATGGAACACGTCGCTAAAACCGGAGACGATGACGTCGGCGCGGCCTCAGCCAAACCCACCGCAAACAACGGCCTTCACTAG

Protein sequence:

>DPOGS208369-PA
MLLIKRLPALSASKINLKVPYLEKNVILERQISKHCHYNLLKLCKSASVLHNLTHKSKNPQLRQFHAEYKAAFALLQRSNFLGFGSFLNTNSRRLHTNSPNQNRKNEDNDDKEKKKENDKGTMPSLLLKAAFWMLTTFTLIMLSSFLVPGDNTQNELIRYVSWNEFVYSMLSKGEVEELIVRPDLEVVTIILHEGAVIKGKRSNHRVFHMNVGDIHRFEEKLRETELGLGVKEGVRVIYDRNGSVAGKIITSLLIAAIIMSFLYSTKSMRMNINLGGFSQLRRAKFTLVDSMSGQGKGVKFEDVAGLKEAKIEVMEFVDYLKRPEHYRSLGAKVPKGALLLGPPGCGKTLLAKAVATEANVPFLSMNGSEFIEMIGGLGAARVRDLFKEASSRAPCIIYIDEMDAVGRARSSGTSSWGPGGGEGEQTLNQLLVEMDGMKSREGVVVLASTNRADVLDKALLRPGRFDRHILIDLPTLLEREEIFERHLKNIVLEKLPPYYVKRLAYLTPGFSGADIANVCNEAALHAARFKQSIVKASDLEYAVERVVGGTEKRSHAISPAEKRVIAYHEAGHALVGWLLEHTDALLKVTIVPRTNKALGFAQYTTSDQKLYSKEELFDRMCMALGGRAAEAITFNSVTSGAQNDLEKVTKIAYAQVRVFGMSPSVGLVSFPDVKEHQRSPFSKALKNLIDMEARQLIAKAYYRTEELLKRNENKLKLLAEELIKKETLNYKDVEAILGKPPFAKKFIDPIEFEQNLRNMEHVAKTGDDDVGAASAKPTANNGLH-