Monarch geneset OGS2.0

DPOGS212039
TranscriptDPOGS212039-TA1434 bp
ProteinDPOGS212039-PA477 aa
Genomic positionDPSCF300054 + 12667-14876
RNAseq coverage668x (Rank: top 19%)
Annotation
HeliconiusHMEL0180640.097.48% 
BombyxBGIBMGA010088-TA0.098.92% 
DrosophilaCG6512-PA0.083.91% 
EBI UniRef50UniRef50_Q9Y4W60.080.22%AFG3-like protein 2 n=104 Tax=Eukaryota RepID=AFG32_HUMAN
NCBI RefSeqXP_969110.20.087.02%PREDICTED: similar to AGAP006949-PA [Tribolium castaneum]
NCBI nr blastpgi|1892359570.087.02%PREDICTED: similar to AGAP006949-PA [Tribolium castaneum]
NCBI nr blastxgi|1892359570.087.02%PREDICTED: similar to AGAP006949-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160206.6e-187membrane
GO:00042226.6e-187metalloendopeptidase activity
GO:00301636.6e-187protein catabolic process
GO:00065081.5e-73proteolysis
GO:00055241.5e-73ATP binding
GO:00001668.8e-23nucleotide binding
GO:00171118.8e-23nucleoside-triphosphatase activity
KEGG pathway 
InterPro domain[1-423] IPR0059366.6e-187Peptidase M41, FtsH
[219-421] IPR0006421.5e-73Peptidase M41
[25-156] IPR0039591.3e-43ATPase, AAA-type, core
[20-159] IPR0035938.8e-23ATPase, AAA+ type, core
Orthology groupMCL12190 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212039-TA
ATGGAATTTGTTAATTTTCTTAAAAATCCCCAGCAGTATATAGACTTAGGAGCTAAAATACCCAAAGGTGCATTACTAACAGGACCTCCTGGTACAGGTAAAACACTTCTCGCTAAAGCCACAGCCGGCGAAGCCAACGTTCCTTTCATAACTGTCTCTGGTTCTGAATTTTTGGAGATGTTTGTCGGTGTTGGTCCTTCAAGGGTTCGTGATATGTTTTCTATGGCTCGGAAACATGCACCATGCATCCTGTTCATCGATGAAATTGATGCTGTTGGTAGAAAGAGAGGAGGTCGCAGTTTTGGAGGTCACTCTGAACAGGAAAACACACTTAACCAACTTTTAGTTGAAATGGATGGTTTTAATACAACAACCAATGTTGTTGTGTTAGCTGCAACCAATCGAGTTGATATTTTGGATAAAGCTCTGTTAAGACCCGGTAGATTTGATCGTCAGATCTTCGTCCCGGCTCCAGACATAAAGGGTAGGGCTTCCATATTCAAAGTACATCTTACACCACTGAAGACTACTTTGAATAAGGAAAATTTGGCTCGTAAAATGGCTGCATTGACACCAGGATTTACTGGAGCTGATATTGCCAATGTATGCAATGAAGCAGCGCTGATTGCAGCTAGAGAATTAGCTAACGACATCACAATGAAAAACTTTGAACAAGCAATTGAGAGAGTTGTCGCTGGCATGGAAAAGAAATCCAATGTCCTACAACCTGATGAGAGGAAGATTGTAGCATACCATGAAGCGGGTCACGCGGTTGCTGGGTGGTTTCTACAACATGCTGACCCACTACTGAAGGTTTCCATCATCCCTAGAGGCAAAGGTCTAGGTTACGCACAGTACTTACCAAAAGAACAGTATTTGTACAGTAAGGAACAACTCTTTGACAGAATGTGCATGACCCTGGGTGGCAGAGTTAGTGAAGAAATTTTCTTTGGTAGAATCACTACTGGTGCTCAGGATGACTTGAAGAAGATAACGCAGAGTGCTTACGCTCAAATTGTGCATTACGGTATGAATGCTAAAGTTGGAAATGTGTCATTCGAAATGCCCCAACCAGGCGAAATGGTTATCGATAAACCGTACTCAGAAAAAACGGCGGAGTTGATTGATTCAGAAGTAAGAGATTTGATCAACTCAGCCCACAAACATACAACTGAACTTTTGATAAAACACAAACCGAACATCGAAAAAGTTGCGGAGAGACTCTTGAAACAAGAAATATTGAGCAGGGACGACATGATCGAACTCTTAGGCCCAAGACCCTTCCCAGAGAAGAGTACTTATGAAGAATTTGTTGAAGGCACTGGATCGCTGGAAGAAGACACGACTTTGCCCGAGGGTTTAAAGAATTGGAACAAAGAGAAACAGCCAACAATACCTCCACCTCAATCTATACCAGGAGCGAGTAAAAATTAA

Protein sequence:

>DPOGS212039-PA
MEFVNFLKNPQQYIDLGAKIPKGALLTGPPGTGKTLLAKATAGEANVPFITVSGSEFLEMFVGVGPSRVRDMFSMARKHAPCILFIDEIDAVGRKRGGRSFGGHSEQENTLNQLLVEMDGFNTTTNVVVLAATNRVDILDKALLRPGRFDRQIFVPAPDIKGRASIFKVHLTPLKTTLNKENLARKMAALTPGFTGADIANVCNEAALIAARELANDITMKNFEQAIERVVAGMEKKSNVLQPDERKIVAYHEAGHAVAGWFLQHADPLLKVSIIPRGKGLGYAQYLPKEQYLYSKEQLFDRMCMTLGGRVSEEIFFGRITTGAQDDLKKITQSAYAQIVHYGMNAKVGNVSFEMPQPGEMVIDKPYSEKTAELIDSEVRDLINSAHKHTTELLIKHKPNIEKVAERLLKQEILSRDDMIELLGPRPFPEKSTYEEFVEGTGSLEEDTTLPEGLKNWNKEKQPTIPPPQSIPGASKN-