Monarch geneset OGS2.0

DPOGS215726
TranscriptDPOGS215726-TA2073 bp
ProteinDPOGS215726-PA690 aa
Genomic positionDPSCF300041 + 315735-320573
RNAseq coverage515x (Rank: top 24%)
Annotation
HeliconiusHMEL0096520.079.12% 
BombyxBGIBMGA005816-TA0.077.47% 
DrosophilaCG7791-PA0.057.53% 
EBI UniRef50UniRef50_Q8T8Q10.057.68%SD07787p n=32 Tax=Coelomata RepID=Q8T8Q1_DROME
NCBI RefSeqXP_001656860.10.061.39%mitochondrial intermediate peptidase [Aedes aegypti]
NCBI nr blastpgi|1571365050.061.39%mitochondrial intermediate peptidase [Aedes aegypti]
NCBI nr blastxgi|1571365050.061.39%mitochondrial intermediate peptidase [Aedes aegypti]
Group
Gene OntologyGO:00065082.5e-116proteolysis
GO:00042222.5e-116metalloendopeptidase activity
KEGG pathway 
InterPro domain[227-669] IPR0015672.5e-116Peptidase M3A/M3B
[502-668] IPR0240772.4e-85Neurolysin/Thimet oligopeptidase, domain 2
[355-501] IPR0240793.1e-46Metallopeptidase, catalytic domain
Orthology groupMCL13171 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215726-TA
ATGAAGACTTTAAAACCACTTTGGGTGACGGGCAGTAAGCGGGTGAGCACTTGGTCGCCCCTAGCCACTGCATTTAACACGAGACCTACATCCAGACCTATTTTTGATACTTTGAGGGAAAAAACTGGTCTCTTCAACAAACCAGAACTAAAAACCTTTGAGGGTTTCTATACATTAAGAGAGCAAGCCATTGCAGCTACAGATCGTCTTATAGAGGAAGCAACAGGTAATCCGGCTCGGCCTATGGTAGACATATTTGATGAACTCTCAGACACACTGTGCAAAGTGGCTGACTTGGCGGAGTTTGTGAGGATAGCGCACCCACAACCACACTTTGCACATGCAGCAGAAGAAGCCTGCATCAGTGTAAGCGGTGTAGTAGAAAAATTAAATACACACAAGGGTTTGTATCAGTCTTTGCGTAATTCAGTGGAGAATGGAACTTGCGGAGATCGTCATCTTGCAGAGTTGTTTTTGTTTGATTTTGAGCAGAGCGGTATACACTTGCCAGACGGACCACGGCAGAAAGTTGTGGCCCTTAATGACCTTATCCTGCAAACGGGACAGCAGTTCATGGCCGGAGCGGCAAAACCGAGAAAGGTTCCAAGATCAGCTGTGCCTCAAAATGTCAGGCAATTTTTTAGTTGTGAGGGTGACACGGTGACTGTGAGTGGTGTGTATGCCGAGTGTGGTGAGGCGGGGGCTCGTGAGGCCGCTTACAGACTGTACCTGGCGGCTGACGGACGACAGGAACTGCTGTTGGGGAAGACACTGATGGCTAGGAGAGAACTGGCCGGCTTGTGTGGATTTAAATGCTATGCGGACCGCGCTATAAAAGCCAGCACCATGGAGACGTCGTCTAACGTCCGTCAGTTCCTGGACGTGTTGTCGGACAATCTGCACGACAGAGCCAACATCGACTTCGAGGCAATGGCGGCCATGAAACGAACGGAGACTCCGTACCAGAAATCGTTGATGTGTTGGGACACTCCGTACTTCACTCACAAGGCGAAGGCCCAGCTGCTGAACGTGTCTCCGTCCGAGTTCAGTCCGTACTTCTCGCTGGGCGCCTGTATGGAGGGACTCAACATGCTGTGCCAGGAACTCTACAACATCACATTACAGTCCGAAGAAATGTTGCCAGGCGAGTCTTGGTCTCCTGACGTGTACAAGGTTCGTGTGGAGCACGCTACAGAGGGTACTTTGGGACACATTTACTGTGACCTCTACGAGAGGCCCGGGAAACCGCATCAAGACTGCCACTTCACTATCCGGGGAGGAAAACTACTGCCCGATGGAACATACCAGAATCCAGTGGTGGTGGTGATGCTGTCTCTGTCTGGCGGTCACCGCTCCGGCCCCGCCCTGCTGGGTCCGTGCTCTGTGGACAACCTTTTCCACGAGCTGGGTCACGCGCTCCACTCCATGTTGGCCCGCGCCCCTCACCAGCACGTGGCCGGCACGCGCTGCGCCACGGACCTCGCCGAGCTACCCTCGGTCCTCATGGAACACTTCGCTGCGGAACCTCGCGTGGTGCGTCGTTTCGCGCGTCACTTCCAAACAGGAGAGCCGATGCCGGAGGACATGTTGCAGCGGCTGTGCGCCTCCAAACATCTCTTCGGCGCCAGTGAGATGCAGTTACAGGTGTTCTACTCGGCCTTAGACCAACAGTACCACGGACCGGAAGCGGGTCAGGGCGAGGACACCACCGAGGTACTGAGACACGTTCAGAAACAGTACTACGGATTACCGTATGTGGAGAACACGGCCTGGCAACATCGGTTCAGTCATCTCATCGGGTACGGCGCGAAATACTACTCTTACCTGATATCAAGAGCGTTGGCGTGGAGCGCCTGGAGGACGCACTTCCACACACAACCGCTCAGTCGGACGGCCGGCGAGCGTCTGAGGCACGGACTCCTAAAACACGGAGGGTCCGTCCCGCCACAGATCTTGCTAAAAGACTACCTGGAGACGGAGATCACACCTCACACGCTGGCGATGGCCCTGACTGAGGAGCTCGACTATCACAAGGACTACCTCGACACTGTGTTCAAGATAGCTGACAAGTAA

Protein sequence:

>DPOGS215726-PA
MKTLKPLWVTGSKRVSTWSPLATAFNTRPTSRPIFDTLREKTGLFNKPELKTFEGFYTLREQAIAATDRLIEEATGNPARPMVDIFDELSDTLCKVADLAEFVRIAHPQPHFAHAAEEACISVSGVVEKLNTHKGLYQSLRNSVENGTCGDRHLAELFLFDFEQSGIHLPDGPRQKVVALNDLILQTGQQFMAGAAKPRKVPRSAVPQNVRQFFSCEGDTVTVSGVYAECGEAGAREAAYRLYLAADGRQELLLGKTLMARRELAGLCGFKCYADRAIKASTMETSSNVRQFLDVLSDNLHDRANIDFEAMAAMKRTETPYQKSLMCWDTPYFTHKAKAQLLNVSPSEFSPYFSLGACMEGLNMLCQELYNITLQSEEMLPGESWSPDVYKVRVEHATEGTLGHIYCDLYERPGKPHQDCHFTIRGGKLLPDGTYQNPVVVVMLSLSGGHRSGPALLGPCSVDNLFHELGHALHSMLARAPHQHVAGTRCATDLAELPSVLMEHFAAEPRVVRRFARHFQTGEPMPEDMLQRLCASKHLFGASEMQLQVFYSALDQQYHGPEAGQGEDTTEVLRHVQKQYYGLPYVENTAWQHRFSHLIGYGAKYYSYLISRALAWSAWRTHFHTQPLSRTAGERLRHGLLKHGGSVPPQILLKDYLETEITPHTLAMALTEELDYHKDYLDTVFKIADK-