Monarch geneset OGS2.0

DPOGS209834
TranscriptDPOGS209834-TA2988 bp
ProteinDPOGS209834-PA995 aa
Genomic positionDPSCF300117 + 708854-717516
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0043510.088.79% 
BombyxBGIBMGA008017-TA5e-13933.11% 
DrosophilaCG14516-PB0.046.63% 
EBI UniRef50UniRef50_F5HLF50.046.09%AGAP013001-PA n=4 Tax=Culicidae RepID=F5HLF5_ANOGA
NCBI RefSeqXP_001651480.10.049.90%protease m1 zinc metalloprotease [Aedes aegypti]
NCBI nr blastpgi|3072105840.049.85%Aminopeptidase N [Harpegnathos saltator]
NCBI nr blastxgi|3072105840.049.85%Aminopeptidase N [Harpegnathos saltator]
Group
Gene OntologyGO:00065080proteolysis
GO:00082371.1e-126metallopeptidase activity
GO:00082701.1e-126zinc ion binding
KEGG pathwayaag:AaeL_AAEL0058060.0 
 K11140 (ANPEP)maps-> Glutathione metabolism
    Renin-angiotensin system
    Hematopoietic cell lineage
InterPro domain[24-995] IPR0019300Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[116-501] IPR0147821.1e-126Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL10074 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209834-TA
ATGACTCAAGGTCACGAACCCCTGTCTATAATGGACGGCAACCTTCAGACAGATTTCTCCACGAAAACCAAACGAGGCATTTTCCTGTCCAAGAGCCTCACCCTTGTGATTCTCATCATATTCGCTTTAGCACTCGTCGCTACAGCGTTCATAGTGTACAATTTTGCTGCATGCCCAAGAACTGATCATTTAGCTAATGTCACCAAATTAACTTACTGTGAACAATCTAAGTTACTAGTTATACCTCTAAATACTGATAGTACTACAAGTGTTTCTATTGAAAGTACAACCAGTTACACAGAAACAACCACGAAAGAAGTAGATAACACTGTGACTGATGTTAGACTTCCAACCAATATAAAACCAGATAGCTATTTGTTAAAATTAACTCCTTATATAATTGAAGGAAACTTCACATTTGATGGCGAAATCAGCATTGTTATAACTGTGAAAAATAACACGAACAGAATAACGTTTCATGGTGTAGAATTAAGTTTTATTAAAATAAATTTGTACAAGAAGGAAGACAGAAAAGAAATTAAAATAATAAGAAGGACTGAAGATGTCGCTAGACAATTCCAAATTCTTGACGTAACAGAAACTTTGATAGCGGGACACCAGTATGTTCTTAACATAACTTATGTAGGAATCTTGAACGACAATCTTCATGGATTTTATCGAAGTTCGTATGAAGAGAGGAAAGTCAAAAGATGGATAGCAGTGACTCAGTTTCAAGCTACAGACGCAAGACGTGCATTTCCATGCTGGGACGAGCCTGCTTTGAAAGCCAAATTTACTATCAGTATAGCGCGATTAAATAATATGACTTCAGTGTCTAATATGAATATGGTTAGAAGAAGCCCACATGAAGTTCTGCAGGACTATACTTGGGATCATTATGCGGAGTCACTTCCAATGTCGACTTACTTAGTCGCCTTCGCTGTGACAGATTTTGGAAACATGAGTGACCATAACTTCTCCGTATGGGCTAGGAAAGAAGCTCTACCCTCAGCGGCCTACGCGCTTGAAATCGGACCAAAAATATTAAAGTTTCTAGAGGAATATTATAAAATTAAATTTCCATTGCCAAAAATAGACATGATCGCTTTACCAGATTTTAAAGCAGGGGCAATGGAGAATTGGGGCCTACTCACATTTAGAGAGATTGCTATGTTATATGAAGAAGGCGTGTCCCCTACAACGGCAAGGGCCCGAGTAGCTTCTGTTGTTGCTCACGAGATTGCCCATCAATGGTTTGGAAACCTGGTGACACCAGCTTGGTGGTCGGACATATGGCTTAATGAGGGTTTTGCTAGCTATGTGGAATATGTAGCTGTGGATGCTGTTGAAAAATCTTGGAAGCTCATGGAGGTGTTTGTTTTAAATGAAGTTCAAAGCGTGTTCAAATTAGATGCACTTACATCATCTCATCAAATATCGGTGGAAGTCGGAAATCCGGAAGAAATTGGAGCTATTTTTGATAAAATTTCTTATGGCAAAGGGTCAGCGATTCTTCGTATGATGAACCACTTTTTAACGGACGAGGTTTTTAACTCTGGCATTACTGATTACTTAAATGCCAAAAAGTACGGGGATGCGGAGCAAAGAGATCTTTGGAGTGCACTTACTAATGCTGCAAGAGAAAAGGGTTCTTTTGATGCCGATGTAGCAGTCGTTATGGACTCTTGGACTTTGCAGACTGGGTTCCCGGTATTATCAATAACAAGAGATTATAAGACTGGTTCCATTACATTCAGACAGGAACGTTTTGTATTGATAAACGAAACAAGTGAGTTGCATAATTCTTCAGTTTGGTGGATACCCATATCATATACAACCGCAATTGAAAAGGACTTTGAGTCTACTCGGCCTAAAATATGGCTAAGAGGAGAGAGATCCATTGTTGTACATAATATAACCATTAGTGAAAATGACTGGCTTATTGGTAATATACAACAAACAGGATTCTACCGTATAAATTACGATCAAAGAAATTGGGCGATGTTGGTTCAGATTCTTAACGATAAATCTCGTTTTGAAGAAATACATCCGATTAATCGAGCTCAAATTGTCGATGATGCTATGAACTTGGCATTATCTGGCCGCTTGGACTACATGACTGCTTTGGATATTACAAATTATTTAGCCCATGAAAGAAGTTACGTGCCCTGGAAGGCTGGACTCGTGGCGTTAGGTTACATTGATACCATGTTGTCTAAGGGCGCGTACTATCTAGAATATAAGCGGTACGTTTTAAGTCTCTTGAATGGAGCTGTCCAGGAGCTAGGTTGGGAAGTGACGAGTAATGAAAGCGTGGTTCGAGCTCAGCACCGAGTCGATATCATATCTACAGCTTGTCATCTACAGCATGTTGAATGTTTGGAACATGCTGTGAGGCTGTATACTAATTGGATGCTTACACCTAATCCTGACGCGTATAACGAGATACATGCTGATATTCGTAGCACTGTTTATTGTGTGGGCATTCAAGCTGGAGGAGCTAGGGAATGGCAGTTCGCTTGGGAGAGGTTCCTTGTTGCAAGCGCGCCTTCGGAAAGAGAACTTTTGCTTTCTGTACTAGGCTGTACAAGGGCGCCATACTTGTTGTACAGGTATTTGGATCTATCGTTAAGGAACGACAGTGGAATACGTAAACAAGATACGATCAGAGTTTTTTCAGCTGTCGCCAGTTCTTCTATTGGGGAACCGATTGCTTTTAACTTCGTCAGGGCTAACTGGCTCCGGCTTAAGGAATATGTCGGCTCCGTGTCCACTTTGAATTCTATTTTGAAAGTTGTAACGAGAAGATTAAACGAAGTTCATGAATATGAGGAGTTGAAGAGATTTGTCGGCGAGTCCTGCAGTGATTTGGGAAGACCAGTCCAGCAGGTATTGGAAAGAACAGCTGCAAATGTCCAATGGATGCAAAAGAATTACCGAAATATTGTGGACTGGCTTCTGGCAGCGGAAAAAAATAAGACCGCATAA

Protein sequence:

>DPOGS209834-PA
MTQGHEPLSIMDGNLQTDFSTKTKRGIFLSKSLTLVILIIFALALVATAFIVYNFAACPRTDHLANVTKLTYCEQSKLLVIPLNTDSTTSVSIESTTSYTETTTKEVDNTVTDVRLPTNIKPDSYLLKLTPYIIEGNFTFDGEISIVITVKNNTNRITFHGVELSFIKINLYKKEDRKEIKIIRRTEDVARQFQILDVTETLIAGHQYVLNITYVGILNDNLHGFYRSSYEERKVKRWIAVTQFQATDARRAFPCWDEPALKAKFTISIARLNNMTSVSNMNMVRRSPHEVLQDYTWDHYAESLPMSTYLVAFAVTDFGNMSDHNFSVWARKEALPSAAYALEIGPKILKFLEEYYKIKFPLPKIDMIALPDFKAGAMENWGLLTFREIAMLYEEGVSPTTARARVASVVAHEIAHQWFGNLVTPAWWSDIWLNEGFASYVEYVAVDAVEKSWKLMEVFVLNEVQSVFKLDALTSSHQISVEVGNPEEIGAIFDKISYGKGSAILRMMNHFLTDEVFNSGITDYLNAKKYGDAEQRDLWSALTNAAREKGSFDADVAVVMDSWTLQTGFPVLSITRDYKTGSITFRQERFVLINETSELHNSSVWWIPISYTTAIEKDFESTRPKIWLRGERSIVVHNITISENDWLIGNIQQTGFYRINYDQRNWAMLVQILNDKSRFEEIHPINRAQIVDDAMNLALSGRLDYMTALDITNYLAHERSYVPWKAGLVALGYIDTMLSKGAYYLEYKRYVLSLLNGAVQELGWEVTSNESVVRAQHRVDIISTACHLQHVECLEHAVRLYTNWMLTPNPDAYNEIHADIRSTVYCVGIQAGGAREWQFAWERFLVASAPSERELLLSVLGCTRAPYLLYRYLDLSLRNDSGIRKQDTIRVFSAVASSSIGEPIAFNFVRANWLRLKEYVGSVSTLNSILKVVTRRLNEVHEYEELKRFVGESCSDLGRPVQQVLERTAANVQWMQKNYRNIVDWLLAAEKNKTA-