Monarch geneset OGS2.0

DPOGS209841
TranscriptDPOGS209841-TA5982 bp
ProteinDPOGS209841-PA1993 aa
Genomic positionDPSCF300117 + 954998-978110
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0090030.059.06% 
BombyxBGIBMGA008066-TA0.055.06% 
DrosophilaCG13409-PA5e-13145.10% 
EBI UniRef50UniRef50_E0VS711e-11534.86%Aminopeptidase N, putative n=1 Tax=Pediculus humanus corporis RepID=E0VS71_PEDHC
NCBI RefSeqXP_002073688.14e-13345.10%GK13003 [Drosophila willistoni]
NCBI nr blastpgi|3503997587e-14949.37%PREDICTED: transmembrane protein 181-like [Bombus impatiens]
NCBI nr blastxgi|3503997582e-15750.45%PREDICTED: transmembrane protein 181-like [Bombus impatiens]
Group
Gene OntologyGO:00065084.1e-222proteolysis
GO:00082372.6e-42metallopeptidase activity
GO:00082702.6e-42zinc ion binding
KEGG pathwayame:4128083e-125 
 K11140 (ANPEP)maps-> Glutathione metabolism
    Renin-angiotensin system
    Hematopoietic cell lineage
InterPro domain[26-992] IPR0019304.1e-222Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[389-507] IPR0147822.6e-42Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL12184 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209841-TA
ATGCCTTTATCAACTAGCCGACAGCAATTCTTGGCGTACGAATCTCATGGTGATCCAGATATACACTATACTAGAAAAGGCGGAGTATTTATATCTACCTGCATTTGCGCTATCTTCGTTATTTTCGCTATATTAATAGCTATACTTGTGGGTATAATCGTATACTACATCACATACATTAAGATATCTCAAAAATCTGAAGATTTCTGGGATGAGACAGATCCATTTGGTCAAAGTAGCTCACCTTCACCAGACCTACGCCTGCCTTCGAGTATAATACCAAGCTTTTATAGACTTAAAATCAAAGCTGATTTAGAGAACAAAAATTTCAGCGGAGATGTGTACATTACTATAAAAGCAAATAGAAAAGTTAACCAAGTCATACTCCACTCCAAAGATCTCGTTATTGGACATGAATTGACTTTAACAGAACAAATATATGAAAAGGTGGAAACATTACACTCTGCTAAATCTAAACGAGAAATAGTGACAGAGAATAATACTGTAACAGAGACTAATTCAACATTACCAAGTGCTACAACGGTAGACACGGTCACAGCTGATAATGCTACATATGTAGAAACAACTGATAATTCATCAAGTATTTTAACCACAACACCTCCAGTTAATACTCAGGTCACTCACAGTAATGTGAGAAACATAAATATTACATCAGTGAAGTTTAGATCCGGCGATCGACTGATTCTAAACCTCGGATCATATCTGACTCCATATGTTGATTATACCTTACAAGTGCCCTTTAAAGGAAACATTTCTAGCTCATTGACCGGCTTCTACAAGAGTTCTTATACGGAGGACAACCAAGTGAGACAGTTGGCCGTTACACAGTTTGAACCGACATCAGCGAGGGCAGCTTTTCCATGCTTTGACGAACCAGAATTCAAAGCAAACGAATCAGATCCAGAATGGAAATGGACACATTTTGAGAGATCAGTCAATATGTCTACTTATCTGGTTGCTTATGTACTATCTGACTTTGTAAATGTAGAGACAAATTATACAAGTGTCGGTAACATAACCAAACCAATCAGGATATGGACAAGACCCTCCTTGATATCCAAAGCGAACTATGCACTAACTATTGTACCTAAGCTGCTGTCGTTCTACGAGGAAATCTTTGGTCGGTGCCATGGAAAATTGGGGGGTCTAATCACTTTCAGAGAGACAACCCTATTATATGATGAAGTTGAGGGTATTCCCCGTGAGAAACAAAACGTAGCTATCGACGTAGCTCATGAGTTAGCTCATCAATGGTTTGGCGATCTCGTCACTATGAAGTGGTGGACGGACCTGTGGCTCAACGAGGGTTTCGCCACATACATAGAGTATGTCGGCGTTGACCATATTGCACCTGAATGGGACATGTTTGAATCATTCACAAGGGACAAAATGAATCTTCTTAGAACAGATGCTCTGAAGAACACAGCTCCTGTATCGCAAAAGGTCATCGACGCCTCAGAAATATCCCAGAAGTTTGATGAGATATCATATTCAAAGGGCGCCAACTTAATAAGAATGTTGAACCACACTATATCCAAGGAATTGTTCCACAAGGGATTGCTTATATATTTGAACCTTTGGAAATACCGGAACGCCGAGGAGAACGATCTATGGCAGGCGATGTCTCTAGCTACTAAGGAGTCCCCGCGTCTGAAGGGCCTGTCTGTTGTCGATTTCATGAACACTTGGACTAAACAGCCGGGTTACCCTGTGGTCAGGGTCCTCCGAAACTATGAAAACGATTATGTTACCTTTGAGCAGAATCTCTTCACCAGCAATAAAAATAATAAGAAGGAACAAAAATGGCAAATACCCATAAGTTACAGTACTAACAGCAGCGACTGGAGTACCGAGGCGAAATTCTTTTTGAACGACGACGCTATCACGACTCAAATTGATATCAACAGCTCGCAGGCGCTGTACGTTAATGTTGAAGCTATCGGATACTATCGGGTTAACTACGATCACAGGAACTGGGATCTGTTAAATAAGGCTCTGAAGAACGGCACAATCAAAAGCCCGATAGCGAAAGCCCAGCTTATAGATGACGCTTTCAATTTGGCCAAAACTAACCAATTGGAGTACAGCTACGCGCTCGGGCTAACCACGTGCGTCATAGACGGAGAGGAGTCCAAGACTGTTTGGGACTTATTATTAAACAATATGGCGTTTTTGAGTCACAACCTGAGAGCGACTTCCGGGTACATGTACTTCCAGGACTACATGAAGATAATACTGAAAAAACAACTGGAGCGTCTCAACTACGGTCTGAATAAACCCAAGGACGACAACGAAGCGTTCTTAATAGAGAACCTGGTGCTGTGGGAGTGTCTCGTGGAGTCGCCGCGGTGTCTACAGTGGACCAGGGAACAATTTGAGACTTGGACCAGTAAACCAAACATGACTGATAATCCTATCCCGAGCTTCCTTCGGTCACTAGTCTACAACATGGCCATCAAAAACGGAGGTAGACGGGAGTTTGAAATACTTTGGAACATCTTCTTAAACACCACCGACCCTAATATCAAGAGCCTGATTATATCCAACTTGCCCAGCACCAAGGAGGAATCGTTGATAACTCTACTGCTCGAGAAGAGTCTGTCGGAGATACCGACGCAGTACGCGATATCGGCTTGGAGCGTGGACGCGCCAATCGGCACTAAGATAGCTCAGGACTTCCTCATAGACAACTTCGACAAGGTGTACAAGAGATTCAACGAGATGGACTCCTTCATGTTCGCTGGAGTACTGAACGGAGCGTTCGGCTTCATCACTACCAACGACGAATTGAACAGGTTTAAAAAATTCGCTTTGGACCACAAATCTGAGCTGCAGCCAATGTCTCACACGCTCCAGAAGATAGCTGACAGCGGAGCGGTCAGGATATCCTGGATCAACACACACGCTAGGAACATTAACAACTGGTTCAAGACATATGTAGAAGGTCCGTCTCCGACATTAACAACGTCCGTGTCCAGCGGAGTCAACGCTAGTTCTTTGGCCCACGGACCGTACCGCCTCCGAAGCCCCGCGATGGGAGCGCGCTCTCAGCAACTGAAATACCGGAACGCCGAGGAGAACGATCTATGGCAGGCGATGTCTCTAGCTACTAAGGAGTCCCCGCGTCTGAAGGGCCTGTCTGTTGTCGATTTCATGAACACTTGGACTAAACAGCCGGGTTACCCTGTGGTCAGGGTCCTCCGAAACTATGAAAACGATTATGTTACCTTTGAGCAGAATCTCTTCACCAGCAATAAAAATAATAAGAAGGAACAAAAATGGCAAATACCCATAAGTTACAGTACTAACAGCAGCGACTGGAGTACCGAGGCGAAATTCTTTTTGAACGACGACGCTATCACGACTCAAATTGATATCAACAGCTCGCAGGCGCTGTACGTTAATGTTGAAGCTATCGGATACTATCGGGTTAACTACGATCACAGGAACTGGGATCTGTTAAATAAGGCTCTGAAGAACGGCACAATCAAAAGCCCGATAGCGAAAGCCCAGCTTATAGATGACGCTTTCAATTTGGCCAAAACTAACCAATTGGAGTACAGCTACGCGCTCGGGCTAACCACGTGCGTCATAGACGGAGAGGAGTCCAAGACTGTTTGGGACTTATTATTAAACAATATGGCGTTTTTGAGTCACAACCTGAGAGCGACTTCCGGGTACATGTACTTCCAGGACTACATGAAGATAATACTGAAAAAACAACTGGAGCGTCTCAACTACGGTCTGAATAAACCCAAGGACGACAACGAAGCGTTCTTAATAGAGAACCTGGTGCTGTGGGAGTGTCTCGTGGAGTCGCCGCGGTGTCTACAGTGGACCAGGGAACAATTTGAGACTTGGACCAGTAAACCAAACATGACTGATAATCCTATCCCGAGCTTCCTTCGGTCACTAGTCTACAACATGGCCATCAAAAACGGAGGTAGACGGGAGTTTGAAATACTTTGGAACATCTTCTTAAACACCACCGACCCTAATATTAAGAGCCTGATTATATCCAACTTGCCCAGCACTAAGGAGGAATCGTTGATAACTCTACTGCTCGAGAAGAGTCTGTCGGAGATACCGACGCAGTACGCGATATCGGCTTGGAGCGTGGACGCGCCAATCGGCACTAAGATAGCTCAGGACTTCCTCATAGACAACTTCGACAAGGTGTACAAGAGATTCAACGAGATGGACTCCTTCATGTTCGCTGGAGTACTGAACGGAGCGTTCGGCTTCATCACTACCAACGACGAATTGAACAGGTTTAAAAAATTCGCTTTGGACCACAAATCTGAGCTGCAGCCAATGTCTCACACGCTCCAGAAGATAGCTGACAGCGGAGCGGTCAGGATATCCTGGATCAACACACACGCTAGGAACATTAACAACTGGTTCAAGACATATGTAGAAGGATATTCGTACCATCTGCCCAGTGGCGGATGGAATTACAAAATTCGTAACACGCTGTCCCAATTTAGTGATTTATTTAGTGAATTCAACAAATATATAGCACCAGCTTACCACCACGACCGTTGTGAAAGATCTGTTCAAATGAGGATTTATTCAATGCACAAAGGAGAATTCGTGATGGTTTTTATAGCATTTTTTGCTTGCTTTGGTCTTGGTGTATTTATAGGGTTAGCAGGTCCGTCTCCGACATTAACAACGTCCGTGTCCAGCGGAGTCAACGCTAGTTCTTTGGCCCACGGACCGTACCGCCTCCGAAGCCCCGCGATGGGAGCGCGCTCTCAGCAACTGTGGTTGCTGGCGGAAATACTCACAGATAATGACGACGAGGAGATATTTGACAAGAGCTTTCAAATAAGTATATCAATAGATGGCGTATTAAGTGACCACACAACTGTCAATCTCCTGCCCGAGTCGGAGGCGACTAACAGAACACAACACCTGAAATGCAAGAAGCAGGTTTGCGAGGACGTGATGGTGCTGCACCTGGGTTCACTGGAGTACACGCACTACGTGTTCAGCGTACATTTCTACGGTCTAGAGGAGTTCCATAAACGATACAATATACGTGAAATAATTTTTTACGTTTGCGAGGACGTGATGGTGCTGCACCTGGGTTCACTGGAGTACACGCACTACGTGTTCAGCGTACATTTCTACGGTCTAGAGGAGTTCCATAAACGATACAATATACGTGAAATAATTTTTTACTTCAAAACCTACAATCCAGTGTTCACTCAAATGGAGACCTGGTTCAGATTCATCTTCCTCCTCACAACATTCACAGTGGCTTGTTGGTTCGGCCACACGCTCCGCCGATACTCCACACAAGACTGGGCCATCGAACAGAAGTGGGTCTCCATACTACTACCGTTACTACTACTATATAATGACCCTCTGTTTCCCCTCCGGCTGGTGTCAAGCGGCGTGCTGTCTCCGCTGTTGGACGTGGTCTTTCAGACTTCATTCTTGTCGTCCGTGTTGCTGTCGTGGTTGTCCCTCTACCACGGGCTGAGACAGGTGGTGAAAATAATGTTCTTCGTGGCTGTGACATTATATTTCCTGTATCTGCTCGTCCTTATTGTGAAAGCGTATAGTGATTTACGTAACATGCCATTTTTCGACGTCCGCCTCCGCTGCCTGTCGTTGGTGGTGTGTATCGTGTGTGTCGTGTGCGTCCTGGTGTGTGTCCGTGGTTGGGGCCCCGCAGCCCTGCAGGACCACTGGGCGTCCCAGGCCTCCGCCCGCTACGACACCTCCGCCGCCTTCATGGCTATCTACGGCTTATTCAACTTCAACGTATACGTCATGGCGTATCTGTTTTCACCCGGAACTAGCGCTGTACACGAGACAGCCATAACCAAAGACAATCCAGCATTTTCTATGATCAACGACTCCGATGAAGAAGTTATCTATGGCTCCGACGAAGAGAGCAGACGTCCCCTTAATTCTCACTCCCACAGAGCGGCTACAGAAGATATATGA

Protein sequence:

>DPOGS209841-PA
MPLSTSRQQFLAYESHGDPDIHYTRKGGVFISTCICAIFVIFAILIAILVGIIVYYITYIKISQKSEDFWDETDPFGQSSSPSPDLRLPSSIIPSFYRLKIKADLENKNFSGDVYITIKANRKVNQVILHSKDLVIGHELTLTEQIYEKVETLHSAKSKREIVTENNTVTETNSTLPSATTVDTVTADNATYVETTDNSSSILTTTPPVNTQVTHSNVRNINITSVKFRSGDRLILNLGSYLTPYVDYTLQVPFKGNISSSLTGFYKSSYTEDNQVRQLAVTQFEPTSARAAFPCFDEPEFKANESDPEWKWTHFERSVNMSTYLVAYVLSDFVNVETNYTSVGNITKPIRIWTRPSLISKANYALTIVPKLLSFYEEIFGRCHGKLGGLITFRETTLLYDEVEGIPREKQNVAIDVAHELAHQWFGDLVTMKWWTDLWLNEGFATYIEYVGVDHIAPEWDMFESFTRDKMNLLRTDALKNTAPVSQKVIDASEISQKFDEISYSKGANLIRMLNHTISKELFHKGLLIYLNLWKYRNAEENDLWQAMSLATKESPRLKGLSVVDFMNTWTKQPGYPVVRVLRNYENDYVTFEQNLFTSNKNNKKEQKWQIPISYSTNSSDWSTEAKFFLNDDAITTQIDINSSQALYVNVEAIGYYRVNYDHRNWDLLNKALKNGTIKSPIAKAQLIDDAFNLAKTNQLEYSYALGLTTCVIDGEESKTVWDLLLNNMAFLSHNLRATSGYMYFQDYMKIILKKQLERLNYGLNKPKDDNEAFLIENLVLWECLVESPRCLQWTREQFETWTSKPNMTDNPIPSFLRSLVYNMAIKNGGRREFEILWNIFLNTTDPNIKSLIISNLPSTKEESLITLLLEKSLSEIPTQYAISAWSVDAPIGTKIAQDFLIDNFDKVYKRFNEMDSFMFAGVLNGAFGFITTNDELNRFKKFALDHKSELQPMSHTLQKIADSGAVRISWINTHARNINNWFKTYVEGPSPTLTTSVSSGVNASSLAHGPYRLRSPAMGARSQQLKYRNAEENDLWQAMSLATKESPRLKGLSVVDFMNTWTKQPGYPVVRVLRNYENDYVTFEQNLFTSNKNNKKEQKWQIPISYSTNSSDWSTEAKFFLNDDAITTQIDINSSQALYVNVEAIGYYRVNYDHRNWDLLNKALKNGTIKSPIAKAQLIDDAFNLAKTNQLEYSYALGLTTCVIDGEESKTVWDLLLNNMAFLSHNLRATSGYMYFQDYMKIILKKQLERLNYGLNKPKDDNEAFLIENLVLWECLVESPRCLQWTREQFETWTSKPNMTDNPIPSFLRSLVYNMAIKNGGRREFEILWNIFLNTTDPNIKSLIISNLPSTKEESLITLLLEKSLSEIPTQYAISAWSVDAPIGTKIAQDFLIDNFDKVYKRFNEMDSFMFAGVLNGAFGFITTNDELNRFKKFALDHKSELQPMSHTLQKIADSGAVRISWINTHARNINNWFKTYVEGYSYHLPSGGWNYKIRNTLSQFSDLFSEFNKYIAPAYHHDRCERSVQMRIYSMHKGEFVMVFIAFFACFGLGVFIGLAGPSPTLTTSVSSGVNASSLAHGPYRLRSPAMGARSQQLWLLAEILTDNDDEEIFDKSFQISISIDGVLSDHTTVNLLPESEATNRTQHLKCKKQVCEDVMVLHLGSLEYTHYVFSVHFYGLEEFHKRYNIREIIFYVCEDVMVLHLGSLEYTHYVFSVHFYGLEEFHKRYNIREIIFYFKTYNPVFTQMETWFRFIFLLTTFTVACWFGHTLRRYSTQDWAIEQKWVSILLPLLLLYNDPLFPLRLVSSGVLSPLLDVVFQTSFLSSVLLSWLSLYHGLRQVVKIMFFVAVTLYFLYLLVLIVKAYSDLRNMPFFDVRLRCLSLVVCIVCVVCVLVCVRGWGPAALQDHWASQASARYDTSAAFMAIYGLFNFNVYVMAYLFSPGTSAVHETAITKDNPAFSMINDSDEEVIYGSDEESRRPLNSHSHRAATEDI-