Monarch geneset OGS2.0

DPOGS209792
TranscriptDPOGS209792-TA2820 bp
ProteinDPOGS209792-PA939 aa
Genomic positionDPSCF300117 - 804380-810993
RNAseq coverage833x (Rank: top 15%)
Annotation
HeliconiusHMEL0084140.060.19% 
BombyxBGIBMGA008017-TA0.058.92% 
DrosophilaCG14516-PB1e-13431.76% 
EBI UniRef50UniRef50_P918870.057.28%Aminopeptidase N n=26 Tax=Ditrysia RepID=AMPN_PLUXY
NCBI RefSeqNP_001036834.10.058.61%aminopeptidase N [Bombyx mori]
NCBI nr blastpgi|84889650.059.70%aminopeptidase N [Manduca sexta]
NCBI nr blastxgi|84889650.059.70%aminopeptidase N [Manduca sexta]
Group
Gene OntologyGO:00065085.8e-256proteolysis
GO:00082373.6e-118metallopeptidase activity
GO:00082703.6e-118zinc ion binding
KEGG pathwayaag:AaeL_AAEL0127831e-148 
 K11140 (ANPEP)maps-> Glutathione metabolism
    Renin-angiotensin system
    Hematopoietic cell lineage
InterPro domain[1-920] IPR0019305.8e-256Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[34-427] IPR0147823.6e-118Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL10421 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209792-TA
ATGTTCACTTTAATATTTATATCTCTTCTGGCTGTGGTGTCAAGTATACCTTTGAATTCCGAAAATGTAATTAGTAGCATACGAGATGCAGATTATATTTTACCAGGAACAGTGATTCCATCTTTTTATGACGTCTCTCTTATTCTGGACCCACGTAATAATGAAACATTCAGTGGAAACGTCTCAATTACATTACAACCCCTAGTGGCAACAAATGAAATTGTCCTACAAGCTAATCAGATGCGCTTACAGAAAATTCAATTGTTTGCTAACGCAAATCAAACCTTGGACATTTTCTCGAGGTATACTCTAGCAACAGATGACACCCATTTTCTGAAAGTATATTCGAACGGTGAATTGAACGTGAACCAATCATACATTTTGAGAATCGAATACACTGCTGAATTATCTACTGATATGTTTGGCGTGTACCTATCCTCTTACAATGATCAGTCTGGGAATCGTGTAAATCTTATAGCTTCTCAGCTCCAGCCTACTTACTCTCGCCGAGTTTTTCCCTGTTTTGATGAACCACGCTATAAGGCTGAATTCAGAACGACTATTTACGCGCCACCACAATTCACAGTAGTGAGGACTAACATGCCGGAAAGGACGGACTTATCAAAACCACAGATATTAGGATATAATAAATATGAATTCCAAGACACACCATTAATGTCCACTTATTTGATTGCGTATATTGTATCCAATTTTGATTATATTACTAACATCGGCAACATGACATTCTCAAAACCATTAAGAATATACGCAAGACCGGGATACAGGAATAGTTCAGAGTTCGCTTTAGAATTCGGTGAAAAGAATATGGTGGCATTTGAGAGGTATATTGAACTGCCATATGATTTACCGAAGATGGACAAAGTTGCTATACCTGATTTTTCAGCAGGCGCAATGGAAAATTGGGGATTGGTGTTGTATAGAGAAGTTGCTCTTTTAGTACGAGATGGCGTTACAACAACTTCTGCATTACAGAATGTAGGACGTATTATATGTCATGAGAATATGCACATGTGGTTTGGAAACGAAGTCAGTCCACTTAACTGGACATATACGTGGCTGAATGAAGGTTTTGCGAATTTTTTCGAAAACTACGGAACAGAAATGGTTCGACCACACTGGCGGATGATGGATCAATACGTTCTTTTGATACAAGGCGTTCTACAAGATGATGCCGTTCTTAGTGTTAACCCAATGACACACCCTGTATATACTCCTTCGCAAATAATGGGAACTTTTAATGCTGTCGCTTACCAAAAGTCTGGATCTGTCATTCGTATGTTGCAACACTTCCTTACCCCAGAAATATTTAGAAGGGGTCTTGTCCTTTACATTAGGAATAATGTACGGCAAGCAGTTTCACCATCCAATTTGTATGAAAGTTTACAACAAGTGGTCAATGAAACGAATACTAGTCTACCAGATTCTGTAGCCAATATTATGGAGCGTTGGACAACTCAAGGAGGTTTCCCAGTTCTAATGGTTCGAAAACAAGCACCAACAGCTAATTCTCTGTCTATAAGTCAGCAACGATTCCTCACTGATACATCACTGAGTTCATCTAACGAATGGCACGTACCTATCAATTGGGTTCTGTCAACCAATCCAAACTTCAGTGATACTAGTCCTCAAGCTTGGGTCCGGCCTTCCTCTCCAGCTTTAGCAGTTGACATACCCGGATTATCCAACGCTTCATGGTTCATTATTAACAAACAACAAACAGGTTACTATAGAGTTAACTACGAAGATGAGAACTGGGCTGCGTTAGCTGATGTTTTGGCCAGATCTCACAACGTTATCCACCATTTGAACAGAGCCCAAATTTTAGATGACGTATTCAACCTAGCTAGAAATGGAAGAACTCACTACAGACACGCATTGGAAGTATCCCGTTATTTAATTAACGAAACTGATTACATCCCGTGGGGAGCAGTCAACGCTGCTTTCAGCTACCTCGACATTGTGCTCAGTAGTACCCCCGTCTATGAACTATTCCAGCGTCACATTCTTCGTCTGTCCGAACCTTTATTTGATCAACTCGGTTTCGAGCCCAAGCAAAACGAAGAGTTCGTTACGCCTTACCAGAGAAATATCATCCTAAACTTCAATTGCCGATTTGGAAATGAGAGATGTCTTAACAAATCACGAGAAATTCTGATGCAATTTAGAGAAAACCCTAGTCAGCGTATTCATCCGGACCTTAAAACGACGATATACTGCTCCGCTTTGAGGGAAGGAGACGCTGATACCTTCAACTTCCTGTGGGAACAATTCCGAGCCAGTCAAGAATCTAGCGAACAGACTATCCTTTTGAATTCCTTAGGTTGCACCTCTAATGACACACTTCGTTCTTTCTATATGAACCAAGTTATCTCACCTAATTCAGAAGTAAGAGAACAAGATCGTCATACAATCCTTGTTTCCGTTATTAACTCCAGTCCAGCTGGTATGGAAGCTGCTTTAGACTTCGTCATTGAAAATTTTGCCGCCATAAAGCCCAGAGTATCAGGACTTTCGGGCACTCGAAACATTCTGACTGCCTTCGCTAGACGACTTACATCGAGAAATCACGCTGCAAAAATTGAAACATTTATCAGTCGCTACCAAAGCATCTTCTCAGCTGCTGAAGAAGCTGGCGTTCGTAGTATTAATGAAAATATCGCAGCCTCAATCATTTGGAGCAGTAATAATTATAACGCAGTTAACGGCTGGTTACGTAACCAATATGGCAATAGCGCCAATATGATTACATCGAGCTTATTGATCTTGGTCTCTATATTTATAGCCTTTTACAACCACTGA

Protein sequence:

>DPOGS209792-PA
MFTLIFISLLAVVSSIPLNSENVISSIRDADYILPGTVIPSFYDVSLILDPRNNETFSGNVSITLQPLVATNEIVLQANQMRLQKIQLFANANQTLDIFSRYTLATDDTHFLKVYSNGELNVNQSYILRIEYTAELSTDMFGVYLSSYNDQSGNRVNLIASQLQPTYSRRVFPCFDEPRYKAEFRTTIYAPPQFTVVRTNMPERTDLSKPQILGYNKYEFQDTPLMSTYLIAYIVSNFDYITNIGNMTFSKPLRIYARPGYRNSSEFALEFGEKNMVAFERYIELPYDLPKMDKVAIPDFSAGAMENWGLVLYREVALLVRDGVTTTSALQNVGRIICHENMHMWFGNEVSPLNWTYTWLNEGFANFFENYGTEMVRPHWRMMDQYVLLIQGVLQDDAVLSVNPMTHPVYTPSQIMGTFNAVAYQKSGSVIRMLQHFLTPEIFRRGLVLYIRNNVRQAVSPSNLYESLQQVVNETNTSLPDSVANIMERWTTQGGFPVLMVRKQAPTANSLSISQQRFLTDTSLSSSNEWHVPINWVLSTNPNFSDTSPQAWVRPSSPALAVDIPGLSNASWFIINKQQTGYYRVNYEDENWAALADVLARSHNVIHHLNRAQILDDVFNLARNGRTHYRHALEVSRYLINETDYIPWGAVNAAFSYLDIVLSSTPVYELFQRHILRLSEPLFDQLGFEPKQNEEFVTPYQRNIILNFNCRFGNERCLNKSREILMQFRENPSQRIHPDLKTTIYCSALREGDADTFNFLWEQFRASQESSEQTILLNSLGCTSNDTLRSFYMNQVISPNSEVREQDRHTILVSVINSSPAGMEAALDFVIENFAAIKPRVSGLSGTRNILTAFARRLTSRNHAAKIETFISRYQSIFSAAEEAGVRSINENIAASIIWSSNNYNAVNGWLRNQYGNSANMITSSLLILVSIFIAFYNH-