Monarch geneset OGS2.0

DPOGS209835
TranscriptDPOGS209835-TA2991 bp
ProteinDPOGS209835-PA996 aa
Genomic positionDPSCF300117 + 726573-732116
RNAseq coverage134x (Rank: top 56%)
Annotation
HeliconiusHMEL0084060.058.67% 
BombyxBGIBMGA008059-TA0.055.37% 
DrosophilaCG31198-PA2e-14134.68% 
EBI UniRef50UniRef50_G3ESW00.057.60%Aminopeptidase N3c n=3 Tax=Obtectomera RepID=G3ESW0_OSTNU
NCBI RefSeqNP_001104835.10.057.60%aminopeptidase N3 [Bombyx mori]
NCBI nr blastpgi|1624626920.057.60%aminopeptidase N3 precursor [Bombyx mori]
NCBI nr blastxgi|1478821470.057.14%aminopeptidase N [Ostrinia furnacalis]
Group
Gene OntologyGO:00065085.6e-190proteolysis
GO:00082375.8e-112metallopeptidase activity
GO:00082705.8e-112zinc ion binding
KEGG pathwaynvi:1001242862e-123 
 K11140 (ANPEP)maps-> Glutathione metabolism
    Renin-angiotensin system
    Hematopoietic cell lineage
InterPro domain[53-945] IPR0019305.6e-190Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[58-457] IPR0147825.8e-112Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL11206 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209835-TA
ATGGCGACAATGAATGTGATCTTTTTATTTTTAATGTGGACCTACGCTAGTGGATTATCTAGTCCACCAGTTACAAGGAATACAATTTTTGGCGATGAAAAACAAATAGGAGAAATATTTGAAAATATTGACACTGTTAAGGATGACGCTGATAATTATACGTTATACCGACTTCCTACAACCACACGCCCAAAACATTATGATATTTTATGGACCATTGATACCTACAAACGTGTGTTTGATGGCCAAGTTGATATAGAACTATATGCCACTCAACCAAATGTTAGTGAAATTGTAATTCATGCCCATGAAATGAATTATTCCTTTGTGGAACTAAAATATAATGACGCAATTATATCACGGCAATATAGCTTACTACCTGAGGCACAGTTTATGAGAATCATACTGCAAAATAGTTCTATGGAATACAATGCGTCTAACCCGGTCGTATACGTACTTAGCGTTGGCTTTAATGCAGCTCTAAGAAAAGATATGCTCGGAATATATGAAAGTTGGTACAGAAACCCTGGAGGAGAAGAAAAGTGGATGGCTACAACTCAGTTCCAAGCAACTGCAGCTCGAAAGGCTTTTCCTTGCTATGACGAGCCTAGTTTTAAAGCAACATTCAACATAACTATAAGACGTCCAACCGAACTAAAAAGTTGGTCACTTACACGTAAATTGTACACGGAGAATGCAACTCTAGCAGGTTACCAAAATGACGTTTATAGTAAGACACCAGTCATGTCCACTTATCTCCTTGCTTTAATTGTTGCTGATTATGACAGTATAACTTTACCTAGCAATGTGAGTGAACAATTACATTACGAGGTTATTGGAAGAAGATCAGTACTTAGAAGTCAAGGAAACTATGCCCTGTACCTAGGAAAGAATCTAACTGAAGTCATGGGTATTCATACGGGCAAAGATTACTTCAGCTTGCATGAGAATCTAAAAATGACACATGCTGCTATTCCTGATTTTGATGCTGGTGCTATGGAAAACTTCGGACTTATTACGTACAGAGAAGCGTACTTAATGTATGACACAGAGCATACAAATGATTATTTTAAACAAATAATAGCTTATATCCTGTCCCATGAGATAGCTCACATGTGGTTTGGTAACTGGGTTACCTGTGACTTTTGGGATAGTTTATGGTTAAACGAAGGTTTCGCTAGGTACTACCAATACTTCTTGACACATTGGGTTGAAGGTTATATGGGTTTAGATTCTCGCTTTATTAACGAGCAACTTCATACGTCGCTTCTAGCTGATTCGTCTGACACCGCCCATGCATTGACCAATCCAAAAGTTGGTAGTCCGAGTTCAATACGCGGTATGTTTGATACTATAACTTACAACAAAGGTGCATCTGTGATACGTATGACTGAACATCTTCTCGGTTCAGAAATACTCGAAATGGGCTTACAAAAATATTTAGCTGATAATGCATATAAAACCGCACGACCAATTGATCTTTTCGAAGCACTGCAAAACGTGAGTTTGTCCACTGGCGCAATTTCTCAGTACAGAAACTTCTCGTTTATTGAATATTATAGAAGTTGGACTGAACAGGCTGGGCACCCGATAGTTAATGTGCAAGTTAATCATAAAACTGGAGATATGGTGATTACACAGCGCCGTTTTAACATAAATACTGGTTTTTCCCGCAATAACAAACAGTATGTTATTCCGCTCTCATTCACAAGTGCTGACAATATAGATTTTAATAACTTAAAACCGAGTCACATTATGAGGGATGGTGTTATCGTGATTAATCGGGGTAGTGTAGGAAACCACTGGGTCATCTTCAATAAACAGCAGACAGGATTTTACAGAGTCAACTATGATGATTACACTTGGGATCTCATAACTGCTGCCCTCCGGAGTTCAAACAGAACCTTGATTCACGAATATAACAGAGCACAAATTGTAAACGATGTGTTTCAATTCGCAAGATCTGGCATAATGTCTTATAATAAAGCTTTCAACATTCTGTCATTCCTGGAGAATGAAACTGAATATACTCCCTGGGTTGCTGCAATTACTGGCTTTAACTGGCTTAGGAATAGATTCGCTGGTACCAATTATTTAGCACCACTCGAGGAGCTTATCACAAAATGGGCATCAACGGTCATGAATCAGCTCACATACTATCCCCGACAAAATGACACTTTCATGACTTCATACTTAAGATACCAGCTTGCACCATTCATGTGCAGGATGAATGTTCGACGGTGCCTTGACGCTGCTGAATCACAATTTCGTGCATTAGTTAACAATCAAACGGAGGTCCCAGTTAACAGTAGAAACTGGGTGTACTGCCATGCACTAAGGCTTGGTTCTAAGGCAGATTTTGACTATCTCTGGCAACGTTTTAACACGCACAATGTCTATACGGAGAAGATTTTACTCCTCATGACCTTAGGATGTACAAATGATGAAGAATCCTTAAATACTTTGCTGAATAATATCGTTGAAGAGAATTTCGTTATACGAAAACAAGATTATACAACGGCTTTCAACACCCTACTCAATGAAAACTATGAAAACGTACAGATCGCCTTTCGCTTTATCCAAAGAAACCTCACACAAGTACTGAAAGCATTCGACGGTGCAAGTCCGATTACCAACATAGCCTCTAGATTAAGATCATCACAAGACATCAATGCTTTTAAAAGTTGGGCTTCACAAAACAATAATACACTGGGAAATTATTACAAGGGTATTGTAGAGCAGGCTCAGTCTACCCAGAGCAGTTTGGATTGGGCTGTAGAAGTTCAAGACGATATTGGTCAGTATCTTCAGGAAGGTGATACAGAAATTACCACGACAACATTTGCACCACCAACGACACCACAAATATCAACAATTTCTCCCCCACCTTTGGTTGAACCGGACTCACCCAATTTACCGGATTCCTCCGTCACAACTATGATATCATTATCGCTTATCATAATAACAGCAATTGTTCATTACTTGATATAA

Protein sequence:

>DPOGS209835-PA
MATMNVIFLFLMWTYASGLSSPPVTRNTIFGDEKQIGEIFENIDTVKDDADNYTLYRLPTTTRPKHYDILWTIDTYKRVFDGQVDIELYATQPNVSEIVIHAHEMNYSFVELKYNDAIISRQYSLLPEAQFMRIILQNSSMEYNASNPVVYVLSVGFNAALRKDMLGIYESWYRNPGGEEKWMATTQFQATAARKAFPCYDEPSFKATFNITIRRPTELKSWSLTRKLYTENATLAGYQNDVYSKTPVMSTYLLALIVADYDSITLPSNVSEQLHYEVIGRRSVLRSQGNYALYLGKNLTEVMGIHTGKDYFSLHENLKMTHAAIPDFDAGAMENFGLITYREAYLMYDTEHTNDYFKQIIAYILSHEIAHMWFGNWVTCDFWDSLWLNEGFARYYQYFLTHWVEGYMGLDSRFINEQLHTSLLADSSDTAHALTNPKVGSPSSIRGMFDTITYNKGASVIRMTEHLLGSEILEMGLQKYLADNAYKTARPIDLFEALQNVSLSTGAISQYRNFSFIEYYRSWTEQAGHPIVNVQVNHKTGDMVITQRRFNINTGFSRNNKQYVIPLSFTSADNIDFNNLKPSHIMRDGVIVINRGSVGNHWVIFNKQQTGFYRVNYDDYTWDLITAALRSSNRTLIHEYNRAQIVNDVFQFARSGIMSYNKAFNILSFLENETEYTPWVAAITGFNWLRNRFAGTNYLAPLEELITKWASTVMNQLTYYPRQNDTFMTSYLRYQLAPFMCRMNVRRCLDAAESQFRALVNNQTEVPVNSRNWVYCHALRLGSKADFDYLWQRFNTHNVYTEKILLLMTLGCTNDEESLNTLLNNIVEENFVIRKQDYTTAFNTLLNENYENVQIAFRFIQRNLTQVLKAFDGASPITNIASRLRSSQDINAFKSWASQNNNTLGNYYKGIVEQAQSTQSSLDWAVEVQDDIGQYLQEGDTEITTTTFAPPTTPQISTISPPPLVEPDSPNLPDSSVTTMISLSLIIITAIVHYLI-