Monarch geneset OGS2.0

DPOGS213611
TranscriptDPOGS213611-TA2601 bp
ProteinDPOGS213611-PA866 aa
Genomic positionDPSCF300033 + 836334-844466
RNAseq coverage1003x (Rank: top 13%)
Annotation
HeliconiusHMEL0136860.088.45% 
BombyxBGIBMGA011674-TA0.086.37% 
DrosophilaPsa-PC0.064.13% 
EBI UniRef50UniRef50_P557860.058.53%Puromycin-sensitive aminopeptidase n=94 Tax=Eumetazoa RepID=PSA_HUMAN
NCBI RefSeqXP_002428559.10.068.26%Aminopeptidase N precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420158480.068.26%Aminopeptidase N precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420158480.068.26%Aminopeptidase N precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00065080proteolysis
GO:00041770aminopeptidase activity
GO:00082372e-138metallopeptidase activity
GO:00082702e-138zinc ion binding
KEGG pathway 
InterPro domain[1-863] IPR0155680Peptidase M1, puromycin-sensitive aminopeptidase
[1-863] IPR0019300Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[10-397] IPR0147822e-138Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL11352 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213611-TA
ATGCCAGAACACAAACCGTTTCAACGTTTACCAAAAAATGTTGTGCCCAAACATTATGAACTGCATTTGGTGCCAAATCTTGAAAAATTTACCTTTACAGGGAAGACTACGGTGAAAGTATCGATTGTTAATACTACCAAAGAAATAGTGCTTAACAGTTTAGATTTGGATTTAAAAAGTGTAAGACTCCAAATCAATGATGGGGGTTCAGTTTCAACACTTAATCCGGTGGAAGTACGACTTGAACCGGCTGATGAAACTGCAATAATAGTTTTTGACAAGCAGCTTCCAGTGGGTGAAGCAACTTTATATTGTGAATTTATTGGGGAAATAAATGATAAAATGAAAGGCCTGTACCGTAGCAAATATCTTACTCCTAGTGGAGAAGAGCGCTACGCTGCCGTTACTCAATTTGAAGCGACTGATGCACGGCGATGTTTCCCATGCTGGGACGAACCTGCCATAAAGGCAACGTTCGATATTACTCTTGAAGTACCGACTGATCGTGTAGCTTTGTCAAATATGCCTGTAAAAGTTGAAAAGGTGAATGGTGATAAGAAAGTAATGCAATTTGACACAACACCGATAATGTCTACATACCTTGTAGCCGTTGTTGTGGGGGAGTATGACTATGTTGAAAAAACATCTCGCGATGGAGTGTTAGTCCGTGTGTATACTCCAGTTGGCAAGAGCAAGCAAGGCATGTTCGCATTAGAGGTAGCCGCAAAAGTACTGCCATATTACAAAGAATACTTTGACATTGCCTATCCTCTACCTAAGATAGATTTGATAGCAATAGCTGATTTCTCTGCTGGTGCCATGGAGAATTGGGGTCTCGTAACCTATAGGGAAACATGCTTGCTAGTTGATGAAGAACACACATCAGCTGTTCGTAGACAATGGATTGCGCTTGTGGTGGGACATGAGCTTGCACACCAGTGGTTTGGCAACCTAGTGACAATGGAATGGTGGACACATCTCTGGCTCAATGAAGGCTATGCTTCATTTGTTGAATTTCTTTGTGTTAATCATTTGTTCCCCGAATATGATATTTGGACACAATTTGTCACAGAAACATACATAAGAGCACTCGAATTAGATTGCTTGAAAAACTCCCATCCTATTGAAGTACCTGTTGGTCACCCATCAGAGATAGATGAAATTTTTGATGACATTTCATACAACAAAGGTGCATCAGTAATCCGCATGTTGCATAGATACATTGGGGATGATGATTTCCGTAAAGGCATGAACATATATCTTACTAGACACCAGTATAAAAATACATTTACTGAGGACCTTTGGGCGGCTTTGGAGGAGGCGTCTAATAAGCCCGTAGGTGCCGTGATGTCAACATGGACCAAACAAATGGGTTTCCCGATGGTTGAAGTCAGTTCCGAGCAGCGTGGCTCTGATAGAGTTTTGAAGTTAACTCAGAAAAAGTTCTGCGCTGATGGCAGTCAGAGCGACGACGCTTTGTGGATGGTGCCCATCACTATATCCACCCAGGAACAACCTTCGAAGGTTGCATTATCAACTGTTTTGGAGAAACGAACACAGGAGGTAGTGTTGAAAAATGTCGCCGAAGATTCGTGGGTCAAGCTCAATCCTGGAACAGTGGGGTATTACCGCACTCGTTACCCGGCCGCCATGCTGGAGCAGCTGGTGCGTGCTGTGAGGGACGGCAGTCTGCCGCCGCTCGACAGGCTCGGACTGCTGGATGATTGTTTCGCACTCGTTCAGGCTGGACACGCACACACATCCGAGTCATTAAAACTCATGGAGGCGTTCAACAACGAAGCCAACTTCACCGTTTGGTCGTCTATTTCAAACTGCCTCGCCAAGCTGAGCGCTTTGTTTTCACACACGCCTCTCGACAAGCCGCTGAAGAACTATGGTAGGAAGTTATTTGCTAACGTCACCCGTCGCCTGGGATGGGATGCCAAAGATAAGGAAAGCCATCTCGACACTTTGCTCAGAAGCTTAGTGTTGAATAAAATGATCAGCTTCGAAGACCCTGACACGATTAAGGAGGCTCAGAGCCGCTTCGAGAAGCACCTGTCGGGCGAGTGTACCCTGCCGGCGGACCTGCGCTCGGCGTGTTACCGCGCGGTGTTGGCGAGCGCCGGCGAGGACACCTTCGGTCGCTTCCTGCAGCTGTACCGCGCCGCTGACCTCCACGAGGAGAAGGACCGCATCAGCCGAGCTCTCGGGGCGGTCAATGACCCCGCGCTGCTCAAAAAAGTGCTGGAGTTCGCTATATCTGACGAGGTTAGGGCACAGGACACCGTCTTCGTCATTGTGTCGGTGGCTTTGAGCCGTAATGGACGGGATTTAGCCTGGCAGTTCTTCAAGGACCATTGGCAGGAATTTATGGACCGTTACCAGGGCGGCTTCCTGCTGGCTCGGCTGGTGAAGTCGACTACTGAGAATTTTGCGTCTGAAGCATGCGCTCAGGAGATCGAGGAGTTCTTCCGCACTCATCACTCGCCCGGCACTGAGCGGTCCGTGCAACAAGCCTTGGAGACCGTCAGGCTGAACGCGGCCTGGCTACGGAGAGACCTCGCCTCCACCACCACATACCTCCAGCCTTATCACTGA

Protein sequence:

>DPOGS213611-PA
MPEHKPFQRLPKNVVPKHYELHLVPNLEKFTFTGKTTVKVSIVNTTKEIVLNSLDLDLKSVRLQINDGGSVSTLNPVEVRLEPADETAIIVFDKQLPVGEATLYCEFIGEINDKMKGLYRSKYLTPSGEERYAAVTQFEATDARRCFPCWDEPAIKATFDITLEVPTDRVALSNMPVKVEKVNGDKKVMQFDTTPIMSTYLVAVVVGEYDYVEKTSRDGVLVRVYTPVGKSKQGMFALEVAAKVLPYYKEYFDIAYPLPKIDLIAIADFSAGAMENWGLVTYRETCLLVDEEHTSAVRRQWIALVVGHELAHQWFGNLVTMEWWTHLWLNEGYASFVEFLCVNHLFPEYDIWTQFVTETYIRALELDCLKNSHPIEVPVGHPSEIDEIFDDISYNKGASVIRMLHRYIGDDDFRKGMNIYLTRHQYKNTFTEDLWAALEEASNKPVGAVMSTWTKQMGFPMVEVSSEQRGSDRVLKLTQKKFCADGSQSDDALWMVPITISTQEQPSKVALSTVLEKRTQEVVLKNVAEDSWVKLNPGTVGYYRTRYPAAMLEQLVRAVRDGSLPPLDRLGLLDDCFALVQAGHAHTSESLKLMEAFNNEANFTVWSSISNCLAKLSALFSHTPLDKPLKNYGRKLFANVTRRLGWDAKDKESHLDTLLRSLVLNKMISFEDPDTIKEAQSRFEKHLSGECTLPADLRSACYRAVLASAGEDTFGRFLQLYRAADLHEEKDRISRALGAVNDPALLKKVLEFAISDEVRAQDTVFVIVSVALSRNGRDLAWQFFKDHWQEFMDRYQGGFLLARLVKSTTENFASEACAQEIEEFFRTHHSPGTERSVQQALETVRLNAAWLRRDLASTTTYLQPYH-