Monarch geneset OGS2.0

DPOGS213400
TranscriptDPOGS213400-TA1590 bp
ProteinDPOGS213400-PA529 aa
Genomic positionDPSCF300109 + 512595-515754
RNAseq coverage2320x (Rank: top 5%)
Annotation
HeliconiusHMEL0145040.072.39% 
BombyxBGIBMGA009138-TA3e-14863.26% 
Drosophilagranny-smith-PB3e-15653.38% 
EBI UniRef50UniRef50_Q8NDH36e-15053.69%Probable aminopeptidase NPEPL1 n=61 Tax=Eukaryota RepID=PEPL1_HUMAN
NCBI RefSeqXP_968492.13e-16355.51%PREDICTED: similar to GA20276-PA [Tribolium castaneum]
NCBI nr blastpgi|910941276e-16255.51%PREDICTED: similar to GA20276-PA [Tribolium castaneum]
NCBI nr blastxgi|910941272e-15555.51%PREDICTED: similar to GA20276-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065082e-91proteolysis
GO:00041772e-91aminopeptidase activity
GO:00056222e-91intracellular
GO:00195381.9e-42protein metabolic process
GO:00057371.9e-42cytoplasm
GO:00301451.9e-42manganese ion binding
GO:00082351.9e-42metalloexopeptidase activity
KEGG pathway 
InterPro domain[204-510] IPR0008192e-91Peptidase M17, leucyl aminopeptidase, C-terminal
[281-298] IPR0113561.9e-42Peptidase M17
Orthology groupMCL17398 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213400-TA
ATGTCAAACTCGGTGAACATCAAGTTTAAATGGGGTCTGAGTACTTCGGACCCCGAGCAAAAGCCTGTGCTGTTTGTGGGTCAAACGGCACACATAGCAGCTCTGTCCTGGCAAGATGTCCGCTGTAAGCTGGAGCCTAGAGTCACTGAAGAGGTGTGGCGGCGTGCAGTGTCCGTGATGGAGGGAGGCGAGGTGTGCGAGGTGTGGCCGCGGGGCGTAGCCCTGGGCGCTCTGCCACCGCGGCGCTCCCGACACGCGGCGCCGGCTCGATCACACGCCCTGTCCAAGCTGGTCAGAACGTCTCTGAGGTCCGCTGCCAGCGAGTTTGTCGTGTTGGTGTGTCGCAAGCGTGACGTATTGTCGAGCGCGGTGGGTGTGGCGCGCGCCGTGCCGCTGTACTCGGCCAGCTCCGGGCCTGCGCCCCTCGCCCACGGGAACCATCACGACGCGGCCTGCGCCACACCGCGCACGCTCACCGTGGAGATACAGCTCGTCCAAGATGACGGTGTGGAGGACGACGAGGACTTGGACCCCGCGGAGCCAATCTTGAGGGACGGCGTTCTGTCCTCGGAGGACCTCAAGACCATACAGGACGTCGCGGACGCGACCCGCCTCGCCGCCCGGATCACCGACACACCCGCCAACATCATGGACGTGGACGCGTTCATACAGGAAGCTATAAACCTCGCCAAGGAGCTGGAGATCCCCCCGCCCACGATCATCCGCGGCGAAGAGTTGAAGGCGCGTGGTATGGGCGGCTTGTACGGCGTGGGCAAGGCGGCCGCTCGTCCGCCCGCCCTCGTCGCGCTGTCCTACCGCCCGCCCTCCGCCAGCCAGACGGTCGCGTGGGTCGGGAAGGGCATCGTCTACGACACCGGCGGGCTCAGTCTCAAGGCTCCCAAGTCGATGTGCGGTATGAAGTATGACTGCGGCGGCGCGGCAGCCGTGCTGGGCGCCTTCAGCGCGGTCGTCAGGGCTCGGCCGTCGGTGGCGCTCCACGCCGTGCTCTGCCTGGCCGAGAACGCGATCGGTCCGCTCGCCACCAGGCCGGACGACATCCACCAGCTGTACTCGGGCCGCACGGTGGAGATCAACAACACGGACGCCGAGGGCCGGCTGGTGCTGGCGGACGGCGTGGTGTTCGCGCAGAGAGACCTCAAGGCCGACACTATCGTGGACGTCGCCACGCTGACGGGAGCCCAGGGCATAGCGACGGGCAAGTACCACGCGGCCGTCGTGTCCAACTGCGGCTCCCTGGAGGCGAGCTGCGTCCGCGCGGGTCGCATCAGCGGCGACCTCACCCACCCACTGCCCTTCGCACCCGAACTGCACTTCTACGAGTTCAGCAGCGCCGTCGCCGACATGAAGAACAGCGTCGCCGACCGGGAGAACGCGCAGTCTTCGTGCGCCGGACTGTTCGTCCTATCGCACCTCGGTTTCGACTTCCCCGGCCGCTGGCTGCACGTGGACATGGCCGCTCCCTCCAGATGTGGTGACCGGGCGACCGGGTACGGCGTGGCGCTGCTCGCGGTGCTGTTCGGAGGCTCCACGGACAGCCGGCTGCTGCGGGCGCTGGCTCCCCACAAGTGA

Protein sequence:

>DPOGS213400-PA
MSNSVNIKFKWGLSTSDPEQKPVLFVGQTAHIAALSWQDVRCKLEPRVTEEVWRRAVSVMEGGEVCEVWPRGVALGALPPRRSRHAAPARSHALSKLVRTSLRSAASEFVVLVCRKRDVLSSAVGVARAVPLYSASSGPAPLAHGNHHDAACATPRTLTVEIQLVQDDGVEDDEDLDPAEPILRDGVLSSEDLKTIQDVADATRLAARITDTPANIMDVDAFIQEAINLAKELEIPPPTIIRGEELKARGMGGLYGVGKAAARPPALVALSYRPPSASQTVAWVGKGIVYDTGGLSLKAPKSMCGMKYDCGGAAAVLGAFSAVVRARPSVALHAVLCLAENAIGPLATRPDDIHQLYSGRTVEINNTDAEGRLVLADGVVFAQRDLKADTIVDVATLTGAQGIATGKYHAAVVSNCGSLEASCVRAGRISGDLTHPLPFAPELHFYEFSSAVADMKNSVADRENAQSSCAGLFVLSHLGFDFPGRWLHVDMAAPSRCGDRATGYGVALLAVLFGGSTDSRLLRALAPHK-