Monarch geneset OGS2.0

DPOGS205036
TranscriptDPOGS205036-TA3891 bp
ProteinDPOGS205036-PA1296 aa
Genomic positionDPSCF300388 - 35984-59714
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0034100.077.09% 
BombyxBGIBMGA001642-TA7e-17461.30% 
DrosophilaCG32473-PC1e-3226.97% 
EBI UniRef50UniRef50_E2BRC97e-6032.40%Glutamyl aminopeptidase n=1 Tax=Harpegnathos saltator RepID=E2BRC9_HARSA
NCBI RefSeqXP_395725.23e-6030.71%PREDICTED: similar to CG32473-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3454961002e-5932.59%PREDICTED: endoplasmic reticulum aminopeptidase 2-like [Nasonia vitripennis]
NCBI nr blastxgi|3072023003e-5631.59%Glutamyl aminopeptidase [Harpegnathos saltator]
Group
Gene OntologyGO:00065081.5e-46proteolysis
GO:00082376.2e-34metallopeptidase activity
GO:00082706.2e-34zinc ion binding
KEGG pathwayame:4122638e-60 
 K11141 (ENPEP)maps-> Renin-angiotensin system
InterPro domain[41-1215] IPR0019301.5e-46Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[38-357] IPR0147826.2e-34Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL26587 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205036-TA
ATGGATGGTATACAGATGGAACCCATTGGAAAATATAAGACAGACCGCAACGGTTCAGCTAAAGCTGCTGCGTGGCGTAACATGGTGGACCAGGACCTAGACGACGTCGCCTTCCTAGCGGGTGAAGTATCTATAGACTTCAAAGTGGACAGAGACACGACATTTGTGGTCCTCAATCTTAGAGACATGAACGTCACCGAGCGCGCTCTGTTTAAGGCCGGAGGTTCTTTGGGGCCCAAGGTTGCGAAAACCCTTGACTACCCACCCGCCGATCAAACCTACATCGACTCATTTTCTAAATCTTATTTTTTAATATACGCACGATTGTTCCTCCTCTTCAATTCCGCGTTTATGAACTACACGCTCAGTCTGCGTTTCATAACTAAACTGGACAGGAGCGATCGTCAGAGAGGTTTTTTCATAGCTGGTACACACAGACAGCGTTGCGCTATATCCAGGTTTTGGCTGACCCACGCTCGCTCGGCCTTCCCCTGTCTAGACGAACCGCATCTGAGGGCCACCTTCAAACTGACCATAGTCAGAGATCGGTTCCACGTGTCACTGACTAACATGCCGATAGTCGCAACCGAGGAGGCTGGTTTTTATTTAGGGCACAGACTGTTACAAGACGAGTTCGCGTTGTCGCCTCCGATGTCCCCTCACATGATGTCGCTGGCTGTGTGCCGTCTCCAGCGGCGCGGCGCGGTGCCACTCCCGGTGCCGACCGCGGACACCACGGAAGCACCACCGGAGGCAGCCCCCGAGATAAGTCTGTACAGCGACCAGCAGGTCATACTGGACGAGTCAGGACCGCTTCTAGAGTGGGTGCAAAAAACGATTCAATTGTTCGCTTACGAACTGAACACGTCCTACCCGCTACCTAAATTCGACATTGTGGTAGTTGACGGTGGTGGTTCGTACTCCGAGGGCTGGGGTCTCATAACGCTGTCACCCTCGCTTCTCACTGACACCAAGGTCATCGCTAGACTGCTGGCGCAACAGTGGTTCGGGGGCCTCGTGTCCCCTCGCTGGTGGAGTTCCCAGTGGTTGTTGGAGGCCCTCACTTCAGTGCTGGGTGAGAAGGCCCCCGCTTTAGGGGGACTCCCCGCCGAGAGGCAACAGGACGCGTTGTTGTTAGATCATGTGTTGCCAGCTCTGAGACTTGAAAAATCAGACATAGAGTCTGCTACCGACGAGTTATCGTTACATAAGGGGGCGGCTATAGTTAGTATGGCGATAGAGGCGGCCGGTGTGGACGTGGGTCGAGCCGCTCTGGCCAGATTACTGAAAGATCATCGCACAGCCAGCGCTGATGCCAAAGACCTTTGGAGGGCGCTGCAACACTCGTCTACCGCTCCGCCCCACGCGTGGGACGGTTGGTGTGAAAGGCCAGGATATCCGCTACTATGTGCCACACAAGTGGACGCGGATATAGTGTTGCGACAGGAGAGGTTCATAATGTCAGCGCCGCCGCCTGAGCCACAACCGGTTATGCTGAATGACCTGCTAACACGCGACCTGAGGTCGGAACTTGAAGAGCTGTTCTATGAACCTGAAAATTACACGGAGATAATCGAGGAGGACATAAATAGCACATCTACCACACCAGCGCCAACCACGACCACCACGGAGAAACCCACAACCAAACCCACGAAACTACCACCAGCACCCAAATGGGTGATACCAGTCACCTTCACCGTGGGACCCCTCATAGAGGAGGAGGAGATGGAGAATATAACGAAGCTATGGAAGAACTCCACGGAGAACATCCAGAATGGCACCTGGTACGACATCTTAAACGACACAAAGCGCTCGTCAAAATGGTCTGAGAATACCGCTACTGGAGTGGGTGCAAAAACGATACAGTTGTTCGCTTACGAACTGAACACGTCCTACCCGCTACCTAAATTCGACATTGTGGTAGTTGACGGTGGTGGTTCGTACTCCGAGGGCTGGGGTCTCATAACGCTGTCACCCTCGCTTCTCACTGACACCAAGGTCATCGCTAGACTGCTGGCGCAACAGTGGTTCGGGGGCCTCGTGTCCCCTCGCTGGTGGAGTTCCCAGTGGTTGTTGGAGGCCCTCACTTCAGTGCTGGGTGAGAAGGCCCCCGCTTTAGGGGGACTCCCCGCCGAGAGGCAACAGGACGCGTTGTTGTTAGATCATGTGTTGCCAGCTCTGAGACTAGATTCCAGTAACTCGGTGCGTGCTGTCGCCTCACCGAGACTTGAAAAATCAGACATAGAGTCTGCTACCGACGAGTTATCGTTACATAAGGGGGCGGCTATAGTTAGTATGGCGATAGAGGCGGCCGGTGTGGACGTGGGTCGAGCCGCTCTGGCCAGATTACTGAAAGATCATCGCACAGCCAGCGCTGATGCCAAAGACCTTTGGAGGGCGCTGCAACACTCGTCTACCGCTCCGCCCCACGCGTGGGACGGTTGGTGTGAAAGGCCAGGATATCCGCTACTATGTGCCACACAAGTGGACGCGGATATAGTGTTGCGACAGGAGAGGTTCATAATGTCAGCGCCGCCGCCTGAGCCACAACCGGTTATGCTGAATGACCTGCTAACACGAGACCTGAGGTCGGAACTTGAAGAGCTGTTCTATGAACCTGAAAATTACACGGAGATAATCGAGGAGGACATAAATAGCACATCAACCACGCCAGCCCCAACCACGACCACCACGGAGAAACCCACAACCAAACCCACGAAACTGCCACCAGCACCCAAATGGGTGATACCAGTCACCTTCACCGTGGGACCCCTCATAGAGGAGGAGGAGATGGAGAATATAACGAAGCTATGGAAGAACTCCACGGAGAACATCCAGAATGGAACCTGGTACGACATCTTAAACGACACAAAGCGCTCGTCAAAATGGTCCGAGAATGTAACTTATCTGCTATGGATGAATGACACTGAGATGGTGATACCAGATTTGGGTAAACACAAGTGGGTGAGGTACAACGTCGGTGCTCGCGGCCTCTACCGCGTCGCGCCACAAGACAGACATCTGAACGAGACGGCTGAGGCGGCAGCTAGACGAGCGTCAGATTTGTACTCCAGCGGAGCGCCGGCCGAGAGAGCGCTCTTACTAGACGACGCGTTCGTGTTAAGCAGAGCGAGGAGGTTACCAGCTAGTGTAGCGATCGCGGCCGCCGCCCACCTGTCCACGGAGCGCCATTGGGCCGTGTGGCGTGTGGTGGTGTCCCACCTGTCGTGGTGGCGGGAACTGCTGAGGGTGTCTTCATCAGCACCGCATCTGCTAAGCCTCCTCAGCACTCTACCACCCACACTCCCACTCTACACCTCTCAGGATATAGCAGACGCCGCCGTCAGCGAGGATCAGCTGTGGTTGAGTGGCGCCCTCCTGACGGCCGGCGTGGAGTGGGGCAACGAGAACGTGACGCAACAGGCTCTGACGCTGTTCGACGGCTGGACAAACGACAACGAGACCATACCCGAGATATACCAGGAGGCAGCATTCATAGCTGGTGTCCGCGCGCACGGGGCCCCGGCCTGGTCGGCCTGCTGGCGAGCCCTGATCACCTCGGCCTCCGCCCCCCGGCCTCTGTACTCGCACAGGGCTCTGCTAGCGGCCCTCAGCGCTCCCGATGACGACTGGCTTGTGTATCGATTCGCGTACACGGTGCTGTCAAGCGAGGCTCAGAAAGGTCGAGATTGGGATACGTGGGTGACCGCACTTTATGAGGGCCTTTGCAGGACGTGTTTGTGTGACGGGAGCGTCACGTTCAAGGAACTATTCGGCGATCAGAAAGGCGCGGCGTCGGCCTTGGACATCATAGCCTTGAACACAGCGTGGGTCGCGAAGGCGGACGCTGATTTGGTCGCGTATTTCAACTCGATAAAACAAAATGAATGA

Protein sequence:

>DPOGS205036-PA
MDGIQMEPIGKYKTDRNGSAKAAAWRNMVDQDLDDVAFLAGEVSIDFKVDRDTTFVVLNLRDMNVTERALFKAGGSLGPKVAKTLDYPPADQTYIDSFSKSYFLIYARLFLLFNSAFMNYTLSLRFITKLDRSDRQRGFFIAGTHRQRCAISRFWLTHARSAFPCLDEPHLRATFKLTIVRDRFHVSLTNMPIVATEEAGFYLGHRLLQDEFALSPPMSPHMMSLAVCRLQRRGAVPLPVPTADTTEAPPEAAPEISLYSDQQVILDESGPLLEWVQKTIQLFAYELNTSYPLPKFDIVVVDGGGSYSEGWGLITLSPSLLTDTKVIARLLAQQWFGGLVSPRWWSSQWLLEALTSVLGEKAPALGGLPAERQQDALLLDHVLPALRLEKSDIESATDELSLHKGAAIVSMAIEAAGVDVGRAALARLLKDHRTASADAKDLWRALQHSSTAPPHAWDGWCERPGYPLLCATQVDADIVLRQERFIMSAPPPEPQPVMLNDLLTRDLRSELEELFYEPENYTEIIEEDINSTSTTPAPTTTTTEKPTTKPTKLPPAPKWVIPVTFTVGPLIEEEEMENITKLWKNSTENIQNGTWYDILNDTKRSSKWSENTATGVGAKTIQLFAYELNTSYPLPKFDIVVVDGGGSYSEGWGLITLSPSLLTDTKVIARLLAQQWFGGLVSPRWWSSQWLLEALTSVLGEKAPALGGLPAERQQDALLLDHVLPALRLDSSNSVRAVASPRLEKSDIESATDELSLHKGAAIVSMAIEAAGVDVGRAALARLLKDHRTASADAKDLWRALQHSSTAPPHAWDGWCERPGYPLLCATQVDADIVLRQERFIMSAPPPEPQPVMLNDLLTRDLRSELEELFYEPENYTEIIEEDINSTSTTPAPTTTTTEKPTTKPTKLPPAPKWVIPVTFTVGPLIEEEEMENITKLWKNSTENIQNGTWYDILNDTKRSSKWSENVTYLLWMNDTEMVIPDLGKHKWVRYNVGARGLYRVAPQDRHLNETAEAAARRASDLYSSGAPAERALLLDDAFVLSRARRLPASVAIAAAAHLSTERHWAVWRVVVSHLSWWRELLRVSSSAPHLLSLLSTLPPTLPLYTSQDIADAAVSEDQLWLSGALLTAGVEWGNENVTQQALTLFDGWTNDNETIPEIYQEAAFIAGVRAHGAPAWSACWRALITSASAPRPLYSHRALLAALSAPDDDWLVYRFAYTVLSSEAQKGRDWDTWVTALYEGLCRTCLCDGSVTFKELFGDQKGAASALDIIALNTAWVAKADADLVAYFNSIKQNE-