Monarch geneset OGS2.0

DPOGS208533
TranscriptDPOGS208533-TA2499 bp
ProteinDPOGS208533-PA832 aa
Genomic positionDPSCF300064 + 610239-618818
RNAseq coverage299x (Rank: top 37%)
Annotation
HeliconiusHMEL0048220.073.58% 
BombyxBGIBMGA010679-TA0.066.61% 
DrosophilaCG8773-PB2e-16947.21% 
EBI UniRef50UniRef50_Q9VFW75e-16345.96%CG32473, isoform A n=34 Tax=Drosophila RepID=Q9VFW7_DROME
NCBI RefSeqXP_001980293.14e-16947.38%GG19555 [Drosophila erecta]
NCBI nr blastpgi|1949015067e-16847.38%GG19555 [Drosophila erecta]
NCBI nr blastxgi|1951461161e-16546.49%GL24464 [Drosophila persimilis]
Group
Gene OntologyGO:00065081.9e-277proteolysis
GO:00082372.5e-56metallopeptidase activity
GO:00082702.5e-56zinc ion binding
KEGG pathwaydme:Dmel_CG87731e-167 
 K11141 (ENPEP)maps-> Renin-angiotensin system
InterPro domain[208-831] IPR0019301.9e-277Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
[202-350] IPR0147822.5e-56Peptidase M1, membrane alanine aminopeptidase, N-terminal
[127-216] IPR0131034.9e-25Reverse transcriptase, RNA-dependent DNA polymerase
Orthology groupMCL10431 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208533-TA
ATGGCGGGCAGACCGCGAAAGATGTATAATTTATCTCAAGAAAATAGCGATAGTGAGAATGAATCTGAAGTTAGAGATGAACAACTAAGTGCAGGAATAGCAACGATAGAACCGATGACATGGAATGAAATAGAAGATAGTGATGACTTAAAAGCATGGCAAATTGCCCTAGAAGATGAGATCCTAGCGCAACTAAAAAATAAAACGTGGGAAATTGTACCACGACCGAAACAGAGAAAAGTTATTGGAAGTCGACTTGTTCTTAGTACTAAAAATAGAGTACTCGCAAATGAGAAAGTTGGGTCACAGAAAAGAACAGTAACAGAGAAGAATATACTGAAGACAGCTAAAAGTTGGAATGATGCTTTGAATGGATGTGTTAACAGTGTATGTTTGTTGAAAAAGTCTCTGTATGGTTTGCGTCAGTCTGGATTGCAGTGGCATAGAGAATTAGTGAGAAAATTAAAGGACTGTAACTTTGAAGGGTTGCAGCAGGATCCGTGTGTGTTTGTTGCACAAAAGGGTGAGCGAATCATGTTAATTGGTATATATGTAGATGATATTATTCTGGCGACGGATGATGTTGATTGGATGTGTGATATAAAGAAAAGTTTGTCATCAGCATTTGAAATGAAAGATATGGATATGATTGCTATTCCGGACTATGTGTCTGGAGCAACGGAACATTGGGGTCTGATAACATATAGGGAAACATCCTTTCTTATCGACAATGATTTGGCGTCGTCAAGGAATAAATTAAGTGTTGCCAACACAGTGGCACATGAATTAGCGCATATGTGGTTTGGTAACTTAGTTACAATGAAATGGTGGGATGAGGTTTGGTTGAATGAAGGTTTTGCTTCATACATGCAAGTCAAAGCTTTAAATGCTATTGAACCTTCTTGGACAATGTTGGATCAGTTTTTAGTGAGGACCATGCATTCTGTGCTAGATATTGATTCAAAATTATCGAGTCACCCCATAGTGCAAACAGTTGAAACACCCGATGAAATAACAGCCATCTTTGATTCAATCTCATACAATAAGGGAGCTTCAGTATTAAGAATGTTAGAAGGTTTTATTGGTGAGGAGAATTTCAGACTTGGTGTTTCGGATTATTTAAAGAAATATAAATATGGCAACACCATCACACAAGACTTGCTTTCGTCGCTAGAACCATATTTTAGAAAAGATAATCCCACCCTCAGTCTAACTTATATAATGGACACTTGGACAAGACAGATGGGTTACCCTCTGATAAGTATCAAATCGGGTGATAAGCCGAACACCTACATCATCCGACAAGAACGATTCCTCATAGATCCAGAGGCAAAAGACCCTGAACCATCAAAATATAACTACCGTTGGCTCATACCAATTACATACACCAGTGATAAGGGTAGAAATGATAATATCACCTGGTTCCCTGACAACAGTGACTCGATCCAGTTGACTTTAAATGATGGCGAAGAATGGTTTAAAATTAATAACAATCAAATCGGTTACTACAGAGTCAACTATGAAGATAGCATGTGGATTAAATTGGTAGAGCAACTGAAAAATAAATCCACTAAGCTAACAATATCAGACAGATCTCATCTCCTAAACGATGTGTTCGCTTTGGCAGAAGCTCAAATAGTGTCGTATGATGTGGCTTTAAACCTGTCTGCTTATTTGGACGTTGAAACTGATTATGTGCCTTGGGAGACGGCGAGCTCGATATTCTCCGAACTATCAGACAGATTACTAAACACCACCGCCCATGATCACCTCGAGGTATACATACAAAAATTAATAAAACCACTGTATGATACAAGATCATGGGAGAGAAGTAACTTGACGGTTATTGAAGGCTTACTCCGTACCAGAGTTCTCTCTCTGGCTACAAACTATCAGCTACCCGAAGCGAACTCGAAGGTCCGCAGTTTATTCTTGTCCTGGTTGAGTTCACCCAACGAAACTAAAATAGAACCGGATCTTAGAGACTTTGTTTACTATCACGGGATGAAATCAGCGACGCAGGAAGAGTGGGATAAATTGTGGCAAATATATGTGAACGAGGAAGACGTCCAAGAACTGTCCAAATTGAGGAGTGCTTTATCAGCGCCCAGAGATGGAAATATATTACAGAGATACCTCACATTAGCGTGGGATGAGAACAATATAAGGAGTCAAGACTACCTGACTGTGGTGCAACAGATAAGTTCCAACCCATCAGGAACAGATCTCGTGTGGGATTACGTTAGGAACAATTGGACAAAATTCGTTGATAGATTTACTTTGAACAGTCGCTACTTGGGCAATCTTATACCAGGCGTGACCGGAAGCTTTAAAACTGTTGATAAACTTAAAGAGATGGAGTCGTTCTTCGCTAAATATCCTGATGCGGGCGCTGGAGAACTGGCCCGTCAACGAGCGCTGGAGAACGTCCGCGACAATATCCGCTGGACCAACAAACACATGACAGTTGTGGCAAATTGGCTCAAGAACAGATTATAA

Protein sequence:

>DPOGS208533-PA
MAGRPRKMYNLSQENSDSENESEVRDEQLSAGIATIEPMTWNEIEDSDDLKAWQIALEDEILAQLKNKTWEIVPRPKQRKVIGSRLVLSTKNRVLANEKVGSQKRTVTEKNILKTAKSWNDALNGCVNSVCLLKKSLYGLRQSGLQWHRELVRKLKDCNFEGLQQDPCVFVAQKGERIMLIGIYVDDIILATDDVDWMCDIKKSLSSAFEMKDMDMIAIPDYVSGATEHWGLITYRETSFLIDNDLASSRNKLSVANTVAHELAHMWFGNLVTMKWWDEVWLNEGFASYMQVKALNAIEPSWTMLDQFLVRTMHSVLDIDSKLSSHPIVQTVETPDEITAIFDSISYNKGASVLRMLEGFIGEENFRLGVSDYLKKYKYGNTITQDLLSSLEPYFRKDNPTLSLTYIMDTWTRQMGYPLISIKSGDKPNTYIIRQERFLIDPEAKDPEPSKYNYRWLIPITYTSDKGRNDNITWFPDNSDSIQLTLNDGEEWFKINNNQIGYYRVNYEDSMWIKLVEQLKNKSTKLTISDRSHLLNDVFALAEAQIVSYDVALNLSAYLDVETDYVPWETASSIFSELSDRLLNTTAHDHLEVYIQKLIKPLYDTRSWERSNLTVIEGLLRTRVLSLATNYQLPEANSKVRSLFLSWLSSPNETKIEPDLRDFVYYHGMKSATQEEWDKLWQIYVNEEDVQELSKLRSALSAPRDGNILQRYLTLAWDENNIRSQDYLTVVQQISSNPSGTDLVWDYVRNNWTKFVDRFTLNSRYLGNLIPGVTGSFKTVDKLKEMESFFAKYPDAGAGELARQRALENVRDNIRWTNKHMTVVANWLKNRL-