Monarch geneset OGS2.0

DPOGS205490
TranscriptDPOGS205490-TA2292 bp
ProteinDPOGS205490-PA763 aa
Genomic positionDPSCF300166 + 303476-309280
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0118040.066.98% 
BombyxBGIBMGA008423-TA0.054.32% 
DrosophilaNep5-PB3e-16138.50% 
EBI UniRef50UniRef50_Q7QC438e-17241.70%AGAP002441-PA n=5 Tax=Culicidae RepID=Q7QC43_ANOGA
NCBI RefSeqXP_312504.31e-16940.89%AGAP002441-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479678793e-17141.70%AGAP002441-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479678792e-16942.07%AGAP002441-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00065082e-235proteolysis
GO:00042222e-235metalloendopeptidase activity
GO:00082371.2e-65metallopeptidase activity
KEGG pathway 
InterPro domain[24-763] IPR0007182e-235Peptidase M13, neprilysin
[564-763] IPR0240792.1e-91Metallopeptidase, catalytic domain
[53-515] IPR0087531.2e-65Peptidase M13
[577-760] IPR0184971.5e-50Peptidase M13, neprilysin, C-terminal
Orthology groupMCL12660 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205490-TA
ATGAATGCCTGGTCGGGCAATAATAGTGATGCTATGGGACCGACGGTTACTCATGTCAAACAAAAACGGAGCAAACCTATACGTATATGCGAAAGCAAACAATGTTTAAGATCAGCTGCGAACCTCGCGCTGTCCATGGACAAGTCAGTGGATCCTTGCAATGATTTTTACCAATACGTATGCGGCAATTGGCCTAAAGAACATCCCAGGCCGGATGCTTATAGTTCATATGATTGGTTCAATGACAAACAGAGGAAAGTATTTGCCACTATACGAGATTTCCTCGCCAAGAATGCCACCAACGAACCGAAACCGGTAAAGCAGGCTAAGGATATATACAGTGCTTGTATAGATACAGAAGAACTAGATAAGAGGGGACTAAAACCTGTGATAAAAATATTAGAATCTTTAGGACTACCCGCGTATCCAACAGTTATCAACGTAACTGATGACATCGACTACTCCACCTACAAATTTAATTGGCTAGAAGCTGTCATAAAAATTAAAACACTATTAGGGATGGATGTACTCATTGGATTTGATATCTTCACTGATCCAAAAAATTCTTCAGTTTATAGACTTGTTATGGGATCACCCGAAGCAACAAATCCTTTTCCAAGCTTTCTCATTGAAAAGAAACGTCATGTAGGACGTAAAAGTATTTTGTTAAGGAAAAAAGAAAAACATAAAAATGCTCCATTTGTAATAGACAATAAAAATAAAAAGAAGAATGTAGATAATAATAATGATGATGAAGAGAAGACTGCACATATATACAAACTATTTTACGCAGAAATGATGAAGCTTTTTGTAGTTGAAGCCACGGCTAAGAACTTTAGTTTGACTGAAGTAGAGTTAGATCAAAACATATTCCTGTCTGCCAATGAATATTATAATATGAATGAAGATATGTATGAGCTAGAAGACGATGTCAATGTTACGACGAGTGATGATGAAATTTATATAAATATACCTGAATACACGGTAGAAGAAATTCAAGAACATACAGATATGATAGTTCAGGATAATGACGGCGTTAAAATGGACATTTGGAAAGAATACCTTGAGGGTATTTTTAAAATAAGCAATGTCGAATTAGATTTTAAAAAAGATTTAATTCTAGTTTCCGATCCCGATTTAAAATACATGTCACTTATGGCTGCCTACGTATCTAAAGCTTCTCCTGTTTCAATTGAACTCTACATTTGGATAAAGGTAGTTGAAGTTATGGCAGTTCACACGACGACGGAATTGAGACTGTTATTCCAGAGGTCATATGACGCATTGAGATCTCGAGAACTGTCGATCACACCGCGCAGCCTTCAGTGTGCTAGTGCCACAAACGATATGATGGGCATGGCTGTGTCCTATGCCATAGCCGACTCCCACTTCTTTAGTGATACCAAGCCAAAGATTGAAGTAATGCTACACGAAATGAAAAACGCACTTGCCCGACTTGTTGGTAAAGCTAAATGGATGGATGATAACACAAAACTAGCGACCTATCAGAAGATCATAGATATGAAGTCATTCGTTGGATTCCCTGACTGGTTACTACATGTGGATTCACTTGAGAAATATTACGAGGGAATCGAAGTTAGTCCTAAAACGCATTTAGAAAATATGATTAAAATAATACAGGTGAAAATAAGGAAAGCCCTTAATAAATTTCGCACTGGCAACGAATTCGCTTGGGCTACCGACCCCACAGAAGTAAATGCTTATCATACTTTTCAAGAAAATACCATAAGTGATTCTCTAAATTATGGAGCATTGGGCAGTGTACTTGGACATGAAATAACACACGGTTTTGATGATTTCGGTAGACGTTTCGACAAGAACGGGAACTTGCTACCGTGGTGGTCCAACGACACCATACAATCGTTCGTCAATATGACCCAATGCTTCGTTGACCAATACTCAAATTTTTATATTCCTGAACTTGGAGAGCATGTAGATGGCAAAAAAACCTTAGGAGAAAATATCGCTGATAACGGTGGTGTTAGGGAGTCTTTCGGGGCTCTTAAAGAACACTTACAAAAGTATGGCCCGGAACAAAAACTACCCGGCTTTGAAGAATTTACTCCAGAACAATTATTTTTTATTTCATATGGAAATTTGTGGTGTGAGGTATCTACCAAGGAGTCCCTGAAGTCAGGTCTATCCGACGAGCACTCTCCTCAACATCTCCGGGCCCGCGGTGCACTACAGAACAATGCAGATTTCTCAAGAATATGGAAATGCCCCCCCGATTCTCCGATGAACCCAACAAAAAGATGTATTATCTATTAG

Protein sequence:

>DPOGS205490-PA
MNAWSGNNSDAMGPTVTHVKQKRSKPIRICESKQCLRSAANLALSMDKSVDPCNDFYQYVCGNWPKEHPRPDAYSSYDWFNDKQRKVFATIRDFLAKNATNEPKPVKQAKDIYSACIDTEELDKRGLKPVIKILESLGLPAYPTVINVTDDIDYSTYKFNWLEAVIKIKTLLGMDVLIGFDIFTDPKNSSVYRLVMGSPEATNPFPSFLIEKKRHVGRKSILLRKKEKHKNAPFVIDNKNKKKNVDNNNDDEEKTAHIYKLFYAEMMKLFVVEATAKNFSLTEVELDQNIFLSANEYYNMNEDMYELEDDVNVTTSDDEIYINIPEYTVEEIQEHTDMIVQDNDGVKMDIWKEYLEGIFKISNVELDFKKDLILVSDPDLKYMSLMAAYVSKASPVSIELYIWIKVVEVMAVHTTTELRLLFQRSYDALRSRELSITPRSLQCASATNDMMGMAVSYAIADSHFFSDTKPKIEVMLHEMKNALARLVGKAKWMDDNTKLATYQKIIDMKSFVGFPDWLLHVDSLEKYYEGIEVSPKTHLENMIKIIQVKIRKALNKFRTGNEFAWATDPTEVNAYHTFQENTISDSLNYGALGSVLGHEITHGFDDFGRRFDKNGNLLPWWSNDTIQSFVNMTQCFVDQYSNFYIPELGEHVDGKKTLGENIADNGGVRESFGALKEHLQKYGPEQKLPGFEEFTPEQLFFISYGNLWCEVSTKESLKSGLSDEHSPQHLRARGALQNNADFSRIWKCPPDSPMNPTKRCIIY-