Monarch geneset OGS2.0

DPOGS209689
TranscriptDPOGS209689-TA2280 bp
ProteinDPOGS209689-PA759 aa
Genomic positionDPSCF300134 + 401605-427120
RNAseq coverage539x (Rank: top 23%)
Annotation
HeliconiusHMEL0021560.086.24% 
BombyxBGIBMGA000529-TA0.084.74% 
DrosophilaNep2-PB0.053.70% 
EBI UniRef50UniRef50_UPI00015B5C720.050.68%UPI00015B5C72 related cluster n=1 Tax=unknown RepID=UPI00015B5C72
NCBI RefSeqNP_001036959.10.084.84%neutral endopeptidase 24.11 [Bombyx mori]
NCBI nr blastpgi|277334130.086.40%zinc metalloprotease [Manduca sexta]
NCBI nr blastxgi|277334130.086.40%zinc metalloprotease [Manduca sexta]
Group
Gene OntologyGO:00065080proteolysis
GO:00042220metalloendopeptidase activity
GO:00082371.8e-113metallopeptidase activity
KEGG pathway 
InterPro domain[4-759] IPR0007180Peptidase M13, neprilysin
[543-759] IPR0240792.8e-131Metallopeptidase, catalytic domain
[113-491] IPR0087531.8e-113Peptidase M13
[552-758] IPR0184971.7e-60Peptidase M13, neprilysin, C-terminal
Orthology groupMCL10307 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209689-TA
ATGAGTGGAGACGCTGAGTACATCACGAGTAACTACAACATGAGGAATAATTCCAAGCCATCATTTTGGGCGCAGAGAAGTAGGCTCGAGAAGAGGCTTTTACTATGTCTGGGCGTGATATCTGTCCTAGCTGTAGCTTTTCTCGCAGCATTCCTCGCCACCGCCCTAATGAAGACATCGACTGATGTTAATGACATGCCACAGACTAGCGAATTGCGACTTTCACCGTCCATGCCACCCGCAGTCATCTCAAAGGACTACAATTTATGTACTTCTCCTGGCTGCATCCATACAGCATCTAAGCTACTAATCAATATGGACGACAAAACGGATCCTTGCGACGATTTCTACGACTTTGCCTGCGGTTCATTCGTAAAGAATACGAGGATTCCTGACGACAAGACTTCGGTGAATACCTTCTCCATCATCACAGATCAGCTTCAGGAACAAATACGGGCCTTACTAGATGAACCGATCTCTGAAAATGAACCACGGCCATTCGTTCTAGCAAAAACTTTGTACCAGGCTTGCATGAACAGAACTGCTATTGAGGCCAGAGGAGTCCAGCCTTTACTCGAAATGCTTCTTCGACTCGGAGGATGGCCAGTTCTACAAGGTGACACCTGGAATGAGCGCTCATTCTCCTGGGAGGAATCCGTATACCGCTTCAGGGAAGCTGGTTACTCTGTGGACTACTTTCTCGATTTCTCTATCAGCGTTGATCTGGATCAAGCATCATTGGGCTTAAGCCGGGAATACTTAAATCGCGGTTTTAGCGATAAGCTAGTGCAGGCCTACTATGAATATATGGTTGATATCGCCACTCTATTAGGAGCTGAACGAGCCAAGGCCGAAGTTGAACTTAAAGAATCTCTCCAGTTTGAAATGAAACTTGCTAATATATCCTTGCCATTGGAAAAGAGACGTAACGCCACCAGTCTTTACAATCCGATGACTATTGCTGAGCTGCAGCAAAAGTTTCCGAGGATCCCATGGCTGGCATACATCAACCGACTCCTGTCTCCACATGTACAAGTGGGCTTAGATGAAGTAACCATCGTGAACGTACCTAAATATATAACTGATCTTGAGGACTTATTAGAGAAGACTCCGAAGCGTGTGCAAGCGAACTACGTGATGTGGCGAGTAGCCGGGGCATCGGTCTCCTACTTGACGGAGGATCTGCGTCGAAGGCAGCTAGCTTACGTCACTGCGCTGTCCGGGAAGACTGAGAGAGAGAGCCGTTGGAAGGAGTGTGCTGATACGACCAGTGTTAGTATGTCCATCGCTGTAGGAGCTTTGTACATAAGAAAATATTTCAACGAAAACTCCAAGTCCAACGCTTTAGAAATGGTGAATGACATCAGACAACAATTCCGAAAGACTTTGGAGACTGTTGATTGGATGGATGAAAAGACTCGTCGTGAAGCTTTAGAGAAAGCAGACGCTATGGCTTCACACATCGCCTATCCCAGTGAGATGCTTGACAACGATAAACTCACCGAATTCTACTCTGGGCTGGAGATGTCGTCCGACAAGCTGATGGAGTCGGTTCTGAATCTCACGCTCTTCGGAACTGAATATTTGTTTGGTAAACTTCGGGAGCCTGTCAACAAGACCGACTGGGTGACGCACGGCCGACCCGCCATCGTCAATGCTTTCTATTCTTCTATAGAAAACAGTATTCAGTTCCCGGCCGGTATTCTCCAAGGGGCGTTCTTTTCTGCAAACCGTCCCGCTTATATGAATTACGGTGCTATTGGATTCGTTATTGGACACGAAATAACCCATGGATTTGATGACCAGGGCCGTCAATTTGACAAAAATGGCAACTTGGTAGACTGGTGGCAAGAAACTACCAAACAAAAATATTTGGAAAAAGCAAAATGTATTATAGACCAGTATTTCAACTATACTGTAAAGGAAGTGAATATGAAGCTGAACGGAGTGAACACACAAGGAGAAAACATTGCTGATAACGGGGGCATTAAAGAAGCCTATTATGCGTACCAAGCTTGGACACAAAGACACGGAGAAGAAGCGCGTTTACCAGGTCTAGAAAAGTATAGTGCACGCCAGTTGTTCTGGATGAGCGCAGCCAATACTTGGTGTTCAGTTTACCGCAATGAGGCTATCAAACTTCGTATAACTACTGGTTTCCACGCGCCCGGTCGCTTCCGTGTTATAGGCCCCATGTCCAATATGGAAGAATTTGCGGCAGACTTCAAGTGCCCGGCCGGTTCACCGATGAACCCAGTCAAAAAGTGCAAAGTGTGGTAA

Protein sequence:

>DPOGS209689-PA
MSGDAEYITSNYNMRNNSKPSFWAQRSRLEKRLLLCLGVISVLAVAFLAAFLATALMKTSTDVNDMPQTSELRLSPSMPPAVISKDYNLCTSPGCIHTASKLLINMDDKTDPCDDFYDFACGSFVKNTRIPDDKTSVNTFSIITDQLQEQIRALLDEPISENEPRPFVLAKTLYQACMNRTAIEARGVQPLLEMLLRLGGWPVLQGDTWNERSFSWEESVYRFREAGYSVDYFLDFSISVDLDQASLGLSREYLNRGFSDKLVQAYYEYMVDIATLLGAERAKAEVELKESLQFEMKLANISLPLEKRRNATSLYNPMTIAELQQKFPRIPWLAYINRLLSPHVQVGLDEVTIVNVPKYITDLEDLLEKTPKRVQANYVMWRVAGASVSYLTEDLRRRQLAYVTALSGKTERESRWKECADTTSVSMSIAVGALYIRKYFNENSKSNALEMVNDIRQQFRKTLETVDWMDEKTRREALEKADAMASHIAYPSEMLDNDKLTEFYSGLEMSSDKLMESVLNLTLFGTEYLFGKLREPVNKTDWVTHGRPAIVNAFYSSIENSIQFPAGILQGAFFSANRPAYMNYGAIGFVIGHEITHGFDDQGRQFDKNGNLVDWWQETTKQKYLEKAKCIIDQYFNYTVKEVNMKLNGVNTQGENIADNGGIKEAYYAYQAWTQRHGEEARLPGLEKYSARQLFWMSAANTWCSVYRNEAIKLRITTGFHAPGRFRVIGPMSNMEEFAADFKCPAGSPMNPVKKCKVW-