Monarch geneset OGS2.0

DPOGS208900
TranscriptDPOGS208900-TA3780 bp
ProteinDPOGS208900-PA1259 aa
Genomic positionDPSCF300009 - 689807-729679
RNAseq coverage491x (Rank: top 25%)
Annotation
HeliconiusHMEL0146841e-15978.41% 
BombyxBGIBMGA002474-TA6e-12970.25% 
DrosophilaCG14869-PB2e-3727.31% 
EBI UniRef50UniRef50_Q5W7F40.066.02%A disintegrin and metalloproteinase with thrombospondin motifs 1 n=4 Tax=Obtectomera RepID=Q5W7F4_BOMMO
NCBI RefSeqNP_001036981.10.066.02%A disintegrin and metalloproteinase with thrombospondin motifs 1 [Bombyx mori]
NCBI nr blastpgi|1129834320.066.02%A disintegrin and metalloproteinase with thrombospondin motifs 1 precursor [Bombyx mori]
NCBI nr blastxgi|1129834320.066.02%A disintegrin and metalloproteinase with thrombospondin motifs 1 precursor [Bombyx mori]
Group
Gene OntologyGO:00065081.2e-26proteolysis
GO:00042221.2e-26metalloendopeptidase activity
GO:00082704.3e-05zinc ion binding
KEGG pathway 
InterPro domain[208-405] IPR0240792.2e-49Metallopeptidase, catalytic domain
[208-402] IPR0015901.2e-26Peptidase M12B, ADAM/reprolysin
[691-748] IPR0008841.4e-11Thrombospondin, type 1 repeat
Orthology groupMCL15449 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208900-TA
ATGCGTGAGCGCAGTGTGGCATGTGCCAGCCAGAGAGGCGAAGCCCGCGGCGGCGCGCGGACGATGACGCGCTGGCTGGGGGCTACGACGCTTCTCCTTACTATAGCGACCCTGCACGCTTGGCGACCACCGCCGCATCCCATCCTACTGCCAAATTCTTCCCATATCAGGCTACCGAGACACGCAAGAAGTATACATCATCTGCATATCCTTGGCTGGCATCTGGAATTGCAGGAGAATCGCGCGATTCGATCGCCGTATTACAACCAGTGCCAGTTTTATAAGGGACGTATTTTGTATGAAGAAGAATCAACAGTTACGGTGACAGAATGTGAGGGCCAGCTGTATGGATTGCTGCAGGTTCGCGGCGAGGATTTTGTGTTGCAGCCAAATAGACCGGACGGGAGTCATGTTTTACGACGTCGTGATGTGCTGGTGTCCGAACAGCCAGCGATGTACAATCTCACAGGGGACACAGTGACTGACCTTGACATCGACTTCGAGGAAGATGGACCTAAAACTTCCACCCATGTTCATCCTCGACAAAACTATCAGTCGGACATCGATTATTTTAGAAGCATGCTCCCAGTTACAAGACCAGTGTCAGGTGGTAAAGGCCTCTGGCTGGAGCTAGCCATAGTTGCCGACCACACAATGGTTAAATTCCATGGACGTGAACGAGTGAAACACTATATCTTAGCTATAATGAATATTGTCAGTGCCATCTTTAACGACCACTCCCTTGATTCAAACATGACGCTCGTTATAAACAAACTGTTTCTGTATGAGGAGAAAGATTCAGTGATCAAATATGGAAACGTTAAGAAATCACTTGAGGCAGTCAACAAATGGAACTATCGACACCTCATGAAGCTACCAGCTGATAGCACGGGTTGGGACGCAACAGTATGGCTAACACGTGCGCAGTTAGGGGGACCTTCGGGATTCGCCCCCGTTGGTGGAGTTTGCACCAGGACTAGATCTGCTGCTATTGACAGAGACGAGGGGCTCACTAGTGCTTTCGTCATCGCTCACGAGCTGGGACATCTATTGGGTTTGACACACGATAGCGATGAACAGTGCGAGTCCGAAGGAAACCGAGGTTCAGTGATGGCACCAACAGTGTTGGCAACGCTGCATAATTACGCTTGGTCTTCGTGTTCACGAGAACAGTTCCACGAGAAGTCTAAGAAGTGGTGGTGCCTTCATGAGCGAAGTCAGGATGATGGTGTTGAATTAGGTGGTGCCAAAGAACTTTATAATTACGTTTTTACCATGGATGAACAGTGCCGTACCGAGTTTGGGGAAGGATTCTCAGTATGTAAGTCGGTGAAAGTACGGTCAGCGTGCGCCAAGCTGTGGTGCGCTCATCGGGCTATGCCTCATGTGTGCCGTTCAAAACGAGCGCCACCATTAGAAGGGACGCCTTGCGGAACAAATCAATGGTGTGTGGATCGTGTCTGTGAGCCGATGCCAGGTCACGCTGTTGAGAATAAGAATAAGGAAGTAGAATCCAAAACACCGCAGTGGGGGGAATGGAGCCCGTGGAGCGATTGTAACACCGAGTGTGGATACGGCCTTCGATCACGGACACGGAGATGTAAATATAAAGGATTGGGTTTGACACACGATAGCGATGAACAGTGCGAGTCCGAAGGAAACCGAGGTTCAGTGATGGCACCAACAGTGTTGGCAACGCTGCATAATTACGCTTGGTCTTCGTGTTCACGAGAACAGTTCCACGAGAAGTCTAAGAAGTGGTGGTGCCTTCATGAGCGAAGTCAGGATGATGGTGTTGAATTAGGTGGTGCCAAAGAACTTTATAATTACGTTTTTACCATGGATGAACAGTGCCGTACCGAGTTTGGGGAAGGATTCTCAGTATGTAAGTCGGTGAAAGTACGGTCAGCGTGTGCCAAGCTGTGGTGCGCTCATCGGGCTATGCCTCATGTGTGCCGTTCAAAACGAGCGCCACCATTAGAAGGGACGCCTTGCGGAACAAATCAATGGTGTGTGGATCGTGTCTGTGAGCCGATGCCAGGTCATGCTGTTGAAAATAAGAATAAGGAAGTAGAATCCAAAACACCGCAGTGGGGGGAATGGAGCCCGTGGAGCGATTGTAACACCGAGTGTGGATACGGCCTTCGATCACGGACACGGAGATGTAAATATAAAGGGGTGACTTCGACATTATGCGAAGGTGCGGGGTCCCAAGTATCTACTTGTTGGACCGGAGCACCCTGTGCTAGTACCAGGGATGCGAGAGCTGACGCTTGTTCCAGACAGGCAACAAATTTCATACCTCATCTACACGTTGATGAATCTAATCACTGTGAGACGTGGTGTGTGGATTTCGCCGGTGGAAATCTTTCCAACTTTGGACCTCTACCCGATGGGGCTCCCTGTAGTTATGAGAGGCCTTACGATCTTTGCTATCAGGGCACATGCGTGAAGGGACAGTGCAATGCAACGGATCCGGTGTGCAACTGGTGTCCTGATGGTTATTGCAACAATAACACTCACATATACACAAGACAATTGGGCAAAGGTTGGACACGTTTGACAGTAATACCTCACGAAGCTGCACAGCTCTCAGTTCACATAGCTACCCCCGTACCATTAAACATCGCGATACGTGAGCGACGCCGTGACCGTCCAATACTACAGCTGACAAAACATTCTAGAACCATCGAGATAGATCACAAACAAGACGACAATCACAAACAGGGGCAGGATTATTTAAAGTACTACGTGCCACAGAACTTGCAGATCATAGACGTGGATAGCAATGTTTTAGACATAAAGGAACGCTTCGGGCTCGAAGGAGAAGTGGCAGCCGCTGGTAGTCTCTTGAGATGGACTCAGACTGAATCAGACGTAATGATAACATCCACTTCAAGATTACAAACTGATTTAATGATAATGGCTGTTCCGACCACATCTACTGAAGACACAATCGCTGTGGAAGTATCTGTCAACTACAGTACGCCTGCTGGTCGTACCAGACCTTTGGAGTACAGGTGGTCGAGCGAACGTGGTCCGTGTTCTGTATCTTGTGGTGGTGGATTGCGCGCAGTACGACCACGTTGTCAACGAGGCGGTCAGTCATGCGGCCCGGTGAGATATGAAACGTGCAACTCAAACAGGTGGTCGAGCGAACGTGGTCCGTGTTCTGTATCTTGTGGTGGTGGATTGCGCGCAGTACGACCACGTTGTCAACGAGGCGGTCAGTCATGCGGCCCGGTGAGATATGAAACGTGCAACTCAAACAGTTGTGACAACGTTTGGGCTCCGGCTGAGTGGGAAGAATGCAGCTCTACTTGCGGTCAAGATGGCTACCAAGAGCGTCAGCTGTTTTGTATTCCTTCTAATATAAGTGTAACAACAAAACATGAACTCATCAAACACAGTGTATCTCCCGCTCTCTGCCCAGCGTCAAAACCAGCCAAGACACAACCCTGCAATAGGATACCCTGTCCTGTTTATTGGCAAGAAATGCCTTGGACACCGTGTTCCACAACTTGTGGCCGCGGAGTTTCCCATCGGCCTCTTTCCTGCCCCGCTTCAGACCCTGCTCTCTGTGGACCGAAGCCCCGGGAGCGACGTCGACGCTGTCGTCTCAGAAAGTGCCCCAAACCGCCCGCCCCCTGCCCAGAAACTGACGCAACCCAATACTGCGAGCTTTTCACCAGCGATCAACTGGAACGAAACTGTGTAGTACCGCCCTTTAGAAAATACTGTTGCAACGCCTGTCAGTACATCAGGAAAAGGGGAGAGTAG

Protein sequence:

>DPOGS208900-PA
MRERSVACASQRGEARGGARTMTRWLGATTLLLTIATLHAWRPPPHPILLPNSSHIRLPRHARSIHHLHILGWHLELQENRAIRSPYYNQCQFYKGRILYEEESTVTVTECEGQLYGLLQVRGEDFVLQPNRPDGSHVLRRRDVLVSEQPAMYNLTGDTVTDLDIDFEEDGPKTSTHVHPRQNYQSDIDYFRSMLPVTRPVSGGKGLWLELAIVADHTMVKFHGRERVKHYILAIMNIVSAIFNDHSLDSNMTLVINKLFLYEEKDSVIKYGNVKKSLEAVNKWNYRHLMKLPADSTGWDATVWLTRAQLGGPSGFAPVGGVCTRTRSAAIDRDEGLTSAFVIAHELGHLLGLTHDSDEQCESEGNRGSVMAPTVLATLHNYAWSSCSREQFHEKSKKWWCLHERSQDDGVELGGAKELYNYVFTMDEQCRTEFGEGFSVCKSVKVRSACAKLWCAHRAMPHVCRSKRAPPLEGTPCGTNQWCVDRVCEPMPGHAVENKNKEVESKTPQWGEWSPWSDCNTECGYGLRSRTRRCKYKGLGLTHDSDEQCESEGNRGSVMAPTVLATLHNYAWSSCSREQFHEKSKKWWCLHERSQDDGVELGGAKELYNYVFTMDEQCRTEFGEGFSVCKSVKVRSACAKLWCAHRAMPHVCRSKRAPPLEGTPCGTNQWCVDRVCEPMPGHAVENKNKEVESKTPQWGEWSPWSDCNTECGYGLRSRTRRCKYKGVTSTLCEGAGSQVSTCWTGAPCASTRDARADACSRQATNFIPHLHVDESNHCETWCVDFAGGNLSNFGPLPDGAPCSYERPYDLCYQGTCVKGQCNATDPVCNWCPDGYCNNNTHIYTRQLGKGWTRLTVIPHEAAQLSVHIATPVPLNIAIRERRRDRPILQLTKHSRTIEIDHKQDDNHKQGQDYLKYYVPQNLQIIDVDSNVLDIKERFGLEGEVAAAGSLLRWTQTESDVMITSTSRLQTDLMIMAVPTTSTEDTIAVEVSVNYSTPAGRTRPLEYRWSSERGPCSVSCGGGLRAVRPRCQRGGQSCGPVRYETCNSNRWSSERGPCSVSCGGGLRAVRPRCQRGGQSCGPVRYETCNSNSCDNVWAPAEWEECSSTCGQDGYQERQLFCIPSNISVTTKHELIKHSVSPALCPASKPAKTQPCNRIPCPVYWQEMPWTPCSTTCGRGVSHRPLSCPASDPALCGPKPRERRRRCRLRKCPKPPAPCPETDATQYCELFTSDQLERNCVVPPFRKYCCNACQYIRKRGE-