Monarch geneset OGS2.0

DPOGS212426
TranscriptDPOGS212426-TA1515 bp
ProteinDPOGS212426-PA504 aa
Genomic positionDPSCF300258 + 31346-34056
RNAseq coverage385x (Rank: top 31%)
Annotation
HeliconiusHMEL0058641e-10053.50% 
BombyxBGIBMGA002885-TA5e-14963.78% 
DrosophilaMmp1-PC3e-6032.69% 
EBI UniRef50UniRef50_E9H7T71e-7834.13%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H7T7_DAPPU
NCBI RefSeqXP_554330.39e-8336.51%AGAP003929-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3214612024e-7834.13%hypothetical protein DAPPUDRAFT_326457 [Daphnia pulex]
NCBI nr blastxgi|3214612023e-8234.72%hypothetical protein DAPPUDRAFT_326457 [Daphnia pulex]
Group
Gene OntologyGO:00310125e-49extracellular matrix
GO:00065085e-49proteolysis
GO:00042225e-49metalloendopeptidase activity
GO:00082705e-49zinc ion binding
GO:00082372.2e-34metallopeptidase activity
GO:00081524.8e-16metabolic process
KEGG pathway 
InterPro domain[37-267] IPR0240799.8e-71Metallopeptidase, catalytic domain
[114-265] IPR0018185e-49Peptidase M10, metallopeptidase
[296-490] IPR0005852.8e-47Hemopexin/matrixin
[110-267] IPR0060262.2e-34Peptidase, metallopeptidase
[88-101] IPR0211901.9e-33Peptidase M10A, matrix metallopeptidase
[35-98] IPR0024774.8e-16Peptidoglycan binding-like
[446-489] IPR0184871.2e-09Hemopexin/matrixin, repeat
Orthology groupMCL20736 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212426-TA
ATGGAATTGGATAAATGTGTGTTGGTTGTGCTGTTGCTGCTGGGCTTGCTGAGCCCAGCGGCAGCTCGGACGATATTTCTTGAAGAAGACATTGTCACGTTTGAAGAGGTAAGCTTTTTAAAAAAATATGGTTACCTATCTGAACAAGGGTTTGGCTCCCCCACTTACACGGCGCAATCTATTGCTGAAGCCGTACGAAGTATGCAGACCTTTGCTGGCCTTCCGCCAACTGGACAGCTCGATTCAGAAACCAGAAAGCTATTTAAAAGAAAACGTTGTGGTGTGAAAGATATCGAAACGAAATCAAACAAAATCAAAAGATATATCCTGCAACAAGGATGGGGGAGAAAGGCTATCACATACAGGGTTATCAATGGCTCGAGTACATTAGAGAAGTCCCGCGTAGAGGCTTTAATGGCCAACGCGTTGGCCGTATGGGCGCCACATGGAAACCTACGCTTTAAATCCCTAAGCTCCGCAGCCGACATACAAGTGTCCTTCGCCAGCAAGGATCATGGAGATGGGTTTCCTTTTGACGGGCCAGGTCATGTGGTGGCCCACGCGTTCCCACCTCCCCATGGCGCGATGCATTTTGACGATGATGAGCAATGGGGTGATAACGCCAACGAGGAAGACGAAGACGTCACCGACTTCTTTGCGGTGGCCGTCCATGAGGTCGGCCACGCCCTGGGGCTGTCGCATTCGAATGTTAAATCCTCAGTGATGTATCCATATTACCAAGTTCCCGTCGAAAAGCTACATGAAGATGACATTTTGGGGATGCAGGAACTATATCTGAAGGAGCAGCCATCGTCGGCGGAGATCGCAGCCAGTTTCACAATAGCTCCACGGTCCACAGTGGCTGACAGCGATGAAAACCTCCTGCCAGACCTGTGCTACGCCAACTACGATACTCTACAGGTCATCCAGAACAAGCTGTACGTGTTCGAGGAGGAGTGGGTATGGGTTTTATCCGAAAGAAAGATCATAGAGGAAGGCTATCCGAAGCGGTTCCACGATGTGTTTGTGGGATTGCCGAGGCATTTTAAAGTTATACGCACCATATATGAAAACAGGAACGGCCATATTGTTATATTTTCAGGTCGCAGTTACTTTGCATTCAGTTCGCGTTTCCACCTCATCAAACGCGGGAGAATCACAGATTTTAAGATACCGTCCCGAGTAGCCGAACTGACCACCGTGTTTTTGTCAAATTACAACAATAAGACGTACCTCATAGACGACGAGAGATACTGGCGTTACGACGAAGACACGGAAACCATGGACAAAGGATATCCGAAACAAATGTCAGCGTGGAGGGATGTCCCGTATCCCGTGGACGCAGCACTGATCTGGAAAGGAGACACGTTCTTCTTCCGAGGGCCTCGTTTCTGGCGTTTCGATAACAAGTCTGTGAAGGCGCATCCCTACTACCCGCTGCCCACCGCCGTCGTGTGGTTCCCTTGCGAACCTACACCAGATATAGCTATATATCTCACCAACTCCGAGCCTTGA

Protein sequence:

>DPOGS212426-PA
MELDKCVLVVLLLLGLLSPAAARTIFLEEDIVTFEEVSFLKKYGYLSEQGFGSPTYTAQSIAEAVRSMQTFAGLPPTGQLDSETRKLFKRKRCGVKDIETKSNKIKRYILQQGWGRKAITYRVINGSSTLEKSRVEALMANALAVWAPHGNLRFKSLSSAADIQVSFASKDHGDGFPFDGPGHVVAHAFPPPHGAMHFDDDEQWGDNANEEDEDVTDFFAVAVHEVGHALGLSHSNVKSSVMYPYYQVPVEKLHEDDILGMQELYLKEQPSSAEIAASFTIAPRSTVADSDENLLPDLCYANYDTLQVIQNKLYVFEEEWVWVLSERKIIEEGYPKRFHDVFVGLPRHFKVIRTIYENRNGHIVIFSGRSYFAFSSRFHLIKRGRITDFKIPSRVAELTTVFLSNYNNKTYLIDDERYWRYDEDTETMDKGYPKQMSAWRDVPYPVDAALIWKGDTFFFRGPRFWRFDNKSVKAHPYYPLPTAVVWFPCEPTPDIAIYLTNSEP-