Monarch geneset OGS2.0

DPOGS201745
TranscriptDPOGS201745-TA2442 bp
ProteinDPOGS201745-PA813 aa
Genomic positionDPSCF300279 - 131843-147109
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0067030.074.10% 
BombyxBGIBMGA002645-TA0.061.83% 
Drosophilastl-PC8e-5130.30% 
EBI UniRef50UniRef50_E2A7672e-15640.65%A disintegrin and metalloproteinase with thrombospondin motifs 14 n=5 Tax=Formicidae RepID=E2A767_CAMFO
NCBI RefSeqXP_001121221.14e-15541.10%PREDICTED: similar to CG3622-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3504212094e-16542.47%PREDICTED: A disintegrin and metalloproteinase with thrombospondin motifs 2-like [Bombus impatiens]
NCBI nr blastxgi|3838639093e-16541.82%PREDICTED: A disintegrin and metalloproteinase with thrombospondin motifs 2-like [Megachile rotundata]
Group
Gene OntologyGO:00065081.5e-14proteolysis
GO:00042221.5e-14metalloendopeptidase activity
GO:00082703.3e-05zinc ion binding
KEGG pathway 
InterPro domain[205-458] IPR0240794.3e-43Metallopeptidase, catalytic domain
[208-417] IPR0015901.5e-14Peptidase M12B, ADAM/reprolysin
Orthology groupMCL18995 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201745-TA
ATGGCGCGAACGGTTGGAATAGTGTTTGTAGCACTGTTTTTCATGCTACGAGTCACCACGGCTGCGCGTTTTCCAGACATATCGGCTTTCAACGGGTGGTGGTCAAAAGGGGTCCACCGAGAGGATGATTCAAAAGGGAACAAAGACGTAGAAGTGGTATACCTTCCAGCGCTAATACCTCGTGAAGCACAGGTTGCTGAAGATTCTGCACAAAATGACGTTCCACTACCTTACAGCTTCGAAGCCTTCGGGAAGAACTTCGACCTCCAACTCTTACCGAATAGGCGACTCGTGTCTCCTCAGTTCCGAGTATGGTCCGAGGACGGCCCCGAGGCCCCCTTGTCGGTTCCCGATTCTTCTTGCCACTTCCTTCATTTATTTTATTTAAAATGTCCGAAACAACACGGTCTTATCTTGACTGATAATTCCACATATGAAGTGCGACCTTTAAAGACTGGAGAAGGAAGGTCAGAACACGGGAAACATCATAGAGATCGCAGGGCTCATATCATCCGTCGTGCGACTCCTCCTCTTATGACTGTCAACGATGACCGTCCACTGAGACACAGAGCTCGTCGCCCCCGTCTCAATATTAACAAACCACCGCCTTCTTCCTACACTGTCGAAATAGCACTGTTCCTGGACGAAGCCGCTTACAAAATATTTCATCCTCATCTAAATTACAATGAGGCTGATCTACGAGATATGTTGTTGGCGTACATTAATGGGGTGCAGGCCTTATATCACCATTCGTCTCTGGGGACCCGTGTTCAGCTGTCTCTGGTGAGACTAACTTTGCTTCGGACACAACCAGCGGCTCTATCGTTGCAGGCGGAGCGCGGTCGTTTGTTGGACTCATTCTGCGCATACCAGCGATCGCTGAACGTAGATGATGACGACGACCCTGAACATTGGGATATGGCTTTATTACTTTCTGGGTTAGACTTTTATTCAGAGGAAGGAGGTCGTCGGAACGGCGTGACGATGGGTCTAGCTCCTGTGGGAGGAGTTTGTCTCCCGGCACACGCGTGTGTCGTCGCTGAGTTCGGAGCCGCGGACACACTCGGGAGACCGTATCCCTCTGCTGGGTTCACATCCGTCTACATTCTAGCACATGAGATCGGACACAATCTGGGCATGCATCATGATGGGACTGGTAATGCGTGTTCTCGCGACGGCTACATCATGTCTCCATCGCGCGGCACCAACGGCGAAGCGACCTGGTCACACTGCAGCGCACAAGTCGTCGCTGACTTGAAATGGGCAACATGTTTATTCGATGGCGGTGACGATCCAGACATACCACCACAACTAGAGCATGAGAGATTTGGTGACGCCCCGGGACTTGTTTGGGTCGCGAAGAAACAGTGCGAAGTGCTCCTCCGCGATAAGGACGCGACGCCAGCGTCTCCAGAACCTGGTGTGAGTGTGTGTATGCAGCTGGCGTGTAGGACTCCTCACCGAGCAGGGTTTTATTACGCCGGACCCGCGCTCCCAGGAACACCCTGTGCACCGGGGAAGGTGCTCCTCCGCGATAAGGACGCGACGCCAGCGTCTCCAGAACCTGGTGTGAGTGTGTGTATGCAGCTGGCGTGTAGGACTCCTCACCGAGCAGGGTTTTATTACGCCGGACCCGCGCTCCCAGGAACACCCTGTGCACCGGGGAAGTGGTGTCATGGCGGGGAGTGCGTGGCTGCTGATCCTACAGTGGCAGCCCTGCCTCCCGTAGTGAGTGACAGCGGCAGTTCCTGGAGTGAGTGGTCTTCAGGATCGTGTCGTTCAGGCTGCACACTGGAGGGCTTAGGAGCGGTGGAGAAACGACGCACTTGTCCTCAGAACGCAATTTGCGCAGGACCTTCTTATGATGTGGCACTTTGTGATGATTCGAAGGTGTGCGGTAAGAAACGGCGCACAAGTGCGAGTGAGTTGGCCGGTCGTCGATGTGCTCAGTACGCAGCGCGCATCCCAGCTCTTGATGCAAGAGGAGGTGGTCTACAGGCGCCTCATGATCCTACTCGCATGTGGATGGGATGCGCGATCTTCTGTCGTCGTGCGAGCGGCGGCGGGTTCTACGCGCCTCGGGTTGAGCTGAACGATGCTGGACTGGATCCTTACTTCCCCGACGGCACGTGGTGCCATCACGACGGACAGAACCACTACTACTGCCTTCAACACCACTGTTTGCCAGAGAATTTCAAGATGTCAGCTCAGTACCACATCTGGGAGTTACCGAGCGAGGATGTCGGTGGATCTTTCAACGCCAGGGCACGCGCGGCGCCTGATGACGGAGCCTCTGCAGCCCTTCGTGCTTACATGACCCTGGACGACGCTGGAGCACCTCTCTTCAGAGCCGCCATACCACCACACATCCCAGAGGAGCCTGAGAGCGACTGGGAAGTAATTGATTATGTCGAAATACCAGCCAGAAACAACACAGATTGA

Protein sequence:

>DPOGS201745-PA
MARTVGIVFVALFFMLRVTTAARFPDISAFNGWWSKGVHREDDSKGNKDVEVVYLPALIPREAQVAEDSAQNDVPLPYSFEAFGKNFDLQLLPNRRLVSPQFRVWSEDGPEAPLSVPDSSCHFLHLFYLKCPKQHGLILTDNSTYEVRPLKTGEGRSEHGKHHRDRRAHIIRRATPPLMTVNDDRPLRHRARRPRLNINKPPPSSYTVEIALFLDEAAYKIFHPHLNYNEADLRDMLLAYINGVQALYHHSSLGTRVQLSLVRLTLLRTQPAALSLQAERGRLLDSFCAYQRSLNVDDDDDPEHWDMALLLSGLDFYSEEGGRRNGVTMGLAPVGGVCLPAHACVVAEFGAADTLGRPYPSAGFTSVYILAHEIGHNLGMHHDGTGNACSRDGYIMSPSRGTNGEATWSHCSAQVVADLKWATCLFDGGDDPDIPPQLEHERFGDAPGLVWVAKKQCEVLLRDKDATPASPEPGVSVCMQLACRTPHRAGFYYAGPALPGTPCAPGKVLLRDKDATPASPEPGVSVCMQLACRTPHRAGFYYAGPALPGTPCAPGKWCHGGECVAADPTVAALPPVVSDSGSSWSEWSSGSCRSGCTLEGLGAVEKRRTCPQNAICAGPSYDVALCDDSKVCGKKRRTSASELAGRRCAQYAARIPALDARGGGLQAPHDPTRMWMGCAIFCRRASGGGFYAPRVELNDAGLDPYFPDGTWCHHDGQNHYYCLQHHCLPENFKMSAQYHIWELPSEDVGGSFNARARAAPDDGASAALRAYMTLDDAGAPLFRAAIPPHIPEEPESDWEVIDYVEIPARNNTD-