Monarch geneset OGS2.0

DPOGS204583
TranscriptDPOGS204583-TA3429 bp
ProteinDPOGS204583-PA1142 aa
Genomic positionDPSCF300400 - 105908-140429
RNAseq coverage225x (Rank: top 44%)
Annotation
HeliconiusHMEL0083855e-10754.55% 
BombyxBGIBMGA001435-TA0.073.47% 
DrosophilaCG4096-PB3e-18044.59% 
EBI UniRef50UniRef50_E1ZYT50.054.18%A disintegrin and metalloproteinase with thrombospondin motifs 7 n=8 Tax=Formicidae RepID=E1ZYT5_CAMFO
NCBI RefSeqXP_001602029.10.049.41%PREDICTED: similar to adamts-7 [Nasonia vitripennis]
NCBI nr blastpgi|2700020660.052.82%hypothetical protein TcasGA2_TC001014 [Tribolium castaneum]
NCBI nr blastxgi|2700020660.049.78%hypothetical protein TcasGA2_TC001014 [Tribolium castaneum]
Group
Gene OntologyGO:00065084.3e-32proteolysis
GO:00042224.3e-32metalloendopeptidase activity
GO:00082704.3e-32zinc ion binding
GO:00082373.6e-18metallopeptidase activity
GO:00055783.6e-18proteinaceous extracellular matrix
GO:00310126.3e-18extracellular matrix
KEGG pathway 
InterPro domain[405-624] IPR0240791.5e-66Metallopeptidase, catalytic domain
[64-210] IPR0028704.3e-32Peptidase M12B, propeptide
[411-624] IPR0015907.5e-27Peptidase M12B, ADAM/reprolysin
[707-765] IPR0008844.7e-20Thrombospondin, type 1 repeat
[720-738] IPR0132733.6e-18Peptidase M12B, ADAM-TS
[902-1016] IPR0102946.3e-18ADAM-TS Spacer 1
Orthology groupMCL10258 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204583-TA
ATGGATGTGCGGTGTTCGATATTCAAACTGCTGTTGTTTCAAATTGTGATTATAAGATGTGGAGCAAAACACATACGCACAGCGAGGGTAGAGTTGGCTGGTGATGTGGAGAGGGAAGTCCAGGATAGCATAAGGAACCTTATACACACCGGTATCTATTCACACCAACACCTGGATCATTCTCAAGTGCAAGTGGTGACCCCGGTGAAGGTTTCATCAGAGGGCGAACTTATCTCTCATGAAGTTGAACATGCACACGGGCATGGTCACGCGCGCTCACGCCGTGACCTTCATGGCATAGAACATCATCTGCCCCATTCTCTTCACTACAACCTCACTGTGGATGGACGGAATCTTAGGCTGGACCTAAGGCCCTCAGTAACGTTCATCACTCCAGCGTTGGTTGTAGAGCGACACAGGGTTAACTCGAGAACTCGGGCGAGGCCTCTGGAGGCAGCTACTGCGTGCCACTACACCGGCTCCGTGAGAGGACAGCCCGGCTCGAATGTCGCACTATCAGCCTGCGATGGACTTGCAGGCATGTTGCGGACGAAATATGGAGAATATTGGATTGAGCCATCAAATCAACTAACGACCGACAGCTCGTCTGGGCGACCACATGTTATATTTAAGAGATCTGCTGTCGATAAAGTTGAGGCATTCCATAGAAGAAAGAGGGAAGTCGATAGGAGAATGACTTCTTCGTCTAGAAATAACAATGAAAAGAATTATAAAGAAAAAAATTCAAAACGAAACACGAGAAGAAAGGATAAGGAGGACATGGATCAGCGTAGACGAGCGTATATAGAAGAGAGACGGAGGAGATTAGAGGCAATGAGACGTGATCCAGCAGCATATAGAAGAAATAAAAAGAATCTACGAATGGAGCATAGAAGACCGCTGTCGGAATCAGCGTCTTCATCCGTATCGAGGAGCACATCCATGGAAAATAGTCATTCCTTGGAACAAAATTTAACAACGAGCAGGTCGAGTATTGAACAGCGACCACGAAGGATAAGAACGAGACGAAAGAGACGGAGACGTTCAAAGAACTGTGCAACAAAACAACCACCGTATCAATGGAAGTATAGAAATTTTAAAGACCTTCATTCAAAGGAAGAACGTAGCAAATCAAAATATTACAAGAGAACCAATAAAACTCAACGCAATAATGTAAACAGACGGTATCTAGAAGCATCAAAACGATCGACGCGATCCGTTAGCAAGCCGCGTCACGTTGAAGTGTTACTTGTCGCCGATAAGTCTATGACTGATTTCCACAATCAAGGCAACTTGGAAACTTATCTACTGACAATCATGAACATGGTATCATCCCTGTATATGGATCCATCAATCGGGAACTACATCAAAGTGGTGGTAGTCAAAATTATTTTGGTCGAGGAAATGCACGCGGCACCTGAGCTATCAGTGTCCACTAACGCGGACTCAACCCTCGCGTCCTTCTGTAGGTGGCAGCACCAGCTGAACCCGGACGACGACCACGATCCACACCACCATGACGTCGCCATACTCATCACACGACGGGATATATGCAGTCAGCAGGACACACCTTGTAGTACTTTGGGCGTGGCGCACGTGGCGGGCATGTGCAAGCCGGACCGCAGCTGTTCCGTCAACGAGGACAATGGCATCATGCTGGCGCACACCATCACGCACGAGCTGGGACACAATTTCGGCCTGTACCATGACACCGAGAAAATTGGTTGTCACAGACGCGATGGGGCTACTCTTCATATTATGACACCGATCTTTGAACCAGATACGGTACAAGTCGCCTGGTCACGATGCAGTAAACGGGATGTCACCAACTTTTTAGATGCCGGTTTAGGGGATTGCTTGAGCGACAGACCATCTCAGGAAGAACCGTACGTCTATCCTGAACTACCCGCAGGAGTAACATTTGACGCGGCTTATCAATGTCACTTACAATTCGGTGAGGAAGCCGTAGTGTGTTCCAAGCTGACGGAGCTTTGTGAGCATCTGTGGTGTCTCGTCAACAACACGTGCAAATCGATGTTAAGACCGGCCGCGCCGGGGACAACTTGTGGAGAGAATATGTGGTGTCAGAATCAGACGTGCGTCCCAAAGGCACCGTCCCCCGTACCACGTGACGGTGGTTGGGGCGCCTGGAGCGAGTGGAGCGAATGTTCCCGCACTTGTGGTGCAGGGGTGTCCACTCAACATCGAGAGTGCAACAATCCTGAACCACTCAACAATGGAAACTACTGCATCGGGGATAGGAGCCGGTACAAAGTTTGCAACACCGATCCGTGCCCTATCAATGAGCCGACGTTCAGGGAAGTTCAGTGCTCCAAGTTTAATAACATGACATACAAGAACGAAACCATCTCCGAATGGATACCATACATTGATCAAGATAAACCGTGCGATTTACAATGCGTGCCTCGCAATCGGAATGATATAGAGATGATCGGCAGCTTCGTAACAGACGGAACACCGTGCAGACAGGGTCTCGGAGTTCGGGACATGTGTATAGCTGGCGTGTGCTACAAAGTGGGTTGTGACTGGATCGTGGACTCTGATGCTGAAGAGGACGCGTGCGGTGAGTGCGGAGGAGACGGCTCCGCTTGCGTGTGCTACAAAGTGGGTTGTGACTGGATTGTGGACTCTGATGCTGAAGAGGACGCGTGCGGTGAGTGCGGAGGAGACGGCTCCGCGTGTAGGACAGTACAAGGGATATATAACAAGGATACTACTAGACAGTCTGGTTATAGCGAGGTTGCGGTAATACCAGCTGGTTCTAGAAATGTGAAGATTCAAGAAAAAGTCAGCCCTGGAAACTATATTTCAATCGGAAGCGCTAAATCTAGAAAGATATATCTCACTGGAGCGCGAAATGCAACATTAACTGAGTACTTCGTAGCCGGAGCTCAGGCGATCTATGAAAGGGACCGTGATTGGGAAAAAGTTAGGATCAGTGGACCCCTTGCTGAGGACATAAAGGTTTATCAACGCATCTTCCGCGGCAAGCATCGCAATCCTGGCGTGACGTATCAGTATACAGTGGACCAACCTAAATCATTGGTCCGCTATCGTCTGTCAGACTGGAGCGCCTGTTCTGTAACCTGCGGCCTCGGGCGGATGTACCGACACTACATCTGCGTGGACCAGCACAACCGTCAAGTTGACCTATCCATGTGTTACCACATGGAGCAACCTCGTCTTGGCGCGTTGGTGCAGCAGTGCCGGTCAGTCTCCTGTACACACTGGTGGGTCGGACCATGGAAACCATGCTCCATTTGCCATATGCCGGGCGAGGAAGCTACCAAGAAACGTAATGTCCACTGTGTGAACAAGACATCCAACCAGGTAGTTGACGACTCAGAATGTGATATGTCAACTAAGCCGATAGGCGTTATCAAATGCGCAGACGTACCAGCTTGTTGA

Protein sequence:

>DPOGS204583-PA
MDVRCSIFKLLLFQIVIIRCGAKHIRTARVELAGDVEREVQDSIRNLIHTGIYSHQHLDHSQVQVVTPVKVSSEGELISHEVEHAHGHGHARSRRDLHGIEHHLPHSLHYNLTVDGRNLRLDLRPSVTFITPALVVERHRVNSRTRARPLEAATACHYTGSVRGQPGSNVALSACDGLAGMLRTKYGEYWIEPSNQLTTDSSSGRPHVIFKRSAVDKVEAFHRRKREVDRRMTSSSRNNNEKNYKEKNSKRNTRRKDKEDMDQRRRAYIEERRRRLEAMRRDPAAYRRNKKNLRMEHRRPLSESASSSVSRSTSMENSHSLEQNLTTSRSSIEQRPRRIRTRRKRRRRSKNCATKQPPYQWKYRNFKDLHSKEERSKSKYYKRTNKTQRNNVNRRYLEASKRSTRSVSKPRHVEVLLVADKSMTDFHNQGNLETYLLTIMNMVSSLYMDPSIGNYIKVVVVKIILVEEMHAAPELSVSTNADSTLASFCRWQHQLNPDDDHDPHHHDVAILITRRDICSQQDTPCSTLGVAHVAGMCKPDRSCSVNEDNGIMLAHTITHELGHNFGLYHDTEKIGCHRRDGATLHIMTPIFEPDTVQVAWSRCSKRDVTNFLDAGLGDCLSDRPSQEEPYVYPELPAGVTFDAAYQCHLQFGEEAVVCSKLTELCEHLWCLVNNTCKSMLRPAAPGTTCGENMWCQNQTCVPKAPSPVPRDGGWGAWSEWSECSRTCGAGVSTQHRECNNPEPLNNGNYCIGDRSRYKVCNTDPCPINEPTFREVQCSKFNNMTYKNETISEWIPYIDQDKPCDLQCVPRNRNDIEMIGSFVTDGTPCRQGLGVRDMCIAGVCYKVGCDWIVDSDAEEDACGECGGDGSACVCYKVGCDWIVDSDAEEDACGECGGDGSACRTVQGIYNKDTTRQSGYSEVAVIPAGSRNVKIQEKVSPGNYISIGSAKSRKIYLTGARNATLTEYFVAGAQAIYERDRDWEKVRISGPLAEDIKVYQRIFRGKHRNPGVTYQYTVDQPKSLVRYRLSDWSACSVTCGLGRMYRHYICVDQHNRQVDLSMCYHMEQPRLGALVQQCRSVSCTHWWVGPWKPCSICHMPGEEATKKRNVHCVNKTSNQVVDDSECDMSTKPIGVIKCADVPAC-