Monarch geneset OGS2.0

DPOGS200771
TranscriptDPOGS200771-TA2886 bp
ProteinDPOGS200771-PA961 aa
Genomic positionDPSCF300318 + 96008-104129
RNAseq coverage48x (Rank: top 70%)
Annotation
HeliconiusHMEL0110600.067.44% 
BombyxBGIBMGA001969-TA0.077.91% 
Drosophilastl-PC5e-14738.27% 
EBI UniRef50UniRef50_D6WRW70.048.47%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WRW7_TRICA
NCBI RefSeqXP_975032.10.049.95%PREDICTED: similar to AGAP004961-PA [Tribolium castaneum]
NCBI nr blastpgi|910858530.049.95%PREDICTED: similar to AGAP004961-PA [Tribolium castaneum]
NCBI nr blastxgi|910858530.049.95%PREDICTED: similar to AGAP004961-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065085.9e-21proteolysis
GO:00042225.9e-21metalloendopeptidase activity
GO:00082701.9e-06zinc ion binding
KEGG pathway 
InterPro domain[249-463] IPR0240796.4e-54Metallopeptidase, catalytic domain
[249-463] IPR0015905.9e-21Peptidase M12B, ADAM/reprolysin
[552-614] IPR0008841.9e-07Thrombospondin, type 1 repeat
[40-159] IPR0028701.9e-06Peptidase M12B, propeptide
Orthology groupMCL17908 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200771-TA
ATGTCATGTAAATTGCTGGTTTTTTTCATATCCTTTTGTTCTGTTCAGGTCGAATCCAAAGAGTTATTCCGTCATTTATCAGAATCAGAGAAAATATTTCTATTTGGAACCGCAGACAAAGATCATCAGTTTGAAATAGCTCATCTAAAAAGACCAAGACGTCGCAAACGATCAGTTGGTACGGAGGATAGTCAGGATTTAGTCAGCCTGAAGAAGCATTTAAATGTCCAGCTAAGCCCCAGGCACAGTTTCATAGATCCACAGTTCCTGTTGATGAAGAGATGGTCCAATGGAACTGAGGCTATTCTGGATGAACACACTAACGACGAATTAGATTTATGTTTTTATAAAAGTGATAACGCTGCCTTATATATCTGTGATGACGTGAGAGGTGTAGTCAAATCGAATGATAGTTTTTATACTATTCACCCACTGCCAGAAAGATTCCACAACGAACATTCTAAAGCTCATTTAATTATAAAGAGAAGTAAGGAAATACTAGCAAGCGATGATGTTGAAGGAGAATGCATAGATAAAAATGATTTGAATGTGAAAAACTATGACCCAATAAAATGGTCTAAACACAGACGAAAACGTCAAGTCGAAAATAGTCCAGACGTTAGTTTAGATAAATTCAAATTAAACTTAGAACATAGAAATAGAACAATAAGTGATTTTGTAGAAAAATTAGTAACAAGAGACAATCAAACCGATAGGCAGAAAAGATCGGTGCTGCCAGCGATATTTGTAGAAACAGCTGTATTTGTAGATAGAGATTTATACAAACTAATGACAATCAACTTCCCAAAAGATACAGAAAGGGAATTAGTGAGATTTGTTTTAGCAATGATAAACGCTGTGCAGTTGATATATCACGATCAAAGCTTGGGCAGGCCAGTCAATTTTATACTCAAGAGATTGGAAATCCTTCATGAAGATCCATCCAATCTGAAGAGACCTCACGATATAGATAGGTTTTTAAGCAATTTCTGTACGTGGCAGAGGTTGGAAAATCCTCCCGGTGACAACGACCCCCTGCACTGGGACCACGCCTTAATACTCACTGGTTTAGATCTGTATGTTGTTAACAAGAATGGTAAAGTTAGCAGTCAAGTTGTCGGTCTGGCACCAGTAGCGGGAATGTGCACAGTAACCAGTAGTTGCACAGTCAATGAAGGGAGACACTTTGAAAGTGTTTATGTCGTGGCGCATGAGATTGGTCACAATCTGGGTATGCGCCACGACGGTCCGAAGGCGGACAACAGCTGCGACCCGAGCGCGTACATCATGAGCCCGACCCTCGGAAGTGGAAAGATTACCTGGTCCCAATGCAGTAGGAACTATCTACAGAAGTTTCTAGACACATCACAGTCTAGATGTCTGTTGGATCACGGCAACTCCGCCGGCCAATTGGATCACAGCGCTGAGGGCATCCTGCCGGGAGAAAGATTTGATGCTGACCAGCAATGTATGCTAAAATACGGTCGCGGCAGTCGACACTCCTCAGCCCAAGACCTGAAGGACGTGTGTCGCGACCTGCACTGCCAGAGGGAGCGGTACACCTGGACGTCACATCCAGCACTAGAAGGCACGCGATGTGGAGACGACATGTACTGTCGTGGCGGAGAGTGCGTGAGTCGCGGGGGGAGAGGTCCGGGCGCGCTGTGGGGTCCTTGGTCGCGGTGGTCGGAGTGCGCGTCCGCCTGCCTCCTCCACGAGGACCAGACCCCCGCCGCCGCCGGAGTGTCCGTCGCCACCCGTCGGTGTAACAGACCGAGGCTCGAAAGCGGTAAAAGCTACTGTCAAGGTCACGACAAAAAATACCAGTCCTGCCACGCGGAACAGTGTAAGACGGTTCCCAGGATGACCGTAGGGGAATTCGCCGACCAGATCTGCCGCCGGGCGAGGGACGTCGACCCTGACCTCATCGGCACCGGACTACAGAGACTCGACACTGATGACAGTCACAGTACGTGTGCCGTGTGGTGCGACACTCGCGGCGGCGGGTACAAGTCACGCGGCTGGACCCTTCCCGATGGAACTGCTTGCTCCACTGTAGCGAATAAATACTGTATCAGTGGGGTTTGTAGGAAGTTCTCATGTCTCGGCAACGCAGACTCGGAGTTCAGTCTGGCGTCCGGCGACTGTGAGTGGGAAGCGCTCCACGTACAGACAGCCACTCCTCACACCAAGTCGTCTGGGACGTGGCGGCGTGAGGTTGGCCGGTGGATCGCGACCTCCGCGTGTCATTACCGCTGTATGCTGAGAGGGTCCGGACTCAGGCTGGTCAGGGCACAGCAGCGGACCTCCATACAGCTGTGTACACCAGACACGGACACCGCAGATAGAGGTTGTAGCGACCACATGAGCCCTTACCAGTTCGCTACGATGGTGTGCTCCAAGTACAAGGAGCGTGTGAGACGTCTCTCCGGTCTGGGGATGCAGATATCACCAGCACTTGAGGAGCCCGACCGACCGTGCCGCGTGGCGTGTCAGGACGAGCGTGTGTCTCATAGATTCTATTTAGTGAACGGAGCGGACGGCTGGTTCCCTCTCGGCACCTCCTGCGGCGGGAACAACTCTTACTGCGAATTCGGTTCCGATCTAACTCCTTTGTCTGAGATGGTGTTCACTCTTCCACTCTTGAACCGTGGTTCCGTCCGTCATCGTCGGAGTCTCTATCGGGATCGCCTGACAGTCAGGACGACCCTCCACAGGCAACACCTGGAGGATATCATAGCTAGGCTCAACTTGACCGATAACACGAGAGGGAGCGCGTATTTTCAAACGATACCAGAAAATATAGAAATTGATTTTACAAATCCGATACACATATCACCAGAGGACACTATGCTTAGGAAAAAACCGAGACCTACTTGGGACTAA

Protein sequence:

>DPOGS200771-PA
MSCKLLVFFISFCSVQVESKELFRHLSESEKIFLFGTADKDHQFEIAHLKRPRRRKRSVGTEDSQDLVSLKKHLNVQLSPRHSFIDPQFLLMKRWSNGTEAILDEHTNDELDLCFYKSDNAALYICDDVRGVVKSNDSFYTIHPLPERFHNEHSKAHLIIKRSKEILASDDVEGECIDKNDLNVKNYDPIKWSKHRRKRQVENSPDVSLDKFKLNLEHRNRTISDFVEKLVTRDNQTDRQKRSVLPAIFVETAVFVDRDLYKLMTINFPKDTERELVRFVLAMINAVQLIYHDQSLGRPVNFILKRLEILHEDPSNLKRPHDIDRFLSNFCTWQRLENPPGDNDPLHWDHALILTGLDLYVVNKNGKVSSQVVGLAPVAGMCTVTSSCTVNEGRHFESVYVVAHEIGHNLGMRHDGPKADNSCDPSAYIMSPTLGSGKITWSQCSRNYLQKFLDTSQSRCLLDHGNSAGQLDHSAEGILPGERFDADQQCMLKYGRGSRHSSAQDLKDVCRDLHCQRERYTWTSHPALEGTRCGDDMYCRGGECVSRGGRGPGALWGPWSRWSECASACLLHEDQTPAAAGVSVATRRCNRPRLESGKSYCQGHDKKYQSCHAEQCKTVPRMTVGEFADQICRRARDVDPDLIGTGLQRLDTDDSHSTCAVWCDTRGGGYKSRGWTLPDGTACSTVANKYCISGVCRKFSCLGNADSEFSLASGDCEWEALHVQTATPHTKSSGTWRREVGRWIATSACHYRCMLRGSGLRLVRAQQRTSIQLCTPDTDTADRGCSDHMSPYQFATMVCSKYKERVRRLSGLGMQISPALEEPDRPCRVACQDERVSHRFYLVNGADGWFPLGTSCGGNNSYCEFGSDLTPLSEMVFTLPLLNRGSVRHRRSLYRDRLTVRTTLHRQHLEDIIARLNLTDNTRGSAYFQTIPENIEIDFTNPIHISPEDTMLRKKPRPTWD-