Monarch geneset OGS2.0

DPOGS206275
TranscriptDPOGS206275-TA5010 bp
ProteinDPOGS206275-PA1669 aa
Genomic positionDPSCF300290 + 8433-30591
RNAseq coverage516x (Rank: top 24%)
Annotation
HeliconiusHMEL0131780.076.28% 
BombyxBGIBMGA010830-TA0.061.59% 
DrosophilaNeu3-PC0.049.39% 
EBI UniRef50UniRef50_D6X2H80.056.76%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X2H8_TRICA
NCBI RefSeqXP_966486.10.056.76%PREDICTED: similar to ADAM metalloprotease, partial [Tribolium castaneum]
NCBI nr blastpgi|910936970.056.76%PREDICTED: similar to ADAM metalloprotease, partial [Tribolium castaneum]
NCBI nr blastxgi|2700129940.056.76%hypothetical protein TcasGA2_TC010657 [Tribolium castaneum]
Group
Gene OntologyGO:00065081.5e-54proteolysis
GO:00042221.5e-54metalloendopeptidase activity
GO:00082703.7e-18zinc ion binding
KEGG pathway 
InterPro domain[182-379] IPR0240791.1e-59Metallopeptidase, catalytic domain
[183-379] IPR0015901.5e-54Peptidase M12B, ADAM/reprolysin
[473-619] IPR0065862e-44ADAM, cysteine-rich
[394-472] IPR0017621.5e-26Blood coagulation inhibitor, Disintegrin
[17-117] IPR0028703.7e-18Peptidase M12B, propeptide
Orthology groupMCL10705 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206275-TA
ATGGCGATTGGAGATATCGCTTACCCCAGGGAAGGCATCCATCATCCAGAGCTGGTTATGAAGATGAACTTTGATGGTCGCGAGCACGTCCTTGACTTGAGACTGAACGAGGATCTCATTACCAAGGATCATGTGATAGCATACCAGAAGGATGGGGAGACGGTGATACATCGACCTACATTGAAGGAGCTCGACATATGCCAGTACTCTGGCAAGGTGAGGGACAAGAAAGAATCGTGGGTCGCCGTGTCCACATGCGACGGAGTGAGGGGGATCATTCACGATGGACAGACAATGAGATATATAGAACCAGCCGATAGAAACGAAATCGACTCTCAGCACTATCTATACGAGCACTCGGATCTGAACACCGATTTCCACTGCGGGTACAGCGGAGGCATCACTACCAATGACACGTACGACCCCGAGCTCATGAAGCGACACATGCATAGCAGGAACGTGGAGAAGAGCAGAATAAGTCGGTACAAACGTGATGCGTACGAGGACACAGAGGTGAGGGGTCCGTTCAAGGTCAACAAACTGTCCCGCTTCGTGGAGTTGGTGCTCGTGGCGGACAACAGAGAGTTCAGAGCCAACGGGGAGAGCAAGGAAACGGTGCACAGACAGCTCAAGGACGTCGCTAATATTATTAATTCTGTGTACACCCCGCTTAATATCTTCATAGCGCTAGTGGGTGTTGTCGTGTGGAACGAAAGAGACGAAATACGGTTAGAGGAGGACGGAGATAAAACTCTCACAGAGTTCCTACATTACAGGAAAAGGCTGCTCCCTGTCATGCCCAACGACAACGCACACCTGTTAACCCGTCAGAAATTTAAAGATGGCGTCGTGGGGAAGGCTCTAAAAGGGCCGATATGTACGTACAATTTCTCTGGTGGTGTCGCCACAAACCATTCGGAGGTGATCGGTCTGGTGGCGACCACTATAGCCCACGAGATGGGCCACAACTTTGGCATGGAACACGACACTGAGGCCGACTGCGAGTGTCCCGATGAGAAGTGCATCATGAGCCCCTCCAGTACGTCGGTCACCCCTACCAAATGGTCGTCCTGCAGCTTGAGATCACTCGCGCTGGCGTTCGAGAGAGGCATGGATTACTGTCTGCGTAATAAGCCAAAGCGTCTATTCGAGCCTTCCACTTGCGGCAACGGATTCATTGAACCCGGCGAGCAGTGTGATTGTGGCCTGGCGGGCGATCCAGCCTGCACTGCTTGCTGTGACCCGCGGGCGTGCGTGTTACGCTCTAACGCGACCTGCGCGGCGGGAGAGTGCTGTGATACGACGACTTGTCGTCCGAAGCCGGCGGGGACGGTGTGCAGGGCGGCCGACAAGGAGTGTGATCTGGCGGAGTACTGCAGCGGACACTCGGAGTACTGTCCGCGGGATGTGTACAAGATGGACGCCACGCCCTGTGGGGGAGGGAAAGCGTACTGTGCGGGCGGGTCTTGTCGGACCCACACGGATCAATGCCGACTTCTCTGGGGTTTCTCCGGAGAGAACTCGGACGTTCAATGTTACACCAACTCTAATACTAAAGGGGATAGGAAGGGGAACTGCGGCTACCATCGCGAAGACCCGCCCGTCTACTACAAATGTTCTAAAGAAGATTCTCTCTGCGGTCTGCTGCAGTGTCGCCATCTCAATGAAAGACTCGAATTCGGCATGGAGTCCGTGTCTACACTGTCAGCTGTCTTCATTAATAATAACGGCACGATAATTCCCTGCCGCACGGCCATGGTGGACATGGGCACGAGCGATCCCGACCCGGGCTTCGTACCAGACGGCGCGAAATGTGGAGACGATAAAATGTGTATGAAACATAGATGCGTTTCAATAGCGGAAGTGACGTCAGAGATCGCTCGGAAAGAAACATCCGTCTGTCCGTCCAACTGTTCGGGCCATGGAGTGTGTAACTCAGAAGGACATTGTCACTGCGACTCGGGCTTCGCCCCTCCACTGTGTGAGCTCCCCGGGCCGGGAGGTTCCGTGGACTCCGGACCAGCCACTGACGCTTCAATTCAACGGAACTTCATGGTCGCTATGTACATAATCTTCCTGGGCATCCTGCCGTCCGTGCTGCTGGTGATGCTGCTCATGTACTACTCGCGTCACAACGTGCTGCTGTGCTGGAAGAAACCCAAAAAATCGTACGTAAATAACATTTTCAACGGCGACCGATTCAAAAGATTCAAAACATCGACCGATTCCTTCGTAAGACTAATCAGCTTTAGGCGGACACAGAAGAAAAATATGTGCAGGAAATGTCAGGACGATATATACAGTAATATATGCGAACACAAAGAAAATATAGACAGTACGTGGAGTTTTAACATTAGTTCTAAGATAATCAATATGTTGAATAGTAACAAACACGATGAAAGCAAAAAGTTTCAGAGGAAGATAAACAAAGACGACATCAAAGTCGCCGACGACTTAGATCTAGCTAACGTTAGGGTTAAAGTTGAACCGAAAATCAACAAATCAAACATAGTCATAGTTAAAACCGGCCTGGCCTCGACTACCAACGAACATGTTAAGGCCGAGATCAACACAACCAAACAGGAAGTTAGCTTAGATAGGAACAAAACCAAAAAGAACATTGTTGTGACAAATAAAATTAATACAGATGTCATATATGAAAACTGTATACCGAATAAGACTCCTTTTTCATATTTCAGTGTACAATCGCCAAAGAAGAACAGCTTGCAGCGTCGCCTGTCGCGAAGTGCGACGAAATTTGCTGCAAATTTCCAAAATAATTCACAAAACAACGCCCAGCCCGTCAACGTTCATACTCTGTCGAACTCTGACGACATGAGCTCCAGCCTCCTCAGAAGTGATTCCGATCGTAGCCCTTCAGGCAACATAAACCCCTCGGTAAATTTCTTCGGAAACTTCAAAGGATTCTCACTCACCCCGATGGATAAGAACTCACAAAACGAAACTGATGTCAAAGATAAAAAAGATAATGTACAGAAAAGCGCCAAAATTACACCCGTGCATCGAAGCGGTAGCAACAGTCAGAATATAGCACAGGGATCGAAGCCGATACTGAGATCAGCACCACCGCTGCCTGTGGTTCCGAACACGGCTAAACTAAGCCCGAAAACCAGCCCATCCATCAAAAGAACGAACAGCTCCGTTCAGAATCGCATTAAAGCTTTCATGGGAACAGAAAAGGCTGAAGAAATACCCGTGAACACAGCTCCGAGACCGACAATATCTAGTCCCATTCTGGAAGCATCGACGTGTACAGCGAAAGAACTCATCTCTCCTCTCCAGGGTTCCAAAACCTTGGGTCCTGTCCGCGCCGCTCCTACCGTTCCTAACTTCTCTCCGGACTTACCGAAGAGGCCGTTAAGCATGCACTCAGCGGGAAATGTACCACAGAAACCACTGCCGGAAGAACCGAAGAAAGTTAAAGAAGGCATATCTCTCAATAGGATTGCGTCGTTCCTGAAACAAGATAAACCAAAAGAAAAAGATAGGAACCCTGTGGAGAGAAGCCATTCGCTACCCAAAAATGGTAACAACCAATTAAAAGTCAAAACCGGTGATAAAGTCGCACTGCGCAATTTGCAAATATCTGGTCCTATTTTGCAAAAGGAAATAGATTTACCTGTTACTACTGTCCCAGTCGTTTCGGATTCAGAAGAAGCCGACGATTCGAAGGCCTTCGTAAACAGAGCGCAGAGTATGCGAGCACCTGCCAGCCAAAAGCCAGTCCTACAAAGCTTTGCATCGATGAGACAAGCGCCAGGTGTACCGCGGCCCTTATCGTGTGTGGGAAGACCAACAGCGCCCCCTCCCCCGTTACCATCACAACCGAAAAACGAAGAACAATCCATTTACCAAAATCCGAAAGTGCAAAATGACATCAAATCGACTGATTATGTTGATTGCATAGAGGAAAAACAGGTCCCATTGGCGCACATCGATGAAGAATCTGGGGACAATATTTACGCCATCATAGAAGAAAGTCCCGAAAAGCATTTCAAACCGATGCCGGGACGTCCTCCTAAATCTACACAGGCGCCGTTCGAAGAATACAATGTACCAAAACCTATTACTTCCAATTCGGGAAGCTCTGAAAGTTTAGGTCTACTCGGAGAAATTGTCAATGAAATACAAAACCGCAATTTCGACTCCATTTACTGCACCAATTCTTTGGCGAGAATGAAAGATAAGAACAAAAATACGGATAGTAACAGAGATAGCACGTACATGAACACAGACTATAAAAGCCCGGAGAGCGTTTACAGCAACTCTGAGACAAAATCTAGTGCAGCCTCCACAACTAGCAGCGGCTACCTTCACCCGTCCGCCGTGAACGTACCGACTTACATGCAAAAAGACAGCGATGAACTAGAAATTGAAAAGCCTCCGTCCCCCACATTAAAAACTAATTCGAAAATACCTACGTTTACCAGACAAGTCACCCCGCCGGGGTTAAGAACTTTCAAAAATATACCGCAATCACCGAAGACGACAACGAGGAGTAATCTTAAAACGATTCCGAACAGTCCCGACCTAGTATCGAGCTGTGCTGTTCCCGAAACACAGAATGCTAAAGCTCCGGATGTTATAAACAATAATAAAACAGAACCACCTAAATTAGCGACTAAACCGAATACGACCAAGACGACCGATAACCGACCCCCACTCAAACCGGTTCCGTCGGAGAAGAAACCTAACGTCAAACCAACACCGGTCCCAAAAACTAATTCTGCCTTAAGTATGAACAAAACAGATAAAAATCCTCCCCTCAACAGAACCACTTCTAAGACAGACTCCAACGTTAAGGCGATAGCTGACAGTTTGAACAAAAATCGACCAAAAATTGTCCCAAAGCCTAACAACATACAGAAGACTGAAGCTGTGAAAACAAACGCTACCAAATTATCAGCGAAACCGTCAAACGTTGCAAGTTTGCAGCAAAAATTTGAAAACAGGAAGTCATTAGGAAAAGAAATAAGTGTCAAAAAATAA

Protein sequence:

>DPOGS206275-PA
MAIGDIAYPREGIHHPELVMKMNFDGREHVLDLRLNEDLITKDHVIAYQKDGETVIHRPTLKELDICQYSGKVRDKKESWVAVSTCDGVRGIIHDGQTMRYIEPADRNEIDSQHYLYEHSDLNTDFHCGYSGGITTNDTYDPELMKRHMHSRNVEKSRISRYKRDAYEDTEVRGPFKVNKLSRFVELVLVADNREFRANGESKETVHRQLKDVANIINSVYTPLNIFIALVGVVVWNERDEIRLEEDGDKTLTEFLHYRKRLLPVMPNDNAHLLTRQKFKDGVVGKALKGPICTYNFSGGVATNHSEVIGLVATTIAHEMGHNFGMEHDTEADCECPDEKCIMSPSSTSVTPTKWSSCSLRSLALAFERGMDYCLRNKPKRLFEPSTCGNGFIEPGEQCDCGLAGDPACTACCDPRACVLRSNATCAAGECCDTTTCRPKPAGTVCRAADKECDLAEYCSGHSEYCPRDVYKMDATPCGGGKAYCAGGSCRTHTDQCRLLWGFSGENSDVQCYTNSNTKGDRKGNCGYHREDPPVYYKCSKEDSLCGLLQCRHLNERLEFGMESVSTLSAVFINNNGTIIPCRTAMVDMGTSDPDPGFVPDGAKCGDDKMCMKHRCVSIAEVTSEIARKETSVCPSNCSGHGVCNSEGHCHCDSGFAPPLCELPGPGGSVDSGPATDASIQRNFMVAMYIIFLGILPSVLLVMLLMYYSRHNVLLCWKKPKKSYVNNIFNGDRFKRFKTSTDSFVRLISFRRTQKKNMCRKCQDDIYSNICEHKENIDSTWSFNISSKIINMLNSNKHDESKKFQRKINKDDIKVADDLDLANVRVKVEPKINKSNIVIVKTGLASTTNEHVKAEINTTKQEVSLDRNKTKKNIVVTNKINTDVIYENCIPNKTPFSYFSVQSPKKNSLQRRLSRSATKFAANFQNNSQNNAQPVNVHTLSNSDDMSSSLLRSDSDRSPSGNINPSVNFFGNFKGFSLTPMDKNSQNETDVKDKKDNVQKSAKITPVHRSGSNSQNIAQGSKPILRSAPPLPVVPNTAKLSPKTSPSIKRTNSSVQNRIKAFMGTEKAEEIPVNTAPRPTISSPILEASTCTAKELISPLQGSKTLGPVRAAPTVPNFSPDLPKRPLSMHSAGNVPQKPLPEEPKKVKEGISLNRIASFLKQDKPKEKDRNPVERSHSLPKNGNNQLKVKTGDKVALRNLQISGPILQKEIDLPVTTVPVVSDSEEADDSKAFVNRAQSMRAPASQKPVLQSFASMRQAPGVPRPLSCVGRPTAPPPPLPSQPKNEEQSIYQNPKVQNDIKSTDYVDCIEEKQVPLAHIDEESGDNIYAIIEESPEKHFKPMPGRPPKSTQAPFEEYNVPKPITSNSGSSESLGLLGEIVNEIQNRNFDSIYCTNSLARMKDKNKNTDSNRDSTYMNTDYKSPESVYSNSETKSSAASTTSSGYLHPSAVNVPTYMQKDSDELEIEKPPSPTLKTNSKIPTFTRQVTPPGLRTFKNIPQSPKTTTRSNLKTIPNSPDLVSSCAVPETQNAKAPDVINNNKTEPPKLATKPNTTKTTDNRPPLKPVPSEKKPNVKPTPVPKTNSALSMNKTDKNPPLNRTTSKTDSNVKAIADSLNKNRPKIVPKPNNIQKTEAVKTNATKLSAKPSNVASLQQKFENRKSLGKEISVKK-