Monarch geneset OGS2.0

DPOGS203321
TranscriptDPOGS203321-TA1341 bp
ProteinDPOGS203321-PA446 aa
Genomic positionDPSCF300003 - 843316-844656
RNAseq coverage24x (Rank: top 77%)
Annotation
HeliconiusHMEL0088166e-11947.67% 
BombyxBGIBMGA002081-TA5e-11846.73% 
DrosophilaCG3731-PB1e-9941.16% 
EBI UniRef50UniRef50_Q9VFF02e-9741.16%CG3731, isoform A n=15 Tax=Eukaryota RepID=Q9VFF0_DROME
NCBI RefSeqXP_309120.32e-10041.57%AGAP000935-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479647813e-9941.57%AGAP000935-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479647813e-9541.57%AGAP000935-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00468722.9e-55metal ion binding
GO:00038242.9e-55catalytic activity
GO:00065083.1e-26proteolysis
GO:00042223.1e-26metalloendopeptidase activity
GO:00082706.4e-23zinc ion binding
KEGG pathway 
InterPro domain[233-445] IPR0112372.9e-55Peptidase M16, core
[16-230] IPR0112494e-37Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[27-172] IPR0117653.1e-26Peptidase M16, N-terminal
[183-359] IPR0078636.4e-23Peptidase M16, C-terminal
Orthology groupMCL20690 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203321-TA
ATGGCCAAAAATTTAGACAAAAGTAGGCTACGTAAGTTTTATTCACCCGTTCATATTACAAATTTGTCCAATGGTTTGAGAGTGGTCACTGAAGAGAGCAGTTCACCACTAGTTTGTACCTCGCTTTTCACTAAAGCGGGATCGAGATTTGAAAATGAAAACAATAATGGTATAGCATACTTCGTAGAACATTTGGCCTATCAAGGCTTTACTTCGATGAGCAATTACGCTTTACAAGATGCAATTAGGAATGGGGGTTCAAAAATGTCTGCTTTTACATCGAGAGATCATCAAGTTTTTTCTGCTGTCGGATTGAAGGAAAGTATTAAACTAAATATAAGTTTGTTGGCTCAAATTATTTGTCAAATAGACTTATCTGAATGTATGATAGAATCACAAAAACAACAATTGTGCTTTGAAGCTATTGAAAATGACAATAACAGTAGATTGGTACTATTTGATTATTTACACCAGACGGCATTTCAAGAGACTCCTTTGGCTCAAACTGTTATGGGTCTTTCTAGAAATTTTTGTAGATTTGATATACGTGATATATGCTCTTATTTTTATCATAATTATCGACCACATCGGATGACTTTAGCTACATCTGGTGGTGTGTCACATGGTGCTGTTGTTGAATATGCTGAAAATTACTTTAATGTTATAAAAGAAAGCGATACTAAAGTTATAAATTTTGGACCAAAGCGTTATACAGGGTCTTCAATCGTATATCGCAATGATGCGTTGCCCGTTGCTCATGTTGCTATTGCTGTAGAAGCACCTGGATATAACAGCCCTGAATATTTACCACTTCTACTGGCCAGCTGTCTAAATGATTCCTGGGAGAGAACACAAGGCGGGGGAGACCGACATGGTTCATTTTTAGCTCGTGCCGCATCAACGTCAAGTTTGTGTGAGAAATTTGAATCATTTTACATAGCGTACCACGATGTCGGTTTGTGGGGTGTTTACTTTGTAGGAACAAACGATTTGGACGACATGGTTTATAATATACAAAAGGATTGGATGAATACATGCACTTCTGTACAGAAGACGGATGTCGACAGAGCGTCTCAACTTTTGAAATTCAAACTAGCGAAAAATGTCGAAGGTGTTGTTAAATCTTCTTATGATATTGGTTTGCAAATGATGTATACATCTAGCCGAAGAAATCTTTGTCAAATATATCAAGATTTATCATCGATCACAGTTGAAAGATTAAGAGACGTTGCTTTTAAATATTTATACGACAAATGCCCAGTGGTTGCAGCTGTTGGACCAACAGAAACTCTACCAGACTATAACGGAATCAGATCTGGAATGTACTGGTTAAGATTATGA

Protein sequence:

>DPOGS203321-PA
MAKNLDKSRLRKFYSPVHITNLSNGLRVVTEESSSPLVCTSLFTKAGSRFENENNNGIAYFVEHLAYQGFTSMSNYALQDAIRNGGSKMSAFTSRDHQVFSAVGLKESIKLNISLLAQIICQIDLSECMIESQKQQLCFEAIENDNNSRLVLFDYLHQTAFQETPLAQTVMGLSRNFCRFDIRDICSYFYHNYRPHRMTLATSGGVSHGAVVEYAENYFNVIKESDTKVINFGPKRYTGSSIVYRNDALPVAHVAIAVEAPGYNSPEYLPLLLASCLNDSWERTQGGGDRHGSFLARAASTSSLCEKFESFYIAYHDVGLWGVYFVGTNDLDDMVYNIQKDWMNTCTSVQKTDVDRASQLLKFKLAKNVEGVVKSSYDIGLQMMYTSSRRNLCQIYQDLSSITVERLRDVAFKYLYDKCPVVAAVGPTETLPDYNGIRSGMYWLRL-