Monarch geneset OGS2.0

DPOGS204399
TranscriptDPOGS204399-TA1587 bp
ProteinDPOGS204399-PA528 aa
Genomic positionDPSCF300002 - 1070191-1075495
RNAseq coverage550x (Rank: top 23%)
Annotation
HeliconiusHMEL0156880.084.67% 
BombyxBGIBMGA002582-TA4e-4527.99% 
DrosophilaCG8728-PA0.062.55% 
EBI UniRef50UniRef50_Q7K3W24e-18062.55%CG8728 n=28 Tax=Eumetazoa RepID=Q7K3W2_DROME
NCBI RefSeqXP_317371.30.064.98%AGAP008086-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123832370.065.59%hypothetical protein AND_03778 [Anopheles darlingi]
NCBI nr blastxgi|3123832370.064.50%hypothetical protein AND_03778 [Anopheles darlingi]
Group
Gene OntologyGO:00468722.9e-65metal ion binding
GO:00038242.9e-65catalytic activity
GO:00065081.8e-33proteolysis
GO:00042221.8e-33metalloendopeptidase activity
GO:00082704.7e-33zinc ion binding
KEGG pathway 
InterPro domain[318-513] IPR0112372.9e-65Peptidase M16, core
[64-277] IPR0112492.9e-37Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[77-226] IPR0117651.8e-33Peptidase M16, N-terminal
[233-433] IPR0078634.7e-33Peptidase M16, C-terminal
Orthology groupMCL11953 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204399-TA
ATGTCTCATGTATCTGACATGAAAATTTTATTTTCTAGGTTACCTACATTTAAACATGGTAGAACATTCAGTCAGAATGTCGAAAGACCACCTCAGCCATTACCAAAAGAAAGTGTCACACCCTTACCTCCATTATCTGAACCAATGTCAAATCTCCCTCCAGTAGTGTATGCTTCTTCAAAATCAGAGAACTATATAACTGAGGTAACAACTCTTAGCAATGGATTAAGAGTGGCTTCTGAGAAAAAATTTGGTCAATTTTGTACTGCTGGTGTGGTTATTGACTCTGGCCCAAGATATGAAGTTGCATATCCAAATGGTATCTGCCATTTTCTTGAAAAATTAAGTTTTGGGGCTACCCACAAATATCCAACCCGTGATGTGATGTTGAGGGAATTGGAGCGTCATGGAGGTATATGTGACTGTCAGGGCTCCAGAGATACTACAGTGTATGCAACCAGTGCTGATTCACGAGGCTTGGAAGCTGTTACTCAGGTGCTCGCTGAAGTGACATTAAGACCAATGCTTTCAAGTGAAGAAATAGAAGGAGCAAGACAGGCTGTTGCATTTGAATTGGAAACACTCTCCATGAGACCCGAACAAGAGACAATATTGATGGATATGATACATGCTGCTGCATATAGAGGGAATACATTAGGTCTACCAAAAATTTGTCCCATGGAAAATGTTTATAAAATTGATAAGGGCATTATTCTAAACTATTTGAAGAATCATTACACGCCGGACAGAATGGTTGTTGCAGCTGTTGGAGTAGACCATGAGCCCTTTGTGGAATATGTTCAAAAGTATTTTGTAGACATGAAGCCTACTTGGCATTCAGAAGACAGTGACACATTCAAAGCACAGACCGACAAATCCATTGCACAATACACTGGTGGATTGGAAAAGGAGGAATGTGAAATTCCATTGTACCCAGGATCGGATTTACCTGAACTGTCCCATGTTGTTATTGGATTGGAGAGTTGCTCTCACAGCGATCCAGACTTTGTGGCAACCTGTGTGTTGAATATGATGATGGGTGGTGGCGGTTCATTCAGTGCTGGTGGTCCAGGCAAGGGAATGTACACTCGCCTTTATACCAATGTATTGAATCGTTATCACTGGATGTTCAATGCAACATCATACAACCACGCATATGGCGACACTGGTCTGATGTGTGTGCACTCCGCTTCACCGCCCGCTAGACTACATGACACTGCGCTAGTTATTGCCCGAGAACTGGCAAACATGGCTGGACACGTCGGGGAGACTGAACTCAGACGTGCCAAAACGCAATTGCAATCGATGTTACTGATGAATCTCGAAGCAAGACCGGTCGTGTTTGAGGACATCGGGAGACAAGTGCTGGCTACCGGCAAACGAAAACCACCATCATTCTTCATTAATGAAATTGAAAAAATAACCGGTGAAGACATTATTCGCGTCGCTCGTCGTATGCTCAGCAAGAAGCCATGTGTAGCTGCGAGAGGCAAACTCTCAAACCTTCCCAGCTTCGAAGACATTCAAGCCAACATGACCGTCAAAAACGACACGAGCGGTAGACGGCTTAATTTCTTCCGCGCATAA

Protein sequence:

>DPOGS204399-PA
MSHVSDMKILFSRLPTFKHGRTFSQNVERPPQPLPKESVTPLPPLSEPMSNLPPVVYASSKSENYITEVTTLSNGLRVASEKKFGQFCTAGVVIDSGPRYEVAYPNGICHFLEKLSFGATHKYPTRDVMLRELERHGGICDCQGSRDTTVYATSADSRGLEAVTQVLAEVTLRPMLSSEEIEGARQAVAFELETLSMRPEQETILMDMIHAAAYRGNTLGLPKICPMENVYKIDKGIILNYLKNHYTPDRMVVAAVGVDHEPFVEYVQKYFVDMKPTWHSEDSDTFKAQTDKSIAQYTGGLEKEECEIPLYPGSDLPELSHVVIGLESCSHSDPDFVATCVLNMMMGGGGSFSAGGPGKGMYTRLYTNVLNRYHWMFNATSYNHAYGDTGLMCVHSASPPARLHDTALVIARELANMAGHVGETELRRAKTQLQSMLLMNLEARPVVFEDIGRQVLATGKRKPPSFFINEIEKITGEDIIRVARRMLSKKPCVAARGKLSNLPSFEDIQANMTVKNDTSGRRLNFFRA-