Monarch geneset OGS2.0

DPOGS201428
TranscriptDPOGS201428-TA1005 bp
ProteinDPOGS201428-PA334 aa
Genomic positionDPSCF300006 - 1383403-1384806
RNAseq coverage5079x (Rank: top 2%)
Annotation
HeliconiusHMEL0090867e-13888.76% 
BombyxBGIBMGA002582-TA2e-13184.40% 
DrosophilaCG3731-PB6e-11575.10% 
EBI UniRef50UniRef50_Q9VFF08e-11375.10%CG3731, isoform A n=15 Tax=Eukaryota RepID=Q9VFF0_DROME
NCBI RefSeqXP_001998997.16e-11575.10%GI23318 [Drosophila mojavensis]
NCBI nr blastpgi|2897429839e-11471.05%mitochondrial processing peptidase beta subunit [Glossina morsitans morsitans]
NCBI nr blastxgi|2897429831e-10971.05%mitochondrial processing peptidase beta subunit [Glossina morsitans morsitans]
Group
Gene OntologyGO:00468722.4e-93metal ion binding
GO:00038242.4e-93catalytic activity
GO:00065085e-31proteolysis
GO:00042225e-31metalloendopeptidase activity
GO:00082705e-31zinc ion binding
KEGG pathway 
InterPro domain[117-333] IPR0112372.4e-93Peptidase M16, core
[121-325] IPR0112494.6e-45Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
[83-249] IPR0078635e-31Peptidase M16, C-terminal
Orthology groupMCL11541 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201428-TA
ATGTTGAAAGTCGCAAATACTTTACGATTGGTATCTAAAAAGGGGAACAATGCTCGAGCCCTGGCTACAGCAGTTGGTTACAAACAGGCCTTAGTTAATGTTCCCCCAACTCAGTTAACAGTACTGGATAATGGAATCCGCATTGCGTCTGAAGATTCTGGCGCTGCTACTGCGACCGTGGGGCTGTGGATCGACGCTGGATCAAGGTATGAGACATCCAAAAACAATGGTGTTGCACATTTCTTAGAACACATGGCTTTCAAAGGAGCTGGAGGAATTGAACATGGTAAACTTGTAGACCTAGCCCAAAAACATCTTGGTGGTTTGAAGAACACTCCAGTCGATGTCCCCGAATTAGCTCCTTGCCGTTATACTGGTTCAGAAATTAGGGTCCGTGATGATTCCATGCCTTTGGCTCATATTGCCATTGCTGTTGAGGGTGCTGGCTGGACTGACCCTGATAACATTCCTTTAATGGTTGCCAATACACTTGTTGGGGCTTGGGATCGTTCTCAAGGTGGTGGCACCAACAATGCATCTTATCTGGCGAGAGCTGCATCTGCTGGTAATCTATGCCACAGCTTCCAATCATTCAATACTTGTTACAAGGACACCGGGCTCTGGGGCATCTACTATGTTGCTGAGCCCATGCAGATTGAGGATATGTTATTCAACATTCAACACGAATGGATGAAACTGTGTACGTCCGTGACCGAAGGTGAAGTCGAAAGGGCAAAGAACATCCTTAAGACAAACATGCTGTTGCAACTGGACGGAACAACCCCAGTATGTGAGGATATTGGTCGCCAGATTCTTTGCTACAACCGCCGCATTCCCATCCACGAACTCGATGCGCGCATCAATGCTGTGACAGCTCAGAATGTCCGCGATGTTTGTTACAAGTTCATCTACGACCGTTGTCCAGCCGTAGCTGCTGTTGGACCCACCGAGGCCCTGTTAGATTACACCAGAATCCGTGCCGGAATGTACTGGCTCAGGGCGTAA

Protein sequence:

>DPOGS201428-PA
MLKVANTLRLVSKKGNNARALATAVGYKQALVNVPPTQLTVLDNGIRIASEDSGAATATVGLWIDAGSRYETSKNNGVAHFLEHMAFKGAGGIEHGKLVDLAQKHLGGLKNTPVDVPELAPCRYTGSEIRVRDDSMPLAHIAIAVEGAGWTDPDNIPLMVANTLVGAWDRSQGGGTNNASYLARAASAGNLCHSFQSFNTCYKDTGLWGIYYVAEPMQIEDMLFNIQHEWMKLCTSVTEGEVERAKNILKTNMLLQLDGTTPVCEDIGRQILCYNRRIPIHELDARINAVTAQNVRDVCYKFIYDRCPAVAAVGPTEALLDYTRIRAGMYWLRA-