Monarch geneset OGS2.0

DPOGS213158
TranscriptDPOGS213158-TA1086 bp
ProteinDPOGS213158-PA361 aa
Genomic positionDPSCF300016 + 1294765-1298856
RNAseq coverage253x (Rank: top 41%)
Annotation
HeliconiusHMEL0150781e-11067.40% 
BombyxBGIBMGA007915-TA2e-13070.07% 
DrosophilaCG15255-PA4e-5042.98% 
EBI UniRef50UniRef50_G3M5H21e-12569.39%Hatching enzyme-like II n=3 Tax=Obtectomera RepID=G3M5H2_BOMMO
NCBI RefSeqNP_001129355.18e-12869.39%hatching enzyme-like protein [Bombyx mori]
NCBI nr blastpgi|3419425039e-12774.10%hatching enzyme-like protein [Antheraea pernyi]
NCBI nr blastxgi|3419425036e-13273.13%hatching enzyme-like protein [Antheraea pernyi]
Group
Gene OntologyGO:00065081.4e-44proteolysis
GO:00042221.4e-44metalloendopeptidase activity
GO:00082371e-25metallopeptidase activity
GO:00082701e-25zinc ion binding
KEGG pathway 
InterPro domain[87-290] IPR0240796.2e-55Metallopeptidase, catalytic domain
[95-290] IPR0015061.4e-44Peptidase M12A, astacin
[92-242] IPR0060261e-25Peptidase, metallopeptidase
Orthology groupMCL19838 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213158-TA
ATGATTCGGACCGTGTTGTTACTGTGCGTGTTAGGGTTGGCGACGGCGTCGCCCGCGCTAGTCAGGAGCCGAGAGGAGGTCGAAGCGTTTAGGAACTTTTTAGAGAGTACGAAAACCAATGATGGGAATAACTTTCTGGCTCGCACGAAACTGTCTCCGTTGTCCAATCCTGAGGAAAACAGCGGCAAGTACCAGGGTGACATCGTTTTGGATGACTTCATGATCGAAGATATGGTACAAGGTTACGCCGTGGGTCGCAATGCTTACATCTTCCCCGACACCCATTGGCCAAATAACACGGTTGTTTGGCAGTTTGGAGAAGGTGAATTCGATCCTGTACAGCAGCAGGCGATTAAGGACGGAATTCAAGATATAGAAAATCACACTTGTATTAAGTTCCGCTACCGAGAGCCAGAAGATACTGTCTTTGTAAATGTTACTGGCGGTCCAGGAGGTTGCTATGCTCATGTAGGTTACTGGGAACCCCGCGAGGTGCACGTCATGAACCTGGCCGCAAACCTGCCCGGCGTGGGTTGCTTCAGACACGCCACTATAGTGCACGAGTGGATGCACATCCTCGGCTTCCTGCACATGCACTCCACTTACAACAGAGACGATTACGTAGACATCATTGAGGAGAACGTTGCACCTGGTAGGTTCCATAACTTCGACATCTACACCTCGGAGCTCGTCAGCAACAACGGCATTGAATATGATTACGTCAGCTGTCTCCACTACGGCCCGTTCGCGTTCACGGTCAATGGTGAACCAACAATCGTACCTAAAAAGGAAATCGAAGGTACAATGGGTCAGAGAGTTTTTATCACGGAGAAGGATTGGCTCAGAATCAACAGGCACTATAATTGTTCCGGAGCTTGGGATGAAGTGAAAGAAGAAATAAAAGAATATAGCAAGCAAGAAGAAGATGACGTTGATGACGAAGAACAAGAAATCGAAATAGTAGGAGATAGTGAAGATGTTGACGAAGAATATGGAGAAAACGGAGAAGTTCTGGATGTAGATGCAGAAGATGAAGAATTAATTAAACGATTGATAGCAGTACAGCTGCTACAGATGAAAAAATAA

Protein sequence:

>DPOGS213158-PA
MIRTVLLLCVLGLATASPALVRSREEVEAFRNFLESTKTNDGNNFLARTKLSPLSNPEENSGKYQGDIVLDDFMIEDMVQGYAVGRNAYIFPDTHWPNNTVVWQFGEGEFDPVQQQAIKDGIQDIENHTCIKFRYREPEDTVFVNVTGGPGGCYAHVGYWEPREVHVMNLAANLPGVGCFRHATIVHEWMHILGFLHMHSTYNRDDYVDIIEENVAPGRFHNFDIYTSELVSNNGIEYDYVSCLHYGPFAFTVNGEPTIVPKKEIEGTMGQRVFITEKDWLRINRHYNCSGAWDEVKEEIKEYSKQEEDDVDDEEQEIEIVGDSEDVDEEYGENGEVLDVDAEDEELIKRLIAVQLLQMKK-