Monarch geneset OGS2.0

DPOGS215916
TranscriptDPOGS215916-TA1035 bp
ProteinDPOGS215916-PA344 aa
Genomic positionDPSCF300029 + 505470-507743
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0054044e-11761.88% 
BombyxBGIBMGA000349-TA5e-9551.58% 
DrosophilaCG14231-PA2e-7545.70% 
EBI UniRef50UniRef50_G6DD693e-10058.48%Putative uncharacterized protein n=3 Tax=cellular organisms RepID=G6DD69_DANPL
NCBI RefSeqXP_974817.13e-9150.63%PREDICTED: similar to AGAP005215-PA [Tribolium castaneum]
NCBI nr blastpgi|910830395e-9050.63%PREDICTED: similar to AGAP005215-PA [Tribolium castaneum]
NCBI nr blastxgi|910830394e-8750.63%PREDICTED: similar to AGAP005215-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065082.6e-65proteolysis
GO:00042222.6e-65metalloendopeptidase activity
KEGG pathway 
InterPro domain[2-281] IPR0009052.6e-65Peptidase M22, glycoprotease
[2-280] IPR0178614.6e-63Peptidase M22, glycoprotease, subgroup
Orthology groupMCL12608 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215916-TA
ATGCATCGTGAAAATATTGAAAAAGCAGTATTTAAATGTTTAGAAAATTCTTCTTTATCCATGGACAATATTGATGCTATCGCTGTTACTGTGAAACCTGGTTTATTAATTAGTTTAGAAATTGGTGTCAAGTATGCCAAGTATCTTTCAAAAATTTACAAAAAGCCCTTAATTCCTATACATCATATGGAGGCGCATGCCTTAGCTGCCAGAATGTTTCAAGATATACAACTACCGTTCCTAACATTGCTAATTTCGGGTGGTAACTGTCTGTTAGCATTTGTGAAAGAAATAGATAATTTTCTCTTACTTGGTGACACTATGGATAATTCACCAGGAGAAGTTTTAGACAAAGCTGCTAGGAGAATGAAACTGAGAAATTTACCAGAATATTCAGGGATGGCTGGTGGTAGAGCTATTGAGGTGGCAGCTAAAAATGCAGTTAATCCATTTCTATTTGATTTCCCATTACCACTCAATAGAAATAGAGATTGCAATTTCAGTTTCAGTGGTCTTCAGGATGCTTTTTTGAGACATTTGTTACATAAAGAAAAATATCATAATATCATGGGTGATGAAATTATTCCGGAAGTAAATGAACTGTGTGCTGCTTTTCAATTGGCAATGGCTGAACATATAGCACATAGAACAGAAAGGGCCATAAAGTATTGTGAAATAACTAATCTGTTCAGAGGGGATGTCAAAAATATTGTTGTTTCTGGTGGTGTTGCCTGCAATGATTTTATATTTAAAAGTATTGAGTGTATTGGTAATAAGTATGGATGCAAAGTCTTTAGACCTCCACCCAAATTATGTACTGACAATGGAGTTATGATAGCATGGAATGCCTTGGAGAAGTTAAAACATAAGTCAGATAGTGTAAACGAGCCCATAGAAATAAATCCTACTGCACCACTTGGCCTGCCGTTAGCCCAAGGTGTAATTCTGACATGGGACTCATCACGTCTCAGAGCTTCTGTACGAAGTTCTGGTTTTAATGACCGCCGCAAAATTTTTGCAGCAATGTTGGAGTAG

Protein sequence:

>DPOGS215916-PA
MHRENIEKAVFKCLENSSLSMDNIDAIAVTVKPGLLISLEIGVKYAKYLSKIYKKPLIPIHHMEAHALAARMFQDIQLPFLTLLISGGNCLLAFVKEIDNFLLLGDTMDNSPGEVLDKAARRMKLRNLPEYSGMAGGRAIEVAAKNAVNPFLFDFPLPLNRNRDCNFSFSGLQDAFLRHLLHKEKYHNIMGDEIIPEVNELCAAFQLAMAEHIAHRTERAIKYCEITNLFRGDVKNIVVSGGVACNDFIFKSIECIGNKYGCKVFRPPPKLCTDNGVMIAWNALEKLKHKSDSVNEPIEINPTAPLGLPLAQGVILTWDSSRLRASVRSSGFNDRRKIFAAMLE-