Monarch geneset OGS2.0

DPOGS207003
TranscriptDPOGS207003-TA1494 bp
ProteinDPOGS207003-PA497 aa
Genomic positionDPSCF300001 + 1037044-1045756
RNAseq coverage1568x (Rank: top 8%)
Annotation
HeliconiusHMEL0086674e-15451.14% 
BombyxBGIBMGA012923-TA3e-9059.13% 
Drosophilagd-PA1e-4927.84% 
EBI UniRef50UniRef50_UPI0002063EFB4e-6633.74%UPI0002063EFB related cluster n=3 Tax=unknown RepID=UPI0002063EFB
NCBI RefSeqXP_001121433.11e-6633.60%PREDICTED: similar to CG9649-PA [Apis mellifera]
NCBI nr blastpgi|3800128726e-6934.34%PREDICTED: serine protease gd-like [Apis florea]
NCBI nr blastxgi|3800128728e-6934.22%PREDICTED: serine protease gd-like [Apis florea]
Group
Gene OntologyGO:00038241.5e-71catalytic activity
GO:00042524.1e-49serine-type endopeptidase activity
GO:00065084.1e-49proteolysis
KEGG pathway 
InterPro domain[230-495] IPR0090031.5e-71Peptidase cysteine/serine, trypsin-like
[247-490] IPR0012544.1e-49Peptidase S1/S6, chymotrypsin/Hap
[278-293] IPR0013146.7e-09Peptidase S1A, chymotrypsin-type
Orthology groupMCL15549 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207003-TA
ATGACGTACCGTGTGTTTTTTAGTTTTTTAATATTAGGCTTGCTAAGTGTATGCGGACAGCAATTTAACTCACCCTGTCCAGATTACTTTGGTTATAGACAAGACTCGGGGGGCATATACGGTTTAATTACCATAAAGTCATTTGGAGTTGTGTCTTCTTTACTAGTTCGAGCTAATTTTACTATTGCAGGAAGATTGCCATCCACATATGCGGGTAGCCTCCAACCTATCGGAACAGAGCTATACCTTCTGCGTGACTTCAATAGGGGAAAGACTTTGGAATACAGGGTTAGCTTCCCAGTCGTGTCTCCTTTACCCCGACTGACTTCCATCTCAGTCAACGATAAATTAGTCTGTTATGGACCTGGAGACTCTATCGGCGAGGTTCAATACATCACTACGATAAGCTTGCAGCACATGCTTTTCTACAAGACCGGCACTCAGGGGATTTACGGACACGATACAAAATTAGCTGAACAGTTGCCTGCCACATTACCTCATACTAATGGTGGGGACGACGTGAATCCATTTACAATTGACAACTCAAAGCCGGTTTATGTACCTCCCAAACCGCTAACACCAGCCCCGGTTCCTGAATATGTTGTCACAGCGACCCCTGCGACACGCCGACCACCACGCCCCGAACCCGATCCTCTGTCAATTAAACCGGAACCGCAGAGCACAAAATGTGGTATCAATAGTAAGAACGATAACCCAGACGCCATACCTGCGGCCCCATTAATATACAATGGGGTCACCTACGACCCTGGTGAATGGCCATGGCTTGTGGCGATGTACCAGCGTAGGTTCGGCAACTTGAACTACATCTGTGCTGGGACCCTGGTTACAGCGAATCATATTGTTAGTGCCGCTCACTGCATTCATCGCAAAAGTACATACACTCGCAAGAAGAATATAGTTATAAGAGCTGGCATATATGGCTTGGAAGATTGGAACGATGACATCGTTACAAGATCACTTAAGGAGGTTTATATCCATGAAGACTACAACTCCACGACGCTTGAGAATGATATCCTTATTATGACTCTTGAGAGTCCAGTGCCTTTTAACAAAATGATACAGCCAGCCTGCCTATGGAGCGGTCCCATTGTTTTGAACGAAATTGTCGGAAAATCTGGAATTGTTGCTGGTTGGGGAGCAAACGAACAGGGTTCTGGAGGCAAGGGTATTCCCCGAATGGTGACCATGCCTGTTGTTAGTACAGAGGACTGCAAAGCTAGCAAGCCAGATTTCCACAGGCTAACGTCTTCCAGGACTTTGTGTGCAGGTGACAGAACCGGGGCTGGTCCATGTCTAGGCGACTCTGGTGGCGGTCTATATTTGCTTCATCGTGGCCGCTGGCGTCTTCGCGGTATTGTTTCGCTCTCTCTTCTATCTGATGACGAGAGTCAATGTGACCTCAAGCAGTACATTGTGTTCACCGACGCAGCACAGTACATGCCGTGGATCACTGATGTGTTGTCCATCACTTGA

Protein sequence:

>DPOGS207003-PA
MTYRVFFSFLILGLLSVCGQQFNSPCPDYFGYRQDSGGIYGLITIKSFGVVSSLLVRANFTIAGRLPSTYAGSLQPIGTELYLLRDFNRGKTLEYRVSFPVVSPLPRLTSISVNDKLVCYGPGDSIGEVQYITTISLQHMLFYKTGTQGIYGHDTKLAEQLPATLPHTNGGDDVNPFTIDNSKPVYVPPKPLTPAPVPEYVVTATPATRRPPRPEPDPLSIKPEPQSTKCGINSKNDNPDAIPAAPLIYNGVTYDPGEWPWLVAMYQRRFGNLNYICAGTLVTANHIVSAAHCIHRKSTYTRKKNIVIRAGIYGLEDWNDDIVTRSLKEVYIHEDYNSTTLENDILIMTLESPVPFNKMIQPACLWSGPIVLNEIVGKSGIVAGWGANEQGSGGKGIPRMVTMPVVSTEDCKASKPDFHRLTSSRTLCAGDRTGAGPCLGDSGGGLYLLHRGRWRLRGIVSLSLLSDDESQCDLKQYIVFTDAAQYMPWITDVLSIT-