Monarch geneset OGS2.0

DPOGS202618
TranscriptDPOGS202618-TA762 bp
ProteinDPOGS202618-PA253 aa
Genomic positionDPSCF300140 + 412187-413943
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0024531e-7555.87% 
BombyxBGIBMGA006534-TA1e-5045.81% 
DrosophilaCG17475-PA7e-4034.45% 
EBI UniRef50UniRef50_D6WK621e-4039.91%Serine protease P145 n=5 Tax=Tenebrionidae RepID=D6WK62_TRICA
NCBI RefSeqXP_001999302.13e-4239.29%GI24438 [Drosophila mojavensis]
NCBI nr blastpgi|1951094545e-4139.29%GI24438 [Drosophila mojavensis]
NCBI nr blastxgi|1951094545e-4238.86%GI24438 [Drosophila mojavensis]
Group
Gene OntologyGO:00038243.1e-69catalytic activity
GO:00042522.2e-58serine-type endopeptidase activity
GO:00065082.2e-58proteolysis
KEGG pathway 
InterPro domain[9-244] IPR0090033.1e-69Peptidase cysteine/serine, trypsin-like
[22-239] IPR0012542.2e-58Peptidase S1/S6, chymotrypsin/Hap
[50-65] IPR0013145.1e-11Peptidase S1A, chymotrypsin-type
Orthology groupMCL20449 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202618-TA
ATGTCTAATACACTACTATTTTTTTGCATTTTCTGTATTTACTTCGGTGTCGAGGGAACACCGCGATTAGTAGGAGGAGATAGAATCCCTATCGAATTAGGAAAGTTTCATGCATCACTCCAAAATGTCACTCGACATCATGTCTGTGGGGGCGCTATAATATCTCAAAAGCATGTTGTTACAGCTGCTCACTGTGTTTTCAACGCTGAACCGAAATATATACATGTAGTAATTGGAACTGCTAATTTGGATAACGGTGGATTGATGTACAGTGTTGAATCAGTATACGTTCATGATGACTATAACAGCACTCTAAGATTAAATGATATAGCGATATTAAAAATTAGAGGTCTTTTTAATTTATGCAAAGCTAAAATGCTACGACTCGATACCGAGAAATTAAAGGAAGGAGATAATGTTACTGTCGTTGGGTTCGGTGCGAAGAAGCCGAATGGAGAATCAGCCCGCAAAATGAACGCTTTAAATCTGACAGTATTCAGCCAGGAAACTTGTCAGTATGCAATGCGATACACGAGAAAGATTTATGACAGCATGTTCTGTACGTTCACAGGAATTGGACAGGGTACTTGCCACGGAGATTCTGGTGGACCACTGGTTAAAGATAACAAACTGGTTGGAATAGTTTCCTGGGGAATACCATGTGCGGTCGGTTTCCCAGACGTTCATACCAGAATACAGCCTTATATACCATGGATACAAAATATAATGGATAAAGTTTCGTGTGGCTCATGTCGTAAATAG

Protein sequence:

>DPOGS202618-PA
MSNTLLFFCIFCIYFGVEGTPRLVGGDRIPIELGKFHASLQNVTRHHVCGGAIISQKHVVTAAHCVFNAEPKYIHVVIGTANLDNGGLMYSVESVYVHDDYNSTLRLNDIAILKIRGLFNLCKAKMLRLDTEKLKEGDNVTVVGFGAKKPNGESARKMNALNLTVFSQETCQYAMRYTRKIYDSMFCTFTGIGQGTCHGDSGGPLVKDNKLVGIVSWGIPCAVGFPDVHTRIQPYIPWIQNIMDKVSCGSCRK-