Monarch geneset OGS2.0

DPOGS201312
TranscriptDPOGS201312-TA771 bp
ProteinDPOGS201312-PA256 aa
Genomic positionDPSCF300176 + 201129-202713
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0172586e-7650.00% 
BombyxBGIBMGA003104-TA3e-4642.32% 
DrosophilaCG3355-PA2e-2229.32% 
EBI UniRef50UniRef50_E1ZZV41e-2129.34%Putative trypsin-6 n=3 Tax=Formicidae RepID=E1ZZV4_CAMFO
NCBI RefSeqXP_002052132.12e-2230.62%GJ23363 [Drosophila virilis]
NCBI nr blastpgi|1953868804e-2130.62%GJ23363 [Drosophila virilis]
NCBI nr blastxgi|3838505166e-2232.66%PREDICTED: chymotrypsin-1-like [Megachile rotundata]
Group
Gene OntologyGO:00038243.6e-46catalytic activity
GO:00042523.3e-33serine-type endopeptidase activity
GO:00065083.3e-33proteolysis
KEGG pathwaybta:2817313e-19 
 K01353 (GZMB)maps-> Graft-versus-host disease
    Type I diabetes mellitus
    Allograft rejection
    Natural killer cell mediated cytotoxicity
    Autoimmune thyroid disease
InterPro domain[15-250] IPR0090033.6e-46Peptidase cysteine/serine, trypsin-like
[27-245] IPR0012543.3e-33Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL26487 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201312-TA
ATGCTGATAAAAGTTATAGCATTATTTCTGTTTACAAATTCAGTTTCATCTAAACCAAATGAATTAGAAGGGAGGGTAGTCAGAGGAGACGTTGTGTCGATCGAGGACTTCCCATATTCAGCGTTTCTGTTGATGGGTAGAGAGAGGGGCAGCTTTATATGTGGTTCATCCATCATCAATCAGAGAATCTTATTGACGGCAGCACATTGTATCGAAATATGCAATCCCAAGTGCAAGAACGGAGCGGCATTTGTTGGAAATGAACAAAAGAGGATGGGAATCAAAATGACTATAACATTCGCAAAATACCACCCCAGATATAGAACAAATCGTGTGCACTTTGATATAGGTCTTGCATTGCTTTCTAGATCTATAAAGTTTGGTAAATTTGTTAAACGGGTTGCCATTTCAAGGCGTCCGAGGATAAAATCTGTCGCTGATATAGCTGGTTGGGGTTTAGTTGATGAAATAAACAAATTGTCGACAGATTACTTGCATCATATAACGCAAAAGGTGATAAGTCATAGTGATTGTAAGGCCTATATATCCAATATTCCTCCAGGCTCTTTCTGCGCTGGTGAGATTAAGAGCAGGCAGTTTGCATCAGAAGGGGACTCTGGCAGTGCTTTAATAATCAACAAGTACACGCAAATCGGTATCGTGTCTTATAAACGGCCGGACATATCGGCCAGTCTTATTGTATATACAAACGTCTCATTCTATTACGACTGGATAAAACAAACTTCGAGAAAATTGTACTGCGACTATTAA

Protein sequence:

>DPOGS201312-PA
MLIKVIALFLFTNSVSSKPNELEGRVVRGDVVSIEDFPYSAFLLMGRERGSFICGSSIINQRILLTAAHCIEICNPKCKNGAAFVGNEQKRMGIKMTITFAKYHPRYRTNRVHFDIGLALLSRSIKFGKFVKRVAISRRPRIKSVADIAGWGLVDEINKLSTDYLHHITQKVISHSDCKAYISNIPPGSFCAGEIKSRQFASEGDSGSALIINKYTQIGIVSYKRPDISASLIVYTNVSFYYDWIKQTSRKLYCDY-