Monarch geneset OGS2.0

DPOGS215672
TranscriptDPOGS215672-TA786 bp
ProteinDPOGS215672-PA261 aa
Genomic positionDPSCF300041 - 1168981-1170359
RNAseq coverage287x (Rank: top 38%)
Annotation
HeliconiusHMEL0146407e-4139.91% 
BombyxBGIBMGA003567-TA2e-4242.86% 
DrosophilaCG3355-PA7e-2432.84% 
EBI UniRef50UniRef50_P350426e-4340.00%Trypsin CFT-1 n=68 Tax=Ditrysia RepID=TRYP_CHOFU
NCBI RefSeqNP_001040350.12e-2739.90%trypsin-like protease [Bombyx mori]
NCBI nr blastpgi|4649622e-4240.00%trypsin [Choristoneura fumiferana]
NCBI nr blastxgi|4649592e-5146.12%trypsin [Manduca sexta]
Group
Gene OntologyGO:00038241.3e-55catalytic activity
GO:00042524.2e-39serine-type endopeptidase activity
GO:00065084.2e-39proteolysis
KEGG pathway 
InterPro domain[13-233] IPR0090031.3e-55Peptidase cysteine/serine, trypsin-like
[25-236] IPR0012544.2e-39Peptidase S1/S6, chymotrypsin/Hap
[56-71] IPR0013142.4e-07Peptidase S1A, chymotrypsin-type
Orthology groupMCL27845 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215672-TA
ATGGCTGCGACTCTAATACTGTTGGCCGGCGTCCTCTCTGCTGTAGCGGCGGTGCCGAATTCGGAGAGAATTGCTGGCGGTGCAATTACGACAATTGCAACGTATCCTTCTGCCGCCAGTATCTCATACAACAGGCTTGGATTTGGTACTTTCACCTTCACCTGTGGAGGAAGTATAGTCAGCAGCAGATCTATTTTAACAGCAGCGTTTTGTGTGCACGGGGATCAGACTTATCGATACCGAGTGCGTGTTGGTTCGTTGACAGCAAGTTCTAACGGAATTCTCCATACCGTCGACTATTTCACTATTCATCCGAATTACAATCCCGTGACCAAAGAACATGACATCGCACTAATTCACGTATTCCCCCATCTATTGTTTACATCTAATGTCCAACTGGCAAATTTTCCTGATGGAAGCTATATCCCTATGAGAAATCAGACGGTCATGGCTATTGGATGGGGACAAATCAATCATGGTGGAGCTCTGTCGGAGAGTCTTCGTCGTGTCCAACTGTGGCTAGTTGACAATAACGAGTGCAGAAATCGTTACTCTGAGCTGGAAGGCCCCAATGTCACTTCTAACATGATTTGTGCTTCTGGGTACGATCGGTCTGGAAGGGGGCAATGTCTAGGTGACAATGGCAGCCCCCTTCTTGATGATGACTCATTATTGGTATATATTCATGGAGTCACCAGTGCGGTACAGTACGTTACCCTAGTGTCAACACCTACATACCAAAATATGCCAACTGGATCAGATCTTACTATTAATTCATATAAATAA

Protein sequence:

>DPOGS215672-PA
MAATLILLAGVLSAVAAVPNSERIAGGAITTIATYPSAASISYNRLGFGTFTFTCGGSIVSSRSILTAAFCVHGDQTYRYRVRVGSLTASSNGILHTVDYFTIHPNYNPVTKEHDIALIHVFPHLLFTSNVQLANFPDGSYIPMRNQTVMAIGWGQINHGGALSESLRRVQLWLVDNNECRNRYSELEGPNVTSNMICASGYDRSGRGQCLGDNGSPLLDDDSLLVYIHGVTSAVQYVTLVSTPTYQNMPTGSDLTINSYK-