Monarch geneset OGS2.0

DPOGS215670
TranscriptDPOGS215670-TA783 bp
ProteinDPOGS215670-PA260 aa
Genomic positionDPSCF300041 - 1188117-1189445
RNAseq coverage1094x (Rank: top 11%)
Annotation
HeliconiusHMEL0163572e-6248.28% 
BombyxBGIBMGA003580-TA3e-6750.96% 
DrosophilaCG31954-PA1e-3132.35% 
EBI UniRef50UniRef50_P350429e-6852.87%Trypsin CFT-1 n=68 Tax=Ditrysia RepID=TRYP_CHOFU
NCBI RefSeqNP_001040350.17e-4144.84%trypsin-like protease [Bombyx mori]
NCBI nr blastpgi|4649623e-6752.87%trypsin [Choristoneura fumiferana]
NCBI nr blastxgi|4649622e-6952.87%trypsin [Choristoneura fumiferana]
Group
Gene OntologyGO:00038247e-69catalytic activity
GO:00042522.2e-58serine-type endopeptidase activity
GO:00065082.2e-58proteolysis
KEGG pathway 
InterPro domain[11-260] IPR0090037e-69Peptidase cysteine/serine, trypsin-like
[23-256] IPR0012542.2e-58Peptidase S1/S6, chymotrypsin/Hap
[56-71] IPR0013142.4e-06Peptidase S1A, chymotrypsin-type
Orthology groupMCL16202 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215670-TA
ATGCGTGTTACTGTTCTCTTATTTGTGGGCTGTCTAGCTGTGGCAGCCGCGCTGCCACCACCTGTGAGAATCGTCGGTGGATCAAACGCAGCAATAACATCATACAGATTTGCTGCCAGTCTTTTACATTCTAGAATTGGCGTTGGTACTTTTATATATGGCTGCGGTGGGTCAATCATTACCAACAGAGTGATTCTGACTGCTGCTTATTGCCTTTACAATGAACCTGTATATCGTTGGCGTGTTCGTGTTGGTTCAGCCAGGTCCAGCACTGGGGGAGTCGTTCATAATACTCTGAGAACAGTAGTTCATCCAAATTATAATCCACGGACTGCTGACAGTGACATTGCTTTATTGCACTCAATGACAGTTTTCGTTTTCAACAATAACGTTAATTTGGTTGGAATTGCTAGCGCAAATTATAACCTTCCTGACAATCAGCCTGTTACAGCTATTGGATGGGGAGCTACCAGTCACGGTGGTCAACTCTCTGATAGGCTCCGTCATGTTGACATTTGGACAGTCAATAGAAACGTTTGCCGGACGCGTCATTCTGAGTTGGGATACAGCATTACCGACAACATGCTATGTGCTGGTTGGCTGGATGTCGGAGGTCGTGGCGCTTGCATTGGTGATACTGGCAGCGCTCTCATTCACCTTACCGGCAATGTTCAAACTATTGTTGGAGTGTACTCGTGGAGTTACAATTGTGCACTTCCTCGATACCCTAGCGTTAACACATTTATTCCCAGATATACTAATTGGATTCTAGCCAATGCATAA

Protein sequence:

>DPOGS215670-PA
MRVTVLLFVGCLAVAAALPPPVRIVGGSNAAITSYRFAASLLHSRIGVGTFIYGCGGSIITNRVILTAAYCLYNEPVYRWRVRVGSARSSTGGVVHNTLRTVVHPNYNPRTADSDIALLHSMTVFVFNNNVNLVGIASANYNLPDNQPVTAIGWGATSHGGQLSDRLRHVDIWTVNRNVCRTRHSELGYSITDNMLCAGWLDVGGRGACIGDTGSALIHLTGNVQTIVGVYSWSYNCALPRYPSVNTFIPRYTNWILANA-