Monarch geneset OGS2.0

DPOGS213430
TranscriptDPOGS213430-TA1266 bp
ProteinDPOGS213430-PA421 aa
Genomic positionDPSCF300271 + 142244-148195
RNAseq coverage264x (Rank: top 40%)
Annotation
HeliconiusHMEL0168071e-8049.72% 
BombyxBGIBMGA004460-TA3e-5337.53% 
DrosophilaCG4386-PA5e-1728.25% 
EBI UniRef50UniRef50_G3TZE03e-1833.16%Uncharacterized protein n=5 Tax=Eutheria RepID=G3TZE0_LOXAF
NCBI RefSeqXP_001607869.17e-1732.79%PREDICTED: similar to Chymotrypsin-2 (Chymotrypsin II) [Nasonia vitripennis]
NCBI nr blastpgi|3453086032e-1832.60%PREDICTED: brain-specific serine protease 4-like, partial [Ornithorhynchus anatinus]
NCBI nr blastxgi|3442919921e-1632.98%PREDICTED: hypothetical protein LOC100659461 [Loxodonta africana]
Group
Gene OntologyGO:00038247.4e-40catalytic activity
GO:00042522.4e-28serine-type endopeptidase activity
GO:00065082.4e-28proteolysis
KEGG pathway 
InterPro domain[211-395] IPR0090037.4e-40Peptidase cysteine/serine, trypsin-like
[215-395] IPR0012542.4e-28Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL30258 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213430-TA
ATGCCATTACAGAAAGGTATTAAAATCATAAATGCGAGGAAATTCAACAAACTAAAAGCACAGGAGAGCCGCAAGGCGAATTATAAGAAAAGAGATAAAAAGGTCACGTATACAGCGTTCCCGCCGATAGCTGTTAAAGTTGTGAAGACTTCGGCTGAGAAAAATTACTTCCAAAATGCCAACGACGAGAGAAAACCAAGGGTGAAAAAGGTATTGCCCGAATACGAACTCACAGTTTTTAGGAATAAGAGGTCATTAGATAATACAGAAAACAATAGGGAAACACGGAGAAGAGTAAAAGCCAAGAGATATTTAAGGTCCAAAGATAAAACGACCAAAATTCATAAGAAAATAGGTAGAGGTAGCAGACAGAAGAGAAATCAAAAACATGATCAGCAAGAAAAATTGATATCCAAACGGACTAAAGATAATATATCATATGTTTACAAAAAACATAAAAGCAATAAGAGGTCAGGGGCCAATCTTAGGGTAGGAAAGAAAAGATCGACCATAGATTTGAAATACACAGAGAAAGTTCGTGACGCGGATCAGAAAAAGAAGTTGATTAAACGGAATAAGAAATTAAAACGTCAAAGATTATTCAGAAAAAGGGAAGACGGTAAAGTTGGATATAGGAGGCTTATAGCTGGTAGAGATGCCATGATTAGAGAGTACCCTTATGTGGTGTCAATACAAAAAGGTCGCGAACATTGGTGCGCCGGCGCATTGCTTAACCAAAGGCTCGTTATTACAACAGCCAACTGCATATGGAAGTCTGAACATGTAAGCCGCATGAAGGTGAGAGCTGGTACACGTCATATGGACCGGAAGGGTCAGGTGGCTAAAATTATGGAGGTGGTGAAACATCCGCTGTGGAATATAAGAGGAGGACCAGACAATGATGTTGGACTGCTTCTACTGGACCGGAATATTAAGTTTTCTGACTCAGTCCATAGCGTTGATCTCCCGAATCGTGTGATGTGGCCGGCCTTCGAGGATGTCTGGGTTACCAGCTGGGGTTCCAATAGACGTGACGGTGTATACGACAGTATATCCAGCACGCTCCAAGTGTATCACGCCATGTTGATGAGCAACGAGCAGTGCAACAACGTCACCATGAGGTTCGGAGTGCCTGTCACTGAGAACTTCTTTTGTGTCACACAGACCGGAAGACGCGCGCCGTGCACTGTCTGCGTCTCGCAACGTGACTATCGTCGTATCCCGCCGACGCCAGCAGCTCATCGCATATCGAAAAAATTGACATAA

Protein sequence:

>DPOGS213430-PA
MPLQKGIKIINARKFNKLKAQESRKANYKKRDKKVTYTAFPPIAVKVVKTSAEKNYFQNANDERKPRVKKVLPEYELTVFRNKRSLDNTENNRETRRRVKAKRYLRSKDKTTKIHKKIGRGSRQKRNQKHDQQEKLISKRTKDNISYVYKKHKSNKRSGANLRVGKKRSTIDLKYTEKVRDADQKKKLIKRNKKLKRQRLFRKREDGKVGYRRLIAGRDAMIREYPYVVSIQKGREHWCAGALLNQRLVITTANCIWKSEHVSRMKVRAGTRHMDRKGQVAKIMEVVKHPLWNIRGGPDNDVGLLLLDRNIKFSDSVHSVDLPNRVMWPAFEDVWVTSWGSNRRDGVYDSISSTLQVYHAMLMSNEQCNNVTMRFGVPVTENFFCVTQTGRRAPCTVCVSQRDYRRIPPTPAAHRISKKLT-