Monarch geneset OGS2.0

DPOGS215346
TranscriptDPOGS215346-TA1176 bp
ProteinDPOGS215346-PA391 aa
Genomic positionDPSCF300120 + 459976-461231
RNAseq coverage134x (Rank: top 56%)
Annotation
HeliconiusHMEL0139043e-0724.90% 
BombyxBGIBMGA007978-TA7e-5140.42% 
DrosophilaCG7142-PA2e-0922.27% 
EBI UniRef50UniRef50_B7PQ962e-0827.08%Putative uncharacterized protein n=1 Tax=Ixodes scapularis RepID=B7PQ96_IXOSC
NCBI RefSeqXP_001868791.12e-1028.18%trypsin 1 [Culex quinquefasciatus]
NCBI nr blastpgi|2607915203e-1028.94%hypothetical protein BRAFLDRAFT_78192 [Branchiostoma floridae]
NCBI nr blastxgi|16986667e-1229.09%early trypsin precursor [Culex quinquefasciatus]
Group
Gene OntologyGO:00038241.2e-21catalytic activity
GO:00042521.4e-13serine-type endopeptidase activity
GO:00065081.4e-13proteolysis
KEGG pathway 
InterPro domain[155-389] IPR0090031.2e-21Peptidase cysteine/serine, trypsin-like
[162-384] IPR0012541.4e-13Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL16492 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215346-TA
ATGAGTGACGTGTGCAAAGAATTCAAAGGCTGGTTCAATTCATTCACTAGAACAGAGATCTATCTAACATTATTGCAAGCCGTGTTCCTCGTATCATTAATCACCATCGTAACGTTCCTGGTGCTTCATTTACTCGTGTGTTTGGAAGACGATCGTCGCGAAGACGTCACTGTCACGATCGATAACTTGACCACTACCGTGGAGCGCTCGTCCACCCACCATTATACACCTACAACAGAAAGATCGACGCTGTCGAAGGACGTGGAGTGCACCTGGACACCCTCCGCGAAGACCGAGGCCACTGCTCAGTATTACGAAGGAGGTGAAGTTATCACACTTCCTCCTCCGACTGTCACGCCCGATGGAAACACTCCTGGAGACGTGTCGACGGAGCAGTCAATTAAAACTGATGATGGAAATGAAACGATGGGGTCGGATGATGAACTCGCAGGTGATGTAAAGGCCGGTTGGCTTCTCGCTCTCGTTAAACTACAGCACCCTGGGGAGGTTCTGTTCGGTTGCACTCTCACCGTCGTCTCGAGTCATTGGACGCTCACTGCGGCCAGTTGTATAGAAGCCATAGAGGAAGTGGACACGCTGGATGACTTCGTGATGATGGACAGGCTCGGACGGGGTACGCGGGGCGCGATCCACGAGGTATCAGAGGTGATCGTGCATCCGCTGTACCAAGGTGTCCAGAAAAGTTACGACCTGGTCGCCATAAAATCTGTAGACAGACTCAGCGAAGACGGCGGCCGACACGTGAAGCTCGCCACCCTGTTGGAGGTCTCCCTCGTCACTCTGGGGGAAAGGCTCAGCGTTCTCGGCTTCGGCAAACTGAGTCTGTCGGTGGACCCCGGGGACCGCCGAGTCCGCGAGGTGTCCGTCTTCAAGGTGTCTCCGCGGCAGTGTTCGGGCCACGACACGTGGGCGGCTCGTCACCTGGGGCGTGCGGGCGCGGCGCGGCGTGACGCGGGCGGGCCGGGCGCGTTGTGCGTGGGGCGGGCGGGCGGCGGACGGGCCTGTCCGTGCGCGGGGGCTCCGCTGCTGTCTTCCGACATTTTGCTGGGTGTCATGAGCGACGACGGCGCCTGCGGGGTCTCCTGCGGCCCCACGCTCTACGTCAACATCGCGCTACAAAGAGATTGGCTGGACTCAGTTCTCGATGACGATTAA

Protein sequence:

>DPOGS215346-PA
MSDVCKEFKGWFNSFTRTEIYLTLLQAVFLVSLITIVTFLVLHLLVCLEDDRREDVTVTIDNLTTTVERSSTHHYTPTTERSTLSKDVECTWTPSAKTEATAQYYEGGEVITLPPPTVTPDGNTPGDVSTEQSIKTDDGNETMGSDDELAGDVKAGWLLALVKLQHPGEVLFGCTLTVVSSHWTLTAASCIEAIEEVDTLDDFVMMDRLGRGTRGAIHEVSEVIVHPLYQGVQKSYDLVAIKSVDRLSEDGGRHVKLATLLEVSLVTLGERLSVLGFGKLSLSVDPGDRRVREVSVFKVSPRQCSGHDTWAARHLGRAGAARRDAGGPGALCVGRAGGGRACPCAGAPLLSSDILLGVMSDDGACGVSCGPTLYVNIALQRDWLDSVLDDD-