Monarch geneset OGS2.0

DPOGS201831
TranscriptDPOGS201831-TA1386 bp
ProteinDPOGS201831-PA461 aa
Genomic positionDPSCF300191 - 957355-965312
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0129975e-5644.80% 
BombyxBGIBMGA008281-TA4e-7251.20% 
DrosophilaCG5255-PA1e-2330.53% 
EBI UniRef50UniRef50_A5CG731e-7253.17%Chymotrypsinogen-like protein 3 n=9 Tax=Obtectomera RepID=A5CG73_MANSE
NCBI RefSeqXP_001607866.12e-2935.19%PREDICTED: similar to Chymotrypsin-2 (Chymotrypsin II) [Nasonia vitripennis]
NCBI nr blastpgi|3044436151e-7959.20%serine protease 33 [Mamestra configurata]
NCBI nr blastxgi|3044436157e-7859.20%serine protease 33 [Mamestra configurata]
Group
Gene OntologyGO:00038243.9e-43catalytic activity
GO:00042524.2e-29serine-type endopeptidase activity
GO:00065084.2e-29proteolysis
KEGG pathway 
InterPro domain[84-320] IPR0090033.9e-43Peptidase cysteine/serine, trypsin-like
[94-323] IPR0012544.2e-29Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL16203 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201831-TA
ATGGACTACTTCGAAAAGTGGCACATAACCAGCTGCGCTCACTGCTTTACTAAAGAAGAATTATCTGTGGAAAAAGAGCTGTATTCAAGAAAGGAGATTTGGTTAGGGTCAAAAGAGCCTAAAAAGCTTGAGCCAAAATGTCAAGGTCCCTACCGTACTAAAAAGGTGCTTCCCAATGACAGGTTCGTAAAAGAAGACACTCCTATTACTATGAAAGCTCTACCTCTCAATGAGGCGGAAGACATGTCCGTCTTCTTCGACCACCCGGCAATAACACCATACATCGTCGGAGGATTGACTGCTGGCAAAGTTCCTCATATGGTGGCTCTGACCACCGGTGTCTTCACTAGATCCTTCACCTGCGGAGGTTCTCTGGTGACCAAGAAACACGTCCTCACTGCAGCACATTGCATTGAAGCTGTGTATAGTCGAGGATCTCTTTTGAGTTCTCTCCGTGGAATTGTCGGCACCAATCGCTGGAATTTTGGGGGAGTCCAACAACAATTCGCCTCAAACATTACGCACCCTAACTACGTCGGTTCCATCATCAAAAACGACATCGGTTTTCTGGTAACAGACGCCGAAGTATCTCTGAACGACAACATACAATTGGTACCAATCTCCTACGATTTCATTGAAGGTGAAGTAGCTGCTGTTATCCATGGATGGGGCAGAATCCGGACTGGTGGATCATTGTCACCAAATCTGTTGGAGCTCAAAACAAAGGTCATCGACGGCGAGCGTTGCGTCTCTGACGTGGCTCGTAGAAGTTCGGAAATTGGTATGAGGGTTCCACCAGTTCAACCAGATCTCGAAGTCTGTACTTTCCTAGCACTCAACTTTGGAAACTGTCATGGTGACTCCGGCAGTGCCCTCCTTCGTCAAAGTGACGGCCAGCAAATCGGTGTCGTGTCTTGGGGTCTTCCTTGTGCTCGCGGCGCACCCGATATTTCTCTCCGTGGAATTGTCGGCACCAATCGCTGGAATTTTGGGGGAGTCCAACAACAATTCGCCTCAAACATTACGCACCCTAACTACGTCGGTTCCATCATCAAAAACGACATCGGTTTTCTGGTAACAGACGCCGAAGTATCTCTGAACGACAACATACAATTGGTACCAATCTCCTACGATTTCATTGAAGGTGAAGTAGCTGCTGTTATCCATGGATGGGGCAGAATCCGGACTGGTGGATCATTGTCACCAAATCTGTTGGAGCTCAAAACAAAGGTCATCGACGGCGAGCGTTGCGTCTCTGACGTGGCTCGTAGAAGTTCGGAAATTGGTATGAGGGTTCCACCAGTTCAACCAGATCTCGAAGTCTGTACTTTCCTAGCACTCAACTTTGGAAACTGTCATGTAAGTTTGTCCACAATATATACATAA

Protein sequence:

>DPOGS201831-PA
MDYFEKWHITSCAHCFTKEELSVEKELYSRKEIWLGSKEPKKLEPKCQGPYRTKKVLPNDRFVKEDTPITMKALPLNEAEDMSVFFDHPAITPYIVGGLTAGKVPHMVALTTGVFTRSFTCGGSLVTKKHVLTAAHCIEAVYSRGSLLSSLRGIVGTNRWNFGGVQQQFASNITHPNYVGSIIKNDIGFLVTDAEVSLNDNIQLVPISYDFIEGEVAAVIHGWGRIRTGGSLSPNLLELKTKVIDGERCVSDVARRSSEIGMRVPPVQPDLEVCTFLALNFGNCHGDSGSALLRQSDGQQIGVVSWGLPCARGAPDISLRGIVGTNRWNFGGVQQQFASNITHPNYVGSIIKNDIGFLVTDAEVSLNDNIQLVPISYDFIEGEVAAVIHGWGRIRTGGSLSPNLLELKTKVIDGERCVSDVARRSSEIGMRVPPVQPDLEVCTFLALNFGNCHVSLSTIYT-