Monarch geneset OGS2.0

DPOGS210713
TranscriptDPOGS210713-TA807 bp
ProteinDPOGS210713-PA268 aa
Genomic positionDPSCF300013 - 291179-299019
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0075402e-5278.81% 
BombyxBGIBMGA006324-TA3e-8971.92% 
DrosophilaCG11836-PH8e-5240.32% 
EBI UniRef50UniRef50_A0NDR41e-5843.01%AGAP004569-PA n=2 Tax=Anopheles RepID=A0NDR4_ANOGA
NCBI RefSeqXP_001845722.15e-5942.86%oviductin [Culex quinquefasciatus]
NCBI nr blastpgi|3479721665e-5843.01%AGAP004569-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479721661e-5843.01%AGAP004569-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00038245.4e-76catalytic activity
GO:00042526.6e-74serine-type endopeptidase activity
GO:00065086.6e-74proteolysis
KEGG pathway 
InterPro domain[17-255] IPR0090035.4e-76Peptidase cysteine/serine, trypsin-like
[29-250] IPR0012546.6e-74Peptidase S1/S6, chymotrypsin/Hap
[56-71] IPR0013141.2e-12Peptidase S1A, chymotrypsin-type
Orthology groupMCL29899 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210713-TA
ATGTTACTCAGCTTGCTCCTCCTAGCTGTGCTCGCTGCAGCCAGCGCTTGCAAGACATGCACCTGTGGGGTGGCCCGAGGGGCTCGCGTGGTAGGGGGTGGACCCGTCACTGCCGGGGAATTCCCGTGGCTAGCCGCTGTCAAGAGAGACGGCAAACTAATCTGTGGAGCCACTGTCGTCGCTCGAGACCATCTAATAACAGCAACGCACTGTGTTTATGAAGTGGAAGCCTCCCGACTGACTGTGCTAGTGGGGGAATACAACGTTAACAAATCGAGATCCGAAGGCTACAGGGTGTCCCACGTCATCCAACATCCAGACTTCAACAGATACACATATGATAACGACATAGCTGTGCTGCGACTAGCGGAAGCATTGCCAGATCACCTTTATCGACCCGCGTGTCTACCTGACGATGAAGATGCATTAGAGGGCGTGGACGCTATTGTCTCTGGATGGGGAAGTACCGTGGAAAAGGGTCCGCCTTCAGATATTCCTATGAAGGCGGAAGTACAAATCTGGTCACAAGAGGCTTGCACGGGCGCGGGCTACGGCCGCAGGAAGGTGACCCCTCGCATGTTGTGTGCTAACGCTCCAGACAGGGACTCCTGTACCGGAGACTCTGGTGGGCCGCTGCTGATGACGCAACCACATTATACTGTTGTTGGCATAGTGTCGTGGGGTCGCGGTTGCGCCAGACAAGGCTACCCGGGCGTGTACGCGAGGGTCAATCACTTCATGCCGTGGTTGCGAGTGGCGCTGAGACACGCTTGTACATGCTCACCTCCGAATATATACAAGAAATGA

Protein sequence:

>DPOGS210713-PA
MLLSLLLLAVLAAASACKTCTCGVARGARVVGGGPVTAGEFPWLAAVKRDGKLICGATVVARDHLITATHCVYEVEASRLTVLVGEYNVNKSRSEGYRVSHVIQHPDFNRYTYDNDIAVLRLAEALPDHLYRPACLPDDEDALEGVDAIVSGWGSTVEKGPPSDIPMKAEVQIWSQEACTGAGYGRRKVTPRMLCANAPDRDSCTGDSGGPLLMTQPHYTVVGIVSWGRGCARQGYPGVYARVNHFMPWLRVALRHACTCSPPNIYKK-