Monarch geneset OGS2.0

DPOGS209257
TranscriptDPOGS209257-TA303 bp
ProteinDPOGS209257-PA100 aa
Genomic positionDPSCF300111 + 21199-21773
RNAseq coverage1446x (Rank: top 9%)
Annotation
HeliconiusHMEL0164893e-2857.89% 
BombyxBGIBMGA007039-TA6e-2054.02% 
Drosophila% 
EBI UniRef50UniRef50_D9HQA04e-2157.89%Seminal fluid protein HACP057 n=3 Tax=Nymphalidae RepID=D9HQA0_9NEOP
NCBI RefSeqNP_001037057.14e-2051.49%BCP inhibitor [Bombyx mori]
NCBI nr blastpgi|2999307452e-2057.89%seminal fluid protein HACP057 [Heliconius melpomene]
NCBI nr blastxgi|2999307454e-2357.89%seminal fluid protein HACP057 [Heliconius melpomene]
Group
Gene OntologyGO:00082345.9e-13cysteine-type peptidase activity
KEGG pathwaytad:TRIADDRAFT_203253e-07 
 K01374 (CTSO)maps-> Lysosome
InterPro domain[1-87] IPR0131285.9e-13Peptidase C1A, papain
[32-88] IPR0132014.6e-11Proteinase inhibitor I29, cathepsin propeptide
Orthology groupMCL11204 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209257-TA
ATGAAGTTCGTAGTTTGCGTATTTGTGCTTGCCGTAGTAGCAGTGGCACTAGGTGACAAGCCTCACTACGATTTAAACGACGCTCCCGCACTTTTCGACAAATTTGTTAAGGATTTCAATAGAAGTTACAAGGATGCAGCTGACAGAGAAGTACACTATCAGGCTTTTGTTAAAAGCTTGCAATCAATAAACGAAGCTAATGCCAGACCTTCCCCTACTGTTTATGATATTAACAACTTTGCCGATTACACGGATGAAGAACAAAACAATATGCGCGGTTTACTGTTACCAGAAAACGAGTAA

Protein sequence:

>DPOGS209257-PA
MKFVVCVFVLAVVAVALGDKPHYDLNDAPALFDKFVKDFNRSYKDAADREVHYQAFVKSLQSINEANARPSPTVYDINNFADYTDEEQNNMRGLLLPENE-