Monarch geneset OGS2.0

DPOGS209252
TranscriptDPOGS209252-TA525 bp
ProteinDPOGS209252-PA174 aa
Genomic positionDPSCF300111 - 107285-109410
RNAseq coverage1164x (Rank: top 11%)
Annotation
HeliconiusHMEL0145173e-4451.16% 
BombyxBGIBMGA007039-TA4e-1650.65% 
Drosophila% 
EBI UniRef50UniRef50_D9HQA05e-1954.35%Seminal fluid protein HACP057 n=3 Tax=Nymphalidae RepID=D9HQA0_9NEOP
NCBI RefSeqNP_001037057.11e-1547.25%BCP inhibitor [Bombyx mori]
NCBI nr blastpgi|2999307451e-1854.35%seminal fluid protein HACP057 [Heliconius melpomene]
NCBI nr blastxgi|2999307451e-2154.95%seminal fluid protein HACP057 [Heliconius melpomene]
Group
Gene OntologyGO:00082341.4e-21cysteine-type peptidase activity
KEGG pathwaytad:TRIADDRAFT_203254e-06 
 K01374 (CTSO)maps-> Lysosome
InterPro domain[80-171] IPR0131281.4e-21Peptidase C1A, papain
[111-167] IPR0132014.1e-17Proteinase inhibitor I29, cathepsin propeptide
Orthology groupMCL11204 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209252-TA
ATGAAGAATTACAATCGTCACTACAAGAACGAGGCTGACAAAGAAGTTCATTATCTAGCTTTTGTTGAAACTTTGAAAGTTATCAACAGACGGAATGCGTTACCCCATAGCGATACACATGACATTAACAAATTTTCTGACTACACGCCGGAAGAATTGAAAAAAATATATATGGCTGTAATCCTCCAGTCTCATAAGCTAGCTCTCGTCTGTCCACAACATACCTACGTTTTCATAATGAAGGCCGTTATTTGTTTATTTTTCATCGCACTTATTGCAATTTCTAATGGAGACAAGCCACATTACGATATTAATAAAGCTCCTCAATTATTCGAGTTGTTTATGAAGAATTACAATCGTCACTACAAGAACGAGGCTGACAAAGAAGCTCATTACCAGGCATTTGTTGAAAATCTGAAAACTATCAACAGACTGAATGCGTTACCCCATAGCGCTACACATGACATTAACAAATTTTCTGACTACACGCCGGAAGAACTGAAACAAATTCATGATAAGAATTAG

Protein sequence:

>DPOGS209252-PA
MKNYNRHYKNEADKEVHYLAFVETLKVINRRNALPHSDTHDINKFSDYTPEELKKIYMAVILQSHKLALVCPQHTYVFIMKAVICLFFIALIAISNGDKPHYDINKAPQLFELFMKNYNRHYKNEADKEAHYQAFVENLKTINRLNALPHSATHDINKFSDYTPEELKQIHDKN-