Monarch geneset OGS2.0

DPOGS215527
TranscriptDPOGS215527-TA1446 bp
ProteinDPOGS215527-PA481 aa
Genomic positionDPSCF300467 + 49604-64439
RNAseq coverage1695x (Rank: top 8%)
Annotation
HeliconiusHMEL0055345e-11046.58% 
BombyxBGIBMGA004226-TA8e-1224.28% 
Drosophila% 
EBI UniRef50UniRef50_UPI0002246E442e-4329.35%UPI0002246E44 related cluster n=1 Tax=unknown RepID=UPI0002246E44
NCBI RefSeqNP_001154996.15e-2632.72%cysteine-rich/pacifastin venom protein 2 [Nasonia vitripennis]
NCBI nr blastpgi|3454879467e-4329.35%PREDICTED: hypothetical protein LOC100678556 [Nasonia vitripennis]
NCBI nr blastxgi|3454879461e-5428.63%PREDICTED: hypothetical protein LOC100678556 [Nasonia vitripennis]
Group
Gene OntologyGO:00304141.2e-11peptidase inhibitor activity
KEGG pathway 
InterPro domain[399-432] IPR0080371.2e-11Proteinase inhibitor I19, pacifastin
Orthology groupMCL17639 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215527-TA
ATGAAGTGCTCTGTGGTAGCATTTATTTTGTGTTTAGCTACGTTTTGTAGAGGCGGTGTTATAAAATGCACCCCCGGTTCAAAAGACGAAACCTGTGTCACGGATAATTTCGGGAAAGAATCGCTTGTCCGACAAAAACGTCAGGACAGTATCGTTGAAGAGTGCAGCCCTAGGACTGAATGGAAGAGCAAATGTCACAGATGTATTTGTTCGGATTCAGGACAAGCCCTGTGTTTCAAGATCGAGGGATGTCGCTCCGACAGTGATGACGTCAGCGACGATACGAGGGAATTAAGATCGAGTAAAACGAAAAGATCTAAGGTGTGTACTCCAAAAGCGACGTACAAAATAGAATGCAACACATGTCTTTGCTCCGACGACGGTGAAAGTTTTGTTTGCACTATGATGGGTTGTAGCGAGGAGGCTTCGAATATGGCTGATGTGGAACTGCTAGCAGATGGAGAGGAAAGAATTTCTAAGGAGCGGATAGATAAGCCGGGTCAGAACGTTTGCAAACCCCGGAGAACCTTCTACATCGGTTGCAACACCTGTTTATGTAACATTTATGGGTCGAACTACTCCTGCACCAATAAGCCCTGCCCTTTGCCTAAGGACGTCGAGATATTCCATGAATTGAAGTTCAGAAGATCGGTGGCACCGTCCAAACCGGTCGTGTGCGCGGCGAACCGAATGTTCATAAAAGACTGCAACACCTGCTGGTGCAATGAGGACGGTACGAGCTTTTTCTGCACGCGAAAAGTCTGTGTCGAGGAACTACCCGAAGAAGTCTCGGAGCCGGTGAAGATCCATGAAATTAACAGCACGTGCCGGCCTGATGAAGTGTTTGAGCTGGACTGTAACACTTGCCGCTGTAACCCTGATGGTTTGTCGTACTCTTGCACGAGACGAGCTTGTCCCATTGGAGGGGAGGAACTGCCGTTGAGAAGGAAGACGAGATCAACATCGCAGCAAACATCAAGAACCGCTGAAGTACCAAGAATAGCTCAGTTAATAAAGGGGACGCCGAAGAACTGCCAGCCGGGGCAGGAGTTCAGGATGGATTGCAACAAGTGTCTCTGTGATAACGAGGGGCAGAACTTCTCGTGCACTCGTATTGATTGCGCCGCACTGAACAGCAACGGCAACGGAGGCACCAGGGTTAGGAGGGAGGTGTCGACACGCGAGGAGTCGGGTTGTACTCCGGGTAGCGTGTTCACGCAGGACTGCAACACGTGCCGCTGCACCGAGGACGGGGGACACGCCACTTGCACCCTCAAACAATGCGTCAAACACGATACAGGATATGAACTGAACCAGCCGGAATCGGATCCCAACTTCCGTTGCAACCCGGGCGAGCAGTTCAAGAGGGACTGCAACGACTGCACTTGCAGCGCTAACGGCCGAGGCGTGTTCTGTACACTTCGCATCTGTGACTTTGAGATATAA

Protein sequence:

>DPOGS215527-PA
MKCSVVAFILCLATFCRGGVIKCTPGSKDETCVTDNFGKESLVRQKRQDSIVEECSPRTEWKSKCHRCICSDSGQALCFKIEGCRSDSDDVSDDTRELRSSKTKRSKVCTPKATYKIECNTCLCSDDGESFVCTMMGCSEEASNMADVELLADGEERISKERIDKPGQNVCKPRRTFYIGCNTCLCNIYGSNYSCTNKPCPLPKDVEIFHELKFRRSVAPSKPVVCAANRMFIKDCNTCWCNEDGTSFFCTRKVCVEELPEEVSEPVKIHEINSTCRPDEVFELDCNTCRCNPDGLSYSCTRRACPIGGEELPLRRKTRSTSQQTSRTAEVPRIAQLIKGTPKNCQPGQEFRMDCNKCLCDNEGQNFSCTRIDCAALNSNGNGGTRVRREVSTREESGCTPGSVFTQDCNTCRCTEDGGHATCTLKQCVKHDTGYELNQPESDPNFRCNPGEQFKRDCNDCTCSANGRGVFCTLRICDFEI-