Monarch geneset OGS2.0

DPOGS216083
TranscriptDPOGS216083-TA1242 bp
ProteinDPOGS216083-PA413 aa
Genomic positionDPSCF300415 - 81725-86742
RNAseq coverage3339x (Rank: top 4%)
Annotation
HeliconiusHMEL0074762e-3829.52% 
BombyxBGIBMGA007729-TA0.071.43% 
DrosophilaSpn5-PA4e-7936.46% 
EBI UniRef50UniRef50_G6D9U40.0100.00%Serine protease inhibitor 6 n=2 Tax=Obtectomera RepID=G6D9U4_DANPL
NCBI RefSeqNP_001103823.15e-18071.19%serine protease inhibitor 6 [Bombyx mori]
NCBI nr blastpgi|3640236350.075.24%seminal fluid protein CSSFP042 [Chilo suppressalis]
NCBI nr blastxgi|3640236353e-18075.24%seminal fluid protein CSSFP042 [Chilo suppressalis]
Group
Gene OntologyGO:00048672.5e-128serine-type endopeptidase inhibitor activity
KEGG pathwaybfo:BRAFLDRAFT_1307492e-50 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[35-413] IPR0002152.5e-128Protease inhibitor I4, serpin
[1-411] IPR0237962.6e-108Serpin domain
Orthology groupMCL14716 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216083-TA
ATGCTGAGATTCGGAGTTATTTTGTTACTCGGGTTTTCAACTACAGTGAACAGTCAATGCTTTACCAAAGATGACGCTTCAAAGAAACTAAATGCGGAGGCGAGAGCAACATTGTATAAAAATCAGTTGGAATTTACATTAAATCTCTTCAATGTGATCAACGAGGCTGTTCCTAATGACAACGTATTCTTCTCACCTTTCTCCGTATACCACGCCCTACTTCTTGGATATTTTGCTGCTGGTGGTCAAACAGAAAAAGCATTGAAAGAATCACTGCGCATCGCTGACACGCAGGACAAGGTGAACCTGTTGATGGCATACAAAGTGGACAAACATTTGCGGGCCGTGAACAACAACAGCGACAGCTATGAGTTCACAAACGTTAATAAGATGTTCGTTGACACAGCGCTTCAAGTCAGAGAATGCTTCAAAGATGCCTTTGGTGGCGAAATATCGGGCCTGAATTTCCACGATCACCCTGGAGTTGCTGTAGGACATATCAACGAGTGGGTGGCCCACGTCACTAAAAACAATATCAAAGACCTCATTCCTCCTTCTGGCGTAACCCAAGCAACCAAGCTCGTTCTCGCCAACGCAGCTTACTTCAAGGGCGTCTGGGCATCAAAATTCCAAGCTCAAAGCACCAAAAAACAAGTATTCTTCGTCTCGGAAACCCGTCAGACCCTCACACATTTTATGAGACAAAAAGGACAATTCCACTTCATGGTGAATGACGAGCTCGGAGCACAGATCCTGGAATTGCCTTACAAGGGCAATGACATCAGCATGTACATCTTATTGCCACCCTACTCTATGAAGGAAGGAGTAAACAACATTATAGCAAATTTGACGCCAGAGAGATTGGCCGCTGTGGTTGAAGAAGGTTATCTAGGAAGGGAGGTTGTGGTCGAAATTCCCAAATTTACCATCGAAAGGAGTCTACAACTTAGACCGATATTGGAAAGGCTGGGTGTTGGCGACTTGTTTAACGCGAGCTCAGACTTCAGCACCATGATGGAAGATCGTGGTGTCATTTTTGATGACGCCGTCCACAAAGCTAAGATACAAGTCGACGAAGAAGGTACGGTAGCGGCGGCAGCAACAGCTATATTCGGCTTCCGTTCCTCGAGACCAGCGGAGCCCACTGTGTTCATCGCCAACTTTCCATTCGTTTACATTATCTACGAGAAACCAACAAATTCTGTTCTATTCATGGGAGTGTTCAGAGACCCTAAGAAATAA

Protein sequence:

>DPOGS216083-PA
MLRFGVILLLGFSTTVNSQCFTKDDASKKLNAEARATLYKNQLEFTLNLFNVINEAVPNDNVFFSPFSVYHALLLGYFAAGGQTEKALKESLRIADTQDKVNLLMAYKVDKHLRAVNNNSDSYEFTNVNKMFVDTALQVRECFKDAFGGEISGLNFHDHPGVAVGHINEWVAHVTKNNIKDLIPPSGVTQATKLVLANAAYFKGVWASKFQAQSTKKQVFFVSETRQTLTHFMRQKGQFHFMVNDELGAQILELPYKGNDISMYILLPPYSMKEGVNNIIANLTPERLAAVVEEGYLGREVVVEIPKFTIERSLQLRPILERLGVGDLFNASSDFSTMMEDRGVIFDDAVHKAKIQVDEEGTVAAAATAIFGFRSSRPAEPTVFIANFPFVYIIYEKPTNSVLFMGVFRDPKK-