Monarch geneset OGS2.0

DPOGS210152
TranscriptDPOGS210152-TA1449 bp
ProteinDPOGS210152-PA482 aa
Genomic positionDPSCF300465 + 8074-10732
RNAseq coverage1513x (Rank: top 8%)
Annotation
HeliconiusHMEL0074762e-17268.39% 
BombyxBGIBMGA010216-TA2e-10244.39% 
DrosophilaSpn6-PA3e-4628.65% 
EBI UniRef50UniRef50_G9F9J54e-10448.79%Seminal fluid protein CSSFP043 n=1 Tax=Chilo suppressalis RepID=G9F9J5_9NEOP
NCBI RefSeqNP_001036857.12e-9943.85%serine protease inhibitor 12 [Bombyx mori]
NCBI nr blastpgi|3640236371e-10348.79%seminal fluid protein CSSFP043 [Chilo suppressalis]
NCBI nr blastxgi|3640236373e-10148.79%seminal fluid protein CSSFP043 [Chilo suppressalis]
Group
Gene OntologyGO:00048671.7e-98serine-type endopeptidase inhibitor activity
KEGG pathwaydmo:Dmoj_GI194206e-44 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[110-480] IPR0237968.1e-99Serpin domain
[116-482] IPR0002151.7e-98Protease inhibitor I4, serpin
Orthology groupMCL19354 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210152-TA
ATGAAGATAGCATATTGTTTGATAGTAATATTTGTAATTAAAACTCAGGCGCAATTTAATATTGTTAAATGGCCGCAACAATACAATTATAGAAGAGTACTGCCTCTCGCCGCAAACTTTAACAACGAGCCAGCTCAAAATAGTGGTCTAGATTCATACGACAGATATATAAACTCTTTACTATCAAATGAAAACAAAATTGGAGATGTAAATAATGCCAATGATAATATAAATAAAGTTCAACCAACACGCTACGACGACGACGAGTCTCCGATTGTTTCAAAACCGGTTAGATTCGTTGGAGTCGGTACAACTGATTTTGAACAACTGAACAATGTTACTTACGCCCTAACAGAATTTAGCGTCATCTTAATGAGAAGTGTTAACCAATTAATAAGAGGAAATGTTATAGTTTCTCCCACGTCTATTGCGACAGTTTTAGCTCTCCTACAACAAGGAACTTCGGGAGAGGCTCAGGATCAAATGACAAGAGCTCTTTCAATGCCACCTAACGCCACAGCGCCTACTTATCAAAGACTTACGTATGATATGAAGAAAAGAAATTCAAGAAACATACTGACTGTTTCCATCAATTTATACCTTGGAGAAGGTTTTCAAATAGAACCAGATTTCAAAGAGACCTCTGTTCAATATTTCGGTAGCGAAATAACAAATGTTGATTTCAGTAGACCTGGTCCAGCTGTTAGGCAAATAAATAAATGGATATCGATCCAAACTCATAACAGAATTGCCGAACTTCTACCGGAGAGTGCTGTTAGTTCCTCCACCCAAGTGCTAATAGCAAATGTGGTATATTTCAAAGGACTTTGGGAGACAAAATTTAAACCAGAATCCACGAGAGAGCTATTGTTCCACTTAAGTAGTGGCGAAAGTATTACTGTGCCATTTATGCGCATGCGCCATTCTTTTAGATATGGTATAGATGAAGAGAGTAATTCAGCTGTCGTTGTAATGCCATTTGAAAGATACCAGTACTCTCTTATAATTATATTACCACAGCATCAGTCTAATGTAGACAATATATTAAAATCACTATCAAATAATAAATTACTTAACTATCTTAAATTTGATGAAAGTGAAATCCAATTGGAAATTCCGAAATTTACCATAAAATCACACACTAATATGATACCAGTCTTGCAAAAGATGGGCATCACTGAAATTTTTTCACCGCAAGCTGATCTTACGCGTATTGGAACTTACCGGACGTATTCACCTAGAATATCTAGTGCTATACACACCGGTTATCTGTCAGTCGACGAGCAAGGTCTATCAGCAACAGCAGCTACAAGTTTTGCTGCAATCGCCTTATCATATGAGGATCCCCCTGCACTGTTCCAGGCTAATAGACCATTCTTAGCCGTATTATGGGACACTCAATTCGCTATTCCTTTGTTTATAGCCAAAATTGAAGACCCATCTAAGTGA

Protein sequence:

>DPOGS210152-PA
MKIAYCLIVIFVIKTQAQFNIVKWPQQYNYRRVLPLAANFNNEPAQNSGLDSYDRYINSLLSNENKIGDVNNANDNINKVQPTRYDDDESPIVSKPVRFVGVGTTDFEQLNNVTYALTEFSVILMRSVNQLIRGNVIVSPTSIATVLALLQQGTSGEAQDQMTRALSMPPNATAPTYQRLTYDMKKRNSRNILTVSINLYLGEGFQIEPDFKETSVQYFGSEITNVDFSRPGPAVRQINKWISIQTHNRIAELLPESAVSSSTQVLIANVVYFKGLWETKFKPESTRELLFHLSSGESITVPFMRMRHSFRYGIDEESNSAVVVMPFERYQYSLIIILPQHQSNVDNILKSLSNNKLLNYLKFDESEIQLEIPKFTIKSHTNMIPVLQKMGITEIFSPQADLTRIGTYRTYSPRISSAIHTGYLSVDEQGLSATAATSFAAIALSYEDPPALFQANRPFLAVLWDTQFAIPLFIAKIEDPSK-