Monarch geneset OGS2.0

DPOGS200948
TranscriptDPOGS200948-TA1365 bp
ProteinDPOGS200948-PA454 aa
Genomic positionDPSCF300215 - 49857-61093
RNAseq coverage1378x (Rank: top 9%)
Annotation
HeliconiusHMEL0074795e-4247.18% 
BombyxBGIBMGA010212-TA8e-13454.55% 
DrosophilaSpn27A-PA1e-5936.30% 
EBI UniRef50UniRef50_Q2F5W32e-13154.55%Serine protease inhibitor serpin n=4 Tax=Ditrysia RepID=Q2F5W3_BOMMO
NCBI RefSeqNP_001040318.13e-13254.55%serine protease inhibitor 3 [Bombyx mori]
NCBI nr blastpgi|277334157e-14656.89%serpin 3a [Manduca sexta]
NCBI nr blastxgi|277334157e-14156.89%serpin 3a [Manduca sexta]
Group
Gene OntologyGO:00048671.6e-108serine-type endopeptidase inhibitor activity
KEGG pathwayaga:AgaP_AGAP0052464e-43 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[78-448] IPR0002151.6e-108Protease inhibitor I4, serpin
[1-444] IPR0237963.7e-95Serpin domain
Orthology groupMCL10191 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200948-TA
ATGAAGTTATTGCTTTTGATATTTTTGGTACCCTTAACGATATGCTCGGCTAATATCCCGATTTCGCCGGATCTTCTAGAAACTGTCTTTGGTCACCCGGATAATGCGACAAATGATTTGAAGCCAGCTGTTTCTCAGGCCTTACCTGTAGCTCCGGGTTCAGTCAACCCAGCTTATCACCCGACTTACCCAAAACCTATTGAGCAGTATTACCCAGCGTTAGCTGAATACGATAAGTTTGATTGGACGCTTACAAAGCGAGTAGCTTCAAGTTCTCAAGAAAATTTCCTGTTATCTCCGATGGGTGTAAAACTTGCGATGGCCATTCTGATGGAAGCCTCTACAGGATCTACTCATGCAGAGCTATCATCGGTTCTCGGTTTTGATAATGACCGTCAAACAGTGCGGAGGAAATTTGGCTATATACTAAACACATTAAAAACTCAATCATCACTAAACGTTCTAGACCTGGCCAGTAGAATATACGTAGCAGAAAACATAGCAACCAGCCAACATTTCTCAGCGATCGCCGAAACTTTCTATAAGACTGAAATTAAAAACATTAACTTCGACCATCCCGTCAAAGCTGCCTTCGACATTAACCAGTGGATCAACATAACCACGCACGGCAGGATTCCCGGCCTTGTTAATGCAGATGACGTGTACAAGGCTTCAGCTTTCATCCTAAACACAATATTCTTCGAAGGAACATGGCTCCATCAATTCGCGCCGAATGTCACTAAGCCGGACTTTTTCTATTTATCAGCTACGTCCAAAAAGGAAACACCTTTCATGAACATCAGAGACAAGTTCTACTTCGCTGAATCTTCTAAATTCAATGCCAAAATATTAAGAATGCCATATTTGGGTAATGCATTTGCAATGTACATCGTAGTACCGAACACTCTAACGGGTATAGTCAACGTGTTCAATGATCTGAGTGACTTACGATCAGAACTTTACTATCTCCAAGAATACACTGTCGACGTAACTCTACCAAAATTCAAGTTTGAATATACCTCGCAACTGGATGGCATTCTCAAAGAAATGGGAATCAGACAAGTCTTCGAAGATACCGCATCTCTCCCCGGTATTTCTAGAGGACAGAATTTAAATCAAAGACTAAAAGTGTCCAGGGTTATACAACGTTCGGGAATTCAAGTAAACGAACTAGGCAGCATAGCGTATTCTGCTACAGAAGTAGCCCTGGAGAACAAATTCGGAGGGGCATCTGAATACAAGGCCGAGGTGGTTGCAAACAAACCTTTCCTATTCTTCATACAGGACGAAACAACGAAACAGTTGCTATTCACAGGAAGAGTGTCTGACCCCGCGCTGGTCGATGGCGTGTTGAAATTACCATAA

Protein sequence:

>DPOGS200948-PA
MKLLLLIFLVPLTICSANIPISPDLLETVFGHPDNATNDLKPAVSQALPVAPGSVNPAYHPTYPKPIEQYYPALAEYDKFDWTLTKRVASSSQENFLLSPMGVKLAMAILMEASTGSTHAELSSVLGFDNDRQTVRRKFGYILNTLKTQSSLNVLDLASRIYVAENIATSQHFSAIAETFYKTEIKNINFDHPVKAAFDINQWINITTHGRIPGLVNADDVYKASAFILNTIFFEGTWLHQFAPNVTKPDFFYLSATSKKETPFMNIRDKFYFAESSKFNAKILRMPYLGNAFAMYIVVPNTLTGIVNVFNDLSDLRSELYYLQEYTVDVTLPKFKFEYTSQLDGILKEMGIRQVFEDTASLPGISRGQNLNQRLKVSRVIQRSGIQVNELGSIAYSATEVALENKFGGASEYKAEVVANKPFLFFIQDETTKQLLFTGRVSDPALVDGVLKLP-