Monarch geneset OGS2.0

DPOGS213371
TranscriptDPOGS213371-TA1707 bp
ProteinDPOGS213371-PA568 aa
Genomic positionDPSCF300109 + 105220-108473
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0075943e-4830.53% 
BombyxBGIBMGA013849-TA1e-4129.33% 
DrosophilaSpn77Ba-PB2e-1926.29% 
EBI UniRef50UniRef50_Q6Q2D31e-4631.40%Serpin-5A n=3 Tax=Obtectomera RepID=Q6Q2D3_MANSE
NCBI RefSeqNP_001037205.12e-4930.26%serine protease inhibitor 5 [Bombyx mori]
NCBI nr blastpgi|1129845483e-4830.26%serine protease inhibitor 5 precursor [Bombyx mori]
NCBI nr blastxgi|455942321e-4831.40%serpin-5A [Manduca sexta]
Group
Gene OntologyGO:00048671.9e-26serine-type endopeptidase inhibitor activity
KEGG pathwayecb:1000502254e-19 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[194-562] IPR0237964.4e-51Serpin domain
[190-561] IPR0002151.9e-26Protease inhibitor I4, serpin
Orthology groupMCL23359 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213371-TA
ATGATTGTGAGTGGTGTTGTAGAATTTAGTGGAGCCTGGTCCCTGCCTTTCCATGAAATCAATACTGTTTTAGAAGACGGTCATAAAGGTAATGGAAAAGTATATATGATGCATCAATGGGCTAATGTAAGATACGCTGATATTGAACAGCTAGGAGCATCGGTTTTAGAACTACCATACGGAGAAGACAAAGAATATAGTATGATTATAATGAGACCAGAGGGAGACTTGAGCGTCACGAATGTACTGGAAAACTTCGCTAAAATAGACTTTTTAGAAGTTATTCATCGATTATACAGTGAAGGACTACAAGAAATAGAAGTCAAACTACCAAAATTCTCTCTAACATCGTCTTTATTACTAAATGGACCTTTGAACGCTATGGAGATGAGCAGCATATTTTCACGGGTTCGAGCAAATATTTTTAAATACAGCAAAAGTAATATGTACATATCTGCGGTGGAGCATAGAACTAAGGTCATGGTTGCTGAAGCGGGAACCGTCGCAACTGCATCAATACCAGGTAACCTGACAAAAGACACTGAGACCTATGGACTTGGAACTGTCGAAGAGACAGTTGCGGTACAGTTCAGTGAAAAAGTTAGAAATTTAAGCCTAGAACTGTTTTATTATAGTGAAAAAGAAAATAATAGTTCTGTAGCAATGGCGCCCTTTACAATATTTCAATTAATATCTCTAGTGGCGTTCAGGTCGGGAGGTGATACTTGGAAACAGCTGCAATCAATTTTCGGAATACCTAAAGAAAATGGAAAACAGTTTAACTCTCTGTTCATGTTTGTAAACGATTTGCTAGTTCAACAAAAGCCGGGAAGAATTTTAAAGAACATACAAGTCATATTCTTCGACACTGACATAGGGCCATATATACGGAAGATTTTCGTTCGCAATGTCCTAAACGCTGGAGTAAGTATGGTAAAACTCAATTTTGATGATAATATCCTGGCTGCAGAATCAGCTAATAACTTTATTCGCTTGTCCGTCACTTCGTCCCCAACGAATGTCGTATTCGACAAAACTGATTTTGAAGAAACATCAATGATTGTGAGTGGTGTTGTAGAATTTAGTGGAGCCTGGTCCCTGCCTTTCCATGAAATCAATACTGTTTTAGAAGACGGTCATAAAGGTAATGGAAAAGTATATATGATGCATCAATGGGCTAATGTAAGATACGCTGATATTGAACAGCTAGGAGCATCGGTTTTAGAACTACCATACGGAGAAGACAAAGAATATAGTATGATTATAATGAGACCAGAGGGAGACTTGAGCGTCACGGATGTACTGGAAAACTTCGCTAAAATAGACTTTTTGGAAGTTATTCATCGATTATACAGTGAAGGACTACAAGAAATAGAAGTCAAACTACCAAGGTTTTCAATTACTTCGGCTTTAATGTTAGACGGTCCCTTAAAGTCTATGGGAGCTAAAAACGGGTTCTTGATGAATCTCGCGGACTTTTCGGGAATATCGTCCGAAGACATTTATATTTCTGCTTTGGAGCAAAGAACAACAGTGATGGTCGGGGCGGGGAGGACAGTGATAATGGCACACACGCCCGGCAATTTTGCTCACAAAACAAGATATTCATCAAATGAAATGGGCCAGCCATTTTTGTTCTTTATAGTGCATCGAAAATATATGAGTATTGTCTTCTGTGGGAGATATGGACGGAAAGATTTTGATTGA

Protein sequence:

>DPOGS213371-PA
MIVSGVVEFSGAWSLPFHEINTVLEDGHKGNGKVYMMHQWANVRYADIEQLGASVLELPYGEDKEYSMIIMRPEGDLSVTNVLENFAKIDFLEVIHRLYSEGLQEIEVKLPKFSLTSSLLLNGPLNAMEMSSIFSRVRANIFKYSKSNMYISAVEHRTKVMVAEAGTVATASIPGNLTKDTETYGLGTVEETVAVQFSEKVRNLSLELFYYSEKENNSSVAMAPFTIFQLISLVAFRSGGDTWKQLQSIFGIPKENGKQFNSLFMFVNDLLVQQKPGRILKNIQVIFFDTDIGPYIRKIFVRNVLNAGVSMVKLNFDDNILAAESANNFIRLSVTSSPTNVVFDKTDFEETSMIVSGVVEFSGAWSLPFHEINTVLEDGHKGNGKVYMMHQWANVRYADIEQLGASVLELPYGEDKEYSMIIMRPEGDLSVTDVLENFAKIDFLEVIHRLYSEGLQEIEVKLPRFSITSALMLDGPLKSMGAKNGFLMNLADFSGISSEDIYISALEQRTTVMVGAGRTVIMAHTPGNFAHKTRYSSNEMGQPFLFFIVHRKYMSIVFCGRYGRKDFD-