Monarch geneset OGS2.0

DPOGS202595
TranscriptDPOGS202595-TA3963 bp
ProteinDPOGS202595-PA1320 aa
Genomic positionDPSCF300363 + 53370-58638
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0066620.056.09% 
BombyxBGIBMGA011587-TA2e-7871.71% 
DrosophilaCG43366-PB2e-15150.62% 
EBI UniRef50UniRef50_C0J8H60.057.66%Serpin-27 (Fragment) n=1 Tax=Bombyx mori RepID=C0J8H6_BOMMO
NCBI RefSeqXP_968019.12e-16958.03%PREDICTED: similar to CG14470 CG14470-PA [Tribolium castaneum]
NCBI nr blastpgi|1959720560.057.66%serpin-27 [Bombyx mori]
NCBI nr blastxgi|1959720560.058.37%serpin-27 [Bombyx mori]
Group
Gene OntologyGO:00048671.2e-146serine-type endopeptidase inhibitor activity
KEGG pathwayecb:1000502253e-15 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[854-1320] IPR0002151.2e-146Protease inhibitor I4, serpin
[871-1316] IPR0237964.7e-53Serpin domain
Orthology groupMCL25870 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202595-TA
ATGCGGGTTATTATATACGTGCTCGTTGCCGCTGTTTGCGTTAGCGCACAGAGTAATCAGATCAGGCGCACGAAGCTTCCGTACGTAGCTGACGCGGTTGGAAATAGTCTACCTCCTCCATCGAGTGGTCCAGCGACCGTCGGGAGCCCCGTGAACTTATTAACCCCAGATAAATATGAGTTTTATACTTTTGATGAATCTGGTGAACTGGTTAAAAGATTAATGACTTTAGAAGAAATACAAGCAATAGTAGCAGCAGGTGAAGAAACAGACGGTCTGGTTACTTTCGCTAATGATAATTTACCGACATTAACATTTAACTTTAGTCAACCTTCACAAACTAAAGTACACAATGTTGTTGCGAACGTTCAAAATGTTCTTAAAGCACAAATGGAGGCTCATAAAAATAAACCTTATATTCAACCAACGTTAGATACACCAGACGTGTCAGATTCTTGGAGTTTGATACTGCCATCAATATTTGGTAATACTGGAATGGACATAGTCCCAGATAAATTATCGGATTCTTTTATCACGCCAGAAACCGAAACAATGGATATTAATAGTTTCAATAAGGAAGGTGTGAATTCACAAAGTAAAGAAAGCAACGATATACCAATACATCCAAATTCAGAGACGAATAAAAATACTGGTATATTTGAAAGTACAACCGAAATACCTTCTACTACAAACAAATCTTTAGAACCAACTTCATCACCGATTACAACAACAACAACAACAACAACAACAACACAAAAACCCATTTCAACCACAAGACAGGAAATCTTAACTACGACTCAAAAGCTGTCCTCAACTACTGTAAGGCCGACTACAACTAGAAAAATTGACTTGGAAGCAAAACCGATGTATAAACCTAAAGAAAATTCGACAGATTCTACAAAAATTTCCAATTCTACTGAGGAATACAAAATTATGTCAACAAAACAACCATTTACTGCCATATCTAGTGAAGGTACGCCTCCCAAACAAGTACCTACAACACTAAGCCTAGGCGATAATTTTAATCAATCGGCTATTAAAAAATCTAATATTACCAAAACAAGTACCAATACTGTTGCAGACATAAAGCAATCTCACGAAGAAGGGAATTATATTCCTGTTTCTACAGTGTCATATATAACGGAAAGTTCTACTAAAAGAATTATATTGAATAATAAGTATCCCATGTCTTCTAATGAAAATTCTTTACAAATTAAAAAAACTACAGTCACCAGCACAAACAAACCTATTGTAAACAGAGACAAGGAATTTGAAAAACAAAATGCAACCACTATGACCACTCAATCTATTTCTCCTTCAACTTCTATATCTTCTATAGAATTTTCTTCAACGTTGAAACAAGAATCGCCAAGCAATTATGAACAAAAAATTCCGACACCTTTTATTGATACTCAAAATGATAATTCAGACTCTACCATACCTTTGTTTGATGTTGCACAAAGCTTAAGTCAAATAGCTTCAGATTTAAGTGGAAATTTTTCCCCCGTACCAACATCCACAAATTTATTAGAAACAACTAAAATTAACAATATAGATATCGAGAGTAAGGAAAATTTAGAAATAGATATACCAACGGAATCAAGCATCGAACTTGACGTCAACGCAAATAAAACTGAAGAAGAAGATAAGTTTGTAAAAATTTCTACTTTTGAACCTGCTGGAAATGGCGTGAATGTTAAAGATGAAGTTCTGAATTCATCCCATAACTTAAACGATTCATTAGAGCTTAAACCATCGATGACGTCATCTCCTCCATTGACTAACATGGACACTCTTTTATCAGAATCGATGGATAATCTTCTATCTCAAGTCGCTAACGAAGATCCCGATTCTACGACTGTTGTGTCGTCTGAAAATAACAATGAAAGTAATGATATAATTACTACAACATCTCTTGATTTGACTACAATTAACACATTTAATCTAGAAACGACGACAGAAAATTATCAGATAGAGACAACAACGACTTTGCCAATAACCAGCAATGATATTATAATGGAGAAGAATAATGATAATGACAAAATAAATAAAAAAGATACTATTCCAGATCCATTTTTACTATTAAATAAAACGTTACAAACTAAGACAAACAGTGTTCTTGCATCTGAATCTGTAACTACTGACACTATATCTACTCCCAAAGATAATGTTCAAGTGACCACTAGAGCTCCCATTAATCATTTACAACAAGAAACTTTAACAATGACCAATGATAACCAACAAACTATTAGTACAATTAAAATTACCAGTTCTGTAAACGATATTAACACAACTAACGTACCACATACGATGCAAGAATCAAATAAAAGCAGCAATATAAACAAACAACAAAAAATATCCGAGGGTTTACCTAAAATCGACGATTTTAAGAAAAAGATTCAAAAGGTCAATATTGACGACAAAGAGTTTTCTAATGAGAGAAACAGTTCCTGGAAACTGGTACCTACAGTAGTTAAATTAAGTGAACTTGCAAAGGACAAACAAAATGTCGAGGGATTTTACACTCCCGATAATGACAAAGATATAATTTTGGAATTTCCAAAGGAAAATCAAGGTTTAGAAGTGACCACGAAAGATTTACGTGACGACATAATGGAATTCACGGAGCTTTGCAATGAATTGGCTTTTAAATACTGGAATATCATGACGGAAAAGATAGATAAGAAACGTAGTATGGTATTCTCACCATATTCTATAACTTCTATGGCGGCCATGATGTTCATGGGAGCCAAAGGTTCTACATCAGGGGAAATGAATGAAGTGCTGAGACTTGATGACATGGTTACATTCAATCCTCACTTCACATTGAAGAACATCTCCGATTCCATAGACACAACACCCGCTTCGGGCGTCGCTGTATCAGTGTTCATAAGAGAACTGTACAGTGAAAGAAATAAGGGTAAATTTTTAACCTTCTATAAAGAGAGAGCTCAACATTTCTACAACGGACATGTGGAGGAAGTAAACTTTAAATTGATTAGCGATATAATACGACGAAGAACTAATTTACTCGTGAAAAGATATTCCTGGGGTAAAATTTCAGAATATATGAAAAGTAACAGCATTATTATGAACCCACCTCTAGCGGCTTTTGCAGCCAACATATTTTACACCGATTGCAACGGATCGTCCGTTGAAGGAAGGGATGGTGAGATGTATTTTGTTGTATCACCTAGTGTGAGACAGCGTCGCCTGGTGCCTGTACCAGCCGTGGTCTACCGTGGTAACTTCCTCGCTGGTTACGACCCCGTCCTCGATGCGACAGCAGCAGCTTTAGGTAACACGAAATCTATAATCAGCACTCTCTTCCTCATGCCGGGCCAGCAGGGGAACGTCGTTCAAGCTGATGATTTGGAGAATCTTGAGAAGAGATTATTGAAATCTGATCCTATCACACCAGCGTGGAACAGATTATTGCGTACCCTACTTCCTAGGTTTGGCTTGGAATTGCAAATACCTAGATTCATGCACAAATCCGTGTTCAACGTTTCATCGACATTGCAACGCATGGGATTAAAGGATTTGTTTAGTGAGGAACACGCTGACCTGGGTGGTTTGAACGGTCCGTCGAAGGACCTTTATCTCACTGATATGATTCAACAAACCTCATTCGCTACCTGCGGGGAAGGTCTCATTGGTGAGCAGCATCATATTGAGGAATATCCTGATACGATCGAAGTGAGATCGAAACGTAGGACGTCTAGATGGAACACAGGCTGGGCTGAGCCTAGAGATTACCAACGAGCTTTCCACGATCCCCATGACGCTGGTGAAGCGATGTACTTACCCCTGCATCTACGACCGAGGCAGGCCAGACTCCCCACCAGGAGTTCCCAACCAGCTAGATTAAAATTCGATCGACCTTTCCTATACTTCGTCAGACATAACCCATCTGGAATGATTCTTTATGTGGGCCGTTACAATCCCCGGCTCTTACCTTAA

Protein sequence:

>DPOGS202595-PA
MRVIIYVLVAAVCVSAQSNQIRRTKLPYVADAVGNSLPPPSSGPATVGSPVNLLTPDKYEFYTFDESGELVKRLMTLEEIQAIVAAGEETDGLVTFANDNLPTLTFNFSQPSQTKVHNVVANVQNVLKAQMEAHKNKPYIQPTLDTPDVSDSWSLILPSIFGNTGMDIVPDKLSDSFITPETETMDINSFNKEGVNSQSKESNDIPIHPNSETNKNTGIFESTTEIPSTTNKSLEPTSSPITTTTTTTTTTQKPISTTRQEILTTTQKLSSTTVRPTTTRKIDLEAKPMYKPKENSTDSTKISNSTEEYKIMSTKQPFTAISSEGTPPKQVPTTLSLGDNFNQSAIKKSNITKTSTNTVADIKQSHEEGNYIPVSTVSYITESSTKRIILNNKYPMSSNENSLQIKKTTVTSTNKPIVNRDKEFEKQNATTMTTQSISPSTSISSIEFSSTLKQESPSNYEQKIPTPFIDTQNDNSDSTIPLFDVAQSLSQIASDLSGNFSPVPTSTNLLETTKINNIDIESKENLEIDIPTESSIELDVNANKTEEEDKFVKISTFEPAGNGVNVKDEVLNSSHNLNDSLELKPSMTSSPPLTNMDTLLSESMDNLLSQVANEDPDSTTVVSSENNNESNDIITTTSLDLTTINTFNLETTTENYQIETTTTLPITSNDIIMEKNNDNDKINKKDTIPDPFLLLNKTLQTKTNSVLASESVTTDTISTPKDNVQVTTRAPINHLQQETLTMTNDNQQTISTIKITSSVNDINTTNVPHTMQESNKSSNINKQQKISEGLPKIDDFKKKIQKVNIDDKEFSNERNSSWKLVPTVVKLSELAKDKQNVEGFYTPDNDKDIILEFPKENQGLEVTTKDLRDDIMEFTELCNELAFKYWNIMTEKIDKKRSMVFSPYSITSMAAMMFMGAKGSTSGEMNEVLRLDDMVTFNPHFTLKNISDSIDTTPASGVAVSVFIRELYSERNKGKFLTFYKERAQHFYNGHVEEVNFKLISDIIRRRTNLLVKRYSWGKISEYMKSNSIIMNPPLAAFAANIFYTDCNGSSVEGRDGEMYFVVSPSVRQRRLVPVPAVVYRGNFLAGYDPVLDATAAALGNTKSIISTLFLMPGQQGNVVQADDLENLEKRLLKSDPITPAWNRLLRTLLPRFGLELQIPRFMHKSVFNVSSTLQRMGLKDLFSEEHADLGGLNGPSKDLYLTDMIQQTSFATCGEGLIGEQHHIEEYPDTIEVRSKRRTSRWNTGWAEPRDYQRAFHDPHDAGEAMYLPLHLRPRQARLPTRSSQPARLKFDRPFLYFVRHNPSGMILYVGRYNPRLLP-