Monarch geneset OGS2.0

DPOGS201679
TranscriptDPOGS201679-TA2274 bp
ProteinDPOGS201679-PA757 aa
Genomic positionDPSCF300103 + 497020-505267
RNAseq coverage215x (Rank: top 45%)
Annotation
HeliconiusHMEL0119289e-15681.63% 
BombyxBGIBMGA005381-TA1e-14382.93% 
Drosophiladbe-PA4e-13167.73% 
EBI UniRef50UniRef50_Q136013e-11065.81%KRR1 small subunit processome component homolog n=204 Tax=root RepID=KRR1_HUMAN
NCBI RefSeqNP_477240.11e-12967.73%dribble [Drosophila melanogaster]
NCBI nr blastpgi|3504089593e-12970.03%PREDICTED: KRR1 small subunit processome component homolog [Bombus impatiens]
NCBI nr blastxgi|910813173e-13368.30%PREDICTED: similar to dribble CG4258-PA [Tribolium castaneum]
Group
Gene OntologyGO:00038246.8e-82catalytic activity
GO:00042521e-73serine-type endopeptidase activity
GO:00065081e-73proteolysis
KEGG pathway 
InterPro domain[493-748] IPR0090036.8e-82Peptidase cysteine/serine, trypsin-like
[507-743] IPR0012541e-73Peptidase S1/S6, chymotrypsin/Hap
[538-553] IPR0013143.9e-11Peptidase S1A, chymotrypsin-type
Orthology groupMCL15003 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201679-TA
ATGGATGATCAGGAAATTGAAGAAGAGTCTCCCAATACTGGGCCGGTAGAAAATGCTTGGGCTATGAAAATTCCTAAATTCACACAAGAGGATAATCCACATGGTCTCTTAGAAGAAAGCAAATTTGCTACACTGTTTCCCAAATATCGTGAACAATATCTAAAAGAATGCTGGCCACTGGTTCAAAAAGTATTGAAAGCACATCACATTGTTGCAGAGTTGGATCTTATTGAGGGAAGCTTAACAGTAAAAACAACAAGGAAAACTTGGGATCCTTACATTATTATTAAAGCGAGGGATTTTATGAAATTACTTTCCAGAAGTGTACCATTTGAACAAGCTGTAAGGGTATTGGATGATGAAATTGGATGTGACATTATCAAAATTAATTCTTTCGTTAGCAAAAAAGAAACATTCTTGAAAAGGCGTCAAAGATTGATAGGCCCTAATGGTGTCACTTTAAAATCAATAGAGTTGCTAACAGATTGCTATGTATTAGTACAAGGAAATACAGTTTCTACAGTTGGACCATATAAAGGTTTACTACAAGTTAGAAGAATAGTTGAAGATACTATGAAAAATATTCATCCAATGTACAATATTAAAAATCTTATGATTAAACGGGAGCTCATGAAGGACCCTAAATTAAAAAATGAAAGCTGGGATAGATTCCTACCCAAATTCAAAAGTAAAAATGTACCTAGGAAACAACCTAAAAATAAAATCAAAAAGAAACCATACACTCCCTTCCCGCCGCCCCAACCTGAAAGTAAAATTGACAAAGAGCTGGCTTCAGGTGAATACTTTCTCAAAGACGAGCAAAAGAAAGCTAAACGTCACCATGAGAAGGAAGAGAAACAGATGTTGGCAAAGAAAGCAAGACAAGAAGAAAGAAAGAAAGATTTCATTCCTCCGACTGAACCTGCTTCATCACATAATAAAGTATCTGAACAATCAACGGTAGACATTAATCAATTTAAGGCAAAAATGAAGAAAGTTTCAAAACAAAACAAAGCTCTTAATAAAAAAGGATACTCAATCAAAAGCAATCGACGGCATCCCTTTGCTACCTGTGGCTATAGTGGTTTCAATGAGATTGTTTGTTGTCCGGACGATCAAAAACTTATGATTGTAGCATCAATAAGGAAATATCCTGAGAAATATACAAATATAGTGAAATTTGCAGATAATTTTGTTGATGACAAATGCAAACCCAATTTAGAAATTAATGATGGAACATGTAAGCTGATAACTGATTGTGAGGTCGCTCGAAGCTCAATCAAAAGCAACCGACGGCATCCCTTTGCCACTTGTGGCTATAGTGGTTTCAATGAGATTGTTTGTTGTCCAGACGAACAAAAAGTTATGGTAGTAGCATCAATAAGAAAATATCCTGAAAAATATCCAAATATAGTGAAATTTGCAGATAATTTTGGTGATGAAAACAATAGATCATTTGTAGTAGCTGATAGGGCCTGTGAAGAAATAACAAAAAATCGGCTACCTCCTCTTGGGCTACAAATAATTAATGGTGTTGAAGCTTCTCTCGGAGAATTCCCACACATGGTGGCTTTAGGATACGGTGGACCGGATGTGTATGAGTTCAATTGTGGTGCTTCGCTGTTGTCAGAGCTGTATGCATTAACAGCAGCGCATTGCGTCGACACGCTCAATCAAATTAAACCAACTATAGCCCGTATGGGTGTCGTTGAACTTGATGAGCAGACATTTAATCCGAACACTGATCACAGGATAGCAGATATATTGATACACCCAGGTTACTCCAGACGCACTAAATATCATGACTTGTCATTAGTGAGGCTAGAGAGACCAGTGCAATTTGACCCTTTCTTAAGTCCAATTTGCTTGCACACGCGTTTTCAAGATCCATTTGAAAGTCTTACTGTAACCGGATGGGGGACAACTAGTAGTTCAAGGCTGACAAGGAGTACAACCTTGATGAAAGCCGACGTGACTGTTGTTTCAAGGAGCGAATGTAATAAATCATTCATTAACTGGCCGAAATTGCCCCGTGGCATAATTGACGGACAAATATGCGCAGGAGACACGAGATCGGATACTTGTTATGGTGACTCTGGTGGTCCAATGCAATATCCTAATGACTACGACGGCCAGTACCGTCTCGTGGGTGTGACGTCATTCGGTCGCGGATGCGGAACAGCAATGCCGGGTGTCTATACACGTGTAGCATACTACATTAACTGGATAGAAAACATAGTATGGCCGGCTGGTTTAAACACTTGGACAAGCTGA

Protein sequence:

>DPOGS201679-PA
MDDQEIEEESPNTGPVENAWAMKIPKFTQEDNPHGLLEESKFATLFPKYREQYLKECWPLVQKVLKAHHIVAELDLIEGSLTVKTTRKTWDPYIIIKARDFMKLLSRSVPFEQAVRVLDDEIGCDIIKINSFVSKKETFLKRRQRLIGPNGVTLKSIELLTDCYVLVQGNTVSTVGPYKGLLQVRRIVEDTMKNIHPMYNIKNLMIKRELMKDPKLKNESWDRFLPKFKSKNVPRKQPKNKIKKKPYTPFPPPQPESKIDKELASGEYFLKDEQKKAKRHHEKEEKQMLAKKARQEERKKDFIPPTEPASSHNKVSEQSTVDINQFKAKMKKVSKQNKALNKKGYSIKSNRRHPFATCGYSGFNEIVCCPDDQKLMIVASIRKYPEKYTNIVKFADNFVDDKCKPNLEINDGTCKLITDCEVARSSIKSNRRHPFATCGYSGFNEIVCCPDEQKVMVVASIRKYPEKYPNIVKFADNFGDENNRSFVVADRACEEITKNRLPPLGLQIINGVEASLGEFPHMVALGYGGPDVYEFNCGASLLSELYALTAAHCVDTLNQIKPTIARMGVVELDEQTFNPNTDHRIADILIHPGYSRRTKYHDLSLVRLERPVQFDPFLSPICLHTRFQDPFESLTVTGWGTTSSSRLTRSTTLMKADVTVVSRSECNKSFINWPKLPRGIIDGQICAGDTRSDTCYGDSGGPMQYPNDYDGQYRLVGVTSFGRGCGTAMPGVYTRVAYYINWIENIVWPAGLNTWTS-