Monarch geneset OGS2.0

DPOGS203610
TranscriptDPOGS203610-TA2340 bp
ProteinDPOGS203610-PA779 aa
Genomic positionDPSCF300063 + 77915-103702
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0034480.085.26% 
BombyxBGIBMGA007277-TA0.095.39% 
Drosophilarho-5-PA3e-15640.56% 
EBI UniRef50UniRef50_D6WXI70.053.80%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WXI7_TRICA
NCBI RefSeqXP_970266.10.053.32%PREDICTED: similar to rhomboid [Tribolium castaneum]
NCBI nr blastpgi|2700124020.053.80%hypothetical protein TcasGA2_TC006551 [Tribolium castaneum]
NCBI nr blastxgi|2700124020.054.58%hypothetical protein TcasGA2_TC006551 [Tribolium castaneum]
Group
Gene OntologyGO:00042524.1e-111serine-type endopeptidase activity
GO:00065084.1e-111proteolysis
GO:00160214.1e-111integral to membrane
KEGG pathway 
InterPro domain[283-739] IPR0026104.1e-111Peptidase S54, rhomboid
[570-708] IPR0227642.5e-33Peptidase S54, rhomboid domain
Orthology groupMCL10640 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203610-TA
ATGTTTCCGTCACCTAGTTGGCGATCTAGTGCTGGGCTGCTGGCCTCCCAACTCACTTGCCAGAGTTCACCTGGACCACCTCCTTCAGCACCCCCAATCCCCCAACCAACCTCCCCTCCTCGAATCGTCTCAGCACCCCCACCCCCAGAACCTCAACTACCTGAAGAAGAAGGGGTGAAATTAAGACGGGCTCATACAGTAGCTGTTAGAGATTATTTGAAAAGAGAGACACAGACATTTTTTGGTGTTCAAAGAGATAATGAAAGAGAACAGAGAAGGCTATGGTATGAAAGGAGGAGAAGACATGCAGCGAGAGCACTTGGAGACCTCAGGCTGGACCTACCCCCAGATCATCATCCTGATCATGACGACATTAGTATGTATGGAGCATCTGGTGCAGCGCCAAGAGAGGCTCGGGGGCACCGTGAGCGACCCGACGTGCTGCCAGCGCCGGCGTCGCTCGATACTAATCAACCAGTTCCCCGCCTCGGCAAGGACAGCGTGGCCCGGCTTACTCTCACAGGACTATCCTACGTCGTTACTGCGGTGACGCGTCGTTCTCGTCAGTCTACCTCTGTCCAATGGTCCCGGTCTTTTCGTGGCAGTGCCAGTAGTCCGCACGAAGAAATAGAAATTGATGACGTCTTCTTCGCCCCACAAGCGATGACGTCACTCAGCCAGCATCACGAAGACATAGACGTCGTAGACGGAAGTAAAGGGGCCAGGTCTCCTCTGAGGACGTCCTCGAGTGTGGAGTACAGGAGGCCTGAGAGGAATGTTCAGTGGGGAACAGGCGGGATCCGCATCCACGCTCCTATGCTGGAACCCTTGGCAGATAACTCTAATAGAAGGCAATTTGGAATGGGCATTGTTGGTAGATTCTTTAGGAGGTCATTACGAAAATCAGTGACGCATCAAGAAGATGTGGCTAGACAGTTGGACGAATTAACGGATTACAGGCCTTACTTCACTTGGTGGGTCAGCACGGTACAAACATTAGTGCTGTTGCTGTCATTACTTTGCTATGGCTTCGGACCAGTCGGTTTTGGACGACACACGCATTCAGGGCAGGTGCTGGTGAAGAGTTTAAGCTTACAACAGGTCGAATGGGAGGAACCGGCTTCTTTCTGGCTAGGTCCTCGAGCAGCTGATCTTATACATTTAGGTGCAAAGTTCGCACCATGCATGCGAAGGGATGTGAGGATAGCACGGGCTATAGCTGCATCAGCTCGAAGGGAAAGAGACACGGCTTGCTGCATAAGAAACGACGATTCTGGATGCGTACAGTCTTCTAAAGCCGATTGTTCCAATACAATATCGACGTGGAAGAAATGGTCATCCGGGGAATCTGGTCCTGGTGGTAGAATATCAGGATCGGTGTGCGGTTTGGATCCAAAATTCTGTGAGGCGCCGCGTTCCATCGCTCCACACGAATGGCCGGACGACATAACAAAATGGCCAATATGCAGGAAGTCAGTAATTGATGGATCAGCTGCTGCAGGGCGAGCTGGACACGCGGCGGAGCACATGGCCTGTGAGGTTATAGGTCATCCATGCTGTATAGGAGTTCACGGACAGTGCGTCATCACCACCAGAGAGCATTGCGATTTTGTTAAAGGCTACTTCCACGAGGAAGCTTCTTTGTGTTCACAAGTATCATGTTTGGACGACGTATGTGGAATGTTACCATTCATGCGGCGGAGACGTCCAGATCAGTTATACAGAGCCTGGACATCACTGTTTGTGCATGCGGGACTATTACACTTGGCGGCCTCGTTAGCACTCCAGTGGCTCTTTATGCGAGATCTGGAAAAGATGGCTGGACCTGTCAGGATGGCGGTGATATATCTTGGCAGTGGTGTTGCAGGAAATATGGCTTCAGCTATTTTTGAACCGTATAGAGCAGAGGTTGGTCCAGCAGGCTCACATTTTGGTCTACTCGCGTGCTTAATAGTGGAAGTGATAGGAGCGTGGCCGCTTTTAAGGCACCCTCGGCGAGCTCTTCTCAAGCTTATAGGACTTGCGTTAGCACTTTTCCTCCTAGGCCTTCTGCCTTGGATTGACAACTTTGCCCATGTGTTCGGATTTGTCTTTGGTTTTTTGCTTTCATACGCGCTGCTGCCCTTCATAACATTCGGACCGTACGAGAGACGGCGGAAAATAGTTTTGGTGTGGGTGTGTATGGTGTCAGCCGGTGCCATGTTATGTGCTCTAATAGCGTTGTTCTACGCGGCTCCGGCATATGAGTGCGCTGCATGTGCGTACTTTACGTGCCTGCCCTTCGCCCCGGATATGTGCGCGTCACAGGACGTCCGCGTACGGCAGCTGGACGGGGTATGA

Protein sequence:

>DPOGS203610-PA
MFPSPSWRSSAGLLASQLTCQSSPGPPPSAPPIPQPTSPPRIVSAPPPPEPQLPEEEGVKLRRAHTVAVRDYLKRETQTFFGVQRDNEREQRRLWYERRRRHAARALGDLRLDLPPDHHPDHDDISMYGASGAAPREARGHRERPDVLPAPASLDTNQPVPRLGKDSVARLTLTGLSYVVTAVTRRSRQSTSVQWSRSFRGSASSPHEEIEIDDVFFAPQAMTSLSQHHEDIDVVDGSKGARSPLRTSSSVEYRRPERNVQWGTGGIRIHAPMLEPLADNSNRRQFGMGIVGRFFRRSLRKSVTHQEDVARQLDELTDYRPYFTWWVSTVQTLVLLLSLLCYGFGPVGFGRHTHSGQVLVKSLSLQQVEWEEPASFWLGPRAADLIHLGAKFAPCMRRDVRIARAIAASARRERDTACCIRNDDSGCVQSSKADCSNTISTWKKWSSGESGPGGRISGSVCGLDPKFCEAPRSIAPHEWPDDITKWPICRKSVIDGSAAAGRAGHAAEHMACEVIGHPCCIGVHGQCVITTREHCDFVKGYFHEEASLCSQVSCLDDVCGMLPFMRRRRPDQLYRAWTSLFVHAGLLHLAASLALQWLFMRDLEKMAGPVRMAVIYLGSGVAGNMASAIFEPYRAEVGPAGSHFGLLACLIVEVIGAWPLLRHPRRALLKLIGLALALFLLGLLPWIDNFAHVFGFVFGFLLSYALLPFITFGPYERRRKIVLVWVCMVSAGAMLCALIALFYAAPAYECAACAYFTCLPFAPDMCASQDVRVRQLDGV-