Monarch geneset OGS2.0

DPOGS204467
TranscriptDPOGS204467-TA1044 bp
ProteinDPOGS204467-PA347 aa
Genomic positionDPSCF300002 + 535302-539042
RNAseq coverage521x (Rank: top 24%)
Annotation
HeliconiusHMEL0062555e-15273.49% 
BombyxBGIBMGA007807-TA3e-7265.02% 
Drosophilarho-7-PA3e-7946.99% 
EBI UniRef50UniRef50_G3JWY32e-12573.94%Rhomboid-7-like protein (Fragment) n=1 Tax=Trichoplusia ni RepID=G3JWY3_TRINI
NCBI RefSeqXP_972425.15e-9656.39%PREDICTED: similar to rhomboid-7 CG8972-PA [Tribolium castaneum]
NCBI nr blastpgi|3362465076e-12573.94%rhomboid-7-like protein, partial [Trichoplusia ni]
NCBI nr blastxgi|3362465078e-12373.94%rhomboid-7-like protein, partial [Trichoplusia ni]
Group
Gene OntologyGO:00042521.5e-51serine-type endopeptidase activity
GO:00065081.5e-51proteolysis
GO:00160211.5e-51integral to membrane
KEGG pathway 
InterPro domain[147-340] IPR0026101.5e-51Peptidase S54, rhomboid
[184-326] IPR0227642.2e-22Peptidase S54, rhomboid domain
Orthology groupMCL13764 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204467-TA
ATGTTCTTCAGAAATGTTTGGTCTGCGGCGGAGATAAATGTTCTTTGCAAGCCACTATTCTATAAGCAAACTTCGCAAAATTTGGGTAATGCTCGATTACAAATCCGTAGCGCGTTTCATAATTCAAGACGAAGTGCGCGTCGCCCAGGGCCGGATACCAATCCTTTAGATAGTATAAATGTTGAAACGGGGCCGTTATATGCTAGAGCCCTCTTTAAACCATTTCTCTTTACTATAGGGGTATCCTCAATAAGTCTAGCTGGTTGTGTGATTTGGGAGTATGAAAACTTAAGAGCTCATGCTACTTCTTTCCTCAGAAGGCCAGGCAGTTGGTTGCATGCCCATCAAAAGAGATTTACAGCACACATGAAATCAGATCCTGGTCCTATAGAAAAATGGTGGAAATCGCTAAAAGAAAGTGAGAAAGTTTTCTACCCTATCCTCGCTGCAAATGTACTAGTCTTTGGTGCTTGGCGAGTTAGATCATTTCAGCCATTTATGATCAAATACTTCTGTTCAAACCCGTCCAGCGTTGTAAAATGTCTGCCTATGGTATTGTCTACATTCAGTCATTATTCGGCTCTGCATTTAGCAGCAAACATGTATGTTCTCTATAGTTTTATGCCAGCTGCCATAGCTTCTCTTGGCAAAGAGCAATTTGTTGCGATGTATCTAAGCGCTGGTGTTATCAGCAGTTTTGCAAGTTTCATCTATAAAGTAATTTCCAATCAGCCCGGTCTCAGTCTCGGAGCGTCAGGTGCAATAATGTCAGTATTGTCTTACGTCTGCGTACAATACCCTGACACTAGACTCAGCATTATATTCCTTCCCATGTACACATTTGCTGCTGGAAATGCAATAAAAGTTATAATGAGCGTTGACTTTGCTGGTGTTCTTTTTGGGTGGAAATTCTTTGATCATGCCGCTCATCTCGGCGGAGCTCTCTTTGGAATGGCCTGGTGTTATTGGGGTTCTCAACAAATTTGGGCGAAGAGGGAAAAGTTGCAGCAGTATTACCACAGCCTCAGAAAAGATTCCAGATAA

Protein sequence:

>DPOGS204467-PA
MFFRNVWSAAEINVLCKPLFYKQTSQNLGNARLQIRSAFHNSRRSARRPGPDTNPLDSINVETGPLYARALFKPFLFTIGVSSISLAGCVIWEYENLRAHATSFLRRPGSWLHAHQKRFTAHMKSDPGPIEKWWKSLKESEKVFYPILAANVLVFGAWRVRSFQPFMIKYFCSNPSSVVKCLPMVLSTFSHYSALHLAANMYVLYSFMPAAIASLGKEQFVAMYLSAGVISSFASFIYKVISNQPGLSLGASGAIMSVLSYVCVQYPDTRLSIIFLPMYTFAAGNAIKVIMSVDFAGVLFGWKFFDHAAHLGGALFGMAWCYWGSQQIWAKREKLQQYYHSLRKDSR-