Monarch geneset OGS2.0

DPOGS206854
TranscriptDPOGS206854-TA1062 bp
ProteinDPOGS206854-PA353 aa
Genomic positionDPSCF300001 - 2891292-2902361
RNAseq coverage183x (Rank: top 49%)
Annotation
HeliconiusHMEL0061383e-16990.43% 
BombyxBGIBMGA013057-TA4e-10889.87% 
Drosophilarho-4-PA1e-9950.94% 
EBI UniRef50UniRef50_F4WTS21e-11458.36%Rhomboid-related protein 3 n=10 Tax=Formicidae RepID=F4WTS2_ACREC
NCBI RefSeqXP_972541.13e-12057.43%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3407180762e-12058.56%PREDICTED: rhomboid-related protein 3-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3287814544e-12159.32%PREDICTED: rhomboid-related protein 3-like [Apis mellifera]
Group
Gene OntologyGO:00042522.6e-118serine-type endopeptidase activity
GO:00160212.6e-118integral to membrane
GO:00065081.9e-69proteolysis
GO:00055091.7e-16calcium ion binding
KEGG pathway 
InterPro domain[1-353] IPR0172132.6e-118Peptidase S54, rhomboid, metazoan
[113-347] IPR0026101.9e-69Peptidase S54, rhomboid
[168-316] IPR0227643.7e-35Peptidase S54, rhomboid domain
[20-87] IPR0119921.7e-16EF-hand-like domain
[61-86] IPR0182481.6e-07EF-hand
Orthology groupMCL14708 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206854-TA
ATGCCGTCAGGTCAGAGAGAAACAACTGTCTCCATACCACTTACCCCAACGCGATCAGATTTGTACTGGAGAAGGATATTTAATAAGTACGATAGAGATAATGATGGTCGCATCTCATACGTTGAGTTAAAGGCGATCATAGAGTCCAGGGAGTACGACCATGATTTACCAGACAACGTGGTTGAAAAGATCATGGCACTCGCCGATAGAGATAACTCCGGATATCTGGACTTCACGGAGTTCGTCGCCATGATGAAGAACCCAGAACTTAAGGCTGTATTCGGACATTTTGTAGCCAGATATGTTCACTCCATCATACCGCATCGAGACCCAGTTGATACTATTGACGGTACATACGAAGAGGAGTATAGCTGCTGGCCGCCGGCCATATGTATGATATTGATATCCATAGTCGAGATAGTACTGTTCTGCTATGACGCCGCCCAGGGGAAGACGGACGCCACCGGTCCCATCGCCCAGATATTCATATACAATCCCCACAAGAGACAGGAGGCTTGGCGGTTCTTGACCTACATGCTGGTCCATGTTGGTGTAGTTCACCTTCTAGTTAACTTGTTGGTTCAATTGTTCCTGGGAGTCCCACTGGAGATGGTTCATCGCTGGTGGCGCGTGGTGCTGGTGTATCTGGCGGGAGTCGCTGCTGGATCTCTAGCGACCAGCCTCACAGATCCTAAAGTATATCTAGCTGGCGCATCCGGAGGGGTGTATGCTCTTATCGCAGCTCATATCGCGACCATTATTATGAACTGGTCTGAGATGGAGTTTGCGATAATCCAGCTGTTAGTGTTCTTACTTCTGGCAACGGTTGACATCGGTACGGCCGTGTACGATCGTTACTGGAGACACCTGCAGCAGAACATCGGATATGTCGCCCATTTAGCAGGCGCAGTGGCTGGTCTTCTGGTTGGTATAGGAGTACTTCGTAATCTAGAAAAGAGAACTTGGGAGAAACGTCTGTGGTGGGCGGCGGTGGTGCTGTACTTCTCGCTCATGATAGCGGGGATATTAGCCAATGTGTTCTGGAAATCACATTTCCAATAA

Protein sequence:

>DPOGS206854-PA
MPSGQRETTVSIPLTPTRSDLYWRRIFNKYDRDNDGRISYVELKAIIESREYDHDLPDNVVEKIMALADRDNSGYLDFTEFVAMMKNPELKAVFGHFVARYVHSIIPHRDPVDTIDGTYEEEYSCWPPAICMILISIVEIVLFCYDAAQGKTDATGPIAQIFIYNPHKRQEAWRFLTYMLVHVGVVHLLVNLLVQLFLGVPLEMVHRWWRVVLVYLAGVAAGSLATSLTDPKVYLAGASGGVYALIAAHIATIIMNWSEMEFAIIQLLVFLLLATVDIGTAVYDRYWRHLQQNIGYVAHLAGAVAGLLVGIGVLRNLEKRTWEKRLWWAAVVLYFSLMIAGILANVFWKSHFQ-