Monarch geneset OGS2.0

DPOGS201740
TranscriptDPOGS201740-TA696 bp
ProteinDPOGS201740-PA231 aa
Genomic positionDPSCF300279 - 202450-203145
RNAseq coverage1393x (Rank: top 9%)
Annotation
HeliconiusHMEL0066932e-10776.96% 
BombyxBGIBMGA002640-TA5e-10676.09% 
DrosophilaUch-PA4e-7055.56% 
EBI UniRef50UniRef50_E5EVW18e-10476.09%Ubiquitin carboxy-terminal hydrolase CG4265 n=11 Tax=Neoptera RepID=E5EVW1_BOMMO
NCBI RefSeqXP_966886.11e-7457.58%PREDICTED: similar to Ubiquitin carboxy-terminal hydrolase CG4265-PA [Tribolium castaneum]
NCBI nr blastpgi|3125975903e-10376.09%ubiquitin carboxy-terminal hydrolase CG4265 [Bombyx mori]
NCBI nr blastxgi|3125975907e-10076.09%ubiquitin carboxy-terminal hydrolase CG4265 [Bombyx mori]
Group
Gene OntologyGO:00065118.6e-109ubiquitin-dependent protein catabolic process
GO:00056228.6e-109intracellular
GO:00042218.6e-109ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[1-227] IPR0015788.6e-109Peptidase C12, ubiquitin carboxyl-terminal hydrolase 1
Orthology groupMCL10534 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201740-TA
ATGGCGACTGAAACTTTGGTCCCCTTGGAATCTAATCCTGATGTGATGAATAAATTTCTACAAAAACTTGGAGTGCCAAGCAAGTGGGGTGTCGTAGATGTGATGGGTTTAGAATCGGATATGCTTTCTTGGGTTCCCAGACCGGTACTCGCCCTTACACTCCTTTTTCCGATTTCTCAATCTTACGAGCAACATAAAGAAAAAGAGGAAAGTGAAATTCTGGCTAAAGGCCAAGAAGTCTCCAACAATCTTTTTTACATGAAACAAAACATTAGCAATGCCTGTGGAACAGTAGCCCTTGTTCATGCCGTTGCAAATAATCTAGATGAAATTGGGTTAAATGATGGTTGTTTGAAAACTTTTTTGGAAGAAGCTAAGAACATGGATGCTGTTGCAAGGGGAAAGCTTTTGGAAAAGTGTGAGGGCATCATCAAAGCTCATACTGAGCTGGCACAGGAAGGGCAGACTAATATGCCCAACGCTGAAGATCCCATCAATCATCATTTCATTACTTTTGTACACAAAGATGGTTCTCTGTATGAGTTGGATGGCCGTAAAGCTTTCCCAATAAATCATGGCCCGTGTACACCGGACTATCTACTGGAAGATGCTGCAAAAGTCTGCAAAGAGTTTATGGCCCGTGACCCTCAGGAGGTTCGTTTCACAGTCATTGCTTTCACTATTGCAGATATTTAA

Protein sequence:

>DPOGS201740-PA
MATETLVPLESNPDVMNKFLQKLGVPSKWGVVDVMGLESDMLSWVPRPVLALTLLFPISQSYEQHKEKEESEILAKGQEVSNNLFYMKQNISNACGTVALVHAVANNLDEIGLNDGCLKTFLEEAKNMDAVARGKLLEKCEGIIKAHTELAQEGQTNMPNAEDPINHHFITFVHKDGSLYELDGRKAFPINHGPCTPDYLLEDAAKVCKEFMARDPQEVRFTVIAFTIADI-