Monarch geneset OGS2.0

DPOGS204814
TranscriptDPOGS204814-TA1146 bp
ProteinDPOGS204814-PA381 aa
Genomic positionDPSCF300221 - 78337-79482
RNAseq coverage4092x (Rank: top 3%)
Annotation
HeliconiusHMEL0165060.0100.00% 
BombyxBGIBMGA001415-TA0.098.43% 
DrosophilaUbi-p63E-PC0.0100.00% 
EBI UniRef50UniRef50_P0CG480.099.74%Polyubiquitin-C n=939 Tax=root RepID=UBC_HUMAN
NCBI RefSeqNP_728908.10.0100.00%Ubiquitin-63E, isoform A [Drosophila melanogaster]
NCBI nr blastpgi|736954280.099.74%Ubc protein [Rattus norvegicus]
NCBI nr blastxgi|1947493440.0100.00%GF10254 [Drosophila ananassae]
Group
Gene OntologyGO:00055152.9e-36protein binding
KEGG pathwaymmu:1000481050.0 
 K08770 (UBC)maps-> PPAR signaling pathway
InterPro domain[11-31] IPR0199565e-38Ubiquitin subgroup
[1-72] IPR0006262.9e-36Ubiquitin
Orthology groupMCL11187 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204814-TA
ATGCAGATTTTTGTAAAAACGCTTACTGGAAAGACCATAACTTTGGAAGTCGAACCTTCGGATACTATTGAAAATGTTAAAGCCAAAATTCAAGACAAAGAGGGAATCCCACCAGATCAGCAACGATTGATCTTTGCAGGAAAACAATTGGAGGACGGACGTACTCTTTCCGATTACAACATTCAAAAAGAGTCTACTCTTCATCTTGTTTTGCGTCTTCGCGGAGGTATGCAGATCTTTGTTAAAACCCTTACGGGTAAAACTATAACGTTAGAGGTAGAGCCTTCTGATACTATTGAAAACGTGAAGGCTAAAATCCAAGACAAGGAAGGCATCCCCCCAGATCAACAACGTTTGATTTTCGCTGGTAAACAATTAGAAGATGGTCGTACTCTCTCCGACTACAACATCCAGAAGGAATCTACACTCCATCTCGTACTACGTCTGCGTGGTGGTATGCAGATCTTCGTCAAGACCTTGACTGGCAAAACCATCACCTTAGAAGTCGAACCTTCAGACACAATTGAGAATGTAAAAGCGAAAATCCAAGACAAAGAAGGTATCCCTCCCGACCAACAGCGTCTGATCTTTGCTGGTAAGCAGCTGGAAGACGGCCGTACACTTTCTGACTACAACATCCAGAAGGAATCGACACTCCATCTCGTACTACGATTGCGTGGTGGTATGCAGATCTTCGTCAAGACCTTGACTGGCAAAACCATCACCCTAGAAGTCGAACCTTCAGACACAATTGAGAATGTAAAAGCGAAAATCCAAGACAAAGAAGGTATCCCTCCCGACCAACAGCGTCTGATCTTTGCTGGTAAGCAGCTGGAAGACGGCCGTACACTTTCTGACTACAACATCCAGAAGGAATCGACACTCCATCTCGTACTACGATTGCGTGGTGGTATGCAGATCTTCGTCAAGACCTTGACTGGCAAAACCATCACCCTAGAAGTCGAACCCTCCGACACCATAGAGAATGTAAAGGCAAAAATCCAAGACAAGGAAGGCATCCCTCCAGACCAACAGCGTCTGATCTTTGCTGGTAAGCAGCTGGAAGACGGCCGTACACTTTCTGACTACAACATCCAGAAGGAGTCTACCTTGCATTTAGTTCTCCGTCTCAGAGGCGGAATTTGA

Protein sequence:

>DPOGS204814-PA
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGI-