Monarch geneset OGS2.0

DPOGS208022
TranscriptDPOGS208022-TA1107 bp
ProteinDPOGS208022-PA368 aa
Genomic positionDPSCF300203 - 276388-280115
RNAseq coverage870x (Rank: top 15%)
Annotation
HeliconiusHMEL0178042e-9165.05% 
BombyxBGIBMGA001495-TA1e-10561.96% 
Drosophila% 
EBI UniRef50UniRef50_D6W9J93e-3432.44%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9J9_TRICA
NCBI RefSeqXP_973179.25e-3532.44%PREDICTED: similar to bone marrow stromal cell-derived ubiquitin-like protein [Tribolium castaneum]
NCBI nr blastpgi|1892345429e-3432.44%PREDICTED: similar to bone marrow stromal cell-derived ubiquitin-like protein [Tribolium castaneum]
NCBI nr blastxgi|1892345423e-3832.60%PREDICTED: similar to bone marrow stromal cell-derived ubiquitin-like protein [Tribolium castaneum]
Group
Gene OntologyGO:00055152.8e-09protein binding
KEGG pathway 
InterPro domain[28-81] IPR0006262.8e-09Ubiquitin
[303-363] IPR0090605.3e-08UBA-like
Orthology groupMCL17123 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208022-TA
ATGGAAAGCCAACCTTGTGTATTTTTAGGAATAAAAGTGAAGTCTGGACCCATCGAACGTTTCAAGCTTGAAAATTTCACATTGGATAATACGGTAGATAAATTAAGAGATCAAGCTGAGAAGAAAACTGGTCTACCATCGTCATCCTTAGAGTTGATCCATCATGGTAAAATATTAAAAGAACCAACTTCACTAATGGACAGTGGTATTAAAAATGGGGAGATGATACATGTGGTGAAAAAAAAGGTAGTCCTACCACCCCCACCCCCACCTACTTACCCTGATTCAGCACTGCAACAGTTGAACATATCTCTCCGCACATTAGGCTGTACTCCTAATGCACCAGGATGGACAAGAGCTATGCAGTTACTCAATGAAGAATCAGCTATATCAGAGATAATAGATCATGCACCATCTTTAGCTGATGATTGCATGACAATCTCCATCTTGCATGAAGTTGAGCTCTTGGCAGCCCTCGGAGCCAATTTGCAAACAATGCGTCGTGGGGCCGACGCTCACCCTGAGCTGCCGAACGCCTTACGGCATCTATTGCGGCTTACAAATTCACAGTCTAAAAGTGCAGCACCTGATTCCGCACCCACATCAGGTTTCGCTTATTCTTTGGAAGCCCTCTCCGAAGATGAAGATGTTGAGGAGGAGGAAACTGAGGAGGGTGAGGAGCGGTCAGGCATCACACCGGAACAACTGGCTTCAGCATTACAGGTAGCTACTCAAGCCCTGATGTCACGTCCAGGGCGGTCGCGCAGTGTTCGCGGTGTACTCCACATGCTCCATGATCAGCCTACTACCACCACTACCACTGCAACCACAACCACCACCAGCGAGCCTACCACCGCCGGTGGTGCGATCACAGCTGAGATGTTCGACGAGGCCATCATAAGAGCCCTACGACCCATGGACAACACTACTGCTGCGTCTTCGTCTCAAAGTTCCGGACGTGACTCCGACTTCACCACTCAACTCAGTCACATGCACGAAATAGGACTATTGGACGACGCCATCAACGTCAGAGCACTCATCATATGTGCAGGTGACGTGAACGCAGCTATAAATCTGGTGTTCAGCGGAGCTATTGGGGACGATTAA

Protein sequence:

>DPOGS208022-PA
MESQPCVFLGIKVKSGPIERFKLENFTLDNTVDKLRDQAEKKTGLPSSSLELIHHGKILKEPTSLMDSGIKNGEMIHVVKKKVVLPPPPPPTYPDSALQQLNISLRTLGCTPNAPGWTRAMQLLNEESAISEIIDHAPSLADDCMTISILHEVELLAALGANLQTMRRGADAHPELPNALRHLLRLTNSQSKSAAPDSAPTSGFAYSLEALSEDEDVEEEETEEGEERSGITPEQLASALQVATQALMSRPGRSRSVRGVLHMLHDQPTTTTTTATTTTTSEPTTAGGAITAEMFDEAIIRALRPMDNTTAASSSQSSGRDSDFTTQLSHMHEIGLLDDAINVRALIICAGDVNAAINLVFSGAIGDD-