Monarch geneset OGS2.0

DPOGS214321
TranscriptDPOGS214321-TA915 bp
ProteinDPOGS214321-PA304 aa
Genomic positionDPSCF300020 - 714121-715035
RNAseq coverage472x (Rank: top 26%)
Annotation
HeliconiusHMEL0063586e-14280.13% 
BombyxBGIBMGA004136-TA3e-13180.19% 
DrosophilaTSG101-PA2e-9053.46% 
EBI UniRef50UniRef50_Q9VVA73e-8853.46%Tumor suppressor protein 101 n=18 Tax=Endopterygota RepID=Q9VVA7_DROME
NCBI RefSeqXP_392951.29e-9256.35%PREDICTED: similar to tumor suppressor protein 101 CG9712-PA [Apis mellifera]
NCBI nr blastpgi|3287774402e-9056.35%PREDICTED: tumor susceptibility gene 101 protein [Apis mellifera]
NCBI nr blastxgi|910800411e-9159.03%PREDICTED: similar to AGAP005934-PA [Tribolium castaneum]
Group
Gene OntologyGO:00064643.2e-10protein modification process
GO:00150313.2e-10protein transport
KEGG pathwayame:4094383e-91 
 K12183 (TSG101, STP22, VPS23)maps-> Endocytosis
InterPro domain[231-295] IPR0179161.6e-24Steadiness box
[1-51] IPR0161353.1e-13Ubiquitin-conjugating enzyme/RWD-like
[1-47] IPR0088833.2e-10Ubiquitin E2 variant, N-terminal
Orthology groupMCL13054 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214321-TA
ATGTCAATAAAAGTATCAAAATATGTTGATAATAACGGAAAAATTTATTTACCGTATCTACACGAATGGAAAGCTAATGTTTCCACTTTACAGCGACTCGTTCAGCAAATGATCATTGCTTTTGGAGAATTACCGCCCGTCTATTCGAAACCACGGAACGAAGTGCGTCCACCCTATCCTATGAACTCCTTTATGCCGCAGCCGGCAGGTTATCCATATCCCACAGTCAGCCCCCCTCAGCAGGGATATCCTTCGGTGACTCCATATCCGACAACTTCCCAACTGCCATATCCCAGTTTTGGTTCACCATATCCTGGCACAGTCAATACTAACGGTTCACCATATCCTGGACCAAATCCTCCATATCCACCAGTGACTGTCAATCCAGTAACAGATGTTGCTGGTGGCACTATCACAGAAGAACATATCAAGGCGTCATTGCTCTCTGCAGTAGAAGATAAACTAAGACGCAGACTCAAAGAACAGTCTCAACAATCACAGGCTGAACTCGAAACATTACGTCGGACACAGCAAGAATTGAGAGAAGGAAAAACAAGATTGGAAGATATAATATCACGTTTGCAAAGAGAGAGATCTGAACTGGATAAAAATGTTGCTATTCTACAAGAGAAGGAAAAGGAATTGGAATCGGCAGTGGAACATCTAGGTGAACAGGAAAGTGTTGATGTTGATGAAGCTGTAGTCACAACAGCTCCTCTATATTCACAACTTCTAAATGCCTTTGCTGAAGAAGCTACACTTGAAGATGCTATTTATTATATGGGAGAGGCCTTACGTAAAGAAGTGATCGATTTGGATACATTTTTGAAACAAGTTCGTACTCTTGCTAGACGACAGTTCACCTTACGTGCACTAATGCATAAATGTAGGCAAAAGGCCCAGCTTGCATGTTAA

Protein sequence:

>DPOGS214321-PA
MSIKVSKYVDNNGKIYLPYLHEWKANVSTLQRLVQQMIIAFGELPPVYSKPRNEVRPPYPMNSFMPQPAGYPYPTVSPPQQGYPSVTPYPTTSQLPYPSFGSPYPGTVNTNGSPYPGPNPPYPPVTVNPVTDVAGGTITEEHIKASLLSAVEDKLRRRLKEQSQQSQAELETLRRTQQELREGKTRLEDIISRLQRERSELDKNVAILQEKEKELESAVEHLGEQESVDVDEAVVTTAPLYSQLLNAFAEEATLEDAIYYMGEALRKEVIDLDTFLKQVRTLARRQFTLRALMHKCRQKAQLAC-