Monarch geneset OGS2.0

DPOGS202281
TranscriptDPOGS202281-TA1203 bp
ProteinDPOGS202281-PA400 aa
Genomic positionDPSCF300032 - 281024-285012
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0047270.081.22% 
BombyxBGIBMGA004937-TA3e-3462.30% 
Drosophilasav-PA6e-4249.45% 
EBI UniRef50UniRef50_D6WT022e-7542.95%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WT02_TRICA
NCBI RefSeqXP_971039.13e-7643.05%PREDICTED: similar to scaffold protein salvador (shar-pei) [Tribolium castaneum]
NCBI nr blastpgi|910861816e-7543.05%PREDICTED: similar to scaffold protein salvador (shar-pei) [Tribolium castaneum]
NCBI nr blastxgi|910861813e-7743.05%PREDICTED: similar to scaffold protein salvador (shar-pei) [Tribolium castaneum]
Group
Gene OntologyGO:00055153.4e-12protein binding
KEGG pathwaytca:6629903e-08 
 K05633 (AIP5, WWP1)maps-> Ubiquitin mediated proteolysis
    Endocytosis
InterPro domain[234-266] IPR0012023.4e-12WW/Rsp5/WWP
Orthology groupMCL13506 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202281-TA
ATGATATCCCGTAAAGGACAAAAAGCTATTAACGAAGGAGTTGTTGGCAAATATGTAAAAAAAGATACTCCACCAGATTTACCTATTATAAATGTCTGGACAACGGGTAATAACAGAAGGCCGCGAAGCCAACCCTTCGGATCTAATGAGGGGCATATGCAAGGCATAGAAAACAAATTTGGTAAAGCCCAAACTGTTAGTGGTCATGCAGGGAAATATACTCCTAGTGAATCAGTACCAAATTTGGCTCACAGATTCGCAAGTCTATCCACTGGTGAAACAGCTAGCTCTTCAAGCAATTTCAATTCTCAATATCATTTAGATGTACACTTAGCATCTCACTCAACCCTCAATGAAATTGACGATTCATTTAGTAGACAGACAAGCCACAGGTACTATAGACAAACACAGACAGAAGATCCTATCTATCAAAATCAGCAGCAAGTCCAACAACAAAAACACCAGCACTACAGTTCAAGGGAGTCAAGTATTCCACGGATTTCGTCCAACATGAGCCTGACTCCTAGACTACATCCGCCGTCTCCTCACTCTATCAGATTACATTGTAATGAATCATTTCCACCGGCAACACAGTCATCGCCAATATATTCTAACTATGTGAACACTCCAGCGCTCCCGGCAATTCCATATCATAATAAGGGCAACATAATTTCTTCTAGTCAAGGTTCGGAAGAATGCGAGCTACCTTTGCCACCGGGCTGGTCAGCGGACCGCACGCTGCGTGGTCGTCGTTATTATATGGATCATAACACACAGACAACACACTGGACGCACCCGTTAGAGAGTGTACCGCGTCCCTGGCACCGCGTATCCACGCCACATCACGGAGTTTACTATTTCAATGAGATAACTCATCAGACAACATATGTTCATCCCTGTCTTGTCGGAGGATGCTACCTTGTTAGTTCACTCGTACCAGCCCTGGTGCCACCTTACTTGTTGGAGGAAATACCTCACTGGCTGATTGTGTACTCAAAAGCTGATCAAGAACTAGATCACAAGCTACGCTGGAACATGTTCAGGCTGAGCGAGTTAGACTGTTACTCGGATATGCTCACAAGACTGTACAAACAGGAACTGCAGTTGATTGTCATGAAATACGAACAGTACAGGTCGGCCTTGCTACAGGAACTTGAACGTAGAAGACTGGCCAAATCGTGCCACACTCACTCAAATTATTGA

Protein sequence:

>DPOGS202281-PA
MISRKGQKAINEGVVGKYVKKDTPPDLPIINVWTTGNNRRPRSQPFGSNEGHMQGIENKFGKAQTVSGHAGKYTPSESVPNLAHRFASLSTGETASSSSNFNSQYHLDVHLASHSTLNEIDDSFSRQTSHRYYRQTQTEDPIYQNQQQVQQQKHQHYSSRESSIPRISSNMSLTPRLHPPSPHSIRLHCNESFPPATQSSPIYSNYVNTPALPAIPYHNKGNIISSSQGSEECELPLPPGWSADRTLRGRRYYMDHNTQTTHWTHPLESVPRPWHRVSTPHHGVYYFNEITHQTTYVHPCLVGGCYLVSSLVPALVPPYLLEEIPHWLIVYSKADQELDHKLRWNMFRLSELDCYSDMLTRLYKQELQLIVMKYEQYRSALLQELERRRLAKSCHTHSNY-