Monarch geneset OGS2.0

DPOGS209282
TranscriptDPOGS209282-TA1818 bp
ProteinDPOGS209282-PA605 aa
Genomic positionDPSCF300522 + 21582-23399
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0176180.064.62% 
BombyxBGIBMGA001710-TA0.070.16% 
DrosophilaCG13025-PA3e-6231.26% 
EBI UniRef50UniRef50_E9J3I61e-7434.39%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9J3I6_SOLIN
NCBI RefSeqXP_971391.14e-7433.53%PREDICTED: similar to CG13025 CG13025-PA [Tribolium castaneum]
NCBI nr blastpgi|3407155054e-8534.51%PREDICTED: e3 ubiquitin-protein ligase RFWD3-like [Bombus terrestris]
NCBI nr blastxgi|3358928089e-8434.15%E3 ubiquitin-protein ligase RFWD3 [Apis mellifera]
Group
Gene OntologyGO:00055158.7e-15protein binding
GO:00082708.2e-07zinc ion binding
KEGG pathway 
InterPro domain[340-494] IPR0110468.7e-15WD40 repeat-like-containing domain
[146-216] IPR0130833.8e-13Zinc finger, RING/FYVE/PHD-type
[324-506] IPR0159435.5e-10WD40/YVTN repeat-like-containing domain
[158-204] IPR0189577.4e-07Zinc finger, C3HC4 RING-type
[158-204] IPR0018418.2e-07Zinc finger, RING-type
Orthology groupMCL14480 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209282-TA
ATGGACGAGGATCCGGAACTCACCAGTAGTCCCGTTATTGTAATACCTGAATCACCAGAACGATCAGAAAATATTGATCGACCACAAAACATACAGCCTATAGTTAATGTTGACGATAATGCTCTATTTGAGGATGTTTTAGAGATCATACAGAACAACAATCAAAATTATAATTCGGAGGCTGTGTTGGAGCATTTAAACCCAGATTTGCCACCATCACCGATTTTAGCGTCGACAGGGGTTGGTTTATCTCGAGCAGCCTCAAATCCAGGGGAAGTAGAACTACCAAGGCATTCATCACAAGCACACATATCAAATGAAGACAGTAATCTTATTATAGCAAACGAAGAATCAAACAGTTTAGCTTCATGTCCAGAGAATAGAGATGGAGATGATGTTGATGTTGAGGAACCTCCAGCCAAAGTGAGGAAGATCAGCTCACCCAAACAAGAAGAAGGTGATGGGGAAACATGCCCTATATGTTTAGATACATGGGGTAATTCCGGAGAACACAGACTGGTAGCATTGAAATGTGGACACTTATTTGGCTGGCAGTGCGTAGAGAGGTGGCTGAAGGCCCAGGCAACTAAAGACAGGACATGTCCAACTTGCAAAAGTAAAGCAAACTTGAAGGATATGCGTTTTATATATGCAAGGAGATTAGTCGCAGCGGACACTTCACAGATAGCAGCACTACAGAAGCAAATAGACGTTCTTAAATCTGAGAACAGCAGGGCCGAGTTGGAGCTATTGAAGTCGAGGATCGCTCATAGAGCTTGTGTGTTACAGTTAGAAGTTCTTAGGAGTACATTGATGAAGAGCCAAACAGCTAAAGACCAACCGCCACGAAGATCTTGGAGATTTGCCCTGGAGAAGAACCAGGAGGTCAGTAAAGATGGTGGATGCCGAGTTCTCACTTACAACTGCCGGACTTATGAGTTATACGTCTCACAGAAGAGTACTAACAATCTTTTCCCCGGATACGGCATCAGGAAAGTTAGTTGTATAGATTATAAATTGGGACAGTTTATCCATTTACATCCCAAGCCTATACGAGATATAACATATTCACAACCCAGGGATCTTCTATTAAGCGTAGGCTTGGACAGTTCAGTTAGAATCATAGAACGAGGTATTCCTAGTGCTTCCATCCAGTGTGGTATGCCCCTATGGTCATGTTCCTGGGACTATTTGCGCAGTAACGAATTCTGTGTAGGCGGAGTGGGGGGCGTCATACACCAGTACGATGTGAGAAATACAAATACTTCAATACAAACATTAAACACGAATGATTTATCACCAGTCGTCTCTTTGTGTTCAACGGAGCACGGCCTATTATCCTGCCAGTTGAACTCGTGCTGGCTTTGGGAGTCGAATATGAGGCAGTGGATCCCCAAAGCGATACCAATAGATGGCCCATTCGTCTCTCTCTGCTATGACAATGATTCCCACCGAGCTTTAATCACATCCCGAGCCAGTGGCAACAGTCGATCAAAACTAACCCTGTTCAAATTAAAAGCAAATAATAGTGGGGAAATTATGTTTGATCTGGAGCAGACATTTGTCGGTTCAGCACGATCCACTTTAATGTCCCGGGGGACCTTCGTTAGGACTCCAGGGGCTATCTGGGCTGCGGCTCACAGTGAAAGTGAGTCCGCCTTAAACCTGCACGGCTTAGATGGAGCCAGAACAATGACCTTACCAGCGGTTGAACCAGCTTTAGACATATGTACAGCTCAAGTTAATGGTGACACGATAGTTGCAGCATTATCCGAGTCCAGGTTAAGACTGTATAAGGCTGTGGCTACCAGCTCCTGA

Protein sequence:

>DPOGS209282-PA
MDEDPELTSSPVIVIPESPERSENIDRPQNIQPIVNVDDNALFEDVLEIIQNNNQNYNSEAVLEHLNPDLPPSPILASTGVGLSRAASNPGEVELPRHSSQAHISNEDSNLIIANEESNSLASCPENRDGDDVDVEEPPAKVRKISSPKQEEGDGETCPICLDTWGNSGEHRLVALKCGHLFGWQCVERWLKAQATKDRTCPTCKSKANLKDMRFIYARRLVAADTSQIAALQKQIDVLKSENSRAELELLKSRIAHRACVLQLEVLRSTLMKSQTAKDQPPRRSWRFALEKNQEVSKDGGCRVLTYNCRTYELYVSQKSTNNLFPGYGIRKVSCIDYKLGQFIHLHPKPIRDITYSQPRDLLLSVGLDSSVRIIERGIPSASIQCGMPLWSCSWDYLRSNEFCVGGVGGVIHQYDVRNTNTSIQTLNTNDLSPVVSLCSTEHGLLSCQLNSCWLWESNMRQWIPKAIPIDGPFVSLCYDNDSHRALITSRASGNSRSKLTLFKLKANNSGEIMFDLEQTFVGSARSTLMSRGTFVRTPGAIWAAAHSESESALNLHGLDGARTMTLPAVEPALDICTAQVNGDTIVAALSESRLRLYKAVATSS-