Monarch geneset OGS2.0

DPOGS204817
TranscriptDPOGS204817-TA1422 bp
ProteinDPOGS204817-PA473 aa
Genomic positionDPSCF300221 - 983-7438
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0074270.083.20% 
BombyxBGIBMGA001417-TA0.082.79% 
Drosophilaari-1-PC2e-17163.39% 
EBI UniRef50UniRef50_Q9Y4X58e-17365.91%E3 ubiquitin-protein ligase ARIH1 n=104 Tax=Metazoa RepID=ARI1_HUMAN
NCBI RefSeqXP_971560.10.075.56%PREDICTED: similar to ariadne ubiquitin-conjugating enzyme E2 binding protein [Tribolium castaneum]
NCBI nr blastpgi|2700145620.075.77%hypothetical protein TcasGA2_TC004596 [Tribolium castaneum]
NCBI nr blastxgi|2700145620.075.31%hypothetical protein TcasGA2_TC004596 [Tribolium castaneum]
Group
Gene OntologyGO:00082702.5e-22zinc ion binding
KEGG pathway 
InterPro domain[204-265] IPR0028672.5e-22Zinc finger, C6HC-type
Orthology groupMCL11058 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204817-TA
ATGGACTCTGAAGATGACACAAAAGACGATGTCGATTCTGGTAATGAGTCCAGCGGAGACGATGTCGACTTTGTCATGGACGAGACTCACAGTACAAGGGAACGACAAACGGAACTCGAAGAATATCCGTACGAGGTGTTATCTACAGAGGAAATCGTTCAACATATGGTAGATTGCATAAAAGAAGTGAATACAGTAGTTGAGAGAATGCAGTGGGCCCTTACCCATCGAAACTGGAGTGAAGAAGACTTCAAAAGAGTTCTATGGACAGATGAGTCCAAATTTGAAGTGCTTGGGAGCAAAAGACGTGTTTTTGTTCGGCGGAGTGCCAAGGAAAAGATGATGCCAGACTGTATCCTGCCCACAGTGAAGCATGGCGGTGGTTCCATTATGGTTTGGGGTTGTTTCTCGAGCCGTGGAACTGGAGATTTGATTATGACCGGTCTGGAATGTGGTCACAGATTCTGTACACAGTGCTGGTGTGAATATTTAACTACTAAAATAATGGAAGAAGGCCTGGGTCAGACGATAGCGTGTGCGGCACACGCGTGCGACATTCTCGTGGATGACGCGACTGTGATGCGTCTCGTCAGAGATCCGAGGGTCAAACTCAAGTACCAGCACATCATCACCAACAGTTTCGTAGAGTGTAACCGCCTCCTCCGCTGGTGTCCATCCCCCGACTGCAGCAATGCCATCAAAGTGGCCTATGTTGAGGCAGCGGCAGTAACCTGCCGATGTGGTCACACGTTCTGTTTCGCCTGCGGTGAGAACTGGCACGATCCCGTCAGGTGCTGTCTGCTGAGGAAGTGGATAAAGCTTGAAACATCGAACTGGATAGCGGCCAATACTAAGGAGTGTCCCAAATGTAACGTGACCATAGAGAAGGACGGCGGCTGTAACCACATGGTGTGTAAGAATCAGAACTGTAAGGCCGACTTCTGCTGGGTGTGCCTCGGACCCTGGGAGCCTCACGGCAGCAGCTGGTACAACTGCAACCGGTATGACGTGGACGAGGCCAAAGCGGCCCGCGACTCCCAGGAGCGCTCGCGTGCAGCGCTGCAGCGCTATTTGTTCTACTGCAACCGCTATATGAACCACATGCAATCGCTGCGCTTCGAGTCCAAATTGTACGCATCCGTGAAGGAAAAGATGGAAGAGATGCAACAGCATAACATGAGCTGGATTGAGGTGCAATTCTTAAAGCGAGCTGTGGACATCCTCTGCCAGTGCCGTCAGACCCTCATGTACACTTATGTGTTCGCGTACTACTTGAGGAAGAACAATCAGTCTGTCATCTTCGAGGATAACCAACGCGACCTGGAATCGGCCACCGAGACTCTATCGGAATACCTGGAAAGAGACATCACTAGCGAGAATTTGGCTGACATCAAGCAGAAAGTGCAGGATAAGTACAGGTAA

Protein sequence:

>DPOGS204817-PA
MDSEDDTKDDVDSGNESSGDDVDFVMDETHSTRERQTELEEYPYEVLSTEEIVQHMVDCIKEVNTVVERMQWALTHRNWSEEDFKRVLWTDESKFEVLGSKRRVFVRRSAKEKMMPDCILPTVKHGGGSIMVWGCFSSRGTGDLIMTGLECGHRFCTQCWCEYLTTKIMEEGLGQTIACAAHACDILVDDATVMRLVRDPRVKLKYQHIITNSFVECNRLLRWCPSPDCSNAIKVAYVEAAAVTCRCGHTFCFACGENWHDPVRCCLLRKWIKLETSNWIAANTKECPKCNVTIEKDGGCNHMVCKNQNCKADFCWVCLGPWEPHGSSWYNCNRYDVDEAKAARDSQERSRAALQRYLFYCNRYMNHMQSLRFESKLYASVKEKMEEMQQHNMSWIEVQFLKRAVDILCQCRQTLMYTYVFAYYLRKNNQSVIFEDNQRDLESATETLSEYLERDITSENLADIKQKVQDKYR-