Monarch geneset OGS2.0

DPOGS207596
TranscriptDPOGS207596-TA951 bp
ProteinDPOGS207596-PA316 aa
Genomic positionDPSCF300072 + 1018743-1020545
RNAseq coverage288x (Rank: top 38%)
Annotation
HeliconiusHMEL0097318e-12577.93% 
BombyxBGIBMGA004673-TA8e-17694.30% 
Drosophilaelgi-PA1e-14073.10% 
EBI UniRef50UniRef50_F4WY682e-14274.84%E3 ubiquitin-protein ligase NRDP1 n=6 Tax=Endopterygota RepID=F4WY68_ACREC
NCBI RefSeqXP_972869.15e-15380.06%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910788789e-15280.06%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910788782e-15280.06%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00313861.1e-83protein tag
GO:00168811.1e-83acid-amino acid ligase activity
GO:00165671.1e-83protein ubiquitination
GO:00055151.3e-10protein binding
GO:00082703.4e-06zinc ion binding
GO:00048423.4e-06ubiquitin-protein ligase activity
KEGG pathwaytca:6616261e-152 
 K11981 (RNF41, NRDP1)maps-> Endocytosis
InterPro domain[137-316] IPR0150361.1e-83USP8 interacting
[8-76] IPR0130836.1e-16Zinc finger, RING/FYVE/PHD-type
[78-197] IPR0089741.3e-10TRAF-like
[18-54] IPR0189573.3e-09Zinc finger, C3HC4 RING-type
[81-132] IPR0133233.4e-06Seven In Absentia Homolog-type
Orthology groupMCL13078 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207596-TA
ATGGGCTTTGAAATCAAAAGGTTTCAAGGTGACGTCGACGAAGAACTTATTTGCCCTATTTGTTCGGGAGTTCTAGAAGATCCATTACAGGCACCTGCATGCGAGCATGCATTTTGCCGTGTCTGCATAACTGAGTGGATAAGTCGCCAACCGACTTGCCCAGTGGACAGGCAGGCTGTTACCGCATGTCAGCTAAGACCCGTTCCTAGAATACTACGCAACCTGCTATCTAGATTATGCACAAGTTGCGACAACTCACCTCATGGATGCAATGCTGTGCTGAAACTGGACTCACTGGCATCACATTTAGTTGAATGCGAATTTAACCCTAAGCGGCCAATGCCTTGTGAAGCAGGCTGTGGTTTAGTCATACCGAAAGATGAGCTGGCAGAGCACAACTGTGTGCGAGAGTTGCGTGCATTAATAACTTCCCAGCAGGGTAAGCTCACGGACTACCAGCAGGAGCTAGCAGAACAAAGACTGGTCATCAATGAACACAAAAGGGAATTGGCTCTACTTAAAGAGTTCATGCGTGCAATGCGTGTGTCAAATCCTACAATGCGAGCACTAGCCGATCAGATGGAGCGTGACGAAGTTGTGCGGTGGGCTGGATCACTAGCCAGGGCCAGAGTCACCCGCTGGGGTGGCATGATCTCTACACCTGATGACGTCCTGCAGGTGATGATGATTAAGAGAAGCTTATCAGAATCTGGATGTCCTCCACACATTATTGATGATCTGATGGAAAATTGCCATGAGAGGCGATGGCCACCTGGTCTCTCTTCCTTAGAAACTAGACAGAACAATAGAAGATTATATGAAAAATATGTATGCAAAAGAGTTCCTGGAAAGCAAGCGGTACTAGTCTTGCAGTGTGACAATACTCATGTCGATGAACATATGATGGTAGAACCTGGTCTTGTAATGATATTTGCTCATGGCATTGAATAA

Protein sequence:

>DPOGS207596-PA
MGFEIKRFQGDVDEELICPICSGVLEDPLQAPACEHAFCRVCITEWISRQPTCPVDRQAVTACQLRPVPRILRNLLSRLCTSCDNSPHGCNAVLKLDSLASHLVECEFNPKRPMPCEAGCGLVIPKDELAEHNCVRELRALITSQQGKLTDYQQELAEQRLVINEHKRELALLKEFMRAMRVSNPTMRALADQMERDEVVRWAGSLARARVTRWGGMISTPDDVLQVMMIKRSLSESGCPPHIIDDLMENCHERRWPPGLSSLETRQNNRRLYEKYVCKRVPGKQAVLVLQCDNTHVDEHMMVEPGLVMIFAHGIE-