Monarch geneset OGS2.0

DPOGS206017
TranscriptDPOGS206017-TA3471 bp
ProteinDPOGS206017-PA1156 aa
Genomic positionDPSCF300253 + 184050-204481
RNAseq coverage458x (Rank: top 27%)
Annotation
HeliconiusHMEL0068162e-12343.66% 
BombyxBGIBMGA012636-TA0.062.06% 
DrosophilaNedd4-PJ0.067.83% 
EBI UniRef50UniRef50_Q9VVI30.067.83%E3 ubiquitin-protein ligase Nedd-4 n=15 Tax=Coelomata RepID=NEDD4_DROME
NCBI RefSeqXP_002094996.10.067.83%GE19920 [Drosophila yakuba]
NCBI nr blastpgi|3838648550.066.83%PREDICTED: E3 ubiquitin-protein ligase Nedd-4-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3479645000.048.92%AGAP000787-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00064641.3e-153protein modification process
GO:00168811.3e-153acid-amino acid ligase activity
GO:00056221.3e-153intracellular
GO:00055151.1e-30protein binding
KEGG pathwaydme:Dmel_CG422790.0 
 K10591 (NEDD4, RSP5)maps-> Ubiquitin mediated proteolysis
    Endocytosis
InterPro domain[764-1155] IPR0005691.3e-153HECT
[59-199] IPR0089731.1e-30C2 calcium/lipid-binding domain, CaLB
[66-152] IPR0000081.9e-21C2 calcium-dependent membrane targeting
[672-712] IPR0012029.8e-19WW/Rsp5/WWP
Orthology groupMCL11389 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206017-TA
ATGAGAGATATACGTTCTCTGTCCATCATGCATAACGTAAGACCTGTCGAGTTTCCGACAAGGACTATGCAGGAGGAAAACCAGGCTCATAGGAACCGCTCATCTTCTTGGTCCTCTTGGAGACGGACCAATTTGCTCCGTAATAACGAACCCAGATATTCCCTGCAATATGAGACTGATGAGAATAGCTTCATGCTGAGGTTGAAGATAGTAGGAGCTTATTCACTAGCGAAGAAAGACATATTTGGTGCTAGCGACCCCTACGTCCGGGTGGAACTTCAGAAGGTTGACAGCGACTTCACGTTCGAGACATTTCTGACGAAAACTAAGAAAAGGACATTAAACCCGGTATGGAACCAGGAGTTCGTATTCCGGGTGAAGCCTCAAGAGCAGAAGCTGCTGATCCAGGTGTTCGACGAGAACCGCCTCACCCGGGACGACTTCCTAGGGATGGTGGAGCTGGCGCTGGCCGGCGTACCCTCGGAGAGCGCCACAAGCCCGCGACCGGCCAGCGTCAAATACCCCCTGCGTCCGCGCAGGTCCGTCGCTCGGTCCCGTGTCCGCGGGTCCCTGGAGGTGTACTGTGCTCTGACTGGGCGAGTCGATGAGCGAGCGTCGGAAAACGAACCCGCGCCGCCAGCTATCAACACGGTGACAGTACCGTCGCCAACCGGAGACGACTGGGAGTTGGTGGACGCACCAAACAATGTGTCAGGGGAGCCGCTGCCCGCGGGCTGGGAGGAGCGCCAGGACGCTAACGGCCGTACTTACTATGTGAACCACGTGGCCAGGTCCACGCAATGGGAGAGGCCTACGTTGATGTTAGTGACAGTACCGTCGCCAACCGGAGACGACTGGGAGTTGGTGGACGCACCAAACAATGTGTCAGGGGAGCCGCTGCCCGCGGGCTGGGAGGAGCGCCAGGACGCTAACGGCCGTACTTACTATGTGAACCACGTGGCCAGGTCCACGCAATGGGAGAGGCCTACGTTGATGCGTAATATGTCAACGGAGTCACAAGCTGAGCGTATGGAAACAGCAGCCACTGAGTTCCAAAGACGTTTCCATATCTCGGCTGATGAAGACCGGCCTCCAGCGAGACAGGCACAGGAGGAATCGAATGATAGACAGCCGGACAATCCTACCGAGCACACGGCCCACGCTTCACCGATACTATCATCTGAGATCAGTCAGGAATCGACACAGGACGCGACACAGCACGCGACACAGAACGCGACACAGGACGCGACACAGGAAGCGACACAGGAAGCAACACAGGACGCGACACAGGAAGTGACACAGGACGCTTCGCAATCAGAAGACGGTGCACAAGCGTCCACATCAGAAACAACTCAAGAACCTGTACAGGATAATATAACGAAAAACTGTGATACTAATTTAAGAGACAACAAAGAAAACGACGACGTCTGTCAAAATGACGCGACAGGCGGAGGTGATGAAGTGGACAATGCTCAAGCAGATAATAATAATGCTAGTCAAGAAATTATAGAGATACAAGTGGCTGGGGACGTGGATGTAGACGGGGAAGGGGGGATCAACGATACGGAATCAAATGGCGTTGTAGACGATGACGCGAGAACAGAGATCGAGGAAATCACTTTTGGGAGAGAGAGAACGCTCACAGACGAGACTGGGGGCTCCACGGAATCTTTGACTTTTGATGAGAACCATTTCAGCACGCCCACGGGAGGCATCACGCCGGAGAGACGGATGTCCCCGTCACGACGGCGGACGACGAACTCGCTGACGGAGTCGGAGGACGAAACCGACGGATCCTCGGAGAGCACTCGAAGCACCAGCTCCAGCAGCAGCCAGAGCCAGAACCTACCAAACAGCGACGGTCTGCCGTCCGGCTGGAGTATGCAGCGAGCTCCGAATGGAAGAATATTCTTCATAGATCACAACCAGAAAACTACAACATGGATCGATCCCAGAACAGGTTGCGCGTCCAGCCTGCCCTCAGCCGCGGCGAGCGCGGAGGTGGACACGGACGAGTTGGGCGCCCTGCCCGAGGGTTGGGAGGAGAGAGTCCACACTGATGGAAGGATCTTCTTCATAGACCACAACACTCGCACGACCCAGTGGGAGGACCCTCGGCTGTCCAACCCTCAGATCGCGGGTCCGGCCGTGCCTTACTCCAGGGACTACAAACGGAAATACGAGTACCTCAAGAGTCAGCTGCGGAAACCTAGCAACGTGCCCAACAAGTTCGAGATCAAGGTCCGGCGGAACTCGATCCTGGAGGACTCGTACAGGATCATCAGCTCCGTCAGCCGCATCGACCTGCTCAAGACCAAGCTGTGGGTGGAGTTCGAGTCCGAGGTCGGCCTAGACTACGGCGGTCTCGCGCGCGAGTGGTTCTTCCTGCTGTCCAAGGAGATGTTCAACCCCTACTACGGCCTGTTCGAGTACTCCGCCATGGACAACTACACGCTCCAGATAAACCCGAACAGCGGCGTCTGCAACGAGGAGCATCTCAGCTACTTCAAGTTCATAGGCCGAGTCGCCGGCATGGCCGTCTACCACGGGAAGCTGTTGGACGCTTTCTTCATCCGTCCCTTCTACAAGATGATGGTGGGGAAGACGATCGAGCTGCAGGACATGGAGTCCGTGGACCTGGAGTACTACAACTCGCTCGTGTGGATCAAGAGCCTAGCGAGCTATCAGCCGTCCACCAATCAATCGGCAATCAGAGAAACTCATTCATTCGGTCAATGCGCCGTTCAGTCCATCAATCAAGGGTTTGTTTATACAATGTGTAATTCAATTCACTTGAAGGATATGTTCCAGAAGCATTCCCGTCACCACAGAGAGGAGAACGATCCGTCCGAACTGTATCTGACGTTCGCCGTGGACGAGGAGCAGTTCGGGAACACCATCCAGAGGGAGCTGAAGCCGGGCGGCGCCAACATAGCGGTGGACGAGGACAACAAGGACGAGTACATCAAGCTGGTGATCCAGTGGCGGTTCGTGAGTCGCGTGCAGGAGCAGATGTTCGCGTTCCTGGAGGGGCTGGGGGCGCTGGTGCCGTTGCCGCTGCTGAAGATCTTCGACGAGCACGAGCTGGAGCTGCTGCTGTGCGGCATCCAGCACATCGACGTCCGCGACTGGCGCGCCAACACGCTCTACAAGGGCGACTATCACGCCAACCACCTCGTGGTGCAGTGGTTCTGGAGGGTGGTGTTGTCTTTCTCCAACGAGATGAGATCCCGTCTGCTGCAGTTCGTGACGGGCACCTCGCGCGTGCCCATGAACGGGTTCAAGGAACTGTACGGCTCCAACGGCCCGCAGCTGTTCACCATCGAAAAGTGGGGCGGACCCGACAACTACCCTCGGGCGCACACCTGCTTCAACCGCATCGACCTGCCCCCGTACGAGAGCTACATGCAGTTGAGGGAGAAGCTGGTGAAGGCTATTGAGGGCTCCCAGGGCTTCGCTGGGGTCGACTGA

Protein sequence:

>DPOGS206017-PA
MRDIRSLSIMHNVRPVEFPTRTMQEENQAHRNRSSSWSSWRRTNLLRNNEPRYSLQYETDENSFMLRLKIVGAYSLAKKDIFGASDPYVRVELQKVDSDFTFETFLTKTKKRTLNPVWNQEFVFRVKPQEQKLLIQVFDENRLTRDDFLGMVELALAGVPSESATSPRPASVKYPLRPRRSVARSRVRGSLEVYCALTGRVDERASENEPAPPAINTVTVPSPTGDDWELVDAPNNVSGEPLPAGWEERQDANGRTYYVNHVARSTQWERPTLMLVTVPSPTGDDWELVDAPNNVSGEPLPAGWEERQDANGRTYYVNHVARSTQWERPTLMRNMSTESQAERMETAATEFQRRFHISADEDRPPARQAQEESNDRQPDNPTEHTAHASPILSSEISQESTQDATQHATQNATQDATQEATQEATQDATQEVTQDASQSEDGAQASTSETTQEPVQDNITKNCDTNLRDNKENDDVCQNDATGGGDEVDNAQADNNNASQEIIEIQVAGDVDVDGEGGINDTESNGVVDDDARTEIEEITFGRERTLTDETGGSTESLTFDENHFSTPTGGITPERRMSPSRRRTTNSLTESEDETDGSSESTRSTSSSSSQSQNLPNSDGLPSGWSMQRAPNGRIFFIDHNQKTTTWIDPRTGCASSLPSAAASAEVDTDELGALPEGWEERVHTDGRIFFIDHNTRTTQWEDPRLSNPQIAGPAVPYSRDYKRKYEYLKSQLRKPSNVPNKFEIKVRRNSILEDSYRIISSVSRIDLLKTKLWVEFESEVGLDYGGLAREWFFLLSKEMFNPYYGLFEYSAMDNYTLQINPNSGVCNEEHLSYFKFIGRVAGMAVYHGKLLDAFFIRPFYKMMVGKTIELQDMESVDLEYYNSLVWIKSLASYQPSTNQSAIRETHSFGQCAVQSINQGFVYTMCNSIHLKDMFQKHSRHHREENDPSELYLTFAVDEEQFGNTIQRELKPGGANIAVDEDNKDEYIKLVIQWRFVSRVQEQMFAFLEGLGALVPLPLLKIFDEHELELLLCGIQHIDVRDWRANTLYKGDYHANHLVVQWFWRVVLSFSNEMRSRLLQFVTGTSRVPMNGFKELYGSNGPQLFTIEKWGGPDNYPRAHTCFNRIDLPPYESYMQLREKLVKAIEGSQGFAGVD-