Monarch geneset OGS2.0

DPOGS209576
TranscriptDPOGS209576-TA1305 bp
ProteinDPOGS209576-PA434 aa
Genomic positionDPSCF300015 - 1041309-1055022
RNAseq coverage515x (Rank: top 24%)
Annotation
HeliconiusHMEL0170430.087.61% 
BombyxBGIBMGA006637-TA0.094.53% 
DrosophilaCG42797-PD0.072.10% 
EBI UniRef50UniRef50_D6WZM40.075.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZM4_TRICA
NCBI RefSeqXP_973247.10.075.69%PREDICTED: similar to E3 ubiquitin-protein ligase HECW2 (HECT, C2 and WW domain-containing protein 2) (NEDD4-like E3 ubiquitin-protein ligase 2) [Tribolium castaneum]
NCBI nr blastpgi|910908860.075.69%PREDICTED: similar to E3 ubiquitin-protein ligase HECW2 (HECT, C2 and WW domain-containing protein 2) (NEDD4-like E3 ubiquitin-protein ligase 2) [Tribolium castaneum]
NCBI nr blastxgi|910908860.075.69%PREDICTED: similar to E3 ubiquitin-protein ligase HECW2 (HECT, C2 and WW domain-containing protein 2) (NEDD4-like E3 ubiquitin-protein ligase 2) [Tribolium castaneum]
Group
Gene OntologyGO:00064646.7e-137protein modification process
GO:00168816.7e-137acid-amino acid ligase activity
GO:00056226.7e-137intracellular
KEGG pathway 
InterPro domain[96-434] IPR0005696.7e-137HECT
Orthology groupMCL12063 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209576-TA
ATGGGGGTCATATCGAAGACATTCGGTGTCGTTTTCAGCCTCCTCTTCGAGCAGGAGATTATGAGTTACGTGCCGGCATCTGGGGCTCCAGAACCTTCGCCGGTGGCGTCCCCGGCCGCCGCGAGGAGCACCAGGGCTCCCGCCCCCCAGAGAAGGGACTTCGAGGCCAAACTACGAGCCTTCTATAGAAAACTCGAAAGCAAAGGATACGGTCAAGGACCAGGGAAGCTAAAACTTCACATCCGGCGCGAACATCTGCTAGAAGACGCGTTCAGACGGATCATGTCTTGTAGCAAGAAGGAGCTCCAGAAAGGCAAGCTGTGTGTGCTGTGGGAGGGCGAGGAAGGCCTGGACTACGGCGGACCCAGCAGGGAGTTCTTCTTCCTCCTGTCCAGGGAACTCTTCAACCCTTACTATGGACTGTTCGAATATTCAGCGAACGACACATACACAGTCCACGTGTCGCCGATGTCCGCCTTCGTCGACAATCATCACGAATGGTTCAGATTCTCCGGTCGCGTGTTGGGGTTGGCGTTAGTCCACGGCTACCTGTTAGAGGCGTGGTTCACTCGAGCCCTGTACCGAGCCCTGCTCCGCCTGCCGCCGGCTCTCGAGGACGTGGACGCGCTCGACGCACAGTTCGCCGCGTCACTGAGATGGCTTCAGTCCGCTCGCTGCGTGTCCTCACTGGAGCTGACGTTCGCGGTGTCCGAGCGTCTGGCGGACGGGCGGGTGTTAGAGAGAGAGCTGAGGCCGGGCGGGCGGGAGCTGGCCGTCACCGAGAGGAACAAGAAGGACTACCTCGAGAGACTGGTCCGCTGGCGCGTGCAGAGAGGAGTCGCCGATCAGACGGAGTGGCTCGTCAGGGGGTTCCATGAGGTGGTAGACCCTCGCTTGGTGGCAGCCTTCGATGCTCGCGAGCTGGAGCTGGTGATCGCGGGCGCCCCCGAGCTGGACGTGGCGGACTGGAGGACACACACAGAATACAGGGGAGGATACCACGACCACCACCCCGTCATACTCGCCTTCTGGCAGGCCATCGACAGATTCACGAACGAGCAGCGTCTCCGGCTGGTTCAGTTCGTGACGGGAACGTCGTCTATACCGTACGAAGGCTTCTCGGCGCTCCGCGGGTCCACGGGGCCGCGGAGGTTCTGCATAGAGCGCTGGGGTCGCACGGAGTCCCTCCCGCGAGCTCACACCTGCTTCAACCGCCTCGACCTGCCGCCTTACCCCACGCTGCAGCTGCTGCACGAGAAGCTGCTCCTGGCCGTGGAGGAAACCAACACCTTCGGCATAGAATGA

Protein sequence:

>DPOGS209576-PA
MGVISKTFGVVFSLLFEQEIMSYVPASGAPEPSPVASPAAARSTRAPAPQRRDFEAKLRAFYRKLESKGYGQGPGKLKLHIRREHLLEDAFRRIMSCSKKELQKGKLCVLWEGEEGLDYGGPSREFFFLLSRELFNPYYGLFEYSANDTYTVHVSPMSAFVDNHHEWFRFSGRVLGLALVHGYLLEAWFTRALYRALLRLPPALEDVDALDAQFAASLRWLQSARCVSSLELTFAVSERLADGRVLERELRPGGRELAVTERNKKDYLERLVRWRVQRGVADQTEWLVRGFHEVVDPRLVAAFDARELELVIAGAPELDVADWRTHTEYRGGYHDHHPVILAFWQAIDRFTNEQRLRLVQFVTGTSSIPYEGFSALRGSTGPRRFCIERWGRTESLPRAHTCFNRLDLPPYPTLQLLHEKLLLAVEETNTFGIE-