Monarch geneset OGS2.0

DPOGS209902
TranscriptDPOGS209902-TA2004 bp
ProteinDPOGS209902-PA667 aa
Genomic positionDPSCF300049 + 412424-431758
RNAseq coverage694x (Rank: top 18%)
Annotation
HeliconiusHMEL0118502e-7751.20% 
BombyxBGIBMGA000200-TA2e-7071.19% 
Drosophilasip3-PA2e-1624.50% 
EBI UniRef50UniRef50_E2BPI84e-10843.50%Autocrine motility factor receptor, isoform 2 n=1 Tax=Harpegnathos saltator RepID=E2BPI8_HARSA
NCBI RefSeqXP_973078.14e-12043.98%PREDICTED: similar to AGAP007538-PA [Tribolium castaneum]
NCBI nr blastpgi|910865698e-11943.98%PREDICTED: similar to AGAP007538-PA [Tribolium castaneum]
NCBI nr blastxgi|910865697e-11443.75%PREDICTED: similar to AGAP007538-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055155e-07protein binding
GO:00082703.3e-06zinc ion binding
KEGG pathwaytca:6618531e-119 
 K10636 (AMFR, GP78)maps-> Protein processing in endoplasmic reticulum
InterPro domain[417-475] IPR0130835.4e-14Zinc finger, RING/FYVE/PHD-type
[534-573] IPR0038925e-07Ubiquitin system component Cue
[431-468] IPR0018413.3e-06Zinc finger, RING-type
Orthology groupMCL14911 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209902-TA
ATGCCTGTGACACTAGTGGATAGGCTGCCCTTGCCGAATCTCAAGGTTTACTCGGCGGGAAGTGTTTTACTATTGTCAGTAGCCGTGTATTATGCTATATGTGTAACTAGTGACCCCAGCTGGAAGGTAAATGCGACTCTCCAGCGCGAGGAATCGGCAATTTCTCACGATGAAGTAAAGATGGTGCAACCAGATAGTTTGGCTGCGGGCGAGCTCACTGCTGGCTCCAAAAATGTTTCCGATCAGTTCATGGAAGTTATGACCTTTATGATGCAGGAGCCGCTTTGTATGTGGTTGTTGGCGTACTGTTCCCTGGCTCTGTTCGGCCTGGCCATCCAGCGCGCGGTGTTCGGCCGGCTGCGTGTGTCCGAGGCTCAGCGGGTGAAGGACAAATTCTGGAACTACGTCTTCTACAAGTTCATCTTTGTGTTCGGCGTCCTCAACGTGCAGTACATGGACGAGGTTCTGCTATGGAGCGGATGGTTCACCGTCGTTGGATTCCTCCACCTGCTGGGACAGCTCTGCAAAGACAGATTCGACTATACGTTGATCAACATGGCGTACTGTTCCCTGGCTCTGTTCGGCCTGGCCATCCAGCGCGCGGTGTTCGGCCGGCTGCGTGTGTCCGAGGCTCAGCGGGTGAAGGACAAATTCTGGAACTACGTCTTCTACAAGTTCATCTTTGTGTTCGGCGTCCTCAACGTGCAGTACATGGACGAGGTTCTGCTATGGAGCGGATGGTTCACCGTCGTTGGATTCCTCCACCTGCTGGGACAGCTCTGCAAAGACAGATTCGACTATCTGTCCTCGTCGGCCAGCGTGTCCCGGGGGGCGGTCGTTCGTCTGCTGTGTTTGCTGGCGGGCATGTTGCTAGCGGCCGGGGGGCTCGCGGCTGCCGCGGTCACTTGGGGCCTGGCCGCCGGCAGGGACACCTTCGCCTTCATGATCGCTGAGTGCCTACTAGTGGCGGTCCCCACGCTGCACGTGATGGCTCGCTACACGCTCAGGGCGAGGAACGCGGACGCCGCCGGCGCCACAGCCTACTACACACATCTAGCATTCGATTCGGTGTCCCTGGTGGTGGAGACGTGTCACGTGTGTCACATGGTGGTGTACAGTAACGTGGTGGTGTCCATGGCGTCGCTGGTGCTGCTCATGCAGCTGAGGCATCTGCTGCACGCCCTGCTCGCGCGACTCAGGAGGCACCGCCTCTACACCGCCCTTTCCACTCACATGACCAAACACTACCCGATGGCCAGCGTAGAGGAGGTGATGAAACATGAGGACAAGTGCGCCATCTGCTGGGAACCCATGACCGAGGCCAGGAAACTCCCCTGCAAACATCTCTTCCACAACTCGTGCCTGTGTCGCTGGGTGCAGCAGGACGCGTCCTGTCCCACGTGCCGGCGCTCGCTGCAGGCTCGGCCCGCACCCTCGCCCGCCGCACCCCACGCACCCCTCGCGCCCCTCACGCCCGCCGCCACCATGGGACTCGACGCCACACACAACCATCTGTTCCACTTTGACGGCTCCCGGTACGTGTCGTGGCTGCCGAGCTTCTCGGTGGAGGTGACGCGGGTCAGGGACGCGCCGCCTCTTGACGATATGGTGGACCAGGTGTTGGCAGTGTTCCCTCAGTACGGTCGCGAGGCCGTGATGGCGGACCTGGCTCGCTCGCGCTCGCCGGACGTCACCGTACACAACATACTGGAGGGCAGACTGCCGCCGCCCCCGCCCCCGCCCTCGCCCTCGCCACCCCCCGCACACGCCCCGCCCGTCCCGCTGGCGACCCCCGCCATCGTGGCCCCCATACAGACGCACACCCCGCACGTCGGATACCACAGCACGGAGGGGTTCTCCTCGGTAGCGGCGGAGAGAGAGGACACGCTGCGGAGACGGAAGGAGGCGCTGCTGGCGGTGGCCAGGAGGAGATACCTGGACAGGCGGGCGGCGGCCGGCGCGGGGGGCGCGGGGGGCGGGCGGCCCGCACACACCGCCAGCTGA

Protein sequence:

>DPOGS209902-PA
MPVTLVDRLPLPNLKVYSAGSVLLLSVAVYYAICVTSDPSWKVNATLQREESAISHDEVKMVQPDSLAAGELTAGSKNVSDQFMEVMTFMMQEPLCMWLLAYCSLALFGLAIQRAVFGRLRVSEAQRVKDKFWNYVFYKFIFVFGVLNVQYMDEVLLWSGWFTVVGFLHLLGQLCKDRFDYTLINMAYCSLALFGLAIQRAVFGRLRVSEAQRVKDKFWNYVFYKFIFVFGVLNVQYMDEVLLWSGWFTVVGFLHLLGQLCKDRFDYLSSSASVSRGAVVRLLCLLAGMLLAAGGLAAAAVTWGLAAGRDTFAFMIAECLLVAVPTLHVMARYTLRARNADAAGATAYYTHLAFDSVSLVVETCHVCHMVVYSNVVVSMASLVLLMQLRHLLHALLARLRRHRLYTALSTHMTKHYPMASVEEVMKHEDKCAICWEPMTEARKLPCKHLFHNSCLCRWVQQDASCPTCRRSLQARPAPSPAAPHAPLAPLTPAATMGLDATHNHLFHFDGSRYVSWLPSFSVEVTRVRDAPPLDDMVDQVLAVFPQYGREAVMADLARSRSPDVTVHNILEGRLPPPPPPPSPSPPPAHAPPVPLATPAIVAPIQTHTPHVGYHSTEGFSSVAAEREDTLRRRKEALLAVARRRYLDRRAAAGAGGAGGGRPAHTAS-