Monarch geneset OGS2.0

DPOGS202284
TranscriptDPOGS202284-TA1608 bp
ProteinDPOGS202284-PA535 aa
Genomic positionDPSCF300032 - 244337-252807
RNAseq coverage841x (Rank: top 15%)
Annotation
HeliconiusHMEL0047320.080.00% 
BombyxBGIBMGA004940-TA3e-17093.67% 
Drosophilasip3-PA3e-14945.92% 
EBI UniRef50UniRef50_G9I6Y07e-16893.67%Synoviolin-like protein n=1 Tax=Bombyx mori RepID=G9I6Y0_BOMMO
NCBI RefSeqXP_001649607.15e-16852.66%synoviolin [Aedes aegypti]
NCBI nr blastpgi|3796990322e-16793.67%synoviolin-like protein [Bombyx mori]
NCBI nr blastxgi|910870355e-17858.99%PREDICTED: similar to synoviolin [Tribolium castaneum]
Group
Gene OntologyGO:00055151.6e-06protein binding
GO:00082701.6e-06zinc ion binding
KEGG pathwayaag:AaeL_AAEL0046971e-167 
 K10601 (SYVN1, HRD1)maps-> Ubiquitin mediated proteolysis
    Protein processing in endoplasmic reticulum
InterPro domain[287-336] IPR0130832.7e-14Zinc finger, RING/FYVE/PHD-type
[290-328] IPR0018411.6e-06Zinc finger, RING-type
[290-328] IPR0189573.5e-06Zinc finger, C3HC4 RING-type
Orthology groupMCL13507 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202284-TA
ATGAAGGCTTTAGTGGCGACTGTGATCAGTCTAGCCCTCACTACTGTGGTAATAGGCAATGCCTATTACCAGAAGAAACAGTTTTATCCATCTGTAGTTTATCTCACTAATTCCAATCCTAGTATGGCTGTCATGTATTTACAAGCTTTCATACTGGTGTTATTGGTTGGTAAAATTTTGCGCAAAATCTTCTTTGGTCAACTGCGGCCAGCCGAATTCGAGCACTTAATAGAAAGATCTTGGTATGCCATAACGGAGACTTGTCTTGCATTCACGGTGTTCAGAGATGACTTCAATCCTAAGTTCATAGCACTTTTCACTCTCCTGCTGTTTTTGAAGGCATTCCATTGGCTCGCTGAAGATCGAGTCGATTATATGGAAAGGAGCCCGGTGATTGGTTGGCTTTTCCATGTAAGGATTCTAAGTTTATTAACACTTTTGGCTCATGCGGACCTGTATTTCATCCATCACGCGTACTCCTTCACCACATCCAAGGGTCCGTCGGTGCAGGTTGTGTTTGGATTTGAGTACAGCATTCTCATATTTATGATTGCGAATATCTTAATCAAGTACATGCTGCACGCGATCGACTCCCGCTGGGAGGCTCCCTGGGAGAGTAAGGCGGCGGTTCTTCTCTACACTGAGTTGGCTATCAACTTCCTTAAGGTCCTTTTATACATCGGCTTCGTAGCCGTCATGGTGCGAATCTACACTCTGCCGCTGTTCGCCTTCAGACCCATGTACGAAACATTAAGGAGTTTTAATAAAGCGTACAACGATGTGGTATTGTCACGTCGAGCGATCAGGAACATGAACACCCTGTATCCGGACGCTACCCCCGAGGAGCTAGCGGCGGCCGACAATGAATGTATCATATGCAGGGAGGAGATGCATAGCGGAGCTAAGAAGCTTCCGTGCAACCACATCTTCCACGCCGCCTGCCTCCGTCTCTGGTTCCAACGACAGCAGACTTGTCCGACTTGTCGTCTTAACGTTCTTAGAGCGCCAGCGCCTGAGCCGCCGAACGCTAACCCCAACGCCAACGTGCCGAACGTCAACCCGCCAAACGTCAACGCGCCTAACGTGCAGGCTCCGACCCCACCAGCCGTTCCACCACCACCCTTCCCCAATATGCCTCCACCACCACCGATGATGGGTTGGGCGCTCCCTCCTCCTCCCCCCCGGCCGGCCTCGCTCGCTAGTCTCACCACCGAGGAACTGAGACGTATGGAGGGAAACGAGAGAAGGAACATAGAGGCGCGTCTACAGTTACTCATGGAGATCCAGTCGATGCTGGACGCATCCGTGCTGTTGATGCAGCAGTACTCGAACGTTGTCTCCAACCAGCCCGCCAACCTGTCCAACGTGGCCACACAGACGGACATACGAAGGACGGAACCTTCCCCCGAGCCGTCTCCGGTGCGTGTGAGAGCGGATCCGGAACCCGTTGCTTCTACTTCCAAGATGTCGACCTACATCGCTACACCTGGACCATCCAGCAGACACGACTTTGAAGATATGGAGACGAGTGTAAATAGGGAAATATCTGATGCCGACGCTGAAGAGCTCAGGCGACGTAGATTACAGAAGTTTAGTGCAGAAAAGTGA

Protein sequence:

>DPOGS202284-PA
MKALVATVISLALTTVVIGNAYYQKKQFYPSVVYLTNSNPSMAVMYLQAFILVLLVGKILRKIFFGQLRPAEFEHLIERSWYAITETCLAFTVFRDDFNPKFIALFTLLLFLKAFHWLAEDRVDYMERSPVIGWLFHVRILSLLTLLAHADLYFIHHAYSFTTSKGPSVQVVFGFEYSILIFMIANILIKYMLHAIDSRWEAPWESKAAVLLYTELAINFLKVLLYIGFVAVMVRIYTLPLFAFRPMYETLRSFNKAYNDVVLSRRAIRNMNTLYPDATPEELAAADNECIICREEMHSGAKKLPCNHIFHAACLRLWFQRQQTCPTCRLNVLRAPAPEPPNANPNANVPNVNPPNVNAPNVQAPTPPAVPPPPFPNMPPPPPMMGWALPPPPPRPASLASLTTEELRRMEGNERRNIEARLQLLMEIQSMLDASVLLMQQYSNVVSNQPANLSNVATQTDIRRTEPSPEPSPVRVRADPEPVASTSKMSTYIATPGPSSRHDFEDMETSVNREISDADAEELRRRRLQKFSAEK-