Monarch geneset OGS2.0

DPOGS207313
TranscriptDPOGS207313-TA1818 bp
ProteinDPOGS207313-PA605 aa
Genomic positionDPSCF300008 + 1521464-1528330
RNAseq coverage622x (Rank: top 21%)
Annotation
HeliconiusHMEL0123170.090.61% 
BombyxBGIBMGA012095-TA0.087.17% 
DrosophilaCG1317-PB3e-16849.19% 
EBI UniRef50UniRef50_D6WXQ60.058.37%Putative uncharacterized protein n=2 Tax=Coelomata RepID=D6WXQ6_TRICA
NCBI RefSeqXP_966509.10.058.37%PREDICTED: similar to ssm4 protein isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910890890.058.37%PREDICTED: similar to ssm4 protein isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|3320191520.061.33%E3 ubiquitin-protein ligase MARCH6 [Acromyrmex echinatior]
Group
KEGG pathwaytca:6598350.0 
 K10661 (MARCH6, DOA10)maps-> Protein processing in endoplasmic reticulum
Orthology groupMCL11900 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207313-TA
ATGGTTTTCCTGGAACATGTATTTTGGGTGGTGTCATTAAACACATTGTTTATTGTTGTGTTTGCCTTTTGCCCATACCACATAGGCAAATTGGGGGCAGCCATGGCTGGCTTAGCTGCAGAAGGACCTTTTGCTGGCTTGTTAACAGCATTGGCGGGATATGTGATTGTAGGAGCCATACTTGCAGTGCTACATGGTATGGCATCTCTGCTTAGACTGAGAAGTGCAAAAAAAGCATTAGGATTTTGCTATGTGGTAGTGAAAGTGGCATTATTATCTGTTGTAGAAATTGGTGTTATACCACTTGTGTGTGGATGGTGGCTGGATTTGTGTTCTTTGTCAATGTTTGATGCAACACTCAAGGATAGAGAGTCCAGTCTTCAGGCTGCGCCATGGACTTTAATGTTTATCCATTGGCTTGTCGGAATGGTCTATGTGTACTACTTTGCTTCCTTTATCCTATTATTAAGAGAAGTGTTAAGACCCGGTGTTTTATGGTTTTTGAAAAATCTTAATGATCCCGATTTCAGTCCCGTGCAGGAAATGATTCATCTCTCGGTGTGGTCTCACATACGCCGTCTCGTTGTGTCCGCTATGGTTTTCGGCACGGCCGTACTATTCATGCTATGGCTTCCAATACGTGTAATAAAATACGTTCTACCCGGCTTCCTGCCTTACGCGGTCGCTGTGCACACCGATGCGCCGGTCAATGAACTCAGTTTGGAGCTTCTGTTGCTACAGGTTATCCTACCGGCTTTGCTGGAACAGTCTCATACCAGGACGTGGTTGAAGGCCGGTCTACGTGCGTGGTGCGGCTGCGCTGCCGGCCTGCTAGGGCTTCGCTCTTACTTACTCGGGGAGGCCTCGAGAGAGAACGAGCCCCACCCACCACCACACCCACCACATCAGTTAGGAGCCGCTCATAGGGCTCTAATATGGCGAGATGGACCGGCTGGTTTCGAGCCTTACGTTCGGGTGTCCTGGTTCCCTTTACGTCTCGGAGCGTTATTGGCACTCGTATCTGTGTCCTTGGTGTTGGCCAGCGCTCTCACATTGGTTATACCGGTAGCCATTGGTAGAAAAGTCATGACGATCTGGCTACCAAAGGCATCAGAGGGAGTACACGAGCTATACACGGCCGCTTGCGGTATGTACGTGTGTTGGGCGGTTGGTCGCGGCGGTGCGTTGGCCGCTGGTTGGGCTCGCGGTGGTCGTGCAGCACTTCTGGCACGTGCGGCACTGTGGACCAGGAGGGCGGCGCGGGCGGCACTAGCGGCACTCGCGCTGCTGGGTTTGGTGCCGCTCATGTTCGGACTGCTTCTAGAATTGGTCCTAGTGATTCCTCTCAGGGTTCCCTTGGAGCAGTCTCCGGTACTGTTTGTATGGCAGGATTGGGCTTTGGGTGTTCTATACACCAAGATAGTCTGCGCTCTCACAATGATGGGACCGGACTGGAGTATGAGGCGGGCCATTGAGAAAGCTTATAGAGACGGCATTAGAGAAATGGATTTGCAATTTATCCTGCGATCTGTGGCCGCGCCACTGGTCCGCTGGCTGGGGCTTGCGCTGGCTGTACCGTACGTGCTGGCACACAGTGTGGCGCCACTGGTACTGAGCGCGCACGCGCAGAGGAACCTGCTGGTGCGACGGGTGTACCCTGCCTTGTTGCTCATAGCACTGCTGGCTGCGCTCGCCCTCTTCCAGATTCGCCAGTTCACTAAACTGTACGAGCACATAAAGAACGACAAGTACCTGGTCGGCCAGCGGCTCGTGAACTACGACCACCGCCGACACAAACACACGGTCACCTCAAACTGA

Protein sequence:

>DPOGS207313-PA
MVFLEHVFWVVSLNTLFIVVFAFCPYHIGKLGAAMAGLAAEGPFAGLLTALAGYVIVGAILAVLHGMASLLRLRSAKKALGFCYVVVKVALLSVVEIGVIPLVCGWWLDLCSLSMFDATLKDRESSLQAAPWTLMFIHWLVGMVYVYYFASFILLLREVLRPGVLWFLKNLNDPDFSPVQEMIHLSVWSHIRRLVVSAMVFGTAVLFMLWLPIRVIKYVLPGFLPYAVAVHTDAPVNELSLELLLLQVILPALLEQSHTRTWLKAGLRAWCGCAAGLLGLRSYLLGEASRENEPHPPPHPPHQLGAAHRALIWRDGPAGFEPYVRVSWFPLRLGALLALVSVSLVLASALTLVIPVAIGRKVMTIWLPKASEGVHELYTAACGMYVCWAVGRGGALAAGWARGGRAALLARAALWTRRAARAALAALALLGLVPLMFGLLLELVLVIPLRVPLEQSPVLFVWQDWALGVLYTKIVCALTMMGPDWSMRRAIEKAYRDGIREMDLQFILRSVAAPLVRWLGLALAVPYVLAHSVAPLVLSAHAQRNLLVRRVYPALLLIALLAALALFQIRQFTKLYEHIKNDKYLVGQRLVNYDHRRHKHTVTSN-