Monarch geneset OGS2.0

DPOGS204037
TranscriptDPOGS204037-TA2226 bp
ProteinDPOGS204037-PA741 aa
Genomic positionDPSCF300138 + 81942-87863
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0049560.070.86% 
BombyxBGIBMGA004785-TA0.066.80% 
DrosophilaHrd3-PA1e-16650.76% 
EBI UniRef50UniRef50_Q7QFQ20.057.56%AGAP000615-PA n=23 Tax=Metazoa RepID=Q7QFQ2_ANOGA
NCBI RefSeqXP_971922.10.049.93%PREDICTED: similar to Sel1l protein [Tribolium castaneum]
NCBI nr blastpgi|910862090.049.93%PREDICTED: similar to Sel1l protein [Tribolium castaneum]
NCBI nr blastxgi|910862090.050.82%PREDICTED: similar to Sel1l protein [Tribolium castaneum]
Group
Gene OntologyGO:00054886.3e-28binding
KEGG pathwaytca:6606130.0 
 K14026 (SEL1, SEL1L)maps-> Protein processing in endoplasmic reticulum
InterPro domain[331-445] IPR0119906.3e-28Tetratricopeptide-like helical
[381-417] IPR0065976.7e-09Sel1-like
Orthology groupMCL10994 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204037-TA
ATGAACCGATTAAAACTATATGTTTTGTTGTTGTGCTTTGTGCTGTCGGCGGCTGAGTTAAAAGGTCCCGATGGTCAAAAAACATCAGATAAGACGGAGAATAAGGACAGTGAGGATACCTCGGTCCCTAAAACGAAATCAGATCAAGAGACCTATAAAGATCTAACTAATGCTCTCAACAAGGAAATACAGTTTTTACAAGACTACGCGACCGCTATACAAATATCCAAAGCTTTTGATCAGGAAATTGAGCCAGAAATTGATCTGCAAAAAATGGAAACACAATTGGAAAAGTTGAAAGATTTAAAGACTACTTTATTAAACACTGCAGATCCAGCTTGGACTGAAGAAGAAAAGGAATCAGCAGGGAACAGCAAATCAGAGAAAAAGGAGTTCAATCCTCAAGAAATACTTGATGCATTGCAACCACTACCCGAGGAATTGGAGCTGGATTTGACAGCTGATATGAAAGAAGCCAAAGCTCAGTATGAAGCCGCAATATCCAAGTTGGAGCGACGTGCTCCTGACCTCGTCGGTGCAATACTTCAAATTAAGCATTCAGCTGAACGTGGGTATATGCCGGCAAAAATCAAACTGGCATGGTCTTATATTTTTGGGGAAGGAGTCGAGCTTGATGTGGATAAGGCGAGGAAGATATTTGAAGAGCTTTTAGAGAAAGGCAATGCTGAAGCTCATGCGGGCATGGGATTCCTTTACGCCACTGGTACATCGGTTCCAGTGTCTCAAGCTCGTGCTCTGGTCCACTACACTTTAGGAGCCATAGCTAATTCGGACTACGCCCAAATGGCCCTTGCATATAGATACTGGGCCGGTGTGACTTTACCAGCCTCCTGTGAAAAGGCCATGGACCTCTATATGAAGGTTGCTCATAAAGTGGCCGGCGGCGTGACCTTGAGCGGCGGCCTGGCCGTGCAGCGTGTGCGGCTGGTGGACGAGGCCGAGGGAGGAGCCTCCGCCCTCGATACCGACCTCATAGAGTACTACCAGCTGCTGGCTGAGAAGGGCGATGTGCAGGCACAGGTCGGCCTGGGGCAGCTTCACTTCACCGGGGGTCGCGGAGTGACCCTCGACCTCAACAAGGCGCTGCATTACTTCACGCAGGCAGCTAAAACCGGAAACGCCGTCGCCAACGCGTTCCTGGGAAAGATATATCTGGAAGGCGGTGATGGTATTAAAGCCGATAACGAGACGGCCATGAGATACTTCAAGAAGGCCGCCGAGATGAACAATCCCATAGGCCAGAGCGGGCTGGGAGTGATGCATCTGCAGGGCCGTGGCGTCGCCAAGGACCCAACAGCCGCCTTCAAGTACTTCGCCATGGCGGCCAACCAGGGCTGGGTTGAGGGACAACTACACCTCGGGTTTATGTATTTCGGCGGTATCGGCGTCCGTCGGGACTTCAAGCAGGCGAATAAGTACTTCAGCCTGGCGTCCCAGTCCGGACACGTGCTGGCTCTATACCACCTGGCGCTGATGCACGCCCAGGGGCTCGGAGTCATGAGATCGTGCGCCACCGCCGTAGAGTTGTTGAAGAACGTCTGCGAGCGCGGTCCGTGGAGCTCCCGTCTGATGTTAGCGCACGCGGCGTGGAGCGCCCGGGACACGGACTCTTCCCTGCTGCAGTACCTGGCGCTGGCGGAGAGGGGCATGGAGGTGGCTCAGAGCAACGCGGCCTACATCCTGGACGTAGGAGAGGGGAACGTGGACGCGGACACGAGACACGCGCGCGCGCTGCAGCTGTGGTCGCGGGCCGCCTCTCAAGGCTGCGCCGCCGCCCGCGTCAAGCTCGGTGATTACCACTACTACGGGCTCGGCACGCCGAAGGACCTGGACGCCGCGGCTCACCACTACCGCCTCGCCTCGGAGCACCTTCACTCGGCGCAGGCGACCTTCAACCTCGGGTTCATGCACGAGCGAGGCCTAGGCCTGGTCAGGGACCTCCACCTCGCCAAGCGGTGTTACGACCTGGCCGCGGACGCGTCTCCTGACGCCAGACTGCCCGCGGCACTGGCCCTGGCCCGCCTACACGCACACTCCGCCATCGAATCCCTACTAGAATCCCTGTCTCAGAGTCCTCTGGCAGTCATCTTCATATCGGGCGACTCCATACTGTTGTCCAACTGGGACCTCTATCTCATGACGGTGCTGGTCGGAGCCCTCGGCTTCGTGATATACTTGCGGCGACCTCACCAGCAAGTCAACTGA

Protein sequence:

>DPOGS204037-PA
MNRLKLYVLLLCFVLSAAELKGPDGQKTSDKTENKDSEDTSVPKTKSDQETYKDLTNALNKEIQFLQDYATAIQISKAFDQEIEPEIDLQKMETQLEKLKDLKTTLLNTADPAWTEEEKESAGNSKSEKKEFNPQEILDALQPLPEELELDLTADMKEAKAQYEAAISKLERRAPDLVGAILQIKHSAERGYMPAKIKLAWSYIFGEGVELDVDKARKIFEELLEKGNAEAHAGMGFLYATGTSVPVSQARALVHYTLGAIANSDYAQMALAYRYWAGVTLPASCEKAMDLYMKVAHKVAGGVTLSGGLAVQRVRLVDEAEGGASALDTDLIEYYQLLAEKGDVQAQVGLGQLHFTGGRGVTLDLNKALHYFTQAAKTGNAVANAFLGKIYLEGGDGIKADNETAMRYFKKAAEMNNPIGQSGLGVMHLQGRGVAKDPTAAFKYFAMAANQGWVEGQLHLGFMYFGGIGVRRDFKQANKYFSLASQSGHVLALYHLALMHAQGLGVMRSCATAVELLKNVCERGPWSSRLMLAHAAWSARDTDSSLLQYLALAERGMEVAQSNAAYILDVGEGNVDADTRHARALQLWSRAASQGCAAARVKLGDYHYYGLGTPKDLDAAAHHYRLASEHLHSAQATFNLGFMHERGLGLVRDLHLAKRCYDLAADASPDARLPAALALARLHAHSAIESLLESLSQSPLAVIFISGDSILLSNWDLYLMTVLVGALGFVIYLRRPHQQVN-