Monarch geneset OGS2.0

DPOGS207007
TranscriptDPOGS207007-TA750 bp
ProteinDPOGS207007-PA249 aa
Genomic positionDPSCF300001 + 1103167-1103997
RNAseq coverage431x (Rank: top 28%)
Annotation
HeliconiusHMEL0143757e-7454.55% 
BombyxBGIBMGA012931-TA8e-8061.60% 
Drosophilawbl-PA1e-3937.55% 
EBI UniRef50UniRef50_E2BCB12e-5145.99%Endoplasmic reticulum protein ERp29 n=7 Tax=Formicidae RepID=E2BCB1_HARSA
NCBI RefSeqXP_001120162.15e-5445.87%PREDICTED: similar to windbeutel CG7225-PA [Apis mellifera]
NCBI nr blastpgi|3800263177e-5345.45%PREDICTED: endoplasmic reticulum resident protein 29-like [Apis florea]
NCBI nr blastxgi|3838519103e-5345.45%PREDICTED: endoplasmic reticulum resident protein 29-like [Megachile rotundata]
Group
Gene OntologyGO:00057882.6e-39endoplasmic reticulum lumen
GO:00093062.6e-39protein secretion
GO:00057831.8e-27endoplasmic reticulum
KEGG pathwaygga:4168822e-34 
 K09586 (ERP29)maps-> Protein processing in endoplasmic reticulum
InterPro domain[24-144] IPR0128832.6e-39ERp29, N-terminal
[19-141] IPR0123361.1e-34Thioredoxin-like fold
[143-240] IPR0116791.8e-27Endoplasmic reticulum, protein ERp29, C-terminal
Orthology groupMCL11952 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207007-TA
ATGAGCGGTTTAATATTTCTTGCTTGTATAATTCTTGTAGTTCCTTCAACGTACCAAGAGTCTTTAGGAGGCTTGGTAGAGTTAGATGAAGTAAGTTTTAACAAATTAGTGCCTAAATTCGATGCCACAGTAGTAAAATTCGACGTGGCTTATCCGTACGGTGATAAGCATGACACCTATGTGGCATTATCAAAGGAATCAAAAGACGTGGATAATCTTTTATTTGCCCAAGTAGGAGTAAAAGATTATGGAGAGAAAGACAACGAAGCATTCGCTAAGAAATATGGAGCTGATAAGAATAACTTCCCTGTTGTAAAACTGTTCCTTAAAGATAAAAGCAAACCAATAACTTTTGATGACTCTGAAGAATTTACAATTGATAGATTACGCCAATTCGTTCGGGAACAGAGTGGAATCTATCTCAGTCTCCCGGGCTGCATAAGAAGTCTAGATTTATTGGCTATTAAATTCAAAAACTCAGATACTGATAAAAGGAAGAGCATCGCAAAAGAAACTGAGAAAGTTCTGGAAAATTTATCAAAAGAGGTGGCGGGTAATGGAAAGATCTACAAAACCATAATGGAAAAGATATTGGAAAAAGGTGACGACTTTATCCAAACAGAAATAACAAGAGTTAATAAATTACTTGCTGGAAAAATAAGTAATGAGAAAAAGAATGAACTAAGCCAAAGGATTAACATCCTTAAATCATTTTTATTACCTCTCAAGAATTATAAGGAGGAACTGTAA

Protein sequence:

>DPOGS207007-PA
MSGLIFLACIILVVPSTYQESLGGLVELDEVSFNKLVPKFDATVVKFDVAYPYGDKHDTYVALSKESKDVDNLLFAQVGVKDYGEKDNEAFAKKYGADKNNFPVVKLFLKDKSKPITFDDSEEFTIDRLRQFVREQSGIYLSLPGCIRSLDLLAIKFKNSDTDKRKSIAKETEKVLENLSKEVAGNGKIYKTIMEKILEKGDDFIQTEITRVNKLLAGKISNEKKNELSQRINILKSFLLPLKNYKEEL-