Monarch geneset OGS2.0

DPOGS200151
TranscriptDPOGS200151-TA1533 bp
ProteinDPOGS200151-PA510 aa
Genomic positionDPSCF300128 - 204500-208834
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0058330.088.87% 
BombyxBGIBMGA002793-TA0.080.50% 
DrosophilaEro1L-PB5e-14354.57% 
EBI UniRef50UniRef50_D6WXL91e-14257.85%Putative uncharacterized protein n=2 Tax=Coelomata RepID=D6WXL9_TRICA
NCBI RefSeqNP_001138795.10.080.25%endoplasmic reticulum oxidoreduction 1-like [Bombyx mori]
NCBI nr blastpgi|2238901600.080.25%endoplasmic reticulum oxidoreduction 1-like precursor [Bombyx mori]
NCBI nr blastxgi|2238901600.080.25%endoplasmic reticulum oxidoreduction 1-like precursor [Bombyx mori]
Group
Gene OntologyGO:00166711e-205oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor
GO:00064671e-205protein thiol-disulfide exchange
GO:00506601e-205flavin adenine dinucleotide binding
GO:00057891e-205endoplasmic reticulum membrane
GO:00551141e-205oxidation-reduction process
KEGG pathwayxla:4467541e-121 
 K10976 (ERO1LB)maps-> Protein processing in endoplasmic reticulum
InterPro domain[44-497] IPR0072661e-205Endoplasmic reticulum oxidoreductin 1
Orthology groupMCL11077 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200151-TA
ATGGTATACAAAAAATATTGTGTTGTGCTTCTAATTTTCGCTCTCGCGATAGTTCAAGCTGTCGGTTACGACACGGAGCTGTTCGAGACCGTCGCCTGTGATAGCACTGCATGTTTCGATGCATTACATGGAGCTCTGGGTGACTGTTCTTGTAATGTTGACACGATCGACTATTTCAACAACGTGAAGATATTCCCTCGCATCCAGAGTTTGGTAAGCAAAGACTATTTTCGTTTCTACAAGGTCAATTTGAAAAAAGAATGTCCATTTTGGGCCGACGATAGCAGGTGTGCCATGAAATATTGCCATATAAAAACCTGTTCCAAGGAGAGTGTTCCCGGTTTCGAGAACGATTACGAAAATGAACTCGAGGAGGAACCACCGGCTCTAAAGTATTCTCAGGAGGCTCAGACTCCATGTAATAGTGATGCGGATCATGATCCTGCTCTTGGATATCTCAATATGACTCTAAGTGTCGCAAGCCAATTCGAAATCGCAAAATGGAAAGCTTACGACGATTCTGTTGGCAATTTCTGTGATTGTGATGATAAAGATGCTGAAGCAGAATATGTAGACTTGTCTTTAAACCCCGAGAGATATACAGGATATAAAGGCACCTCTGCACACAGAATTTGGAGAAGCATCTACGAAGAGAATTGTTTTAGACCTAAGGTTAATCCATACAAATCTTTTCCTTATGTTTTAAGTTCAGACTTAGGCAATATGTGTTTGGAGAAGAGAGTGTTCTACAGAGCAGTTTCTGGATTACATACAAGTATAAACATACATTTATGTTCAAAATATTTGCTCTCAGAAAAGAAGTTAGGTTTTGCGGCTCCACCTGAAGGTGAATGGGGACCGAATCTGGCAGAGTTCCAACGTCGTTTTGATCCATCTCAGACATTTGGTGAGGGTCCAAATTGGCTGAAGAATTTATACTTTGTATACTTACTTGAAATGAGAGCTTTGGCTAAAGCTGGACCTTATTTGGAACGGGAAGATTATTTCACTGGTAACCCTGTCGAAGATGAAGAAACGAGAGACGCTATACGTAACATGTTGGGTGTTATATACACTTTCCCCGATCACTTCAATGAATCATCCATGTTCAACGGTGGCAGCCCGGCTGCAAAACTAAAGCAAGAGTTCCGAGATCATTTTTGGAACATCTCACGGATAATGGATTGTGTTGGATGTGACAAATGCAAGTTGTGGGGCAAATTACAAACCCAGGGCTTGGGGACCGCACTCAAAATATTATTCTCCGGCCAGTGGGATAGTTTTGATGAGGAAGCAGAGAAGGGTAGGATAGTACTGAGGCATAAAGCACACAAGCGCTTGCAGAGGACTGAGATAGTGGCACTATTTAATGCCTTCGCAAGACTGTCAAATAGTATAAGGGAGTTAGAAAACTTCAGGAATATGCTCAGCAGCCACACAGACGGTCAGAAGGACGTGTTTAGCGGAGCGGCCGGTGACATCAAACGAGAGCGTTGTACGTCGGGAGTATCGAAACCTAGCCTTTGGAGTTAG

Protein sequence:

>DPOGS200151-PA
MVYKKYCVVLLIFALAIVQAVGYDTELFETVACDSTACFDALHGALGDCSCNVDTIDYFNNVKIFPRIQSLVSKDYFRFYKVNLKKECPFWADDSRCAMKYCHIKTCSKESVPGFENDYENELEEEPPALKYSQEAQTPCNSDADHDPALGYLNMTLSVASQFEIAKWKAYDDSVGNFCDCDDKDAEAEYVDLSLNPERYTGYKGTSAHRIWRSIYEENCFRPKVNPYKSFPYVLSSDLGNMCLEKRVFYRAVSGLHTSINIHLCSKYLLSEKKLGFAAPPEGEWGPNLAEFQRRFDPSQTFGEGPNWLKNLYFVYLLEMRALAKAGPYLEREDYFTGNPVEDEETRDAIRNMLGVIYTFPDHFNESSMFNGGSPAAKLKQEFRDHFWNISRIMDCVGCDKCKLWGKLQTQGLGTALKILFSGQWDSFDEEAEKGRIVLRHKAHKRLQRTEIVALFNAFARLSNSIRELENFRNMLSSHTDGQKDVFSGAAGDIKRERCTSGVSKPSLWS-