Monarch geneset OGS2.0

DPOGS210466
TranscriptDPOGS210466-TA1770 bp
ProteinDPOGS210466-PA589 aa
Genomic positionDPSCF300062 + 346521-351234
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0134147e-13272.07% 
BombyxBGIBMGA014318-TA3e-10254.81% 
DrosophilaCG7600-PA1e-3033.71% 
EBI UniRef50UniRef50_D6X5569e-6736.97%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X556_TRICA
NCBI RefSeqXP_972360.12e-6736.97%PREDICTED: similar to CG7600 CG7600-PA [Tribolium castaneum]
NCBI nr blastpgi|910945313e-6636.97%PREDICTED: similar to CG7600 CG7600-PA [Tribolium castaneum]
NCBI nr blastxgi|910945312e-5734.66%PREDICTED: similar to CG7600 CG7600-PA [Tribolium castaneum]
Group
KEGG pathwaytca:6632737e-23 
 K09556 (BAG2)maps-> Protein processing in endoplasmic reticulum
Orthology groupMCL15422 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210466-TA
ATGGAAGTGGACGTGTATTCACTGTCTGAAGGCTCTCGCCTACCGCTAATTGACGAAACATCGGCTTTAGGCACAACAGCACCAAAAGACAGATTGATATCCGTATTAGACCAAGTGGAGATGAGAGTGGAGCGACTCAGACGGGACACAGTGCGTATTGAAGAGGAAAAAGACTCCTTACTGTCAACCCTAGACAGCGTGAAACATTCTGAACTGCTCGGTGACATATCTGAATGTGATAAGGAAGATATAATGCGTTATGCCGACCGTATCCTGGCTCGAGCTCTCACTGTGGAGGTGGCTGTGAGGACGGACAGGGACTCACAGCAGGAAGAAGCCTTGTCCCAGTCTCCATTCCTACTTATTATTGTTAGACTGGCGATATCTACGGCGCCGTTGAGTCGCGAGGCGCGTGCGGGGGCCGCGGCGGGGTCCCTGCCTCACGTGGGTCCTGCGCCGCCCGAGGCCGGCTCCCCTTGGATGAAACTCTACCTATATCACCTGGCCGGCTATGGACCACCTACACTGCTGCTGCTTAAAGGTACAGTACTGCGCGCTGTTCCAAGATGTCTGCTAGGTTTCTCACGGTTGGCGGTGTCGTGGTGGGGGGCGGAGTCCGAGTCCTGCCGAGACGGCCGCCGCCCACGAGAAGCGGCTGCCAACTCGGCGCTCCGCCGCGGGCCCGTGTTGCTACAAGCGGCCGGCGTGCGAGCCCCGCCCGTCCTCCTCCACGTGCCATTCCCGCCATCTAACGACGAACAAGATGAGTATCAGAGTCTTTGGCGTCGTCACGGCGGTCTCCGTCGTGTGAGTCGCCGCCTGCGCCTATGTCGGTCCGCGGGGTACGTGACACTAGCGGACATCGGGGTGCCCGACCTGGGGTGTGCGCGACCCCCCGCCGCCGTCCAGCTGGTGACACCCAGGAAGAGGAAGGACGTCTGCGGCATCGCGAAGCCGATCAGCGAGATATCGATATCATCGAACGAATCCACGAAGGAGACTTCGGAGTTCACAAAGAACAGCAGACTTCAATCACCCATAGAGTCCCACTTCGCTAACACTCCGACCGCAGACAGCAAAACTGGTTCCCCCGCTAACGGCTTCACCAGCGCCGAGAGCGGGCAACTTCTCTGCGAAGAACTGGACAATCTGAACCTTGACACTCATCTGAGCAAAGAATCCATAACCAGCGACGATTTCGTCCCCATACATTCCTCTCGAAGTATTGACGATCTAACTCACAAACATTCCACTGAGGCTTCCAAGAGCAAAGAGTCCGAGAACGAAATAGTACCCGTTCACAGCACCAACGTTAAAGATAAAGACAAGTTGGACAACCAGTTGAGCGATCTGTTGAGCCCAGCTGAAGAGTCCATATCGATGTTCACTCAGTTGTCCGACAAGTTGACCGAAATAACAAACGAGGCAGACAGCGGCGTGGATACGGCGCACAATAACTCCGACGATGATACTTGTAAACCAGAGAAATGGACTATTCTGGATCTTCAATTTGGCATTCCATTGTTCGACGAGGCGCTCTGTGAGAATGTCTGTCGCAGTATCATTGATAGAATCGCCAAGCCGGAGTTACTGGAGAAAGTCAAAGAAGACAACGAGTTTATTCGTGCGGACCTGTTGAAGTTTGTCTCGCAATGTCAGTATTACCCCGGCGAGGATATGGGCATAGTGAAGAGAGGGACGTTGGTTCCGCTGCCAAGAAAGAATTTGGTATTTGAAAACGGCCAGATCAGTGAGTGGACGGGAAAATAA

Protein sequence:

>DPOGS210466-PA
MEVDVYSLSEGSRLPLIDETSALGTTAPKDRLISVLDQVEMRVERLRRDTVRIEEEKDSLLSTLDSVKHSELLGDISECDKEDIMRYADRILARALTVEVAVRTDRDSQQEEALSQSPFLLIIVRLAISTAPLSREARAGAAAGSLPHVGPAPPEAGSPWMKLYLYHLAGYGPPTLLLLKGTVLRAVPRCLLGFSRLAVSWWGAESESCRDGRRPREAAANSALRRGPVLLQAAGVRAPPVLLHVPFPPSNDEQDEYQSLWRRHGGLRRVSRRLRLCRSAGYVTLADIGVPDLGCARPPAAVQLVTPRKRKDVCGIAKPISEISISSNESTKETSEFTKNSRLQSPIESHFANTPTADSKTGSPANGFTSAESGQLLCEELDNLNLDTHLSKESITSDDFVPIHSSRSIDDLTHKHSTEASKSKESENEIVPVHSTNVKDKDKLDNQLSDLLSPAEESISMFTQLSDKLTEITNEADSGVDTAHNNSDDDTCKPEKWTILDLQFGIPLFDEALCENVCRSIIDRIAKPELLEKVKEDNEFIRADLLKFVSQCQYYPGEDMGIVKRGTLVPLPRKNLVFENGQISEWTGK-