Monarch geneset OGS2.0

DPOGS207107
TranscriptDPOGS207107-TA1047 bp
ProteinDPOGS207107-PA348 aa
Genomic positionDPSCF300001 + 3173913-3177080
RNAseq coverage1464x (Rank: top 9%)
Annotation
HeliconiusHMEL0132752e-4533.02% 
BombyxBGIBMGA013070-TA6e-10357.74% 
DrosophilaCG10973-PB2e-1730.12% 
EBI UniRef50UniRef50_D6X1Z24e-5037.42%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X1Z2_TRICA
NCBI RefSeqXP_001605940.12e-5439.94%PREDICTED: similar to hsp70 binding protein [Nasonia vitripennis]
NCBI nr blastpgi|1565372833e-5339.94%PREDICTED: hsp70-binding protein 1-like [Nasonia vitripennis]
NCBI nr blastxgi|1565372832e-5339.56%PREDICTED: hsp70-binding protein 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054884.2e-34binding
KEGG pathwaynvi:1001223384e-54 
 K09562 (HSPBP1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[62-335] IPR0160244.2e-34Armadillo-type fold
[109-294] IPR0119893.5e-17Armadillo-like helical
Orthology groupMCL13289 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207107-TA
ATGGCTTCAGGAAATCCTAACAATCAAGCTGTGGGCGCTATGAACGTTGAAAACGACGGTGCAGAGCCGGTGCAGCCTCGACAACCTACAAATTTACAGGGACTTCTCCGCTTTGCTGTGGAAGCTACTAAGGCTGAAGATGCCCCTGGCAATTCAGAGCTGGGGCCGATGGATGAAGAGAGACGTAAATTTTTGGAGGAAGCTCTCAAGAGTCTTACAATAGATGTGGCAGAGGTGTTACAAAAGAGCATTAAGATCCTGAGTGATTCTGAGAGGATCCAAAGCATCCAGCTGGGCCAGGAGTTACCGGATGATGTGGATGTTGCTTTTGCTAACATCCTCGAACTTGTTGATAATATTGATACAGCCAACGATTTTTACAAACTTGGTGGTTTTGCAATCTTACCTATCTGTCTTGGAAGTGAGAATGATAAAATCAGGTCTCGAGCTAGTTCAATACTAGCTGAGCTGTGTCAGAACAATCCATTCTGTCAGGCTCGAGCATTGGAGTGTGGATTATTCAATGTGATGTTGCACTTAGCCCCGAGCGAGAAGGGAATGGCACTTGCTAAGTGTATATCGGCCATATCATCCATGGCTCGTGACTTCAAGCCATCACTACAGGAGTTGACGGCACAGGGAGGTTGTGAATTGCTGGCCAACACCCTTCAGGGTTCCGATATCTCAGCAAGGACTAGAGCTGCATTCCTGATAAGATACCTGTGTAACAGTTATGTGGATGCTAAAGATAAATTCATCCATCAGAACATAGTAAAGATAATAGCAGATCTCCTCAAAGAAGGCAGAGATGATACTTCAGAGCACCTCCTGAGTATTTTGGACACTTTGGTACAAGATGTGGATCCCAAAGTGATCAAACTATGCAGAGATCCAGGCTTAAATCTGGATAACATTTTGAAGGAACACCTCAAGAATCCAGAGCTTGATGAATGTTTCATTGAAGAGAGAGATTATTGTCGGTCTATCTTAAGAGTACTTGAAAACTTCCCACAATTTGAACAGCTTAACAGCGAAGTGGATAGATAA

Protein sequence:

>DPOGS207107-PA
MASGNPNNQAVGAMNVENDGAEPVQPRQPTNLQGLLRFAVEATKAEDAPGNSELGPMDEERRKFLEEALKSLTIDVAEVLQKSIKILSDSERIQSIQLGQELPDDVDVAFANILELVDNIDTANDFYKLGGFAILPICLGSENDKIRSRASSILAELCQNNPFCQARALECGLFNVMLHLAPSEKGMALAKCISAISSMARDFKPSLQELTAQGGCELLANTLQGSDISARTRAAFLIRYLCNSYVDAKDKFIHQNIVKIIADLLKEGRDDTSEHLLSILDTLVQDVDPKVIKLCRDPGLNLDNILKEHLKNPELDECFIEERDYCRSILRVLENFPQFEQLNSEVDR-