Monarch geneset OGS2.0

DPOGS202781
TranscriptDPOGS202781-TA1623 bp
ProteinDPOGS202781-PA540 aa
Genomic positionDPSCF300018 - 933635-937145
RNAseq coverage1498x (Rank: top 9%)
Annotation
HeliconiusHMEL0092940.074.44% 
BombyxBGIBMGA010475-TA0.076.92% 
DrosophilaHop-PA8e-15954.17% 
EBI UniRef50UniRef50_Q9VPN51e-15654.17%Hsp70/Hsp90 organizing protein homolog n=23 Tax=Endopterygota RepID=Q9VPN5_DROME
NCBI RefSeqNP_001036957.10.075.82%Hsc70/Hsp90-organizing protein HOP [Bombyx mori]
NCBI nr blastpgi|1129832800.075.82%Hsc70/Hsp90-organizing protein HOP [Bombyx mori]
NCBI nr blastxgi|1129832800.075.82%Hsc70/Hsp90-organizing protein HOP [Bombyx mori]
Group
Gene OntologyGO:00054887.8e-37binding
GO:00055154.9e-09protein binding
KEGG pathwaynvi:1001197010.0 
 K09553 (STIP1)maps-> Prion diseases
InterPro domain[3-137] IPR0119907.8e-37Tetratricopeptide-like helical
[73-105] IPR0014404.9e-09Tetratricopeptide TPR-1
[489-528] IPR0066365.7e-07Heat shock chaperonin-binding
[72-105] IPR0197346.8e-07Tetratricopeptide repeat
Orthology groupMCL13961 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202781-TA
ATGGAACAGGTCAATCATTTAAAGGAGAAAGGTAACGCCGCTTTGTCCTCCGGTCAGTATGCCGAAGCCGTGAAACTATATACTAGTGCTATAGAATTAGATCCAAAAAACCACGTTTTGTATAGTAACCGATCAGCAGCGCATGCAAAAGCAGGTAACTATGCAGAGGCGTTAGAGGATGCAAATAAAACAGTATCCATTAATCCCACTTGGAGCAAAGGATATTCTAGAAAGGGTAGTGCTCTAGCTTACCTAGGCAGACATGAAGAAGCCATCCAGGCTTACGAAAAAGGTTTACAGCTAGATCCTTCTAATCAACAATTAGCGTCAGGCTTAGCGGAGGTAAAGAAGCAAGCTGAAGAAGCAGAACTCGTATCACAGACATGGATTGAGAAACTAAGAGCAAATCCACAAACGAGGGAGTGGTTGAATGATCCGGAATATGTCCAACTAGTCAAGAACTTTAACCCGAGTGATCCAAATTCACTGAATTGGCCGACACAAAGTGAAAGAGTTTTACCAACTATAGCCGTTCTATTAGGACTGAACCCCGAAAAAGGTATGCCAATGGATGTCGATCCTCCGGCATCAGAGCCAAAACCAAAATCGGCACCTAAAAAAGAAGAGCCTCCTAAACCTAAATATGACGATCTCCCTGAGAACCGTCGGCTAGCTTTACAAGAAAAAGATCTAGGAAATGAATATTACAAGAAAAAAGACTTTGATAATGCTATACAGCACTACAACAAAGCTATCGAGCATTACCCTACAGACATTACGTTTTATACAAATCTAGCGGCCGTGTTCTTTGAACAAAAAGAATATGAAAAGTGCATTAAAGAGTGTGAAAAAGCAATTGAAATCGGACGAGAGAATAGAGCCGACTTCAAACTGATTGCCAAAGCATTCACAAGGATCGGTAATGCATACAAAAAGATGGAACAATGGAAACTAGCTAAAACCTATTTTGAAAAATCAATGTCGGAGCATCGTACGCCAGAGATAAAGACGCTCCTCAGCGAAGTAGAGAAGAAGATAGTGGAAGAAGAGAAGAAGGCTTACGTAGATCCAGTGAAGGCTGAACAAGAGAAGGAACTTGGCAATGAATTCTTCAAACAAGGAGACTACAGCACAGCCATGAAACATTACTCGGAGGCTATCAAACGTAACCCCGACGACCCTAAGCTGTATTCAAACAGAGCGGCCTGCTACACCAAGCTGGCGGCCTTTGACCTGGGACTCAAGGACTGTGAACAGTGCTGCAAACTGGATCCCAAGTTCATCAAGGGCTGGATTAGGAAAGGAAAGATATTGCAAGGCATGCAGCAGGCGTCCAAAGCGCTCACAGCCTACCAGAAGGCTCTAGAGCTGGACCCCAGCAATGTTGAGGCGCTGGAAGGTTACCGAGCCTGCTCCACACAGTTACACTCCAACCCAGAGGAGGTACGCAAGCGGGCCATGGCCGACCCTGAGGTGCAGCAGATCCTGCGAGACCCCGCCATGCGCTGCATCCTGGAACAGATGCAGCACGACCCTCAGGCGCTGCAAGACCACCTCAAGAACCCAGACGTGGCCGCCAAGATACAGAAGCTGCTGGAGTCGGGCCTCATCGCCATCCACTAG

Protein sequence:

>DPOGS202781-PA
MEQVNHLKEKGNAALSSGQYAEAVKLYTSAIELDPKNHVLYSNRSAAHAKAGNYAEALEDANKTVSINPTWSKGYSRKGSALAYLGRHEEAIQAYEKGLQLDPSNQQLASGLAEVKKQAEEAELVSQTWIEKLRANPQTREWLNDPEYVQLVKNFNPSDPNSLNWPTQSERVLPTIAVLLGLNPEKGMPMDVDPPASEPKPKSAPKKEEPPKPKYDDLPENRRLALQEKDLGNEYYKKKDFDNAIQHYNKAIEHYPTDITFYTNLAAVFFEQKEYEKCIKECEKAIEIGRENRADFKLIAKAFTRIGNAYKKMEQWKLAKTYFEKSMSEHRTPEIKTLLSEVEKKIVEEEKKAYVDPVKAEQEKELGNEFFKQGDYSTAMKHYSEAIKRNPDDPKLYSNRAACYTKLAAFDLGLKDCEQCCKLDPKFIKGWIRKGKILQGMQQASKALTAYQKALELDPSNVEALEGYRACSTQLHSNPEEVRKRAMADPEVQQILRDPAMRCILEQMQHDPQALQDHLKNPDVAAKIQKLLESGLIAIH-