Monarch geneset OGS2.0

DPOGS210732
TranscriptDPOGS210732-TA1668 bp
ProteinDPOGS210732-PA555 aa
Genomic positionDPSCF300013 + 172867-177122
RNAseq coverage1516x (Rank: top 8%)
Annotation
HeliconiusHMEL0161720.093.32% 
BombyxBGIBMGA006263-TA0.092.07% 
DrosophilaT-cp1-PA0.081.12% 
EBI UniRef50UniRef50_P179870.073.33%T-complex protein 1 subunit alpha n=212 Tax=root RepID=TCPA_HUMAN
NCBI RefSeqXP_001659922.10.082.73%chaperonin [Aedes aegypti]
NCBI nr blastpgi|1571220170.082.73%chaperonin [Aedes aegypti]
NCBI nr blastxgi|1571220170.082.73%chaperonin [Aedes aegypti]
Group
Gene OntologyGO:00064570protein folding
GO:00510820unfolded protein binding
GO:00442672.2e-88cellular protein metabolic process
GO:00055242.2e-88ATP binding
KEGG pathway 
InterPro domain[1-548] IPR0024230Chaperonin Cpn60/TCP-1
[1-548] IPR0127150T-complex protein 1, alpha subunit
[33-49] IPR0179986.8e-30Chaperone, tailless complex polypeptide 1
Orthology groupMCL13632 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210732-TA
ATGTCGACGCTAGCTGCAGCACTATCTGTTGCGGGAACAAGACATTCTGGAGCCCCAGTTCGTACACAAAATGTAATGGCAGCAGCCTCAATCGCCAACATAGTGAAGAGTTCACTGGGTCCTGTGGGCTTGGACAAAATGATGGTAGATGACATTGGAGACGTCACAGTCACAAATGATGGAGCAACCATCCTTAAAATGTTGGAAGTAGAACATCCAGCTGCTAAAGTACTGGTAGAGTTGGCACAGTTACAGGACGAGGAGGTCGGAGATGGTACCACCTCTGTAGTTATCATTGCTGCTGAACTGTTGAAGAATGCCGATGAGCTGGTCAAGAACAAGATTCATCCCACGAGCATAATCTCCGGCTACAGACTGGCTTGCAAAGAGGCTGTGAAGTATATTCAAGACAACCTGACTGTAACCGTCGACTCCATCGGCAGATCATCCATCATCAACGCCGCTAAGACCACAATGTCTTCAAAACTCATTGGAGCTGATGCAGATTTCTTCTCCGAAATAATAGTTGATGCTGCACAAGCTATCAAGGTGACGGATCCTAAAGGGAATGCCGTGTACCCCATTAAAGCTGTGAACATCCTCAAAGCCCACGGCAGAAGTGCAAGGGAAAGTGTATTGATTAAGGGCTACGCCCTCAACTGTACCGTGGCCTCGCAGGCCATGCCTAAGAAGATTGTGAACGCTAAGATTGCTTGTCTCGATTTTTCGCTGCAGAAGACCAAAATGAAGATGGGTGTGCAAGTTCTCGTGTCCGACCCAGAGAAGCTGGAGGCCATCAGAGCTCGTGAGCTCGACATCACGAAGGAAAGACTACACAAGATACTGTCGACAGGGGTGAATGTCATCTTATCTACTGGCGGCATCGACGACCTCTGTCTCAAATACTTTGTAGAATGCGGTGCTATGGGAGTCCGGCGCTGCAAGAAGGCCGACCTGAAGAGAATCGCCAAGGCCACAGGAGCTACGTTCCTGACCTCCTTAACAAACATGGAAGGTGAGGAGGTGTTTGAACCGAGTATGATCGGTGAAGCTGCTGAAGTGGTCCAGGAACAAATTTGCGACGATCAGCTGATCCTCATTAAGGGGCCAGCGGCTCGCACTGCGGCGTCCATTATACTAAGAGGGCCGACAGACGCGTACTGTGATGAGATGGAACGGTCAGCACACGACGCGCTGTGTGCCGTCCGCAGAGTGATGGAATCAGGGCGGGTGGTGCCGGGAGGAGGAGCCGTGGAAGCGGCGCTGTCCATATACCTTGACAACTTCGCCACAACACTGAGCTCCCGAGAACAGCTTGCAATCGCTGCCTTCGCACAATCACTGCTAGTTATTCCGAAAACTCTGGCCGTTAACGCTGCTAAGGACGCTACAGACCTGGTCGCTAAACTACGAGCCTACCACAACTCTTCACAGACTAAGGTGGAACACGCAAACCTCAAATGGGTGGGTTTGGACCTGATTGAAGGCAGTCTCCGTGACAACCTCACAGCCGGAGTGCTAGAGCCAGCGATATCTAAGATTAAATCCCTGAAATTCGCCACAGAGGCTGCGATCACCATCCTCCGTATAGATGACATGATTAAGCTGGACCCGGAACAGAAGGGAAAGAGCTACGAAGATGCCTGCAACGCTGGGGAACTCGACTAG

Protein sequence:

>DPOGS210732-PA
MSTLAAALSVAGTRHSGAPVRTQNVMAAASIANIVKSSLGPVGLDKMMVDDIGDVTVTNDGATILKMLEVEHPAAKVLVELAQLQDEEVGDGTTSVVIIAAELLKNADELVKNKIHPTSIISGYRLACKEAVKYIQDNLTVTVDSIGRSSIINAAKTTMSSKLIGADADFFSEIIVDAAQAIKVTDPKGNAVYPIKAVNILKAHGRSARESVLIKGYALNCTVASQAMPKKIVNAKIACLDFSLQKTKMKMGVQVLVSDPEKLEAIRARELDITKERLHKILSTGVNVILSTGGIDDLCLKYFVECGAMGVRRCKKADLKRIAKATGATFLTSLTNMEGEEVFEPSMIGEAAEVVQEQICDDQLILIKGPAARTAASIILRGPTDAYCDEMERSAHDALCAVRRVMESGRVVPGGGAVEAALSIYLDNFATTLSSREQLAIAAFAQSLLVIPKTLAVNAAKDATDLVAKLRAYHNSSQTKVEHANLKWVGLDLIEGSLRDNLTAGVLEPAISKIKSLKFATEAAITILRIDDMIKLDPEQKGKSYEDACNAGELD-