Monarch geneset OGS2.0

DPOGS215973
TranscriptDPOGS215973-TA1611 bp
ProteinDPOGS215973-PA536 aa
Genomic positionDPSCF300078 - 620281-624416
RNAseq coverage1177x (Rank: top 11%)
Annotation
HeliconiusHMEL0046770.091.24% 
BombyxBGIBMGA001206-TA0.092.76% 
DrosophilaCG7033-PA0.075.70% 
EBI UniRef50UniRef50_P783710.073.27%T-complex protein 1 subunit beta n=74 Tax=Eukaryota RepID=TCPB_HUMAN
NCBI RefSeqNP_001040109.10.092.76%chaperonin containing t-complex polypeptide 1 beta subunit [Bombyx mori]
NCBI nr blastpgi|1140513130.092.76%chaperonin containing t-complex polypeptide 1 beta subunit [Bombyx mori]
NCBI nr blastxgi|1140513130.092.91%chaperonin containing t-complex polypeptide 1 beta subunit [Bombyx mori]
Group
Gene OntologyGO:00064570protein folding
GO:00510820unfolded protein binding
GO:00442672.7e-138cellular protein metabolic process
GO:00055242.7e-138ATP binding
KEGG pathway 
InterPro domain[1-536] IPR0024230Chaperonin Cpn60/TCP-1
[1-536] IPR0127160T-complex protein 1, beta subunit
[35-51] IPR0179982.8e-34Chaperone, tailless complex polypeptide 1
Orthology groupMCL14291 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215973-TA
ATGGTTTCACTCAATCCTGTAAGAATCTTGAAAAATGAAGCAGAAGAAGAGAAAGCCGAAGTTGCTCGGATGTCTAGTTTCATTGGAGCTATAGCTATAGGTGATTTAGTGAAAAGTACTCTTGGACCAAAAGGCATGGATAAAATTTTAGTCTCGTATGGAAGAAATGCAGGACAGGTTGAGGTTACAAATGATGGTGCTACTATACTTAAATCCGTGGGTGTTGACAATCCCGCGGCCAAAATTCTCGTTGATATGTCAAAAGTGCAGGATGATGAAGTTGGTGATGGCACTACATCAGTCACAGTTTTAGCTGCTGAGTTACTTCGAGAAGCTGAAAAATTGATCGAACAAAAACTGCATCCTCAGACAGTTATAGCCGGGTGGCGCATTGCTGTTGAAGCAGCTCGTCAAGCATTGGCTGAAGCTAGTTTTGATCATGAAAAAAGCATGAACGAAGCAGCATTAAGAGCAGACTTGGAAAATATTGCTCGTACCACTCTGAGCTCTAAAATTTTGTCTAATCACAAGGAACATTTCACAAAATTAGCAGTAGATGCAGTTCTTCGTCTTAAGGGTTCTGGTAATCTCAAGGCTATACAGATTATAAAAATTTCTGGAGGTTTATTGGAGGAATCATTTTTAGATGAAGGATTTTTATTAAATAAAAAGGTAGGAGTCCACCAACCCAAGAAAATTGAGAATGCTAATATATTGATAGCTAATACTCCTATGGACACTGACAAAATAAAAGTTTTTGGTTCAACTATAAAAGTAGACTCGATGGCTAAAATTGCTGAACTCGAAGTCGCTGAGAAAGAGAAAATGAAGGACAAAGTGAACAAGATTCTCAACCATAAATGCAATGTTTTTATTAATAGACAGTTGATTTACAACTACCCTGAACAGCTGTTTGCCGATGCTGGAGTCATGGCTATTGAACATGCCGATTTTGATGGAATTGAGCGTCTTGCTTTAGTCACCGGCGGTGAGATAGTATCTACATTTGATTCACCTGAAAAAGTTAAACTGGGACATTGCAAACTTATTGAACAGGTCCTCATTGGTGATGAATGTTTAATACGTTTCTCTGGTGTGGAGCTGGGTTCAGCTTGTACCATCGTAATCCGAGGTGCAACCCAGCAAGTTATAGATGAAGCCGAGCGTTCCTTGCACGACGCGCTCTGTGTACTCGCTGCTACTGTCAAGGAACCTAAAGTTGTCTACGGCGGTGGTGCCAGTGAGATGCTGATGGCGGAGGCTGTGTCTCGTGTTGCTGCACGGACTGCCGGCAAGGAGGCGGCGGCCGCTGAAGCCTTCGCCGTAGCTCTACGGCGTCTACCTTCCGCCGTAGCAGACAACGCCGGCTACGACAGCGCAGATCTCATAGCTCGCCTCAGAGCTTCTCACGCACAGGGAGAGAATACTATGGGGCTCGATATGGAGAATGGCTGCATCGGTGATATGAAGAAGTTGGGTATAACCGAGTCCTATGTGGTGAAGCGTCAAGTGTTGCTGTCAGCCAGTGAAGCTGCGGAGGTCATTCTACGTGTGGACAACATTCTCAAATCGGCACCCCGCAGGCGTGGGCCCGACCGCCGCCCCTGCTAA

Protein sequence:

>DPOGS215973-PA
MVSLNPVRILKNEAEEEKAEVARMSSFIGAIAIGDLVKSTLGPKGMDKILVSYGRNAGQVEVTNDGATILKSVGVDNPAAKILVDMSKVQDDEVGDGTTSVTVLAAELLREAEKLIEQKLHPQTVIAGWRIAVEAARQALAEASFDHEKSMNEAALRADLENIARTTLSSKILSNHKEHFTKLAVDAVLRLKGSGNLKAIQIIKISGGLLEESFLDEGFLLNKKVGVHQPKKIENANILIANTPMDTDKIKVFGSTIKVDSMAKIAELEVAEKEKMKDKVNKILNHKCNVFINRQLIYNYPEQLFADAGVMAIEHADFDGIERLALVTGGEIVSTFDSPEKVKLGHCKLIEQVLIGDECLIRFSGVELGSACTIVIRGATQQVIDEAERSLHDALCVLAATVKEPKVVYGGGASEMLMAEAVSRVAARTAGKEAAAAEAFAVALRRLPSAVADNAGYDSADLIARLRASHAQGENTMGLDMENGCIGDMKKLGITESYVVKRQVLLSASEAAEVILRVDNILKSAPRRRGPDRRPC-