Monarch geneset OGS2.0

DPOGS203217
TranscriptDPOGS203217-TA1641 bp
ProteinDPOGS203217-PA546 aa
Genomic positionDPSCF300035 + 935752-938630
RNAseq coverage1417x (Rank: top 9%)
Annotation
HeliconiusHMEL0064930.089.01% 
BombyxBGIBMGA011508-TA8e-16786.20% 
DrosophilaCG8258-PA0.073.31% 
EBI UniRef50UniRef50_P509900.064.04%T-complex protein 1 subunit theta n=99 Tax=Eukaryota RepID=TCPQ_HUMAN
NCBI RefSeqNP_001073348.10.087.55%chaperonin [Bombyx mori]
NCBI nr blastpgi|1204449030.087.55%chaperonin [Bombyx mori]
NCBI nr blastxgi|1204449030.087.55%chaperonin [Bombyx mori]
Group
Gene OntologyGO:00442670cellular protein metabolic process
GO:00064570protein folding
GO:00055240ATP binding
GO:00510820unfolded protein binding
KEGG pathway 
InterPro domain[1-544] IPR0024230Chaperonin Cpn60/TCP-1
[1-544] IPR0127210T-complex protein 1, theta subunit
[41-57] IPR0179982.1e-21Chaperone, tailless complex polypeptide 1
Orthology groupMCL13170 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203217-TA
ATGGCTTTACATGTACCAAAAGCACCGGGTGTACCCCAAATGTTAAAAGAAGGAGCTCGGATGTTTTCCGGTTTAGAAGAAGCAGTATACCGTAACATAAATGCATGCAAACAATTTGCCCAAAGTGTGCGCTCAGCATATGGCCCGAATGGCATGAATAAAATGATCATCAATCACATTGACAAACAGTTTGTCACAAGTGATGCTGGTACTATCATCAGAGAATTAGATGTTGAACATCCAGCTGCCAAGTTAATGGTTCTAGCTAGTCAAATGCAAGATGCTGAGGTTGGAGATGGTACAAACTTTGTAATTGTATTATCTGGAGCTCTTCTTGAGGCAGCCGAAGAGCTTCTACGTCTAGGTGTGACGACGAGCGAAATTGCAGAAGGATACGAAAAGGCCCTTGATAAATGCCTAGAAATCCTACCCCAGTTAATTTGCGATGAAATTAAAGATTGCAGAAATATGGATACTGTTATTAAAGGCATTAAACCATCTATCATGTCCAAACAGTATGGTAATGAAGACTTCATTGCTGGGCTGGTAGCAAAAGCTTGTGTGGCTATTCTACCAGAAAATACAACATTTAATGTTGATAATGTCAGGATATGTAAGATCCTTGGTGCTGGGCTACTACAGTCTGAGGTATTGTCAGGAATGGTGTTTAAGAGAGAAGTTGAGGGTGATGTTGCCAGTGCATCAAAAGCCAAAGTAGCTGTTTATTCCTGTCCTGTTGACATCACTCAAACTGAAACAAAGGGAACAGTACTCATCAAAACAGCCGATGAACTACTTAACTTTAGTAAAGGAGAGGAGTCTTTGTTAGAAAAACAGATTAAAGCTATTGCCGACACTGGTGTAAAAGTTATTGTTGCTGGAGCTAAGTTTGGTGATATGGCTTTACATTTCCTTAATAAATATAACATTATGGCGGTTCGTCTCAACTCAAAGTTTGATATTCGTCGTCTCGCAAAGACTGTTAATGCTACTGTACTGCCAAAATTGACTACTCCAACGGCTGAAGAGCTTGGATACTGTGACACGGTCAGAGTTGATGAAGTCGGTGATACAAGAGTAGTTGTCTTCAGTATGGAAAGCAGTGAATCTAGAATTTCAACAGTTGTTATCAGAGGTTCAACTGACAATTACATGGATGACATTGAAAGAGCTATTAATGACGGTGTTAACACATTCAAGGGAATAGCAAGAGACGGACGATTCTTGGCTGGCGCTGGAGCTACTGAAATTGAATTAGCTCAACAGCTTTTACAGTATGCAGATACTCTTCCAGGACTTGAGCAATATGCAGTTCGTAAGTTTGCTGTAGCTCTTGAGGGTGTTCCGAAAGCTCTTGCAGAAAATTCTGGAGCAAATGCCACTGAAGTTGTGAACAATATCTATAAAGCACACAGGGAAAACAACAAGTATGCTGGTTTTGATATCGAATCTGAGAGTGCCGGAATTTGTGATGCAAAGGAAAAGGGAATTTTAGATTTATATGTCTTAAAATACTGGGGCCTTAAATATGCTGTTGGTGCTGCAACAACTATTTTGAAAGTTGATCAAATCATTATGGCTAAAAGGGCAGGAGGGCCAAAAGCACCTAAACCCAATGCCGGCAGTGATGATGAATCTTAA

Protein sequence:

>DPOGS203217-PA
MALHVPKAPGVPQMLKEGARMFSGLEEAVYRNINACKQFAQSVRSAYGPNGMNKMIINHIDKQFVTSDAGTIIRELDVEHPAAKLMVLASQMQDAEVGDGTNFVIVLSGALLEAAEELLRLGVTTSEIAEGYEKALDKCLEILPQLICDEIKDCRNMDTVIKGIKPSIMSKQYGNEDFIAGLVAKACVAILPENTTFNVDNVRICKILGAGLLQSEVLSGMVFKREVEGDVASASKAKVAVYSCPVDITQTETKGTVLIKTADELLNFSKGEESLLEKQIKAIADTGVKVIVAGAKFGDMALHFLNKYNIMAVRLNSKFDIRRLAKTVNATVLPKLTTPTAEELGYCDTVRVDEVGDTRVVVFSMESSESRISTVVIRGSTDNYMDDIERAINDGVNTFKGIARDGRFLAGAGATEIELAQQLLQYADTLPGLEQYAVRKFAVALEGVPKALAENSGANATEVVNNIYKAHRENNKYAGFDIESESAGICDAKEKGILDLYVLKYWGLKYAVGAATTILKVDQIIMAKRAGGPKAPKPNAGSDDES-