Monarch geneset OGS2.0

DPOGS201820
TranscriptDPOGS201820-TA1596 bp
ProteinDPOGS201820-PA531 aa
Genomic positionDPSCF300145 + 342372-344547
RNAseq coverage1385x (Rank: top 9%)
Annotation
HeliconiusHMEL0083380.089.83% 
BombyxBGIBMGA013116-TA0.090.96% 
DrosophilaTcp-1zeta-PA0.078.99% 
EBI UniRef50UniRef50_P402270.071.37%T-complex protein 1 subunit zeta n=272 Tax=root RepID=TCPZ_HUMAN
NCBI RefSeqNP_001040108.10.090.96%chaperonin subunit 6a zeta [Bombyx mori]
NCBI nr blastpgi|1140507490.090.96%chaperonin subunit 6a zeta [Bombyx mori]
NCBI nr blastxgi|1140507490.090.96%chaperonin subunit 6a zeta [Bombyx mori]
Group
Gene OntologyGO:00442670cellular protein metabolic process
GO:00064570protein folding
GO:00055240ATP binding
GO:00510820unfolded protein binding
KEGG pathway 
InterPro domain[1-528] IPR0024230Chaperonin Cpn60/TCP-1
[1-528] IPR0127220T-complex protein 1, zeta subunit
[32-48] IPR0179981.1e-22Chaperone, tailless complex polypeptide 1
Orthology groupMCL10928 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201820-TA
ATGTCAGCAATAAGCTTATTAAATCCTAAAGCTGAATTCGCTCGCGCCGCCCAAGCTTTAGCTGTAAACATCACAGCTGCGAAAGGAATTCAAGATGTAATGAAAACTAACCTCGGGCCGAAAGGTACCATGAAAATGTTGGTGTCGGGGGCGGGGGATATCAAGATAACTAAGGATGGGAACGTTCTGCTTCATGAAATGCAGATCCAGCATCCTACCGCGTCTCTGATAGCTCGCGCTTCCACAGCTCAGGACGACGCCACAGGCGATGGCACAACTTCCACCGTGCTCCTCATTGGAGAACTGTTGAAGCAGGCTGATATTTATATCAGTGAAGGACTCCATCCAAGAATAATTACTGAAGGTTTTGATGTAGCCAGAAACAAGGCTCTTGAAGTTTTGGAATCCATGAAAATTCCTATTGAGATCAAAAGAGAAAACCTCATTGATATAACCCGTACAGCTTTAAAAACTAAGGTACATCCAAGTCTGGCTGAAGTTTTGACTGATGCTTGTGTGGATTCAGTACTGGCTATTAGAGTTGAGGGTAAACCAGTTGATCTTCACATGGTGGAGCTGATGGAAATGCAGCACAAAACCGCTACAGAGACTATACTTATAAAAGGTCTGGTAATGGATCATGGAGCTCGTCATCCTGATATGCCGAAACGTGTAGAAAATGCATACATCCTGACTTGCAATGTTTCACTGGAGTATGAAAAGACTGAGGTCAATTCTGGCTTCTTCTACAAGTCTGCTGAAGATAGGGAGAAGCTTATTGCAGCTGAGAGAGAATTTATTGACCAACGAGTCAAGAAGATTGTAGCTCTGAAGAAGAAACTATGTGACGGCACTAAAAAATCATTTGTGGTCATCAACCAGAAGGGTATTGATCCCCTTTCATTGGATGTACTTGCAAAAGAAGGTATCATTGCTCTGCGTAGGGCTAAGAGACGTAACATGGAGCGTCTAGCGTTGGCCTGCGGAGGCATTGCTATGAACTCTGTCGATGATTTGAGTGAAGAGTGCCTCGGCTATGCTGGTCTTGTGTATGAACATATTCTTGGAGAAGAGAAATACACTTTTGTAGAAGAATGCAAGAATCCACAGTCTGTAACCATCCTAATCAAAGGCCCTAACAAACACACACTCGCACAAATTAAGGATGCCGTAAGAGATGGCCTTAGAGCAATCAACAATGCCATTGAAGACAAATGTCTCGTACCTGGAGCTGCTGCTTTTGAAGTGAAGGCTAACAACGAATTATTGAAGTTCAAGGACACGGTTAAGGGAAAGTTGCGTTTAGGCATACAGGCTTATGCCGAAGCATTGTTGGTGATTCCAAAAACTTTGGCCGTGAACTCTGGATATGACGCCCAGGACACCATTGTGAAGCTACAAGAGGAATCTCGTTTGAACCCTGACCTTATTGGATTGGATCTTAGCACTGGTGAAGCGATAAAGCCAATTGATTTAGGCATTTACGACAACTATATTGTTAAGAAACAGATCCTTAACTCTTGTTCAGTCATCGCTAGCAACCTCCTCCTTGTGGATGAAATCATGAGAGCTGGGATGTCCAGCCTTAAGGGCTAG

Protein sequence:

>DPOGS201820-PA
MSAISLLNPKAEFARAAQALAVNITAAKGIQDVMKTNLGPKGTMKMLVSGAGDIKITKDGNVLLHEMQIQHPTASLIARASTAQDDATGDGTTSTVLLIGELLKQADIYISEGLHPRIITEGFDVARNKALEVLESMKIPIEIKRENLIDITRTALKTKVHPSLAEVLTDACVDSVLAIRVEGKPVDLHMVELMEMQHKTATETILIKGLVMDHGARHPDMPKRVENAYILTCNVSLEYEKTEVNSGFFYKSAEDREKLIAAEREFIDQRVKKIVALKKKLCDGTKKSFVVINQKGIDPLSLDVLAKEGIIALRRAKRRNMERLALACGGIAMNSVDDLSEECLGYAGLVYEHILGEEKYTFVEECKNPQSVTILIKGPNKHTLAQIKDAVRDGLRAINNAIEDKCLVPGAAAFEVKANNELLKFKDTVKGKLRLGIQAYAEALLVIPKTLAVNSGYDAQDTIVKLQEESRLNPDLIGLDLSTGEAIKPIDLGIYDNYIVKKQILNSCSVIASNLLLVDEIMRAGMSSLKG-