Monarch geneset OGS2.0

DPOGS205402
TranscriptDPOGS205402-TA963 bp
ProteinDPOGS205402-PA320 aa
Genomic positionDPSCF300407 - 105805-110016
RNAseq coverage1623x (Rank: top 8%)
Annotation
HeliconiusHMEL0222182e-11870.94% 
BombyxBGIBMGA001406-TA4e-16189.80% 
DrosophilaCctgamma-PB3e-14679.08% 
EBI UniRef50UniRef50_P493687e-13169.75%T-complex protein 1 subunit gamma n=209 Tax=root RepID=TCPG_HUMAN
NCBI RefSeqXP_001660595.13e-14980.12%chaperonin [Aedes aegypti]
NCBI nr blastpgi|1571250487e-14880.12%chaperonin [Aedes aegypti]
NCBI nr blastxgi|3123720272e-14180.75%hypothetical protein AND_20694 [Anopheles darlingi]
Group
Gene OntologyGO:00442676.8e-220cellular protein metabolic process
GO:00064576.8e-220protein folding
GO:00055246.8e-220ATP binding
GO:00510826.8e-220unfolded protein binding
KEGG pathway 
InterPro domain[1-320] IPR0024236.8e-220Chaperonin Cpn60/TCP-1
[1-320] IPR0127196.8e-220T-complex protein 1, gamma subunit
[155-177] IPR0179982.1e-06Chaperone, tailless complex polypeptide 1
Orthology groupMCL14385 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205402-TA
ATGTTCAATAAAGATGTCACACATCCAAAAATGAGGAGATATATTGAAAATCCTAGAATTATTCTCCTTGACTGCCCGTTGGAGTACAAGAAGGGTGAAAGCCAGACCAACATTGAGATTGTTGGTGAACAGGACTTCACTAAGCTTCTTCAACTAGAAGAAGAGCATGTTCAACGTCTTTGTGAAGATATCATAGCTTTGAAGCCGGATGTAGTCGTCACAGAAAAAGGAGTATCAGACTTGGCACAGCATTACCTCGTCAAGGCCGGAATTACAGCCATTCGCAGACTTCGTAAAACTGACAATAATCGTTTAGCTCGTGCTTGTGGAGCGACCATTGTGAATCGGACCGAGGAGTTGAAGGAATCAGATGTGGGGACCCAGGCCGGACGGTTTGAGATTAAGAAGATAGGAGATGATTACTTCACATTTGTCACTGAATGTAAGAATCCCAAGGCCTGCACTATTCTCCTCCGCGGTGCCTCCAAGGATATTTTGAATGAAATTGAAAGAAATCTCCAGGATGCCCTTCATGTTGCAAAGAATTTGGTGCTGAATCCTCGTCTCGTGTGTGGCGGCGGAGCTGTTGAGATGTCCGTTTCCGGGGAACTGGCGTCCAAAGCGCATCACGCGGCCCCTTACAGAGCAGTCGCACAAGCGCTCGAAATAATACCTCGAACATTGGCTCAGAACTGTGGTGCGAATACAATCCGTACCCTCACAGCACTGAGAGCAAAGCATGCAGCTGGTGAGAGGAATTGCGGCATCGACGGAGAGACCGGCCTCATAGTGGACATGGCACAGAAAGGGATATGGGAACCGCTGGCTGTTAAGTTACAGGTCTATAAAACGGCGGTGGAGACGGCGATATTTCTGTTAAGGATCGACGACATCGTCTCAGGGTCCAAGAAGAAGAACAAGGAAGGCGCGAACCCGGCTGAGATGGCGAACATGCAGGAATAG

Protein sequence:

>DPOGS205402-PA
MFNKDVTHPKMRRYIENPRIILLDCPLEYKKGESQTNIEIVGEQDFTKLLQLEEEHVQRLCEDIIALKPDVVVTEKGVSDLAQHYLVKAGITAIRRLRKTDNNRLARACGATIVNRTEELKESDVGTQAGRFEIKKIGDDYFTFVTECKNPKACTILLRGASKDILNEIERNLQDALHVAKNLVLNPRLVCGGGAVEMSVSGELASKAHHAAPYRAVAQALEIIPRTLAQNCGANTIRTLTALRAKHAAGERNCGIDGETGLIVDMAQKGIWEPLAVKLQVYKTAVETAIFLLRIDDIVSGSKKKNKEGANPAEMANMQE-