Monarch geneset OGS2.0

DPOGS213471
TranscriptDPOGS213471-TA1896 bp
ProteinDPOGS213471-PA631 aa
Genomic positionDPSCF300100 - 274987-281672
RNAseq coverage189x (Rank: top 48%)
Annotation
HeliconiusHMEL0168441e-16372.12% 
BombyxBGIBMGA004496-TA4e-17374.26% 
DrosophilaDpit47-PA4e-8545.21% 
EBI UniRef50UniRef50_E2BLB63e-9647.87%Tetratricopeptide repeat protein 4 n=1 Tax=Harpegnathos saltator RepID=E2BLB6_HARSA
NCBI RefSeqXP_002429064.14e-8949.42%Cyclophilin seven suppressor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3287909642e-9749.20%PREDICTED: tetratricopeptide repeat protein 4 [Apis mellifera]
NCBI nr blastxgi|3838622592e-9948.94%PREDICTED: tetratricopeptide repeat protein 4-like [Megachile rotundata]
Group
Gene OntologyGO:00054886.5e-11binding
GO:00055158.7e-05protein binding
KEGG pathway 
InterPro domain[76-130] IPR0231142.1e-13Elongated TPR repeat-containing domain
[381-473] IPR0119906.5e-11Tetratricopeptide-like helical
Orthology groupMCL13045 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213471-TA
ATGACTGAGGAGGAAAGGATCGCGCTGTGTCAAAAATTGGACAAAGAACTTAATGATTTTATAGATGGCTTGGAGAAAAAGCGATATACTGAGGGATGGCCAGAAGACCGGTGGGAGGAGGAAATGGATAAACATCCGTTCTTCATGAAGTCTACTCCTGAGGACGGTGAGCTCTCCCCGTTAGCCGAGGGACTGGCCAAACTGAAATATGATCCTGAGGAAAATACCCCATTGGAACTGGCCACCAATTACAAAGAGGATGGAAATTTCAACTTCAAACACAAGAACTATCGACTTGCTATTATAGGGTACACAGAGGGTATCAAGGTAAGGTGTGATAATGCGGAAATAAACGCCAGCCTATACAATAATAGGGCTGCAGCACACTTCCATCTCAAGAACTACAGGTCAGCTTTGTATGACAGCGAGAAAGCATTGTCATTCAACCCAGGACACGAGAAATCGAGACTAAGAGCCGCAAAATCGGCACTGCAGATATCGAGATTTGATACGTGCATCGAACATTGTCAGCAACTTCTAGAACAAAAATCATCAGATAAAGAACTCTCTGAGCTAATGGCTGATGCCAAAAAGAAAAAGATGGTGGTGGCGCGAGATGAAAGAAAGAAGAAGAAGGTGGAAGCCAAGAGAAGCGAACAGAAGGATTTAGTCGTGAAAGCTGTAATCCAGCGCGGCATTAAAATATCCAAATGCGAAGATGAAGATGACATAGATCTATCAAAGTTAGAACCAACTTTGCCGGGGGCTCAGGAGTCTATAGTACATTTAGAGAATGGCATTTTAAAATGGCCGGTGCTATTCCTGTATCCGGAATATCAGACGTCCGATTTCGTGAAAGCCTGCCCAGAGGATGTGCCACTGATTCGCCAATTGGAGCAACTATTTCCAGCGCCTTGGGACGAGGCTAAAACTTACAACGTTAGAAGCATTAACGTCTATTACGAGGGTAGCGACAAAATGCCTCACGTGGTTGATCCGAAAAAGAATTTAGGTGAACTGTTAGTTGCTAAATATTATGAATTGAAGGCCGGCACTCCCGCGTTTTTCGTTATGGTACGCGGCAGTCGGGCGGAAAGCAGATTTATAGAATGTTATCTGTACCAAGAAGATGTCCAGGTAACAAGTAGAAAAGCAGAAAAGTATGCTTGCTTGTACAGCAATAGGGCTGCAGAGCATTGGAATCTCAGAAACTTCAAACAAGCACTGTATGACAGTGAGAAAGCCTTGCTACTAAACCCCGAAGACGACGAAACAAGACTAAGAGCTGCAAAATCAGCATTGGAAGCAGCTAAATATGACGCTTCTATTGAACATTGTCGGAAACTAATACAGAAAAATTGCACAGATATAGAACTCTTTGAGCTTTTGGCTATTGCTAAAATGAGGAAAAAGGAGGCTAAGACGGATGAATCGTCAGAATTAATCGTGAAGGCTGTACTTGAGCGCGGCATTAAAATATCTAAATGTAAAAATAAAAATGACATAGATATATCAAAATTAGAACCAACTTTACCGGGGGCTCGCGATTCTATGGTATATTTAGAGAATGGCGTTTTAAAATGGCCGATTCTATTTCTATATCCGGAATATGAAACCTCAGATTTCTTGACAGGCTGCCCAGAGAATGTGCCACTGATTTATCAATTGGAAAAACTGTTCCCGGCGCCTTGGGATAGGGGTAATAAATACTGCAGTGCCAATATTAAGGTTTATTACGAGGGTTGTGATAAAATGCCGCATATTGTGGACCCCAGGAGGAGCTTGGGTGAACTCTTAGTATCTACATATTATGAATTGAAAGCCGGCACCCCGATGTTTTTTGTCATGGTACGCGGCAGTTGGGTGGAGAGTATGTTTTTAGACTGTTACCTGTAA

Protein sequence:

>DPOGS213471-PA
MTEEERIALCQKLDKELNDFIDGLEKKRYTEGWPEDRWEEEMDKHPFFMKSTPEDGELSPLAEGLAKLKYDPEENTPLELATNYKEDGNFNFKHKNYRLAIIGYTEGIKVRCDNAEINASLYNNRAAAHFHLKNYRSALYDSEKALSFNPGHEKSRLRAAKSALQISRFDTCIEHCQQLLEQKSSDKELSELMADAKKKKMVVARDERKKKKVEAKRSEQKDLVVKAVIQRGIKISKCEDEDDIDLSKLEPTLPGAQESIVHLENGILKWPVLFLYPEYQTSDFVKACPEDVPLIRQLEQLFPAPWDEAKTYNVRSINVYYEGSDKMPHVVDPKKNLGELLVAKYYELKAGTPAFFVMVRGSRAESRFIECYLYQEDVQVTSRKAEKYACLYSNRAAEHWNLRNFKQALYDSEKALLLNPEDDETRLRAAKSALEAAKYDASIEHCRKLIQKNCTDIELFELLAIAKMRKKEAKTDESSELIVKAVLERGIKISKCKNKNDIDISKLEPTLPGARDSMVYLENGVLKWPILFLYPEYETSDFLTGCPENVPLIYQLEKLFPAPWDRGNKYCSANIKVYYEGCDKMPHIVDPRRSLGELLVSTYYELKAGTPMFFVMVRGSWVESMFLDCYL-