Monarch geneset OGS2.0

DPOGS216133
TranscriptDPOGS216133-TA1353 bp
ProteinDPOGS216133-PA450 aa
Genomic positionDPSCF300182 + 492980-499994
RNAseq coverage177x (Rank: top 50%)
Annotation
HeliconiusHMEL0218004e-9468.88% 
BombyxBGIBMGA009540-TA6e-14861.22% 
Drosophila% 
EBI UniRef50UniRef50_F4WTH11e-10245.99%Tetratricopeptide repeat protein 5 n=11 Tax=Neoptera RepID=F4WTH1_ACREC
NCBI RefSeqXP_394646.31e-9744.55%PREDICTED: similar to tetratricopeptide repeat domain 5 [Apis mellifera]
NCBI nr blastpgi|3071891761e-10345.12%Tetratricopeptide repeat protein 5 [Camponotus floridanus]
NCBI nr blastxgi|3407150703e-10045.79%PREDICTED: tetratricopeptide repeat protein 5-like [Bombus terrestris]
Group
Gene OntologyGO:00054888.7e-21binding
GO:00055155e-07protein binding
KEGG pathway 
InterPro domain[78-276] IPR0119908.7e-21Tetratricopeptide-like helical
[234-264] IPR0014405e-07Tetratricopeptide TPR-1
Orthology groupMCL16994 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216133-TA
ATGTCGAACGATACTGAAGAACCAGCGGAATTGCTGGAAGATGCATCAGAGCTTCTTTTGAATTTATCTCAAGATTTGCGAGACCTATATTCCTTTAGGGATCTTTTCTTTGAAAATCATCCTTTTGAAATGGCATCAGAGAAAAACAAATGTGTTGAAGAGAAAAAACAGAAGTTAGTCGAGAAATTTGAAAATATTGATGTTGACACACAAATACCATTTTCTCACCGAGCAGAATTCCTATACTTGAAGGGTAGATGCTACAACATTAGCTCCGTCTATGATCCTCAAGCTGCCCAATGCTTAAGCAAGGCGGTGAAACTAAACCCCAATTTAGTTAGTGCCTGGAATGAGCTCGGCGAATGCTATCTGAAGAATATGAATGTTAAGGAAGCTAAAAATAGTTTTGAAGGTGCTCTTAAACATGAACGTAATCGCGTGGCACTGCGATGCCTTTCAATAATACTACGTCAAGAAAACACTGGTAGCGCCAGTGAAGCGAAATCTGCGATCTTAGCCAGTGTAGTTATGGCCAAGGAGGCCGTAGCTCAAGATACAAAGGATGGTATATCGTGGACAGTTCTCGGCAACGCTTATTTATGTCAGTTCTTCATGGTCAAACAAGATCCGGCTACGTTGAAACTATGTATGAGCGCTTACAAACAGGCCTGGTCGGATCCAATAGCTAGGGGTCAACCAGATTTATATTACAACAAGGGTGTGGCGTTGAAATATGAAGAGCAGTACAACGAAGCTCTGGAGATGTTCCGCACAGCGATGCAGCTGGATCCAGGCTGGGCTCCAGCAGTTCGCGAGCTGACAGCTCTCAAGGCACACCTGGCAGCTGCAACCACACTCGTAAGGACGAGGGGAAGGATCAAGGCCAAGAGACTCGCGAATATGGTGCGATCCATAGATCAAAGAATGCTGGGGGACTATCTTCCACAAAACTTCCAAACCTTCGGTAACAGGAAGGACGTTTTGTTGGAGCACGTAACACTTGACAAGCTGCAAGATGGCAGCAATGAGAATAAAGTTATCTTGGGACGTGTCGTCGGATCCATACACCATGAGAACTGCGTGCCATTCACATTCGCCCTAACAGACGCGAGTACGCTGTGCGTTTTGGTGTCCGTGTATAATTGGGCTGAAGGACGCGGCCCAGTGGTGGGAGACGCGGTCGCACTCCCCGAACCAGTCCTCGAACATCATCAAGGCACACATGACGACCTGGAATACGAATTCAAGAGCATCAGAGTGAACAATCCCCTCTACTTGCTCATCAACGGTAAACGTGTGAATCGCTGTCAGTTTGCCTGCACGAGAGTCAAGAGCACGTATGAAATACACTAA

Protein sequence:

>DPOGS216133-PA
MSNDTEEPAELLEDASELLLNLSQDLRDLYSFRDLFFENHPFEMASEKNKCVEEKKQKLVEKFENIDVDTQIPFSHRAEFLYLKGRCYNISSVYDPQAAQCLSKAVKLNPNLVSAWNELGECYLKNMNVKEAKNSFEGALKHERNRVALRCLSIILRQENTGSASEAKSAILASVVMAKEAVAQDTKDGISWTVLGNAYLCQFFMVKQDPATLKLCMSAYKQAWSDPIARGQPDLYYNKGVALKYEEQYNEALEMFRTAMQLDPGWAPAVRELTALKAHLAAATTLVRTRGRIKAKRLANMVRSIDQRMLGDYLPQNFQTFGNRKDVLLEHVTLDKLQDGSNENKVILGRVVGSIHHENCVPFTFALTDASTLCVLVSVYNWAEGRGPVVGDAVALPEPVLEHHQGTHDDLEYEFKSIRVNNPLYLLINGKRVNRCQFACTRVKSTYEIH-