Monarch geneset OGS2.0

DPOGS201984
TranscriptDPOGS201984-TA1581 bp
ProteinDPOGS201984-PA526 aa
Genomic positionDPSCF300060 - 162815-168217
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0056166e-15550.76% 
BombyxBGIBMGA010395-TA3e-7652.80% 
Drosophila% 
EBI UniRef50UniRef50_UPI0002063D816e-1422.98%UPI0002063D81 related cluster n=3 Tax=unknown RepID=UPI0002063D81
NCBI RefSeq%
NCBI nr blastpgi|3287821842e-1322.98%PREDICTED: tetratricopeptide repeat protein 18-like [Apis mellifera]
NCBI nr blastxgi|3287821848e-1423.56%PREDICTED: tetratricopeptide repeat protein 18-like [Apis mellifera]
Group
Gene OntologyGO:00054881.8e-15binding
KEGG pathway 
InterPro domain[438-465] IPR0119901.8e-15Tetratricopeptide-like helical
Orthology groupMCL25734 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201984-TA
ATGTTGGTGGATCTGGAGATAGAACCTACGAAGAGATCTTGCACTGGTCGAGGGCAACAGGAAAGATCTTGGAGTAATATCATACGTCTCGCGTCCTCTGGTCTCTCGCATGTGCCATACTTCGGAATGTCCGATATTTGCACAATAAATAGACAGCTGAGTGAAACGCGAACTCGAGTAGAGATTGTCACATCACTTTGGCAAGAAGCTGCAATTTTCGTCAATAACAATTTTGTAGTACACGATTTCCTAAATTCTGACGAAACGTTCCAGGAGATGGTAATGATAGCGCACGCTTGTCTCATGCGTATAACAAATGACGCCTTAATAGATTCAGATAAAAAACCTCCCCCGTATTCTGTACAAAAAGCTGCTAGACACGCGCGACAGCTCCACGACCTGTCACACGCTATGGATCTTTATCTACAGTTAATAGTCCAAGCGCCACGAGAAGCAGATAGCTGGCGTGAATTAGCTACGTGTTTACGAGATATAGATAAGGACTGGGCAAATGTTTGCATAGATAAATCAATTATACTAAATCCAAGACATCCGTTGAGTCTCCTCTCCAAAAGCTGTATGATATTCGAAGAAGATCCCAATGCAGCCGAGCCTTTCTTCTTGGCTCTATTAGCATTTTACCCGTTTTGGACTGCAATCTGGGCAGCTGCTAGCGCTTATTTTCTACACAAGGAAATGTTTCACATGTCGGATCAAATAATGGAACAAATGAGAAAGACACAAGCAGAGGGCTTAGCGAAGGAGCCGAGATTTCCTCGAACTTGGGAACAGGAACTGGGAGAGTGGTGGGAAGAAACCCCACTCCTACCTGGCACAAGTGTCTACTACGACGCTGCAGATCTATTGCTGAGACTGCGAGGCATTGACTTGGCAGAAATATGTATAGCCAAAGCACTTTTGGAAAACGGAGACTCAGCCGTGTATTACCATATGGTAGCTCTGTGCTGCAGACTCAAAGGGAACTACGCGGACGCTCTTTGTCATCTCCAAGAAGGAATCGACAAATATGGAGAAATTAGTTATCTTAGAAGCTTACAGGGTGAATGCTATCACAGGACAAAGGAATATAATTTATCGTTGGCTTCTTTTGAGAAATCAGGAAGTTGCAAATCCGCTTATACAACGCTGCTGTCGTTGCCTCGTCGTGACGGAGGAAGAACTCGTTCCATACTAACAGACCTGGTCCGCCGTCACCCAAGCGCATACGCATGGATGGCTTTCGCTGATGAATGGATGACGCGCAGTGCAGTAGGTGAAGGAGGAGATGCTAATGTAACAGAAGAACAAAGATCAGCATTAGCAAATGCAGAATCGTGTGCGTTCACTGCCCTGGAATACGACAGACGGGCTGCCCGAGCCTGGGCGCTACTGGCCACTTGTCTCACACCTTCAACTAGACGAAACTATTGCAGAGAAATGGCGATACTGTGTGGTTTCACGAAGAATTTGGATGACCGTCCAAAGACCTACAGCCGGGAATCGAGAGAATCATTGTGTTTCCGTATCGGCAGACCTCTGAGGGAATGTCGGTGTAAAATGTGCGAACACATTGCGTTGTAG

Protein sequence:

>DPOGS201984-PA
MLVDLEIEPTKRSCTGRGQQERSWSNIIRLASSGLSHVPYFGMSDICTINRQLSETRTRVEIVTSLWQEAAIFVNNNFVVHDFLNSDETFQEMVMIAHACLMRITNDALIDSDKKPPPYSVQKAARHARQLHDLSHAMDLYLQLIVQAPREADSWRELATCLRDIDKDWANVCIDKSIILNPRHPLSLLSKSCMIFEEDPNAAEPFFLALLAFYPFWTAIWAAASAYFLHKEMFHMSDQIMEQMRKTQAEGLAKEPRFPRTWEQELGEWWEETPLLPGTSVYYDAADLLLRLRGIDLAEICIAKALLENGDSAVYYHMVALCCRLKGNYADALCHLQEGIDKYGEISYLRSLQGECYHRTKEYNLSLASFEKSGSCKSAYTTLLSLPRRDGGRTRSILTDLVRRHPSAYAWMAFADEWMTRSAVGEGGDANVTEEQRSALANAESCAFTALEYDRRAARAWALLATCLTPSTRRNYCREMAILCGFTKNLDDRPKTYSRESRESLCFRIGRPLRECRCKMCEHIAL-