Monarch geneset OGS2.0

DPOGS209675
TranscriptDPOGS209675-TA2439 bp
ProteinDPOGS209675-PA812 aa
Genomic positionDPSCF300134 - 70231-83422
RNAseq coverage671x (Rank: top 19%)
Annotation
HeliconiusHMEL0021701e-14650.65% 
BombyxBGIBMGA000708-TA1e-13748.95% 
Drosophila% 
EBI UniRef50UniRef50_E2AAW24e-12138.86%Tetratricopeptide repeat protein 39B n=2 Tax=Camponotus floridanus RepID=E2AAW2_CAMFO
NCBI RefSeqXP_973681.19e-12236.11%PREDICTED: similar to TPR repeat-containing protein C9orf52 [Tribolium castaneum]
NCBI nr blastpgi|3320228718e-12340.32%Tetratricopeptide repeat protein 39B [Acromyrmex echinatior]
NCBI nr blastxgi|3320228712e-11940.57%Tetratricopeptide repeat protein 39B [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[159-615] IPR0194121.9e-102Outer membrane protein, IML2, mitochondrial/Tetratricopeptide repeat protein 39
Orthology groupMCL14663 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209675-TA
ATGTTTTATGTGACAAGAATAGCGACGCGTCACGACGTGCTCGGCGAAGACGAGCTGGGGAATCCGATCGTGATGCCGGAGTTTGTGGCAGCAGCAAGGACGCCGCCCAGCACACCGATGGCTACGACCGAATTTATCCAAGAAAGAATGACGGGACCCACTGCTACAGAACAACAATTAGATGCAACTACCGCTCTGGGGATTACGGCGGAGACACAATTCAATACATCAAATGGCGAACCGCAACCCAGTACGTCAGGATACCAACCTCAACGTGGACCAAGGAACCGTGACCGACGGGACAGCTACTCTGGCATCGCTCCAAGAGCGGAACCGGAGGACACTGATAGTGATGAATTCTTTGAAGCGGAAGATCATGGGTCTTCACCACCGAGCAGTCCAATAGTTCCCTGCAACCCTTCAACAGCAATATCGACTGAGTTGACACTCTACTCGGCGCTGGGAGACTGTGAACGTGCTCTTCAGCTCTTCTTCAACAATAATCTTGACGGAGCCATATCCATTATGACCCCGTGGAAAGACGTATCGCTATACCACGCTCACGGTGCTGCCATTTTCGAGTTCATACCGGCGATGCTAACCCTAGATCCGGTTCAAATACAAAAAGCAAACGACGCTTTGAGACATACTTTCGAACTGTGCTCACAACATCGCCGGAACTACAACTTTGTTCAGTCTATTGGAGCCATCATTAGGAGGCCAAATTACGCGACTTATACTGAGGAAGAGGCACACGCTGAACTCATTCATGCTGAGGCAGTGATGCTACAGGCTTGTATCGGAGCCCTCGAGAACGAAGATGTAGCGGGCTTAGTGAAAGCCTCCTTCCGCATTAAATCGTCCTTTAACAGCTACAAGCAATGCGCTAAGATATTGTCGCGTAAGGAATGGGAGAGTGCTGAATGTAAGGCTCACTTTGAGAGCGGCGTCAGACTCGGCCTGGCGACGTTCGATGTCATGATATCACTGCTACCTCCCCGCCTGATAACACTCTTGGAGTTTATAGGATTCAGCGCTAATAAGGAAAAAGGCATAGAGGATCTGAAGATGGAGGCCAAATCCCCAGGATTACGGTCTGTGCTCTGTTCACTGACTCTCTTGATCTATTATCTAGTTATTGGACACTTTGCCGCTATTGTACCGGATTATTTTGCTACGGAACAAATGATTTCTAGAGGATTGGCGAGGTATCCGAACAGCGTGTGGTTTACGATGTTCAAAGCTCGGATGCTACTACTTCGGGGTATGATCAACGAAGCGGTGGAATCGTATCAGATGGCGACTCACACTGAGAATCTATGGCCGCAACTAAAACATCTATGTTACTGGGAAATGATATGGGCATACGGAATAATGATGGAATGGTCTCAAGCCGCGTGGTATGCGAGTAGACTAGGCGTGGAGAGTAAGTGGTCCCGCACTATATACATGTACACGGAGGCCGCCATGCACTTAGAGCGGGGGGACGGACTCACCACCGACCAGAGGCGACACTGTGAACTACTACTCAGGAAGGCCACAAGCTACAAACAGAGAGTCATGGGTCGTTCGCTGCCAATGGAGAAATTCGTTATCCGTCGCTGCGAGAGATGGCTTCGTCGCGGATACCTCACCCTGCCCGGCGTCGAGTTGATGTGTTTGTGGAACATGTTTCCATCACTCGCCGCCGAGCCTCACTACGCCGACAGAATGCTCAAACACATCGAAAAATTATGCGACGAAGTCGAAACAACCCTGAGGAGCAGACGGAGCGACTCGTCCAGCGACACTCCGGACTATGATAAGGAGGACCTTGCCGTGGTCAAATATCTGAACGGCAGTCTATTGGCAGCCATGTCCTTGCCCAGGCTCGCGTTGAGACATCTCGAAGTGGTGCTGACGATGAAAGACGAAATTAAGGACGGCACACATCTAGTACCTTTCACGGCTGTCGAGATAGCCATGTGCCACTACGCGCTCGGCGACTCGTACCAGGCCGTGGATATATTACACGACGCGAGGAAAAAATACGCTGGTTACTTACTAGAAGCCAGACTACAGTTCCGCATACATTCGAAGCTGGAGTTGATAAACGCGGGTGCAGCCAACACGGTTGTGGCTGGGACCACGGCTAGTGCTGTACCGATGGCCAGCGTCAACGTGCTCGCAGCTCGACCATCGACATCGACATCGTCATCCAGCAGACCAATCACTACATCCAGAGAGGTCGTCGAGAACAGATCAGGCAGGATGACGACCAGGGAAGTTCTGAACACGATAGCCGATTCAAGGGCAATAATGGCGGCCAGTTCGCATGTTAGAAACGAACCTGAAGAGATCGCTCTGCAAAGAGCGAACCCAGGGCACGCGCACGTGAACGCGAACGCGAACACGAACGCGAACACGAACGCGAACGCGAACTCGAACACGAATAGATGA

Protein sequence:

>DPOGS209675-PA
MFYVTRIATRHDVLGEDELGNPIVMPEFVAAARTPPSTPMATTEFIQERMTGPTATEQQLDATTALGITAETQFNTSNGEPQPSTSGYQPQRGPRNRDRRDSYSGIAPRAEPEDTDSDEFFEAEDHGSSPPSSPIVPCNPSTAISTELTLYSALGDCERALQLFFNNNLDGAISIMTPWKDVSLYHAHGAAIFEFIPAMLTLDPVQIQKANDALRHTFELCSQHRRNYNFVQSIGAIIRRPNYATYTEEEAHAELIHAEAVMLQACIGALENEDVAGLVKASFRIKSSFNSYKQCAKILSRKEWESAECKAHFESGVRLGLATFDVMISLLPPRLITLLEFIGFSANKEKGIEDLKMEAKSPGLRSVLCSLTLLIYYLVIGHFAAIVPDYFATEQMISRGLARYPNSVWFTMFKARMLLLRGMINEAVESYQMATHTENLWPQLKHLCYWEMIWAYGIMMEWSQAAWYASRLGVESKWSRTIYMYTEAAMHLERGDGLTTDQRRHCELLLRKATSYKQRVMGRSLPMEKFVIRRCERWLRRGYLTLPGVELMCLWNMFPSLAAEPHYADRMLKHIEKLCDEVETTLRSRRSDSSSDTPDYDKEDLAVVKYLNGSLLAAMSLPRLALRHLEVVLTMKDEIKDGTHLVPFTAVEIAMCHYALGDSYQAVDILHDARKKYAGYLLEARLQFRIHSKLELINAGAANTVVAGTTASAVPMASVNVLAARPSTSTSSSSRPITTSREVVENRSGRMTTREVLNTIADSRAIMAASSHVRNEPEEIALQRANPGHAHVNANANTNANTNANANSNTNR-