Monarch geneset OGS2.0

DPOGS201959
TranscriptDPOGS201959-TA1524 bp
ProteinDPOGS201959-PA507 aa
Genomic positionDPSCF300384 + 30453-34084
RNAseq coverage52x (Rank: top 70%)
Annotation
HeliconiusHMEL0020594e-4685.71% 
BombyxBGIBMGA011215-TA3e-17167.93% 
DrosophilaPatr-1-PA4e-4031.17% 
EBI UniRef50UniRef50_D2A5J14e-5434.95%Putative uncharacterized protein GLEAN_15163 n=1 Tax=Tribolium castaneum RepID=D2A5J1_TRICA
NCBI RefSeqXP_001809160.17e-5534.95%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
NCBI nr blastpgi|1892382861e-5334.95%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
NCBI nr blastxgi|1892382864e-5334.88%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
Group
KEGG pathwaytca:1001424892e-54 
 K12617 (PATL1, PAT1)maps-> RNA degradation
InterPro domain[43-333] IPR0191671.4e-34Topoisomerase II-associated protein PAT1
Orthology groupMCL14537 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201959-TA
ATGTTAGAGACGTTTCTATCGTTCGTCCCGGCGGGCGGAGGGTGCGCACCCCTCGCACCCTCGCGGCGGCCGTGCGCCGTTCGTCTGCACCCTCGCGAACGTCCGACTCCCGTCGTTGACGGGGAGCTGGATGAGTATGCAGGTCTAATGACGGCCAGAGAGAAACAATGGCTCATCAATATCCAGATGTTGCAGCTGAACACCGGAACACCTTACATACATGACTTCTATTATACAGTTTTCCTCGAGAGACAGGCAAGCAAGGAAAAGGAAGGTGTGAAGGAGGCTCATAAGGCCAATCAACAGAACCATCCCTTCTACAGCGGAGGCAAGCAGGAAGACAGTCACGCCATGAGACAGAGAGAGAGGCACAACTCGCACAGACACAACTCCACCGGAGAAGATCCCAGGACATATGTCCCTACACAGTTTGAGAACTCCCTGGGGAAACTGCAGTGCGGTAGCGTGACGGCTCCGAGGAAAATTATAGACGTGGAAGTCGTGGGGGCGGAGCCCGAGCAGCGAGCGAGCAGGGCGCCCTCTGTGGCCAGCAACAACCCCGCTGAGGTTCCGCGCGAGATGAGAAGGACCAAGCAACTGCTTCTGGATATTGAGGCGTTGTATCTCATACTGCTCAGACTGGAAGAGCTCAACGATCCCTTGGCAATATCTAACGCTTTAATATTGAAGGAAAGAGAAGAGAAGCAGAAGCAACTGGAAGCAGCACAGAAGGAAGCAGAAGATGACGATGATGGAAACTTATTCCTTAAGAATATAGAATCTGTACAGAGACGACCCAAGCAGGAGAGTCCCAAGAGCGAGAGTATCGATAAGAGACAGGTGGTGAACCTGACCACGAACCAGAAACCTCAACCGGCCAAGAACCTGCTGGATGAAGACAAGGAGGACCTGCTCAATAAGATGTTCTCCGGACTCCTGCACGGAGAGAGAGTGCCGCAGATACTGGCGGTTAGGAAGGGGAAGTCGTTGCTAGCTCGTTTCCTGGCCCGCACCCCCGAGACTCACCCCCGTCTTCGTCCTCTATGGTCGAGGGTGCTCCGCTGTCTGCCGACCGCTGCTCGCCGCGACGAAGGCGGCGCCCTAGTGTCCCTGGAGCCCCACTTCCGGCGCTACGCTCTCGCGGCGCCCGGCTGGCCGGCGGTGGCGGAGACCTGCGCCGCGCTAGCGGAGGCCCTGGACCCCGCCCGCGGCCACCACGCGCTCGACACGCGACTGACTCTGAGCTGCGTCTGCGCGCTCACCGAGAAAGCGATGATGCTGGTGGGGGTGTCGGAGTCGTCCGGCGGCGAGCGGCAGTGGTACAGGTTCCTCAAGACGGTGGCGAAGGCCCTGAGACACGCGCCGAGCGTCGCCCCGCCCACCCGGCCCGTGGCGGAGGCCCGGCTCTCGGCGCACCTGAAGCGGCTGGAGGCCCGCGCGGGGCTCGAGATCCTGCAGAGAGGCGCCACCATCGAGCTGTCCGCCAAGGACGCCGCGGAACACCTCGTGAGGCACCTCTGCTGA

Protein sequence:

>DPOGS201959-PA
MLETFLSFVPAGGGCAPLAPSRRPCAVRLHPRERPTPVVDGELDEYAGLMTAREKQWLINIQMLQLNTGTPYIHDFYYTVFLERQASKEKEGVKEAHKANQQNHPFYSGGKQEDSHAMRQRERHNSHRHNSTGEDPRTYVPTQFENSLGKLQCGSVTAPRKIIDVEVVGAEPEQRASRAPSVASNNPAEVPREMRRTKQLLLDIEALYLILLRLEELNDPLAISNALILKEREEKQKQLEAAQKEAEDDDDGNLFLKNIESVQRRPKQESPKSESIDKRQVVNLTTNQKPQPAKNLLDEDKEDLLNKMFSGLLHGERVPQILAVRKGKSLLARFLARTPETHPRLRPLWSRVLRCLPTAARRDEGGALVSLEPHFRRYALAAPGWPAVAETCAALAEALDPARGHHALDTRLTLSCVCALTEKAMMLVGVSESSGGERQWYRFLKTVAKALRHAPSVAPPTRPVAEARLSAHLKRLEARAGLEILQRGATIELSAKDAAEHLVRHLC-