Monarch geneset OGS2.0

DPOGS201958
TranscriptDPOGS201958-TA3003 bp
ProteinDPOGS201958-PA1000 aa
Genomic positionDPSCF300384 + 20428-30052
RNAseq coverage535x (Rank: top 23%)
Annotation
HeliconiusHMEL0020591e-7145.07% 
BombyxBGIBMGA011215-TA0.057.35% 
DrosophilaPatr-1-PA5e-4131.17% 
EBI UniRef50UniRef50_D2A5J12e-5333.91%Putative uncharacterized protein GLEAN_15163 n=1 Tax=Tribolium castaneum RepID=D2A5J1_TRICA
NCBI RefSeqXP_001809160.14e-5433.91%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
NCBI nr blastpgi|1892382869e-5333.91%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
NCBI nr blastxgi|1892382862e-5333.40%PREDICTED: similar to protein associated with topo II-related 1 [Tribolium castaneum]
Group
KEGG pathwaytca:1001424891e-53 
 K12617 (PATL1, PAT1)maps-> RNA degradation
InterPro domain[304-826] IPR0191679.3e-36Topoisomerase II-associated protein PAT1
Orthology groupMCL14537 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201958-TA
ATGGCAGATTCTTTTTTCGGGATCGATACTTCGTCATCGAACTTAAATGACGACGAGGGTGGCGGGGAGCCTTCGGAAGACGAGTATGATGCCCTCAACGATGAAACATTTGGACAGGATTCTGAAGAGTTTGACTGGGAGTATGAACACGAACAGCTGGCCGGACAGCTGGAGAGCAGTCGACGGAATGCAGCACTTGACGACGCTGACTCCAGGCTTGAGGCATCTCTCTCTCAGCTAGTGTTGGATGAAACGGATGCTCCACGAAGCCTCGGGTCCAGCGTCTGGAGACATGACGTTCCGTTTCCCACACCAACAACGCCGGTACAACAGCCAGCCCTCAAGAATGTGTGCACAGTGGAGGAACTGGAAAGACAGCTGCGACAGAACCAGCAACAGACGTACACACAGAACTACTTTCAACCGCGGTTTCCACCTGTTATCCTGCAACGGCCTCCAGGTCTACAGGCACCACTTCCGATACCGTTCGCACCACAGCAATCTATGGCACATAATATGAATCAAATGAACCAGCCGTTGAATAAAATGATTGGACAAAACTCACAAATCAACCAAATCGGCCAATCGAATATGATGAATCATATGAACCAGAACATAAATCAGATTGGTTCTAATATAAATCAGATGAATCAGAATGTAAATCAGATGGGTCAAGGTGTTAATCACATGGGACAAAATGTTATGGGTCAAAATCAAATGGGACAAAATGTTATGGGTCAGAATCAAATGGGGCACAATGTGAACCAATTCATGCAAAATTCAAACCAGATGAATCAGTTTCAAAACAACCAAATGGCACCGGCGATGATGAACCAAATGCATAACATGAATCTTCATCCCATGAATCAGAACAATCAAATGATTCAAAACCAAATGGGTCAAAACATGAATCAGATGGTACAAAACTCTGGGTCTAACATGAATCAAATGAATCAGGGGGGTCAAAACATGAATCAAATGGGCCAAAATATGAATCAAATGGGTCAAAATATGAACCAAATGGGTCAAAATATGAATCAGATGGGTCAAAATCAGATGATCCCAAACGGTAACCAGTTTGGTATGAATCAGTTCCAACAAATGATGGGTCAGCCTCGAATGATGGCGCCGCCCCCCGGCATGAATATGCCAAGACAAAATTTTGGCAATAATATGAATCAATTTAATACAAATTTTCGTGGACAAATGCAATCAAAAAACGATCAGCAAATATCACAACAGAACATGATTATGAACCACAATAAAAACCAGAACCAGGTAAAACCGAACCAGCTGTCACCGAAGCAGAAACCGGTACAGAATTTGAACCACATCCAGAATATGAACCAGGCTCAGAACAACAAGTTGCAGAAGAGTCGCGTATATAATAACAAGAGCTTAACATCTCAAAACCTGGTCCAGCTGATCCAGAACACGCATCCCATGTTGAACTTCAACAACAGCTTCCACAACGCCAGCCATCACCCCATCCTCAACAGAAACCATTTCAACAACCAGCTCATGAAACATCTCATGTTCGACAACAGGCAGAACGGGAATTTTGGCAACACAACCAGCAGAGCGAACACCTCAGTTGACGGGGAGCTGGATGAGTATGCAGGTCTAATGACGGCCAGAGAGAAACAATGGCTCATCAATATCCAGATGTTGCAGCTGAACACCGGAACACCTTACATACATGACTTCTATTATACAGTTTTCCTCGAGAGACAGGCAAGCAAGGAAAAGGAAGGTGTGAAGGAGGCTCATAAGGCCAATCAACAGAACCATCCCTTCTACAGCGGAGGCAAGCAGGAAGACAGTCACGCCATGAGACAGAGAGAGAGGCACAACTCGCACAGACACAACTCCACCGGAGAAGATCCCAGGACATATGTCCCTACACAGTTTGAGAACTCCCTGGGGAAACTGCAGTGCGGTAGCGTGACGGCTCCGAGGAAAATTATAGACGTGGAAGTCGTGGGGGCGGAGCCCGAGCAGCGAGCGAGCAGGGCGCCCTCTGTGGCCAGCAACAACCCCACTGAGGTTCCGCGCGAGATGAGAAGGACCAAGCAACTGCTTCTGGATATTGAGGCGTTGTATCTCATACTGCTCAGACTGGAAGAGCTCAACGATCCCTTGGCAATATCTAACGCTTTAATATTGAAGGAAAGAGAAGAGAAGCAGAAGCAACTGGAAGCGGCGCAGAAGGAAGCAGAAGATGACGATGATGGAAACTTATTCCTTAAGAATATAGAATCTGTACAGAGACGACCCAAGCAGGAGAGTCCCAAGAGCGAGAGTATCGATAAGAGACAGGTGGTGAACCTGACCACGAACCAGAAACCTCAACCGGCCAAGAACCTGCTGGATGAAGACAAGGAGGACCTGCTCAATAAGATGTTCTCCGGACTCCTGCACGGAGAGAGAGTGCCGCAGATACTGGCGGTTAGGAAGGGGAAGTCGTTGCTAGCTCGTTTCCTGGCCCGCACCCCCGAGACTCACCCCCGTCTTCGTCCTCTATGGTCGAGGGTGCTCCGCTGTCTGCCGACCGCTGCTCGCCGCGACGAAGGCGGCGCCCTAGTGTCCCTGGAGCCCCACTTCCGGCGCTACGCTCTCGCGGCGCCCGGCTGGCCGGCGGTGGCGGAGACCTGCGCCGCGCTAGCGGAGGCCCTGGACCCCGCCCGCGGCCACCACGCGCTCGACACGCGACTGACTCTGAGCTGCGTCTGCGCGCTCACCGAGAAAGCGATGATGCTGGTGGGGGTGTCGGAGTCGTCCGGCGGCGAGCGGCAGTGGTACAGGTTCCTCAAGACGGTGGCGAAGGCCCTGAGACACGCGCCGAGCGTCGCCCCGCCCACCCGGCCCGTGGCGGAGGCCCGGCTCTCGGCGCACCTGAAGCGGCTGGAGGCCCGCGCGGGGCTCGAGATCCTGCAGAGAGGCGCCACCATCGAGCTGTCCGCCAAGGACGCCGCGGAACACCTCGTGAGGCACCTCTGCTGA

Protein sequence:

>DPOGS201958-PA
MADSFFGIDTSSSNLNDDEGGGEPSEDEYDALNDETFGQDSEEFDWEYEHEQLAGQLESSRRNAALDDADSRLEASLSQLVLDETDAPRSLGSSVWRHDVPFPTPTTPVQQPALKNVCTVEELERQLRQNQQQTYTQNYFQPRFPPVILQRPPGLQAPLPIPFAPQQSMAHNMNQMNQPLNKMIGQNSQINQIGQSNMMNHMNQNINQIGSNINQMNQNVNQMGQGVNHMGQNVMGQNQMGQNVMGQNQMGHNVNQFMQNSNQMNQFQNNQMAPAMMNQMHNMNLHPMNQNNQMIQNQMGQNMNQMVQNSGSNMNQMNQGGQNMNQMGQNMNQMGQNMNQMGQNMNQMGQNQMIPNGNQFGMNQFQQMMGQPRMMAPPPGMNMPRQNFGNNMNQFNTNFRGQMQSKNDQQISQQNMIMNHNKNQNQVKPNQLSPKQKPVQNLNHIQNMNQAQNNKLQKSRVYNNKSLTSQNLVQLIQNTHPMLNFNNSFHNASHHPILNRNHFNNQLMKHLMFDNRQNGNFGNTTSRANTSVDGELDEYAGLMTAREKQWLINIQMLQLNTGTPYIHDFYYTVFLERQASKEKEGVKEAHKANQQNHPFYSGGKQEDSHAMRQRERHNSHRHNSTGEDPRTYVPTQFENSLGKLQCGSVTAPRKIIDVEVVGAEPEQRASRAPSVASNNPTEVPREMRRTKQLLLDIEALYLILLRLEELNDPLAISNALILKEREEKQKQLEAAQKEAEDDDDGNLFLKNIESVQRRPKQESPKSESIDKRQVVNLTTNQKPQPAKNLLDEDKEDLLNKMFSGLLHGERVPQILAVRKGKSLLARFLARTPETHPRLRPLWSRVLRCLPTAARRDEGGALVSLEPHFRRYALAAPGWPAVAETCAALAEALDPARGHHALDTRLTLSCVCALTEKAMMLVGVSESSGGERQWYRFLKTVAKALRHAPSVAPPTRPVAEARLSAHLKRLEARAGLEILQRGATIELSAKDAAEHLVRHLC-