Monarch geneset OGS2.0

DPOGS212160
TranscriptDPOGS212160-TA1596 bp
ProteinDPOGS212160-PA531 aa
Genomic positionDPSCF300038 + 602092-603687
RNAseq coverage223x (Rank: top 45%)
Annotation
HeliconiusHMEL0125390.076.69% 
BombyxBGIBMGA006608-TA0.069.57% 
DrosophilaCG14815-PA5e-9337.71% 
EBI UniRef50UniRef50_D6WZG53e-9739.06%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZG5_TRICA
NCBI RefSeqXP_970686.26e-9839.06%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|1892412131e-9639.06%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|1892412133e-9738.83%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00054884e-31binding
GO:00055151.2e-05protein binding
KEGG pathwaytca:6592682e-97 
 K13342 (PEX5, PXR1)maps-> Peroxisome
InterPro domain[1-531] IPR0241114.7e-128Peroxisomal targeting signal 1 receptor family
[1-531] IPR0241134.7e-128Peroxisomal targeting signal 1 receptor
[364-476] IPR0119904e-31Tetratricopeptide-like helical
Orthology groupMCL15042 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212160-TA
ATGTCATTAAACAAACTTGTCGGAGGTGATTGTGGCGGTAACAATTCACTTGTTAAACTAACAAATATTGTAGGTAGAGATGGTTCCATAACACAAAACTTATCACAATCTGACAGATTTGTCAATGAATTTCTGGCACAAAATTCTCAAGTTCCTCAAACTTTTAATATGAACGCCCTTCTTAATAATATGCCAGAAGTGGAGAAAGTGTCAAACATTACTGCTCAACCATCTACAAGTCAAATTTCTAATGTTCGTCCACATATGCCATCTCCTTGGATGCATGCTCCTTCAGCATCTTTCATGCCCTCGGCAATGAGACCTTTTCAAACACCATTCCAAATAATGAGACAACCACAGACATCAAATGTTCAGATACAATATGTTAATGAATCCGAGTTGCAAAAATCTGATAGTGATGTGAAAACTAAAGCTCAAGAATATGTCAACAGTGTTAAAGAAGATGACGAACTTGCTTATAATCAATTCATGTCATTTATGAAAAGAATAAGTTCAGGTGAATTAAATCTCGGAGAAAGTCTGGAGGGGGAACAAAAAAGTATGAGCAAAGATAAAATAGTCGAAGAGATGGCTGAAAAATACAAAGATGAATGGGCTAAGTTGAGTGATGTCAATGAATACTGGGATTCTGAAGCGGCAAATGGAATAGCAAAAGAATATACATTCGCGGAAGGGAATATGATGTTGGAAAATAAAAGTGCTCTAGAACTTGGTAAGGAGAAGTTGAAGATGGGTGATATTCCAGGTGCCGTTCTTTGTTTTGAGGCGGCAGCTCAGCAGCAACCCGATTCAGCTGAAGCTTGGTTCTTACTTGGCACAACACAAGCTGAAAATGAACAAGATCCTCTAGCAATAACAGCACTAAAAAAATCCCTAGCAATTGATCCAAGGCAACTGGAAGCATATATAACCTTAGCAGCTGCATACACCAATGAGAACATGGCTAAACATGCATATTTGACATTGCTGGATTGGTTGAAGGCCAGTAGTAAATATAGTGATTTGGTTCCCCAAGACATTGATCCTAACAAAATGAGTATTAAAGAATTGGAGGCCTATTCAACATCACTATATCTGAAAGCGGCACAATTAAACCCTGTTCAAGTGGATCCTGATGTGCAAAATGCATTGGGTGTAATTTGTAACATTAATCAGCAATATGATAAAGCGGTGGATTGTTTTAAAGCAGCTCTGGCTGTGGCTTCGGATAATGCTAAACTGTGGAACAGGCTAGGAGCCACTCTTGCCAACAGTGACAGGTCTGAGGAAGCCCTGGATGCTTATCATGAGGCTCTCAACCTAGAACCGGGTTTCATAAGAGCTAGATATAATGTTGGTATCACATGCATGAATTTAGGAGCTCATAAACAAGCAGCAGAGCATTTCTTAGTTGTACTGAATCAGCAATATAAAGCTCAAAGTTCGAACCCCAATGCTTCATCAGATATAAGCTCTTCAACCATTTGGACAACATTAAGAATGGTTTGTTCCTTTATGGGCGAGCATGATGCTGCAAAATTAGTTGATGATAGAAATCTTAGTGAGCTGAACAAATTTTTTGAAGTTGAGCCGTAA

Protein sequence:

>DPOGS212160-PA
MSLNKLVGGDCGGNNSLVKLTNIVGRDGSITQNLSQSDRFVNEFLAQNSQVPQTFNMNALLNNMPEVEKVSNITAQPSTSQISNVRPHMPSPWMHAPSASFMPSAMRPFQTPFQIMRQPQTSNVQIQYVNESELQKSDSDVKTKAQEYVNSVKEDDELAYNQFMSFMKRISSGELNLGESLEGEQKSMSKDKIVEEMAEKYKDEWAKLSDVNEYWDSEAANGIAKEYTFAEGNMMLENKSALELGKEKLKMGDIPGAVLCFEAAAQQQPDSAEAWFLLGTTQAENEQDPLAITALKKSLAIDPRQLEAYITLAAAYTNENMAKHAYLTLLDWLKASSKYSDLVPQDIDPNKMSIKELEAYSTSLYLKAAQLNPVQVDPDVQNALGVICNINQQYDKAVDCFKAALAVASDNAKLWNRLGATLANSDRSEEALDAYHEALNLEPGFIRARYNVGITCMNLGAHKQAAEHFLVVLNQQYKAQSSNPNASSDISSSTIWTTLRMVCSFMGEHDAAKLVDDRNLSELNKFFEVEP-