Monarch geneset OGS2.0

DPOGS206817
TranscriptDPOGS206817-TA1008 bp
ProteinDPOGS206817-PA335 aa
Genomic positionDPSCF300001 - 3746719-3751438
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0100056e-13366.57% 
BombyxBGIBMGA012760-TA2e-3346.55% 
Drosophilapex12-PA2e-3130.09% 
EBI UniRef50UniRef50_D2A4278e-5236.14%Putative uncharacterized protein GLEAN_14905 n=2 Tax=Tribolium castaneum RepID=D2A427_TRICA
NCBI RefSeqXP_967344.11e-5336.14%PREDICTED: similar to peroxisome assembly protein 12 [Tribolium castaneum]
NCBI nr blastpgi|910854072e-5236.14%PREDICTED: similar to peroxisome assembly protein 12 [Tribolium castaneum]
NCBI nr blastxgi|910854073e-5136.14%PREDICTED: similar to peroxisome assembly protein 12 [Tribolium castaneum]
Group
Gene OntologyGO:00070314.3e-32peroxisome organization
GO:00057784.3e-32peroxisomal membrane
GO:00055152.8e-05protein binding
GO:00082702.8e-05zinc ion binding
KEGG pathwaytca:6556903e-53 
 K13345 (PEX12, PAF3)maps-> Peroxisome
InterPro domain[25-252] IPR0068454.3e-32Pex, N-terminal
[280-331] IPR0130835.2e-11Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11962 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206817-TA
ATGGCAGTTTACGCTGCTCATTTGACCCGCACGTTACAGGGGACGCCGTCGGTCTTTCAGGTGACTGCGCAGGAAGCCCTCGGATCTACAGTAAAGCCCGCGCTACGAAAACTTGTTGAATATTTAGCAGCAGTATATCCTGATAAGCTTAGTTGGAGTGAGCGATGGTATGATGAGTTGTACCTACTGTTGGACTGCATAGTTCAATACCATTATCTCAAACATTATGCTGCTTCTTTTTCTGAGAGCTTCTATGGCTTGGTGAGGTCACCAATAAGTCCAAATCATGAGTTCAATTCAGGCCCCCGTTTACCGCATAAGCTTGAGCAAGCCTCATTACTATTCCTAGTGGGACTGCCTTATATGCAGGACAAGATTGATAAAATATTGGAGGGATGGAGAGAAGAACTGGACGAAGGACGCCTTGGTAAGAGTAAAGGAGATCAAGCCTGTAAAGCAGCCATCAGACTATACAAAATATCAAATTTTGTGGGTGAGGTCAGCAAATTGGTGGTATTGGCCCAATACTTGACTGGCAAGAGCCCGTCTCCTACATTGTCACTGCAAATTTTGGGGTTAACATTAAAAGATGCACCCTCAGAGGAACCAGATGATAATTCTTGGGGGGATTTGTTCAGAAATGTATTGATGGGACAATTTAGCAGTGCGGTGTTATCCTTTCGAATGTGTGTCGCAGCATGTCGACTTCTTATGGAACGTGGTGCGTTCGTGGTACAACTGCTTAGGTGGTGGGAGTCTCGTGCCCCCGCAGCATGTTCCGCGCTACCTCCACCACCACCACCTCAAGCACGGAGAGACGAGTACCGATGGTTGAACGCATGCCCTATATGTCTTCAATCGTGGAGGGTGCCCACCGTGTTGCCAGTGTCAGGTTACGTGTTCTGCTACACGTGTATCTCCCGTCATTTACGTCGCTCGGGCTCGTGTCCCGTAACGAGGCTTCCGGCCAGCGAGCGCTCACTGGTCAGGCTGTACCTGGACCTGTGA

Protein sequence:

>DPOGS206817-PA
MAVYAAHLTRTLQGTPSVFQVTAQEALGSTVKPALRKLVEYLAAVYPDKLSWSERWYDELYLLLDCIVQYHYLKHYAASFSESFYGLVRSPISPNHEFNSGPRLPHKLEQASLLFLVGLPYMQDKIDKILEGWREELDEGRLGKSKGDQACKAAIRLYKISNFVGEVSKLVVLAQYLTGKSPSPTLSLQILGLTLKDAPSEEPDDNSWGDLFRNVLMGQFSSAVLSFRMCVAACRLLMERGAFVVQLLRWWESRAPAACSALPPPPPPQARRDEYRWLNACPICLQSWRVPTVLPVSGYVFCYTCISRHLRRSGSCPVTRLPASERSLVRLYLDL-