Monarch geneset OGS2.0

DPOGS202717
TranscriptDPOGS202717-TA888 bp
ProteinDPOGS202717-PA295 aa
Genomic positionDPSCF300272 + 40714-43818
RNAseq coverage655x (Rank: top 20%)
Annotation
HeliconiusHMEL0041162e-12781.41% 
BombyxBGIBMGA004502-TA5e-10768.44% 
Drosophilapex13-PA6e-5343.93% 
EBI UniRef50UniRef50_D6WSW57e-5145.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WSW5_TRICA
NCBI RefSeqXP_001360774.16e-5846.69%GA18339 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|2897395892e-5848.57%peroxisomal bioproteinsis protein [Glossina morsitans morsitans]
NCBI nr blastxgi|1951230956e-6946.75%GI18763 [Drosophila mojavensis]
Group
Gene OntologyGO:00165606.4e-40protein import into peroxisome matrix, docking
GO:00160216.4e-40integral to membrane
GO:00057776.4e-40peroxisome
GO:00055152.1e-10protein binding
KEGG pathwaydpo:Dpse_GA183392e-57 
 K13344 (PEX13)maps-> Peroxisome
InterPro domain[41-191] IPR0072236.4e-40Peroxin 13, N-terminal
[205-279] IPR0014522.1e-10Src homology-3 domain
Orthology groupMCL14772 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202717-TA
ATGTCCTATGGTGGGATGGGAGGGATGGGAGGCATGGGATCGTATGGTATGGGGTATGGAGGTTACAACATGGGTGGTTATGGTATGGGAGGGATGGGTATGGGTATGAGCCCCTATGGTGGATACAACAGATATGGACCTATGTACGGAGACATTGAAAGCAGATTCATTCAAATAGCTGAAGAAGGCTCCCGTCCAGCATTTGAATCTATACAGAGTGTTGTGAATGCTGTCGGTAGCGTAGCTATGATGATGGAAAACACATTTTTTGCACTGACCAGCTCTTTCAGAGCAATTCTGGGTGTAGCAGAAAATTTTGGCCGTCTAAGGTCTCTGTTTGCCCAGTTCTGGTCAACGTTTGCCGTGGTTAGGAGTTTGAACTGGCTAGTCAGAAAACTGCTTGTCATGTTAGGAATCAGAACAGAGACCGAGTTCAAGGCATGGGCGGAAGCATTGGCAGCCACCCAGTCAGGAACGGGTACACCAGAACAGAAAGCCAAGGGTTCAAGTTGGCCGATACTACTGTTCTTCGGAGTCATAGCAGCTGCACCATACATTGTCCTCAAAATGCTTAACGGCATCTCATCGAGTATACACGAGAGATTAAATGACCCCTCTAGCTGGCAGAATCCTCTTCGCGCTGTGGCTCAACACGACTTCCAAGCTACTTCCCCTCAGGAAATCAGCTTCACAACGGACCAGGTCCTAACCTTGGCTCCCCAACATCTCCAAGGTCACCTTTGGAATTCAGGCTGGCTCATGGCATCCGCTGACAGACACACAGCTGGCCTGGTGCCAGTCACGTACATCAAAGTAATCAAACCAAGCGATACAAATAAAGCAGAACCCACAGACGACTTGCACAAATATTATAATCAGGAATTGTAA

Protein sequence:

>DPOGS202717-PA
MSYGGMGGMGGMGSYGMGYGGYNMGGYGMGGMGMGMSPYGGYNRYGPMYGDIESRFIQIAEEGSRPAFESIQSVVNAVGSVAMMMENTFFALTSSFRAILGVAENFGRLRSLFAQFWSTFAVVRSLNWLVRKLLVMLGIRTETEFKAWAEALAATQSGTGTPEQKAKGSSWPILLFFGVIAAAPYIVLKMLNGISSSIHERLNDPSSWQNPLRAVAQHDFQATSPQEISFTTDQVLTLAPQHLQGHLWNSGWLMASADRHTAGLVPVTYIKVIKPSDTNKAEPTDDLHKYYNQEL-