Monarch geneset OGS2.0

DPOGS204990
TranscriptDPOGS204990-TA1116 bp
ProteinDPOGS204990-PA371 aa
Genomic positionDPSCF300123 + 92867-93982
RNAseq coverage778x (Rank: top 17%)
Annotation
HeliconiusHMEL0094998e-18080.59% 
BombyxBGIBMGA010225-TA9e-16073.85% 
DrosophilaCG6859-PA3e-6838.34% 
EBI UniRef50UniRef50_Q2F5N62e-15172.24%Peroxisomal biogenesis factor 3 n=1 Tax=Bombyx mori RepID=Q2F5N6_BOMMO
NCBI RefSeqNP_001040264.13e-15272.24%peroxisomal biogenesis factor 3 [Bombyx mori]
NCBI nr blastpgi|1140526596e-15172.24%peroxisomal biogenesis factor 3 [Bombyx mori]
NCBI nr blastxgi|1140526594e-14672.24%peroxisomal biogenesis factor 3 [Bombyx mori]
Group
Gene OntologyGO:00070312e-46peroxisome organization
GO:00057792e-46integral to peroxisomal membrane
KEGG pathwaytca:6585692e-72 
 K13336 (PEX3)maps-> Peroxisome
InterPro domain[96-368] IPR0069662e-46Peroxin-3
Orthology groupMCL13246 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204990-TA
ATGTTCTCTACTATTAAACACTTTCTTTATCGTCATAGAAGGAAATTCGTAGTAACAGGTGCAGTGTTCGGTTCCTTATATTTATTACTCGGCTATGCACAAAAAAGACTGAGGGAATGGCAAGAGAAAGAAGCAAAAAAGTTCTTTGACATGACCCGTAAAAAACAGCATTTTGAAAGCACAGAGAGGACATGTAATCAAACTATACTTTCACTGTCTAAAATAGTTTCAGAAAGCATTGTTGGAATCATTGACACTGAAGATGTAGTTCAGAAACTACATAACAAACCTGAAAACAAAAAAACATTATGGGAAGAGCTCAAGATAATGATTTTCACAAGAATTTGTGTCCTTGTTTATGCTCTATCCATACTAAATGTCACTCTTAGAGTACAATTAAATGTTATAGGGGGATATCTTTATAAGGATTCTGTGCAGGAAGAGGAACCTCTTATAGATAGTGAATTACAAGCAAAATATCTGTCTTTATGCCACCATTTTGTTGGATCTGGAGTAGAAGACTTGGTCCGACAAATAGAAAAGGCTGTCAAGAAAGTTGTAGAATCCATTCCTTTGACCAAGAAAATAACCCTCCAAGAAGTAGAACAAGTGTTTTGGTCTGTGCAAACTATACTGTGCACGGATACCAACGGTGATCCTGTTAAAAAGATGGTTCATTACTTGGTCGACCACACAGTCATTAATGAAGCCAAGTTTGACACTATTGTTAAGGAAACAATGGATATTTTAGAAAGCGATGAAGTCATTTCAGTTGCTATGTCGACCGTCAGTAGGAGTTTCTCGTCTGTGGTGGATGAGGTCGCTAATATATTCTCCTCGAAATGGATCCCCACAAAGAAAAATCATTTGGAAGTCGTGGAGAATCATGTCGTAACTAATGGAGCGCTTAAATTGGACAGTTCAGAACCATTTGTGGATGTGAATAAAATTGAAATGAGTTTTGTATTGTTATTGGCACATATGAATAAACTTATCACGGAAAATAATTGTAAAGGCAACATAAACATTCCAGATCTAATAACCCAGCAGCTGACATTGAATGAAAAGTTAAAACTGTTGGGCGCTAACATTTACGAAGTTTTTAGCAGCCCGTAG

Protein sequence:

>DPOGS204990-PA
MFSTIKHFLYRHRRKFVVTGAVFGSLYLLLGYAQKRLREWQEKEAKKFFDMTRKKQHFESTERTCNQTILSLSKIVSESIVGIIDTEDVVQKLHNKPENKKTLWEELKIMIFTRICVLVYALSILNVTLRVQLNVIGGYLYKDSVQEEEPLIDSELQAKYLSLCHHFVGSGVEDLVRQIEKAVKKVVESIPLTKKITLQEVEQVFWSVQTILCTDTNGDPVKKMVHYLVDHTVINEAKFDTIVKETMDILESDEVISVAMSTVSRSFSSVVDEVANIFSSKWIPTKKNHLEVVENHVVTNGALKLDSSEPFVDVNKIEMSFVLLLAHMNKLITENNCKGNINIPDLITQQLTLNEKLKLLGANIYEVFSSP-