Monarch geneset OGS2.0

DPOGS215878
TranscriptDPOGS215878-TA1263 bp
ProteinDPOGS215878-PA420 aa
Genomic positionDPSCF300029 - 481221-487854
RNAseq coverage115x (Rank: top 59%)
Annotation
HeliconiusHMEL0054004e-12875.36% 
Bombyx% 
Drosophilapex2-PA1e-4235.19% 
EBI UniRef50UniRef50_Q7PFH72e-4633.93%AGAP010836-PA n=4 Tax=Culicidae RepID=Q7PFH7_ANOGA
NCBI RefSeqXP_309872.45e-4733.93%AGAP010836-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582880109e-4633.93%AGAP010836-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582880101e-4434.78%AGAP010836-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00070316.6e-10peroxisome organization
GO:00057786.6e-10peroxisomal membrane
GO:00055150.0001protein binding
GO:00082700.0001zinc ion binding
KEGG pathwayaga:AgaP_AGAP0108361e-46 
 K06664 (PEX2, PXMP3)maps-> Peroxisome
InterPro domain[18-205] IPR0068456.6e-10Pex, N-terminal
[214-269] IPR0130832.1e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL16177 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215878-TA
ATGTCTATTACTTACATCCCAAGGGTGACCCAGTTGGATTCTTTACAATTAGATACCGAATTGGAAGAACTATTTAAGCAAAAAATTTTTCAAGCTACGAAATATTTTGAGCCCGGCCTTCTCCAGCCTATTCTTCCTGAAATCGACTTGTCAATTCGGACATGGCTATTTTTAAATTCGGTGAAAACTAATAAAAGCACTTTTGGTCAGCAAATGCTTTCTTTAAAATATAAACCAGATAATTTTGCTAGAAGCAAATTGTACTGGTATTTTGTGTATGCTATTGGGTTAAAGTATCTAAGAGATAGAGCTTTATATAGTTTTACATCTAACACCCGAGTGCAGAACTTTCTATCTAAAATAGAAACATTCCAGCTAGTAGGTGACATTTTAAATTTTCTTCGGTTCATACAAAGTGGGAAACACCCTGCTCTTATAGATTTTATTCTTGGATTAGAGTTGACAGCAGATAAATTGACAAGGGAGGACCTCACTGACTTCTCTTGGACTCGGGAATTGTTATGGCATAATCTTATTGTAAGAACGGTGCTATCATTAGTAAATATGTTTGGTTTAAAACAAAGAATGACCACAATATTGAAGTATATGTGGTGGAAAAATCGTAAATCATACAAAGCCTCGGCGAGTCAAGCAACAATGACTTTGAACACAACATGTGCATTTTGCAACGAAAAACCCGTTCTGCCACACATTATGGGCTGCTCACACATATTTTGCTATTACTGTCTCTCGGCTAATAAGACTGCGGATCCAGATTTTACATGTCCAAAATGTGATTACAATGGAAAAGAAGTTACCAAATACATCGTGGCAATGTCTAGTAAAAAACTGGACTACAGTAGTTTCGGCGTTACACATCATATATGGAACGGTATTGCTAAGCTTTTAGTGTTGCTTGTCTTTGCTATTGTGAACGTTAGCGTGGAAACCGGTCTCCCAGTCAGCGACGTAGTTGACAGTTCGCACCGAACCGTCCGGTTCCACCAAAGAATCTACCTCTGGTGTCTACAGCAGCTACTTCTGGAGTGGAAACTGTTTGATGGGATACTGACGGTGCTATATATTGCGAGGCCTGTACAGCTGGCTGTACTCTTGAGTATTGAGTCACGACTGGTGCAGAATAATGAGTCACAGATGGAGCTGAGTATTGAGCGACTGGTGATGTATATTGAACTGCCGATTGTGTTACGTAATGAGCTGGAGAAGAGTATTGTACCGATGCCGTTTGCTGAGCTAAATTAG

Protein sequence:

>DPOGS215878-PA
MSITYIPRVTQLDSLQLDTELEELFKQKIFQATKYFEPGLLQPILPEIDLSIRTWLFLNSVKTNKSTFGQQMLSLKYKPDNFARSKLYWYFVYAIGLKYLRDRALYSFTSNTRVQNFLSKIETFQLVGDILNFLRFIQSGKHPALIDFILGLELTADKLTREDLTDFSWTRELLWHNLIVRTVLSLVNMFGLKQRMTTILKYMWWKNRKSYKASASQATMTLNTTCAFCNEKPVLPHIMGCSHIFCYYCLSANKTADPDFTCPKCDYNGKEVTKYIVAMSSKKLDYSSFGVTHHIWNGIAKLLVLLVFAIVNVSVETGLPVSDVVDSSHRTVRFHQRIYLWCLQQLLLEWKLFDGILTVLYIARPVQLAVLLSIESRLVQNNESQMELSIERLVMYIELPIVLRNELEKSIVPMPFAELN-