Monarch geneset OGS2.0

DPOGS210896
TranscriptDPOGS210896-TA2913 bp
ProteinDPOGS210896-PA970 aa
Genomic positionDPSCF300045 - 739773-745736
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0133000.068.10% 
BombyxBGIBMGA003775-TA0.064.35% 
Drosophilapex1-PA1e-9127.48% 
EBI UniRef50UniRef50_D6WNC91e-10629.60%Putative uncharacterized protein n=3 Tax=Endopterygota RepID=D6WNC9_TRICA
NCBI RefSeqXP_397107.32e-9829.12%PREDICTED: similar to lethal (3) 70Da CG6760-PA [Apis mellifera]
NCBI nr blastpgi|3287870172e-10128.82%PREDICTED: peroxisome biogenesis factor 1-like [Apis mellifera]
NCBI nr blastxgi|3504167823e-10228.88%PREDICTED: peroxisome biogenesis protein 1-like [Bombus impatiens]
Group
Gene OntologyGO:00055245.3e-34ATP binding
GO:00001661.4e-11nucleotide binding
GO:00171111.4e-11nucleoside-triphosphatase activity
GO:00070319.4e-08peroxisome organization
GO:00057779.4e-08peroxisome
KEGG pathwayame:4136666e-98 
 K13338 (PEX1)maps-> Peroxisome
InterPro domain[716-844] IPR0039595.3e-34ATPase, AAA-type, core
[712-848] IPR0035931.4e-11ATPase, AAA+ type, core
[90-145] IPR0153429.4e-08Peroxisome biogenesis factor 1, N-terminal
Orthology groupMCL12438 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210896-TA
ATGTTGGGGGGAAACAAATTAAAAGTATCATTTACACATGAAAGATCTTGTTTCGGATATATTAGTACTAAACATAGTACGAACAATGAACAGTCACAATGTGTGCAAGTATTGTATAGAGAGAAACAAATATTTCTCTGGGCAGTATTCAACAGTAATATTCCAGAGGGTAACATAGCGTGTAATCCTGTCTTCAGTAAAGTTGTTGGGTTAGACGAGGGTGTGGAGGTGTTTGTGGCTCCGTATGGAGACGTTAAGGTTCTAGACCAGTTGTACATTGACACCGATAGCCCCGATGACCAGGAAATACTCGAACACAATGTGGAGGTTTTGCAATTAAGGATTTTAGACCAATTAAGGCTTGTGGCTGCTGATCAGAAGGCTGTTGTGTGGATATCTACATCTATGCCTATCGTTTTCACTCCAAAGACGACCGGTCTACTGGTCAATCATAGCAGAATCATTGTCAAAATAGATGCTTTCAACAGTCTCGGTAGTGACTTTAGTAATCAGGGGGCGGCAACAAAGGGATCACACCAAAAATATAATAACATAGGTATTATCAACGATGGTCTTCTCAGTCCATACTTAAATCCACAAAACAGATTAATATTAAGAGCTCTGCCTATAGAGGGCGACGGGAAGAAAAATTTAATACATCCCTATACTGTGTTCATACATGAAGACTTGATTGATGAAAGTTATAAAGATTTAATGGTGATCCTGGCCACTATGAACCACATTCCATCGGTGTTGCAAGAAGATGAGGTTGAATCCCATAAAATAGACGGCATTTGTGTGGAAATAGTCACCATAGACAATACGGTCTTCAGAAGTCTCTGTAGAGAGGTTTACAATGAAAATATACCGACTGTACTCATACCGAAGTCACTGAACGTTATAATAAATGTTGAGATGGGTGTGAAGCTAATTTTTAACATAATCGGTGATAAAGTAGAGCTTCCCGATCACGTGGACATCATAACGTACTCGGAGAAAACTCAAACGGAGATTGATGTTATAGAAAAGTTTAAGAAGTGTGTCGTCGAAAACACACACTCCGGTAAAATGTTCCTCATAAATGACGGCATGGTGAAGCAGAACACACACATCAGTCACGGGTTTCTGAGGTTCAAGTTGCAACCTGAAAGGTTAAAATACACTATGCTGAACTCAGAGTCGTTTAGGAACTGCAGTGTGGCCGCCAAATGCTTGACTGATGCAGACTTCGACTGTCCCAAGAAAGTAACACATCAACTAGAATATGACTTTAAGAATTATTGTCGCAGTATAAAGTCAAGTCAGGAGCTGGTTGATGATATCCTGTCACACATACATTTTGAGATACAAAGGGAGGCGAGTTTCAAAGGGGTATCGGAGATCAAGAGCAATGTCCTTATAACCGGTTCAAGCGGCGCGGGAAAGTCTGCTATATGTCATATAGTACAAAAAGAATTAACTCTCTGGTCCCACATACTACACTGCCGGGCGCTCAAAGGCCGGAAGGACGTCACTGAAGTCATAGGGAAAGCTATACTGATGTGTCAGGAACATAGTCCATCAGTGCTGATATGTGACGACCTCGATGCTTTAATACCGCCGAACACAGAGGGCGCCTCCCCACAAGACATCGCATACTATCAGAGATTGGCGTCGTGTATTAAACAGATGTTACAGTCTTGTTCGTCTGTGTGCGTGTTGATGACGTCACTCAGTCTGAGGTCGTTGCACCCGGTCCTGAGACAGTTCAGCGGGCGACCCCTGTTCACGGCACACTTTGACATACCGCAGATGAATCAGGATGATAGAATAGAAGTTTTCAAACACTTACTGAATGATAAAATCCGCGAGTCGGTGCTGGTGGAGGAGCAGGACGTGGCGACCCTGGCCACCGACACGGCCGGCTGTAACGTCCGGGAGATACTGGAGTACTTCAATAAGAGAATATTCAAGGCCGTCAAGAATAAGTCCAAGCCGTCAGACAGGCCGCGTCTAATAGCCGACATATCCAAGGAGCTGGAGAAGGCGAACACCTTCGATATATGGGGTTCTGTGGGGGGGATGCACGACGTGAAGAGACAGATCACTGAGGCCATATTCTGGCCAATCATGTACCCGAGTCTGTTCCCATCATCTTCATGTGGCATCTTGCTGTATGGGCCTCCGGGGTCGGGGAAGTCGCTGATAGGATCCTGCTTGTCCTCGTTGACCAACATGAGGGTCCTCACAGTCAAGGGACCGGAATTACTGTCTAAATACATCGGGCAAAGCGAGAAGGCTGTCCGGGATATATTCGATAAAGCGGACATGCAGCGTCCCTGTATCCTGTTCTTCGACGAGTTCGACAGTCTGGCGCCCAAACGCGGCCACGACTCCACGGGTGTGACGGACCGCGTGGTGAACCAGCTGCTGTCTCGGCTGGACGGGGCGGAGGGCGGCGCGCGCGGGCCCGTGCTGGCGGCCACCTCGCGCCCGGACCTGGTGGACCCGGCGCTGCTGCGGCCCGGCCGGCTCATGCTGCACCTGTACTGTGGCCTGCCCGACCAGGCCGACCGTGTGGAGGTGCTCCGGTGCCTGTCGAGGAGCGTGTGCCTGTCCCGGGAGGTGGATCTGTCGTGGCTGTCGTGTCGTACTGAAGGCTACTCCGCCGCCGACCTCAAGTCCCTGCTAGTGACGGCGCAGCTCACCAGGCTCGAGAAGCAATTGGCCGCGAGCGACGACAAAACCTTGGAGTCGGTGGTAGTGTTGAAGGAGGACGTAGAGGACGCGCTCCGGGAGACTTCGCCCTCGCTGTCACCAGAACAGAGGCTGTTCTATGACACTATCTACCGTCGTTTCCGCGGGGAGCCGCTCTCCCCGCAACAGACGCGCCTGCAGCATCGCCTCGACCGCCAGAGAGTCACACTCGCCTGA

Protein sequence:

>DPOGS210896-PA
MLGGNKLKVSFTHERSCFGYISTKHSTNNEQSQCVQVLYREKQIFLWAVFNSNIPEGNIACNPVFSKVVGLDEGVEVFVAPYGDVKVLDQLYIDTDSPDDQEILEHNVEVLQLRILDQLRLVAADQKAVVWISTSMPIVFTPKTTGLLVNHSRIIVKIDAFNSLGSDFSNQGAATKGSHQKYNNIGIINDGLLSPYLNPQNRLILRALPIEGDGKKNLIHPYTVFIHEDLIDESYKDLMVILATMNHIPSVLQEDEVESHKIDGICVEIVTIDNTVFRSLCREVYNENIPTVLIPKSLNVIINVEMGVKLIFNIIGDKVELPDHVDIITYSEKTQTEIDVIEKFKKCVVENTHSGKMFLINDGMVKQNTHISHGFLRFKLQPERLKYTMLNSESFRNCSVAAKCLTDADFDCPKKVTHQLEYDFKNYCRSIKSSQELVDDILSHIHFEIQREASFKGVSEIKSNVLITGSSGAGKSAICHIVQKELTLWSHILHCRALKGRKDVTEVIGKAILMCQEHSPSVLICDDLDALIPPNTEGASPQDIAYYQRLASCIKQMLQSCSSVCVLMTSLSLRSLHPVLRQFSGRPLFTAHFDIPQMNQDDRIEVFKHLLNDKIRESVLVEEQDVATLATDTAGCNVREILEYFNKRIFKAVKNKSKPSDRPRLIADISKELEKANTFDIWGSVGGMHDVKRQITEAIFWPIMYPSLFPSSSCGILLYGPPGSGKSLIGSCLSSLTNMRVLTVKGPELLSKYIGQSEKAVRDIFDKADMQRPCILFFDEFDSLAPKRGHDSTGVTDRVVNQLLSRLDGAEGGARGPVLAATSRPDLVDPALLRPGRLMLHLYCGLPDQADRVEVLRCLSRSVCLSREVDLSWLSCRTEGYSAADLKSLLVTAQLTRLEKQLAASDDKTLESVVVLKEDVEDALRETSPSLSPEQRLFYDTIYRRFRGEPLSPQQTRLQHRLDRQRVTLA-