Monarch geneset OGS2.0

DPOGS210049
TranscriptDPOGS210049-TA1968 bp
ProteinDPOGS210049-PA655 aa
Genomic positionDPSCF300017 - 1110017-1124696
RNAseq coverage1186x (Rank: top 11%)
Annotation
HeliconiusHMEL0104241e-16269.11% 
BombyxBGIBMGA012688-TA0.087.98% 
DrosophilaCG12703-PA0.061.64% 
EBI UniRef50UniRef50_P282880.056.86%ATP-binding cassette sub-family D member 3 n=100 Tax=root RepID=ABCD3_HUMAN
NCBI RefSeqXP_310656.20.063.46%AGAP000440-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583806070.063.46%AGAP000440-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2420131690.064.42%peroxisomal membrane protein 70 abcd3, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00160202.8e-82membrane
GO:00068102.8e-82transport
GO:00055241.2e-09ATP binding
GO:00168871.2e-09ATPase activity
GO:00001668e-05nucleotide binding
GO:00171118e-05nucleoside-triphosphatase activity
KEGG pathwayaga:AgaP_AGAP0004400.0 
 K05677 (ABCD3, PMP70)maps-> Peroxisome
    ABC transporters
InterPro domain[65-336] IPR0105092.8e-82ABC transporter, N-terminal
[501-595] IPR0034391.2e-09ABC transporter-like
Orthology groupMCL14135 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210049-TA
ATGGCTCCAAATTTTAGTAAAATTACGTCAAGGACAGATGTGAAACTTGGACTTGCTGCCTCTGCGGCTCTAAGTGCTTGGATCTTGAGGAATTTCTTAAAGTCCAAGAAAAATGTCAAACAAGGCCAATTATCACCAGCAGAGACCGTGCAATATATGATAAAAGATAAAGACAGAAAAGGTCCAAAAGCACAAGTTGATGCACAGTTCTTTGCGGAGCTAAAGTCATTATGGAAGATCATGGTTCCTGGTCTTTGGACCAAGGAGAGTGGCTTTATGGCCCTGATAGCATTATCACTGATATCTAGAACATTATGTGACCTATGGCTTATTCAACATACAACACTTGTAGAAGGATCAATTATAACAATGAATTTAAACGAATTCAGAAGACTTCTCACCCAGCTGTTCATATCAATGCCGTTGTGGAGCATAGGTGAAGTGAAGCTGAGACTGCGTACGAATTTATCATTACATCTTTATCAACAATACCTGAAAGGCTTCACATACTACCAGGTGACGAATCTAGATAACCGGATATCAAACGCCGACCAGCTGCTTACTACGGACATCGACAAGTTCTGTGACACGGTCATCGACTTATACAGCAATATCAGCAAACCGATGCTCGATATATCGATTTACTTGTATCGGCTGACTGTAAATTTGGGACCGTCGACACCAGGTATCATGATGGCGTACCTGTTAGTCTCTGGCATATTTTTGACGTATCTTCGTAAACCAACAGCCAAAATGACGGTCCAGGAACAAAAACTCGAAGGTGAATTCAGATATGTTAATTCTAGGCTGATCACTAACTCCGAGGAGATCGCGTTCTATCAAGGCAACCATCGCGAGCAGTTGACAATACTAGCCAGTTTCTACAAGCTGACCAGACATTTGCGCAACTTCCTCAATTTCCGCGTCATGATGGGCTTCATTGACAACATCGTTGCCAAATACATCGCGATTACCGTCGGTTTCTACGCCGTGTCGAGGCCATTCTTCGTCAAAGATCACAACTTGTTGACAACAGGCACAGAACAGGACAGATTCCAGCACTACTATACTTATGGCCGTATGTTAGTCAAAATGGCGGAAGGTATCGGTAGATTAGTACTGTCCGGCAGGGAGCTTAGTAAATTGGCTGGTCTTACTGCTAGAGTTACTCAACTCAGACATGTATTGGAGGATATTAATAAAGGAAACTATAAGAGAACTATGGTGGAAAGACAAGCAAATGGCAATGGTCCCCCCATGCTTCTATCTCCTGGAGCAGGGAAGATCATATATCAGGATAAGATCATACGCTTTGATAAAGTACCGCTGGTAACGCCAAACGGTGATGTTCTCATCAAGGAATTAACCTTTGAAGTCAAATCTGGTATCAATGTATTAGTATGCGGACCTAACGGCTGCGGTAAGTCTTCTATGTTCCGTATGCTGGGTGAGCTTTGGCCTATCTTCGGAGGAACCCTGACCAAACCGCCGAAAGGGAAACTGTTCTATGTACCGCAGAGACCTTATATGACACTAGGAACGTTCAGAGATCAGGTGATATACCCCCAGATCCAACAGGAGATGATCCGTCGAGGCCGTACTGATGAGGAACTGCTCAAGTTCCTGGATATAGTCCAGCTGTCCTACCTGGTGACCAGGGACGGCGGCTGGGACGCAGTCGAAGACTGGATGGATGTACTGTCCGGGGGGGAGAAGCAAAGGATAGCGATGGCGCGACTTTTCTACCACGCGCCTCAGTTCGCGATCCTAGACGAGTGTACCAGCGCTGTGTCCGTCGACGTTGAGGGACAAATGTATAGATACTGTAGAGAGATGGGTATATCTCTGTTCACGGTGTCACACAGGAAGTCGCTCTGGCAGCATCACGATCACTTCCTGCAGATGGACGGACGAGGCGGATACGTGTTCGGGGAAATTGACAACGACACGCAGGAGTTCGGATCGTGA

Protein sequence:

>DPOGS210049-PA
MAPNFSKITSRTDVKLGLAASAALSAWILRNFLKSKKNVKQGQLSPAETVQYMIKDKDRKGPKAQVDAQFFAELKSLWKIMVPGLWTKESGFMALIALSLISRTLCDLWLIQHTTLVEGSIITMNLNEFRRLLTQLFISMPLWSIGEVKLRLRTNLSLHLYQQYLKGFTYYQVTNLDNRISNADQLLTTDIDKFCDTVIDLYSNISKPMLDISIYLYRLTVNLGPSTPGIMMAYLLVSGIFLTYLRKPTAKMTVQEQKLEGEFRYVNSRLITNSEEIAFYQGNHREQLTILASFYKLTRHLRNFLNFRVMMGFIDNIVAKYIAITVGFYAVSRPFFVKDHNLLTTGTEQDRFQHYYTYGRMLVKMAEGIGRLVLSGRELSKLAGLTARVTQLRHVLEDINKGNYKRTMVERQANGNGPPMLLSPGAGKIIYQDKIIRFDKVPLVTPNGDVLIKELTFEVKSGINVLVCGPNGCGKSSMFRMLGELWPIFGGTLTKPPKGKLFYVPQRPYMTLGTFRDQVIYPQIQQEMIRRGRTDEELLKFLDIVQLSYLVTRDGGWDAVEDWMDVLSGGEKQRIAMARLFYHAPQFAILDECTSAVSVDVEGQMYRYCREMGISLFTVSHRKSLWQHHDHFLQMDGRGGYVFGEIDNDTQEFGS-