Monarch geneset OGS2.0

DPOGS212301
TranscriptDPOGS212301-TA2787 bp
ProteinDPOGS212301-PA928 aa
Genomic positionDPSCF300491 + 17240-29058
RNAseq coverage598x (Rank: top 21%)
Annotation
HeliconiusHMEL0175870.085.26% 
BombyxBGIBMGA005473-TA0.079.49% 
DrosophilaHmt-1-PA7e-17665.49% 
EBI UniRef50UniRef50_F4W5V62e-16761.46%ATP-binding cassette sub-family B member 6, mitochondrial n=1 Tax=Acromyrmex echinatior RepID=F4W5V6_ACREC
NCBI RefSeqXP_307900.40.071.82%AGAP002278-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3800211860.058.04%PREDICTED: LOW QUALITY PROTEIN: ATP-binding cassette sub-family B member 6, mitochondrial-like [Apis florea]
NCBI nr blastxgi|3800211860.058.04%PREDICTED: LOW QUALITY PROTEIN: ATP-binding cassette sub-family B member 6, mitochondrial-like [Apis florea]
Group
Gene OntologyGO:00068103.6e-41transport
GO:00550853.6e-41transmembrane transport
GO:00055243.6e-41ATP binding
GO:00426263.6e-41ATPase activity, coupled to transmembrane movement of substances
GO:00160213.6e-41integral to membrane
GO:00168871.4e-19ATPase activity
GO:00001664.1e-18nucleotide binding
GO:00171114.1e-18nucleoside-triphosphatase activity
KEGG pathwaydme:Dmel_CG42256e-174 
 K05663 (ABC.ATM)maps-> ABC transporters
InterPro domain[200-468] IPR0115273.6e-41ABC transporter, transmembrane domain, type 1
[243-465] IPR0011409.8e-24ABC transporter, transmembrane domain
[685-810] IPR0034391.4e-19ABC transporter-like
[670-856] IPR0035934.1e-18ATPase, AAA+ type, core
Orthology groupMCL13549 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212301-TA
ATGGAGTACTGTCCGCCGAATATAACTTTAGAAGAGATCTGGATAGATCATGGAATTTCACAATGTTTTATGGAGACTGTAACTGCCACTCTAATTGGAGGATTTCTTCTGGTTTTCGGTATAACACAAATTGTCATGTATAAAAGATATGCTACAGAAATAACCGATGTCAGATCTTCAAAGCTGTTTGTAGTACAATTGTTCTTCACACTTTTTGTGCCAGTTTTGGCTATAATTCGTTTTCTGTTACAAGCTTTTGTATTTAAAGGAGGACATATCTATGGATACATGATAACATTACTTGTGATATTCCCACTGTCGGCGTATTTAGCTGTGATTGAAAGACGGTTCCTTTTACCGTCAGTGTTACCGAGGGGCCATGGTTTTGTATTGCTTGTATTTTGGGCAATGATATTTATATCAGAGAATTTATCATTTTTAAATTTGAATAAGGATGGATGGTGGTGGCATTTGAAAGATCTCCAAGATCGCCTCGAAATGGCCCTGTTTGTTGGTCGGTATGTCTCATGTATGTTAATGTTCATCCTCGGGATGAAAGCACCAGGTATTATGCATCCATTTGAATATCTCGACTCTGATGACGATAATCGTAGAAACATACCACCTAGGAATGACAGCGGTTCAACATTCCGAAATGTATTCGGCAAAATGCGTACCCTGCTGCCGTTCATGTGGCCCAGCAAGAGCGTTTGTCTGCAGATATACGTGTTCATATGTGTGCTAGCTCTGCTCGCTGGAAGGGTCATCAACCTTTACGTACCTATATATAACAAGAAAATAGTTGACAGTCTTTCAATACCGCCGCTTCACTTCCGATGGGATCTGGTGGTTTTGTACGTTTTCTTCAAGTTTCTCCAAGGAGGCGGCACTGGCGGTATGGGACTCTTGAACAACCTGAGATCCTTTCTCTGGATACGAGTCCAACAGTATACGACGAGAGAGCTACAGTTGAAGTTGTTTCGGCACCTACACGATTTACCTCTCAAGTGGCATTTATCGCGGAAGACGGGCGAGGTGTTGAGGGTTATGGACAGAGGCACGGACTCCATAGACAATCTCCTGTCCTACATACTATTTTCCATAACACCCACCATCATAGACATCTTAGTCGCCGTGGTGTACTTCGTGACAGCCTTCAACGCGTGGTTCGGACTCATCGTCTTCGCCACCATGGTTCTGTATATAATCGCAACAATAGCTGTAACAGAATGGCGTACGAAGTTCCAGCGTAGAATGAATCAAGCTGACAACGAGCAGAAAGCACGCTCAGTGGATTCGCTTCTCAATTTCGAAACAGTCAAATATTATGGCGCTGAGACTTATGAGGTTTACTCATACAAGGACGCCATCTTGAATTACCAGGTACACCAGATCTTACTATATAATTCAGAATTTCAAATTCAGAAACGTTTTCTACTCGCAACAATAGCTGTAACAGAATGGCGTACGAAGTTCCAGCGTAGAATGAATCAAGCTGACAACGAGCAGAAGGCACGCTCAGTGGATTCGCTTCTCAATTTCGAAACAGTCAAATATTATGGCGCTGAGACTTATGAGGTTTACTCATACAAGGACGCCATCTTGAATTACCAGAAAGAAGAATTCAAGTCTCTGTTAACACTGAGCATGTTGAATACTATGCAGAACATCATTATATGTGTGGGTCTACTGACGGGTTCTCTCCTGGCGATATCCATGGTGGTTAGAACCTACCAGCTAACCGTCGGTGACTATGTACTGTTTGCGTCATATATTGTCCAACTATACGTGCCTTTGAACTGGTTTGGGACCTATTACAGGGCTATCCAAAAGAACTTTGTTGATATGGAGAATATGTTCGATCTTCTCCGCGTGGACTCTGACGTGAAGGACGTGCCGGGCGCACCGGATTTACTCATCAGGAGGGGGGGCATCGAGTTCAAGCACGTGTCGTTCGGCTACGGACCGGAGAGATTGGTCTTGAATGATATCAGTTTCAAAGTGGCACCGGGATCCACCGTCGCCTTGGTTGGTCCAAGCGGAGCCGGTAAGTCGACCGTGATGCGTCTCCTGTTCCGTTTCTACGACGTTAATGGCGGCGCTGTCCTTGTCGACGGACAGGACGTGGCGACCGTGACTCAGGCCTCCCTGAGGGCCGCCATTGGTGTCGTGCCACAAGATACCGTGCTCTTTAACAACACTGTCAGATACAACATACAGTACGGCCGTCTGACAGCATCTTCCTCGGACATCATCGCGGCGGCGAAGAATGCGGACATCCACGACAGAATACTCACTTTCCCCGACGCTTACGACACTCAGGTAGGAGAGAGGGGTCTCCGTTTGAGCGGCGGAGAGAAGCAAAGGATAGCCATCGCTAGAACACTACTGAAGGACCCCGCTATAGTACTGTTGGACGAGGCGACCTCCGCGCTAGACACTAACACCGAAAGAAACATACAATCCGCCTTAGCCCGGGTATGCGCCAACAGAACGACGTTGATAATAGCCCATAGACTATCCACTATAATACACGCGGACGAAATTCTTGTACTTAAAGACGGGGAGATTGTCGAAAGGGGAAACCACGAGGCATTATTAGCATTGGAGGGTGTATACGCTTCGATGTGGCACCAACAGCTCGAGAGTAGAAATAGCAATGGCAACGGTAATGGGAACAACGAAACTAACGCTGAAGGAAACAACAACAACAACAACAGACCGAGCCAACAACAGAACGGCTCCAGCGTGTTTCCACAAGGCCACGGGCATGGCCATGGCCATTAA

Protein sequence:

>DPOGS212301-PA
MEYCPPNITLEEIWIDHGISQCFMETVTATLIGGFLLVFGITQIVMYKRYATEITDVRSSKLFVVQLFFTLFVPVLAIIRFLLQAFVFKGGHIYGYMITLLVIFPLSAYLAVIERRFLLPSVLPRGHGFVLLVFWAMIFISENLSFLNLNKDGWWWHLKDLQDRLEMALFVGRYVSCMLMFILGMKAPGIMHPFEYLDSDDDNRRNIPPRNDSGSTFRNVFGKMRTLLPFMWPSKSVCLQIYVFICVLALLAGRVINLYVPIYNKKIVDSLSIPPLHFRWDLVVLYVFFKFLQGGGTGGMGLLNNLRSFLWIRVQQYTTRELQLKLFRHLHDLPLKWHLSRKTGEVLRVMDRGTDSIDNLLSYILFSITPTIIDILVAVVYFVTAFNAWFGLIVFATMVLYIIATIAVTEWRTKFQRRMNQADNEQKARSVDSLLNFETVKYYGAETYEVYSYKDAILNYQVHQILLYNSEFQIQKRFLLATIAVTEWRTKFQRRMNQADNEQKARSVDSLLNFETVKYYGAETYEVYSYKDAILNYQKEEFKSLLTLSMLNTMQNIIICVGLLTGSLLAISMVVRTYQLTVGDYVLFASYIVQLYVPLNWFGTYYRAIQKNFVDMENMFDLLRVDSDVKDVPGAPDLLIRRGGIEFKHVSFGYGPERLVLNDISFKVAPGSTVALVGPSGAGKSTVMRLLFRFYDVNGGAVLVDGQDVATVTQASLRAAIGVVPQDTVLFNNTVRYNIQYGRLTASSSDIIAAAKNADIHDRILTFPDAYDTQVGERGLRLSGGEKQRIAIARTLLKDPAIVLLDEATSALDTNTERNIQSALARVCANRTTLIIAHRLSTIIHADEILVLKDGEIVERGNHEALLALEGVYASMWHQQLESRNSNGNGNGNNETNAEGNNNNNNRPSQQQNGSSVFPQGHGHGHGH-