Monarch geneset OGS2.0

DPOGS207746
TranscriptDPOGS207746-TA2646 bp
ProteinDPOGS207746-PA881 aa
Genomic positionDPSCF300042 - 689420-695910
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0119730.072.66% 
BombyxBGIBMGA007494-TA0.070.36% 
DrosophilaMdr50-PA0.050.70% 
EBI UniRef50UniRef50_E9LP500.071.90%ATP-binding cassette sub-family B member 1 n=4 Tax=Obtectomera RepID=E9LP50_TRINI
NCBI RefSeqXP_002432260.10.052.38%multidrug resistance protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3198947620.071.90%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
NCBI nr blastxgi|3198947620.072.11%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
Group
Gene OntologyGO:00068101.3e-49transport
GO:00550851.3e-49transmembrane transport
GO:00055241.3e-49ATP binding
GO:00426261.3e-49ATPase activity, coupled to transmembrane movement of substances
GO:00160211.3e-49integral to membrane
GO:00168877.7e-26ATPase activity
GO:00001663.2e-19nucleotide binding
GO:00171113.2e-19nucleoside-triphosphatase activity
KEGG pathwayphu:Phum_PHUM5788200.0 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[314-629] IPR0115271.3e-49ABC transporter, transmembrane domain, type 1
[353-588] IPR0011401.5e-36ABC transporter, transmembrane domain
[95-220] IPR0034397.7e-26ABC transporter-like
[80-266] IPR0035933.2e-19ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207746-TA
ATGGGTTCAGCCAACTTTGGCATATCCTCCACCCTGATGGATGTATTCGGAGTCGCCCGTGGGGCCGGTGCACAAATATTCCATCTTATCGATAATGTCCCCCTCATAAACCCTCTGCTCAATCGTGGCATTGTACCGAATAGTGTTGAGGGAAAAATTGAATTGAAAAACGTGGTCTTCCACTATCCTTCTAGACCAGACGTTCCAGTTCTGAAAGGAGTTAATCTAAGTGTACAAAAAGGACAATCAGTTGCCCTCGTCGGCCATTCCGGCTGCGGTAAATCCACTATCATACAACTGTTATCAAGATATTATGACGTCATAGACGGCAGCGTACAAATTGATGGAAATGACGTAAGACAGTTGTCTGTACGATGGTTGAGAGCTCAGATCGGTTTAGTTGGTCAGGAGCCGGTCCTCTTTAATACAACAGTCCGGGAGAACATCAGGTATGGCCGAGAGGACGCTACTAATGAGGAAATAGAAAAAGTCGCGAAGCAAGCTAATGCTCATGAGTTCATTATGAAACTTCCTCAGGGTTATGACACAGTAGTTGGAGAGCGAGGTGCATCGATATCGGGAGGTCAAAAACAAAGAATTGCCATAGCTCGAGCCCTTGTACGAAACCCTAAAATATTATTGTTGGATGAAGCCACCAGCGCGTTAGATACTACCTCAGAGGCCAAAGTGCAAAAAGCTTTAGATAAAGCCCAAGAAGGTCGAACAACGATTATTGTCGCGCATAGACTGTCAACCATAAGGAACGTCGACAAAATATATGTTTTTAAAAAAGGAGATGTGGTAGAAAGCGGAGGTCATGACGAGCTTATGGACAAAAAAGGCTATTTCTATGATATGGTGATGCTTCAAAGGTCACCCAATCAATCAAATGAGAAAGATATGAAGAACAAATTCGAACGCAGCGAGTCCATCATGAGTGAAAAAGAAGAAGAGGAACTTGTGGAAACGAGAATCCAAAACGTCGAAGAGTCCAGTGCAGACACCGAAGTATCCTTCTTACGAGTTCTAAAACTGAACTCACCGGAGTGGAAGTCCATCACTGTGGCCAGCGTATGTGCCATCCTCAGCGGTTTCGCGATGCCGCTTTTAGCTATTGTCATGGGAGACTTTATGGGCGTGTTCATGTACAGTATAGCTGGAGAACATTTAACGTGCAGGTTGCGAAAATTACTCTTCCAACATTTACTGCAACAGGAAATCGGATTTTTTGATGATAAAAATAATTCAACTGGAGCTCTTTGTGCTCGAATATCAGGAGATGCTGCGTCAGTGCAAGGGGCTACAGGTCAAAGAATAGGAACAGTTTTACAAGCTTTCGGAACTCTTTGTTTCGCGTTGTCTCGCCTGTACTATGAATGGCGGCTGGGTTTGGTTGCTCTAGCGTTCGTGCCTATTATGGCTGCAATAGTTTATAAACAAGGGAGAATGGTAAATACGGAATCTTTTGGAACAGCGAAAACAATGGAAAAGAGTTCTAAGCTCGCAGTAGAAGCGGTAGCTAATATCCGCACCGTGGCATCATTAGGTCGCGAACCAATCATATTAAGCGACTACGCAATCCAGCTTCTGCCCGCACTTGAACTTGCCAAAAAATCATCGCATTGGAGAGGACTTGTTTTTGGATTATCTAGAGGGCTTTTCAACTTGGTGTACTCCGTGACTATGTTTTATGGTGGTCAACTGATAGTGTACCAGGGAATCGAATATAACACAGTACTTAAATCAGCTCAAACTTTATTAATGGGTTCGTCATCAGCAGCCCAAGCGCTTGCATTCGCACCTAACTTCCAAACCGGAATAAAAGCCGCGGGTCGTATTATCGTGACATTAGCAAGAAAATCAAAAATCATGGACCCCGAGAAACCTGCCATCGAAAACTTTAAAGGAACAGGTGAAGCAACGTTAACAGATGTAACATTTACTTATCCGACTAGGCCGCTTATACAAGTATTGAAGGATTGTAACTTGGAAATTCTGAACGGGAAAACAGTAGCTCTGGTCGGCGGGAGTGGATGCGGCAAGAGTACTATCATACAGTTATTAGAGAGATACTACGATCCCGACGAGGGCGTTGTGGCTCAGAATGGAACTCCCCTACCAAATCTCCGTTTGGCTGACTTAAGGCAGTCCATCGGCTTCGTGCAACAAGAACCTATATTATTTAACGGCACCATTAAAGAAAATATCGCTTATGGTGACAATTCCCGAACACACAGTACGAATGATGTTATTGAAGTCGCTAAGCAAGCTAACATACACAACTTTGTCGTATCTTTGCCTATGGGTTATGATACCAATATAGGTTCAAAGGGTACACAACTTTCTGGAGGTCAGAAACAAAGGATAGCCATAGCGAGAGCTTTAATAAGACGTCCAAAAATGTTGTTACTAGACGAAGCCACGAGTGCCTTGGACACGGAAAGCGAAAAAGTGGTTCAGGAAGCCCTGGACCAAGCTAAAGCGGGTCGTACGTGTGTCATGATCGCTCACCGACTGAGTACAGTGCGTGACGCTGACGTCATATGTGTCCTCAACAATGGAAGTGTCGCAGAGCGAGGAACACACGCAGAGCTGTTAGAACTCAAAGGACTTTATTACAATCTGTATAAACGCGGATACTCGTGA

Protein sequence:

>DPOGS207746-PA
MGSANFGISSTLMDVFGVARGAGAQIFHLIDNVPLINPLLNRGIVPNSVEGKIELKNVVFHYPSRPDVPVLKGVNLSVQKGQSVALVGHSGCGKSTIIQLLSRYYDVIDGSVQIDGNDVRQLSVRWLRAQIGLVGQEPVLFNTTVRENIRYGREDATNEEIEKVAKQANAHEFIMKLPQGYDTVVGERGASISGGQKQRIAIARALVRNPKILLLDEATSALDTTSEAKVQKALDKAQEGRTTIIVAHRLSTIRNVDKIYVFKKGDVVESGGHDELMDKKGYFYDMVMLQRSPNQSNEKDMKNKFERSESIMSEKEEEELVETRIQNVEESSADTEVSFLRVLKLNSPEWKSITVASVCAILSGFAMPLLAIVMGDFMGVFMYSIAGEHLTCRLRKLLFQHLLQQEIGFFDDKNNSTGALCARISGDAASVQGATGQRIGTVLQAFGTLCFALSRLYYEWRLGLVALAFVPIMAAIVYKQGRMVNTESFGTAKTMEKSSKLAVEAVANIRTVASLGREPIILSDYAIQLLPALELAKKSSHWRGLVFGLSRGLFNLVYSVTMFYGGQLIVYQGIEYNTVLKSAQTLLMGSSSAAQALAFAPNFQTGIKAAGRIIVTLARKSKIMDPEKPAIENFKGTGEATLTDVTFTYPTRPLIQVLKDCNLEILNGKTVALVGGSGCGKSTIIQLLERYYDPDEGVVAQNGTPLPNLRLADLRQSIGFVQQEPILFNGTIKENIAYGDNSRTHSTNDVIEVAKQANIHNFVVSLPMGYDTNIGSKGTQLSGGQKQRIAIARALIRRPKMLLLDEATSALDTESEKVVQEALDQAKAGRTCVMIAHRLSTVRDADVICVLNNGSVAERGTHAELLELKGLYYNLYKRGYS-