Monarch geneset OGS2.0

DPOGS214416
TranscriptDPOGS214416-TA4149 bp
ProteinDPOGS214416-PA1382 aa
Genomic positionDPSCF300069 + 213434-231589
RNAseq coverage494x (Rank: top 25%)
Annotation
HeliconiusHMEL0119730.035.32% 
BombyxBGIBMGA011228-TA0.054.64% 
DrosophilaMdr49-PA0.038.31% 
EBI UniRef50UniRef50_E9LP530.041.19%ATP-binding cassette sub-family B member 3 n=2 Tax=Trichoplusia ni RepID=E9LP53_TRINI
NCBI RefSeqXP_001654492.10.040.79%ATP-binding cassette transporter [Aedes aegypti]
NCBI nr blastpgi|3320245850.042.00%Multidrug resistance protein-like protein 49 [Acromyrmex echinatior]
NCBI nr blastxgi|3123735380.041.25%hypothetical protein AND_17301 [Anopheles darlingi]
Group
Gene OntologyGO:00068103.6e-41transport
GO:00550853.6e-41transmembrane transport
GO:00055243.6e-41ATP binding
GO:00426263.6e-41ATPase activity, coupled to transmembrane movement of substances
GO:00160213.6e-41integral to membrane
GO:00168871.2e-20ATPase activity
GO:00001663.2e-19nucleotide binding
GO:00171113.2e-19nucleoside-triphosphatase activity
KEGG pathwayaag:AaeL_AAEL0103790.0 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[173-620] IPR0115273.6e-41ABC transporter, transmembrane domain, type 1
[167-404] IPR0011402.5e-40ABC transporter, transmembrane domain
[497-622] IPR0034391.2e-20ABC transporter-like
[1164-1353] IPR0035933.2e-19ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214416-TA
ATGATAATGAAAGCGATAATCTGGCGCTTTTCGACATTGATTGAGAGGTCAGCTACAATCCTCGGCGCCGTCTTTGGCTTCATCTGCTCGATATGTCCCGTGGCTGCGGTCATTGTGTATGCTGAGATCACAGCTCTGATGATCAAGAGGCACAATGAGAAGAGTGTGGAAGGAAATACTATTGTACTCAATGTGTTCGGCGGGGGGAAAGACGTAGGAGAGAGCAACCGCACCTCCCATATGGACGCGCTAGTGGACGACTCCGTGTCCTACTTGATAGTCAGTGTCATCATCATGGCGTTGCAAATCGTCACAGCAGCTACAGCCCTCACTCTAACAAACTGGGCAGCTGGAAGGATGGCTTCTAGGTTACGGTTCAATCTGCTACGGTCTGTGTTGAGTCAAGAAATCGCATTCTTCGACACAAACGCTACAATGAATTTTGCAACTACTTTAACAGAAGGTGAGAGCAACCGCACCTCCCATATGGACGCGCTAGTGGATGACTCCGTGTCCTACTTGATAGTCAGTGTGATCATCATGGCACTGCAAATCGTCACAGCAGCTACAGCCCTCACTCTAACAAACTGGGCAGCTGGAAGGATGGCTTCTAGATTACGGTTCAATCTTCTACGCTCTGTGTTGAGTCAAGAAATCGCATTCTTCGACACAAACGCTACAATGAATTTTGCAACCACTTTAACAGATGACGTAGAAAAGCTGAAGAATGGGGTCGGCCATCACGTGGCTATTACGACGTACTTGGCCAGCAGCGTGGTGGTATCCTCGGTGGTCGCGTTGTTGTACGGATGGCAACTCACACTGGCGGGGATGGCTGTCATTCCAGTGGCTTTAATAGCAGTCAGCACTGTAACCAAGTATCAAACCCGTTGTTCTTCAGACGAAGTGTCGTCTCTCGGAACAGCTGGTAGGATAGTGGAGCAGGCTTTATCAGCTATTAGGACGGTACGAGCTTACTCCGGCGAGCACGTGGAAGTCGACAAATACTCCAAAGCCCTGGCTCCAGCCCGAAAGGCTGCGAGTCGCCGCAGTGTTTGGGCGGGCCTGGGCGCTGGTGTTGGCTGGTTCCTCACTTTCTTAATGAACGGAGTGATCCTTGTGTACGGAACCGCTCTCTGTGTGAGAGATGAGGAAGAAGGACATTATCATCCAGGGATCATGATAACTATCATGTTCTGTACGTACACGGCATCCCAGCACATAACGCTCTGCAACCCTCACATAGAAGTGATATCACAAGCAAAAGGAGCTGCGAAAAGTCTGTTCAAGATTCTGGAGAGGAAGTCAAAAATCAACGCTCTGGAAGACATCGGAACAAAACCTGAAGGCTTCAAAGGGAATATTGTGTTTGATAACCTGTACTTCAACTATCCCTCGAGACCTGATGTGAAGGTGTTGCGAGGTCTGTCTCTCACAGTAAATGCAGGTGAAACAGTAGCTCTGGTTGGCGGATCCGGCTGCGGAAAGTCAACCCTGCTGCAACTGCTGCAACGAGCGTACGAACCAGACAGCGGACAAATTTTCTTGGATGGACACAAACTGGACAGTTTGCTCTTGCACCACTATAGAAGAAGTATTGGTGTTGTCGGCCAGGAGCCAGTACTCTTCAGCGGCACTATCCGATACAACATAACCATTGGACTTGAAAATGTGTCAGAATCTGACATGATCAAAGCTGCTCAGATTGCGCACGCGCATCAATTTATCACAAAACTTAGTAACGGCTACGATACGGTTCTGGGCGAGTGTGGAGCCCTATTGTCGGGAGGACAAAAACAAAGGATCGCGATAGCTCGAGCTCTGGTTAGGAATCCAGCTGTGCTGCTCCTTGATGAACCTACATCAGCTCTGGACCCAGCCTCGGAAAGACAGGTCCAAGCGGCTTTGGATTCCGCCAGTGAAGGCAGGACCACGCTAGTGCTGCGAGGTCTGTCTCTGACAGTAAATGCAGGTGAAACAGTAGCTCTGGTTGGCGGATCCGGCTGCGGAAAGTCAACCCTGCTGCAACTGCTGCAACGAGCGTACGAACCAGACAGCGGACAAATTTTCTTGGATGGACACAAACTGGACAGTTTGCTCTTGCACCACTATAGAAGAAGTATTGTACCAAAAGAGTGTATGCTGTCGACTATAGTGAAAGCATCCCGAATAGTTTACGTGGAACAGGGGGCTGTTTTGGAACAAGGAACTCACGAAGAGCTGGTGGAGAAGAAGGGCGCCTATTGGAAACTGTTGCAGGAAGACATGACGCACAGAAGTTTTGCCAATTTGACAGCGGAAGATGTGGACGACGAGGTGGTGGAAGAAACGACTAACAAAAGAAGTAAAGTCAGGAGAAACAGCAGTATGTGTTCCATTAAAACAGTGAATTCTCTCCGTGATAGCATCATTGGTGGTAATCGACGTTTAGGTTCCATGGCCATACCGGAGTTACCAGATGTGTTTTGCGAGGAAGACGAGGAGCCTGTGGAGCCGGTGTCCGTTTGGAAGCTGATATCCTTCAACAAAGAGGAGTTACCGCAGCTTGCTGGTGGAATTTGCGCGTCATTAATCATCTGCTGTAGTTTTCCAGCTTTCGCGCTTCTTCTGTCCAAAATGTTCGGGATCTTCGCTGATTCTGATTCCGAGACGATTCTCCAGCAATCACAAATCTACGCGGCTATGTTCACGTGCTGTGCTATCCTAAGTGGAGACCAAGGATGGTTTGACATTCCAAAAAATTCAGTGGGCTCGCTGTGTGCACGACTGGCCACAGACTGCGCTGCTGTTCAGAGAGGGACTGGTACTGGTTTGGGGGTGATGCTGCAAGGATTAGGGACCATGATCCTAGCTGTTGTGATCGCTATGGCCTACTCCTGGAAGATAACACTCGTCAGTCTTCTGTGCATGCCCTGTGTGACAGTCAGTATGTATCTCGAGAGCTGGGCGTCCAGGAAGTGCGAGGAGAGGAACAGGGACGACCTCGAGGATGCGTCAAAAGTGGCTACGGAGGCTGTGCTCAATATACGAACAGTACACAGCCTGGGTGTTGAGCGCACGTTCCTGTCGCGTTTCTCTGAACAGTTGCAACGCTCCTCCAAGTTGTCTCTTTACATGCGTGGTCCAGTCTACGGTATTTGTTTAGCCATGCCGATGGTTGGATACGCTCTCTCACTAGCTTACGGAGGATACCTCGTGGCCAGCGAGGGCTTGCCGTATGAGTACGCTATATTGGTATCGGAGGAGCTCATTTTCGGTTCTTGGATGTTTGCGGAGGCTTTATCCTACGTGCCAACGGTGCTCGCTGGTAAACGTGCGTGTGCTAAAATCATCAGCGCCTTGGAGAGAAAACCTAGAGTTATAACGGAACCCACCGCCGTAGACGACGATTGGACTTCCAGCGGTAATCTGAAGTTCTCCAACATACACTTCCACTATCCAACTCGTCCGGAGACGACCGTCTTGAGGGGTCTGTCCCTGGACTTGCCCGTTGGTCGCACGCTGGCGCTCGTTGGACCCTCGGGCTGCGGGAAGTCTACCATCATGCATCTCCTCTTGAGGAGTTACGACCCTGTCAGCGGTAATGTCACGCTAGATGGCAGAGACATCAAAACGTCTCTGACGTTGAAGAAGCTGCGGAGTCAGCTGGGCCTGGTGCAGCAGGAGCCTGTGATGTTCGAGAGGAGCATCAGGGAGAATATAGCATACGGTGACAACACCAGGGAAGTACCGCTGCAAGAGATAGTCACGGCAGCACAGATGGCCAACGTTCATACCTTCATAGCCGGCTTACCGTCGGGTTACGAAACGGTTCTGGAAGCGGGTAGTGCTGCACTGTCTGGTGGTCAGAAGCAGCGTGTAGCAATCGCCAGGGCTCTCATCCGGAACCCCCGAGTATTACTGCTGGATGAGGCGACTTCCGCTTTGGATGCTGCCAGCGAGAAGGTGGTCCAAGCTGCTTTAGAAGTGGCGTCCAAAGATAGGACTACCATTATCATAGCTCACAGACTCGCCACAATCAGACACGCAGACCTTATATGTGTATTGGACAAAGGTGTGATAGCTGAGAGTGGTACACATGAAGAACTAGTTCGCAAACGTGGTTTGTATTGGGAGTTATTACAACAACAGGGACCGAATGGGGCGTGA

Protein sequence:

>DPOGS214416-PA
MIMKAIIWRFSTLIERSATILGAVFGFICSICPVAAVIVYAEITALMIKRHNEKSVEGNTIVLNVFGGGKDVGESNRTSHMDALVDDSVSYLIVSVIIMALQIVTAATALTLTNWAAGRMASRLRFNLLRSVLSQEIAFFDTNATMNFATTLTEGESNRTSHMDALVDDSVSYLIVSVIIMALQIVTAATALTLTNWAAGRMASRLRFNLLRSVLSQEIAFFDTNATMNFATTLTDDVEKLKNGVGHHVAITTYLASSVVVSSVVALLYGWQLTLAGMAVIPVALIAVSTVTKYQTRCSSDEVSSLGTAGRIVEQALSAIRTVRAYSGEHVEVDKYSKALAPARKAASRRSVWAGLGAGVGWFLTFLMNGVILVYGTALCVRDEEEGHYHPGIMITIMFCTYTASQHITLCNPHIEVISQAKGAAKSLFKILERKSKINALEDIGTKPEGFKGNIVFDNLYFNYPSRPDVKVLRGLSLTVNAGETVALVGGSGCGKSTLLQLLQRAYEPDSGQIFLDGHKLDSLLLHHYRRSIGVVGQEPVLFSGTIRYNITIGLENVSESDMIKAAQIAHAHQFITKLSNGYDTVLGECGALLSGGQKQRIAIARALVRNPAVLLLDEPTSALDPASERQVQAALDSASEGRTTLVLRGLSLTVNAGETVALVGGSGCGKSTLLQLLQRAYEPDSGQIFLDGHKLDSLLLHHYRRSIVPKECMLSTIVKASRIVYVEQGAVLEQGTHEELVEKKGAYWKLLQEDMTHRSFANLTAEDVDDEVVEETTNKRSKVRRNSSMCSIKTVNSLRDSIIGGNRRLGSMAIPELPDVFCEEDEEPVEPVSVWKLISFNKEELPQLAGGICASLIICCSFPAFALLLSKMFGIFADSDSETILQQSQIYAAMFTCCAILSGDQGWFDIPKNSVGSLCARLATDCAAVQRGTGTGLGVMLQGLGTMILAVVIAMAYSWKITLVSLLCMPCVTVSMYLESWASRKCEERNRDDLEDASKVATEAVLNIRTVHSLGVERTFLSRFSEQLQRSSKLSLYMRGPVYGICLAMPMVGYALSLAYGGYLVASEGLPYEYAILVSEELIFGSWMFAEALSYVPTVLAGKRACAKIISALERKPRVITEPTAVDDDWTSSGNLKFSNIHFHYPTRPETTVLRGLSLDLPVGRTLALVGPSGCGKSTIMHLLLRSYDPVSGNVTLDGRDIKTSLTLKKLRSQLGLVQQEPVMFERSIRENIAYGDNTREVPLQEIVTAAQMANVHTFIAGLPSGYETVLEAGSAALSGGQKQRVAIARALIRNPRVLLLDEATSALDAASEKVVQAALEVASKDRTTIIIAHRLATIRHADLICVLDKGVIAESGTHEELVRKRGLYWELLQQQGPNGA-