Monarch geneset OGS2.0

DPOGS207747
TranscriptDPOGS207747-TA2739 bp
ProteinDPOGS207747-PA912 aa
Genomic positionDPSCF300042 - 661830-675361
RNAseq coverage573x (Rank: top 22%)
Annotation
HeliconiusHMEL0119730.069.29% 
BombyxBGIBMGA007494-TA0.068.55% 
DrosophilaMdr50-PA9e-16545.84% 
EBI UniRef50UniRef50_E9LP500.066.77%ATP-binding cassette sub-family B member 1 n=4 Tax=Obtectomera RepID=E9LP50_TRINI
NCBI RefSeqXP_001810982.13e-18048.55%PREDICTED: similar to Multi drug resistance 50 CG8523-PA [Tribolium castaneum]
NCBI nr blastpgi|3198947620.066.77%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
NCBI nr blastxgi|3198947620.066.35%ATP-binding cassette sub-family B member 1 [Trichoplusia ni]
Group
Gene OntologyGO:00068104.7e-57transport
GO:00550854.7e-57transmembrane transport
GO:00055244.7e-57ATP binding
GO:00426264.7e-57ATPase activity, coupled to transmembrane movement of substances
GO:00160214.7e-57integral to membrane
GO:00168876.3e-27ATPase activity
GO:00001669.2e-19nucleotide binding
GO:00171119.2e-19nucleoside-triphosphatase activity
KEGG pathwaytca:6598479e-180 
 K05658 (ABCB1)maps-> ABC transporters
InterPro domain[10-352] IPR0115274.7e-57ABC transporter, transmembrane domain, type 1
[39-321] IPR0011408.8e-51ABC transporter, transmembrane domain
[413-538] IPR0034396.3e-27ABC transporter-like
[398-584] IPR0035939.2e-19ATPase, AAA+ type, core
Orthology groupMCL10075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207747-TA
ATGTCTTATGTCATTTACAGAAATGTCGAAAAAGATACTGCCAAGAAAGATGCGGTCTCCAACATATCATTCAAAACACTGTTCCGTTTTGCCACAACGAAAGACAAAATGTTCATGTGTATAGCGATTGTGGCATCCGTGTTATGTGGATGTACGACACCGATTAATACTCTGCTCTTTTCTTCACTGCTTCAAAGTATGGTCAACTATGGAAAATCTATTGTGTTAAATGACCCTCAACCGGATATATTGCTTAGCGAAGTACAAGATTTTGCTATATACAATGCCGTTCTGGGACTTGTTATTATTATACTGTCATATATCGCAACAGTTCTCATGAACATGTCAGCCTATAATCAGGCACATCGCATTCGTCAAGAGTATTTGAGAGCTACATTGAATCAAGATTTTGAATATTTTGACACTCACAAAACTGGTGATTTCGCAAGCAAAGTCACGGACGATGTTTTAAAACTAGAGGATGGTATAGGAGAGAAACTAGCGACCTTTATATATTACCAAGTGACATTCATAAGCTCTATAATAATGGCGCTGGTGAAGGGCTGGAAACTAACACTCTTATGTTTGATCTCATTTCCTATCACTTGTCTGTTAATTGGAGTCACAGGCTTTTTTGGGGCGCGAATATCGTATAAAGAAGCTATTTCGGCGGCAAAAGCTGGTTCTATAGCTGAGGAAGTATTGTCGTCTATAAGAACAGTATTTGCTTTTAGTGGTCAGGAAAAAGAATTAAAAAGATACGAAAAATATACTGTGGAAATGAGAGGATTACACGCAAAGAAAAGCATTTTCAATGGCATGTTGATGGGGTTGATATACCTATGTTTGTTTGGGTCTTACTCTTTATGTTTTTGGTTCGGATATAAGTTCATGATGGATGAACCAGAATTATATGACGTCAACACCATGATTGCTGTACTTTTTAGTGTATTAATGGGTTCAACGAACTTTGGTATGTCGGCTACCATCATGGACGTGTTTGGATCAGCGCGTGGCGCTGGTGAACAAATCTTCAATTTAATAGACAATGTCCCCAAAATTAATCCGCTCCTGAACCTTGGAATTGCCCCCAAAAGTATTGAAGGAAACATTGAATTCAAAAACGTTTGCTTCCATTATCCTTCCAGACCTAACGTCAAAATATTAAAGGGAATTAATCTCAGTATCAAGAAAGGGCAATCAGTTGCGTTAGTTGGACATTCAGGTAGCGGCAAGTCTACTATCGTACAACTGATATCAAGAAACTATGATGTAATAAGTGGCAGTGTCCGAATTGATGGTAATGATGTCAAAGACCTTTCGGTGAAATGGTTGAGAGCTCAGATCGGTTTAGTCGGTCAGGAGCCGGTCCTCTTTAATACAACAGTCCGAGAGAACATCAGGTATGGCCGAGAGGACGCTACTAATGAGGAAATAGAAAAAGTCGCGAAGCAAGCTAATGCTCATGAGTTTATTATGAAACTTCCGTTAGGATATGATACATTAGTTGGAGAACGTGGTACATCACTATCGGGAGGTCAAAAACAAAGAATTGCGATAGCTCGAGCGCTTGTACGAAATCCAGCGATATTATTGCTAGATGAAGCCACTAGTGCACTAGATACTGCTTCAGAGGCTAAAGTGCAGAAGGCCTTAGACAGAGCCCAAGAAGGTCGTACAACTATTGTTGTTGCTCATAGACTCACGACCATAAGAAATGTTGACAAAATTTATGTTTTCAAAAGTGGAGATGTGATAGAAAGCGGAACTCATGACGAACTTATTGCCAAGAAAGGTCACTTTTACGATATGGTAAAACTACAAACATCAAACAATGTAAAGGAGAAAGGTCCATCTAATAAAATCGATAGAAGTGAATCATTGTTAAGTGAAAAAGAAGAAAATAAGCAAATGGAAACTAGAGAGCAAATCGCAGTAGAGGCGGTTGCTAATGTCCGGACAGTTTCATCGCTAGGACGTGAACAAATTATTCTCCAAGACTACGCAAACCAACTTCTGCCCGTACTGCAAATTGCTAAAAAAACAACACATTGGCAAGGAATTGTCTTTGGAATGTCTAGAGGACTTTTCAACCTGGTATATTCCATTACCATGTTTTACGGAGGCCATCTTATGGTGTACCAGGGAATCGGATATGAAATTGTTCTCAAATCAGCTCAAACTTTATTAATGGGTTCATCATCAGCAGCCCAAGCCTTTGCATATGCTCCTAATTTCCAGAGAGGGATTAAAGCTGCAGGAAGAATCATAATTACTTTAGCTAGACAAACAAAAATCACGGACCCCGTCAAACCTGCAGTAGAAAATTTTGTAGGAAACGGTGAGGCAAGCATAACGAATGTTACATTTACTTATCCGACTAGGCCCTTAATACAAGTATTGAAGGATTGTAACTTAGAAATTGAGAATGGGAAAACAGTAGCTCTGGTATATAAGATTCGTCAGGAGTATCTAAAAGCCGCTTTAAATCAAGACTTCGAATACTTCGATACCCATCAAACAGGAGATTTTGCTAGCAAAGTGACGAGTGATGTCATAAAACTAGAAGACGGTATAGGAGAAAAGCTGGCTACGTTCATGGCATCAAGATTGTCTTACAAAGAAGCCGTTGCTTCTGCAAAAGCTGGATCTGTAGCTGAGGAAGTATTGTCATCGATAAGAACAGTATTTGCTTTTAGTGGGCAAAAAAGGAAACTGAAAGATATGAAAAACACCTAA

Protein sequence:

>DPOGS207747-PA
MSYVIYRNVEKDTAKKDAVSNISFKTLFRFATTKDKMFMCIAIVASVLCGCTTPINTLLFSSLLQSMVNYGKSIVLNDPQPDILLSEVQDFAIYNAVLGLVIIILSYIATVLMNMSAYNQAHRIRQEYLRATLNQDFEYFDTHKTGDFASKVTDDVLKLEDGIGEKLATFIYYQVTFISSIIMALVKGWKLTLLCLISFPITCLLIGVTGFFGARISYKEAISAAKAGSIAEEVLSSIRTVFAFSGQEKELKRYEKYTVEMRGLHAKKSIFNGMLMGLIYLCLFGSYSLCFWFGYKFMMDEPELYDVNTMIAVLFSVLMGSTNFGMSATIMDVFGSARGAGEQIFNLIDNVPKINPLLNLGIAPKSIEGNIEFKNVCFHYPSRPNVKILKGINLSIKKGQSVALVGHSGSGKSTIVQLISRNYDVISGSVRIDGNDVKDLSVKWLRAQIGLVGQEPVLFNTTVRENIRYGREDATNEEIEKVAKQANAHEFIMKLPLGYDTLVGERGTSLSGGQKQRIAIARALVRNPAILLLDEATSALDTASEAKVQKALDRAQEGRTTIVVAHRLTTIRNVDKIYVFKSGDVIESGTHDELIAKKGHFYDMVKLQTSNNVKEKGPSNKIDRSESLLSEKEENKQMETREQIAVEAVANVRTVSSLGREQIILQDYANQLLPVLQIAKKTTHWQGIVFGMSRGLFNLVYSITMFYGGHLMVYQGIGYEIVLKSAQTLLMGSSSAAQAFAYAPNFQRGIKAAGRIIITLARQTKITDPVKPAVENFVGNGEASITNVTFTYPTRPLIQVLKDCNLEIENGKTVALVYKIRQEYLKAALNQDFEYFDTHQTGDFASKVTSDVIKLEDGIGEKLATFMASRLSYKEAVASAKAGSVAEEVLSSIRTVFAFSGQKRKLKDMKNT-